Introduction to Information Geometry

Size: px
Start display at page:

Download "Introduction to Information Geometry"

Transcription

1 Introduction to Information Geometry based on the book Methods of Information Geometry written by Shun-Ichi Amari and Hiroshi Nagaoka Yunshu Liu

2 Outline 1 Introduction to differential geometry Manifold and Submanifold Tangent vector, Tangent space and Vector field Riemannian metric and Affine connection Flatness and autoparallel 2 The Fisher metric and α-connection Exponential family Divergence and Geometric statistical inference Yunshu Liu (ASPITRG) Introduction to Information Geometry 2 / 79

3 Part I Introduction to differential geometry Yunshu Liu (ASPITRG) Introduction to Information Geometry 3 / 79

4 Basic concepts in differential geometry Basic concepts Manifold and Submanifold Tangent vector, Tangent space and Vector field Riemannian metric and Affine connection Flatness and autoparallel Yunshu Liu (ASPITRG) Introduction to Information Geometry 4 / 79

5 Manifold Manifold S Manifold: a set with a coordinate system, a one-to-one mapping from S to R n, supposed to be locally looks like an open subset of R n Elements of the set(points): points in R n, probability distribution, linear system. Figure : A coordinate system ξ for a manifold S Yunshu Liu (ASPITRG) Introduction to Information Geometry 5 / 79

6 Manifold Manifold S Definition: Let S be a set, if there exists a set of coordinate systems A for S which satisfies the condition (1) and (2) below, we call S an n-dimensional C differentiable manifold. (1) Each element ϕ of A is a one-to-one mapping from S to some open subset of R n. (2) For all ϕ A, given any one-to-one mapping ψ from S to R n, the following hold: ψ A ψ ϕ 1 is a C diffeomorphism. Here, by a C diffeomorphism we mean that ψ ϕ 1 and its inverse ϕ ψ 1 are both C (infinitely many times differentiable). Yunshu Liu (ASPITRG) Introduction to Information Geometry 6 / 79

7 Examples of Manifold Examples of one-dimensional manifold A straight line: a manifold in R 1, even if it is given in R k for k 2. Any open subset of a straight line: one-dimensional manifold A closed subset of a straight line: not a manifold A circle: locally the circle looks like a line Any open subset of a circle: one-dimensional manifold A closed subset of a circle: not a manifold. Yunshu Liu (ASPITRG) Introduction to Information Geometry 7 / 79

8 Examples of Manifold: surface of a sphere Surface of a sphere in R 3, defined by S = {(x, y, z) R 3 x 2 + y 2 + z 2 = 1}, locally it can be parameterized by using two coordinates, for example, we can use latitude and longitude as the coordinates. nd sphere(n-1 sphere): S = {(x 1, x 2,..., x n ) R n x x x2 n = 1}. Yunshu Liu (ASPITRG) Introduction to Information Geometry 8 / 79

9 Examples of Manifold: surface of a torus The torus in R 3 (surface of a doughnut): x(u, v) = ((a + b cos u)cos v, (a + b cos u)sin v, b sin u), 0 u, v < 2π. where a is the distance from the center of the tube to the center of the torus, and b is the radius of the tube. A torus is a closed surface defined as product of two circles: T 2 = S 1 S 1. n-torus: T n is defined as a product of n circles: T n = S 1 S 1 S 1. Yunshu Liu (ASPITRG) Introduction to Information Geometry 9 / 79

10 Coordinate systems for manifold Parametrization of unit hemisphere Parametrization: map of unit hemisphere into R 2 (1) by latitude and longigude; x(ϕ, θ) = (sin ϕ cos θ, sin ϕ sin θ, cos ϕ), 0 < ϕ < π/2, 0 θ < 2π (1) Yunshu Liu (ASPITRG) Introduction to Information Geometry 10 / 79

11 Coordinate systems for manifold Parametrization of unit hemisphere Parametrization: map of unit hemisphere into R 2 (2) by stereographic projections. 2u x(u, v) = ( 1 + u 2 + v 2, 2v 1 + u 2 + v 2, 1 u2 v u 2 + v 2 ) where u2 + v 2 1 (2) Yunshu Liu (ASPITRG) Introduction to Information Geometry 11 / 79

12 Examples of Manifold: colors Parametrization of color models 3 channel color models: RGB, CMYK, LAB, HSV and so on. The RGB color model: an additive color model in which red, green, and blue light are added together in various ways to reproduce a broad array of colors. The Lab color model: three coordinates of Lab represent the lightness of the color(l), its position between red and green(a) and its position between yellow and blue(b). Yunshu Liu (ASPITRG) Introduction to Information Geometry 12 / 79

13 Coordinate systems for manifold Parametrization of color models Parametrization: map of color into R 3 Examples: Yunshu Liu (ASPITRG) Introduction to Information Geometry 13 / 79

14 Submanifolds Submanifolds Definition: a submanifold M of a manifold S is a subset of S which itself has the structure of a manifold An open subset of n-dimensional manifold forms an n-dimensional submanifold. One way to construct m(<n)dimensional manifold: fix n-m coordinates. Examples: Yunshu Liu (ASPITRG) Introduction to Information Geometry 14 / 79

15 Submanifolds Examples: color models 3-dimensional submanifold: any open subset; 2-dimensional submanifold: fix one coordinate; 1-dimensional submanifold: fix two coordinates. Note: In Lab color model, we set a and b to 0 and change L from 0 to 100(from black to white), then we get a 1-dimensional submanifold. Yunshu Liu (ASPITRG) Introduction to Information Geometry 15 / 79

16 Basic concepts in differential geometry Basic concepts Manifold and Submanifold Tangent vector, Tangent space and Vector field Riemannian metric and Affine connection Flatness and autoparallel Yunshu Liu (ASPITRG) Introduction to Information Geometry 16 / 79

17 Curves and Tangent vector of Curves Curves Curve γ: I S from some interval I( R) to S. Examples: curve on sphere, set of probability distribution, set of linear systems. Using coordinate system {ξ i } to express the point γ(t) on the curve(where t I): γ i (t) = ξ i (γ(t)), then we get γ(t) = [γ 1 (t),, γ n (t)]. C Curves C : infinitely many times differentiable(sufficiently smooth). If γ(t) is C for t I, we call γ a C on manifold S. Yunshu Liu (ASPITRG) Introduction to Information Geometry 17 / 79

18 Tangent vector of Curves A tangent vector is a vector that is tangent to a curve or surface at a given point. When S is an open subset of R n, the range of γ is contained within a single linear space, hence we consider the standard derivative: γ(a) = lim h 0 γ(a + h) γ(a) h (3) In general, however, this is not true, ex: the range of γ in a color model Thus we use a more general derivative instead: γ(a) = n i=1 γ i (a)( ξ i ) p (4) where γ i (t) = ξ i γ(t), γ i (a) = d dt γi (t) t=a and ( ξ i ) p is an operator which maps f ( f ξ i ) p for given function f : S R. Yunshu Liu (ASPITRG) Introduction to Information Geometry 18 / 79

19 Tangent space Tangent space Tangent space at p: a hyperplane T p containing all the tangents of curves passing through the point p S. (dim T p (S) = dim S) n T p (S) = { c i ( ξ i ) p [c 1,, c n ] R n } i=1 Examples: hemisphere and color Yunshu Liu (ASPITRG) Introduction to Information Geometry 19 / 79

20 Vector fields Vector fields Vector fields: a map from each point in a manifold S to a tangent vector. Consider a coordinate system {ξ i } for a n-dimensional manifold, clearly i = ξ i are vector fields for i = 1,, n. Yunshu Liu (ASPITRG) Introduction to Information Geometry 20 / 79

21 Basic concepts in differential geometry Basic concepts Manifold and Submanifold Tangent vector, Tangent space and Vector field Riemannian metric and Affine connection Flatness and autoparallel Yunshu Liu (ASPITRG) Introduction to Information Geometry 21 / 79

22 Riemannian Metrics Riemannian Metrics: an inner product of two tangent vectors(d and D T p (S)) which satisfy D, D p R, and the following condition hold: Linearity : ad + bd, D p = a D, D p + b D, D p Symmetry : D, D p = D, D p Positive definiteness : If D 0 then D, D p > 0 The components {g ij } of a Riemannian metric g w.r.t. the coordinate system {ξ i } are defined by g ij = i, j, where i = ξ i. Yunshu Liu (ASPITRG) Introduction to Information Geometry 22 / 79

23 Riemannian Metrics Examples of inner product: For X = (x 1,, x n ) and Y = (y 1,, y n ), we can define inner product as X, Y 1 = X Y = n i=1 x iy i, or X, Y 2 = YMX, where M is any symmetry positive-definite matrix. For random variables X and Y, the expected value of their product: X, Y = E(XY) For square real matrix, A, B = tr(ab T ) Yunshu Liu (ASPITRG) Introduction to Information Geometry 23 / 79

24 Riemannian Metrics For unit sphere: x(ϕ, θ) = (sin ϕ cos θ, sin ϕ sin θ, cos ϕ), 0 < ϕ < π, 0 θ < 2π (5) we have: ϕ = (cos ϕ cos θ, cos ϕ sin θ, sin ϕ) θ = ( sin ϕ sin θ, sin ϕ cos θ, 0) g 11 = ϕ, ϕ, g 22 = θ, θ g 12 = g 21 = ϕ, θ = θ, ϕ If we define X, Y = YMX = 2 n i=1 x iy i, where M = ( ) g11 g (g i,j ) = 12 = g 21 g 22 ( 2 0 ) 0 2sin 2 ϕ ( ) 2 0, then 0 2 Yunshu Liu (ASPITRG) Introduction to Information Geometry 24 / 79 (6)

25 Figure : Translation of a tangent vector along a curve Yunshu Liu (ASPITRG) Introduction to Information Geometry 25 / 79 Affine connection Parallel translation along curves Let γ: [a, b] S be a curve in S, X(t) be a vector field mapping each point γ(t) to a tangent vector, if for all t [a, b] and the corresponding infinitesimal dt, the corresponding tangent vectors are linearly related, that is to say there exist a linear mapping Π p,p, such that X(t + dt) = Π p,p (X(t)) for t [a, b], we say X is parallel along γ, and call Π γ the parallel translation along γ. Linear mapping: additivity and scalar multiplication.

26 Affine connection Affine connection: relationships between tangent space at different points. Recall: Natural basis of the coordinate system [ξ i ]: ( i ) p = ( ξ i ) p : an operator which maps f ( f ξ i ) p for given function f : S R at p. Tangent space: T p (S) = { n i=1 c i ( ξ i ) p [c 1,, c n ] R n } Tangent vector(elements in Tangent space) can be represented as linear combinations of i. Tangent space T p Tangent vector X p Natural basis ( i ) p = ( ξ i ) p Yunshu Liu (ASPITRG) Introduction to Information Geometry 26 / 79

27 Affine connection If the difference between the coordinates of p and p are very small, that we can ignore the second-order infinitesimals (dξ i )(dξ j ), where dξ i = ξ i (p ) ξ i (p), then we can express difference between Π p,p (( j ) p ) and (( j ) p ) as a linear combination of {dξ 1,, dξ n }: Π p,p (( j ) p ) = ( j ) p i,k (dξ i (Γ k ij) p ( k ) p ) (7) where {(Γ k ij ) p; i, j, k = 1,, n} are n 3 numbers which depend on the point p. From X(t) = n i=1 Xi (t)( i ) p and X(t + dt) = n i=1 (Xi (t + dt)( i ) p ), we have Π p,p (X(t)) = ({X k (t) dt γ i (t)x j (t)(γ k ij) p }( k ) p ) (8) i,j,k Yunshu Liu (ASPITRG) Introduction to Information Geometry 27 / 79

28 Affine connection Π p,p (( j ) p ) = ( j ) p i,k (dξ i (Γ k ij) p ( k ) p ) Yunshu Liu (ASPITRG) Introduction to Information Geometry 28 / 79

29 Connection coefficients(christoffel s symbols): (Γ k ij ) p Given a connection on the manifold S, the value of (Γ k ij ) p are different for different coordinate systems, it shows how tangent vectors changes on a manifold, thus shows how basis vectors changes. In Π p,p (( j ) p ) = ( j ) p (dξ i (Γ k ij) p ( k ) p ) i,k if we let Γ k ij = 0 for i, j, k = x, y, we will have Π p,p (( j ) p ) = ( j ) p Yunshu Liu (ASPITRG) Introduction to Information Geometry 29 / 79

30 Connection coefficients(christoffel s symbols): (Γ k ij ) p Given a connection on a manifold S, Γ k i,j depend on coordinate system. Define a connection which makes Γ k i,j to be zero in one coordinate system, we will get non-zero connection coefficients in some other coordinate systems. Example: If it is desired to let the connection coefficients for Cartesian Coordinates of a 2D flat plane to be zero, Γ k ij = 0 for i, j, k = x, y, we can calculate the connection coefficients for Polar Coordinates: Γ ϕ rϕ = Γ ϕ ϕr = 1 r, Γ r ϕϕ = r, and Γ k ij = 0 for all others. Yunshu Liu (ASPITRG) Introduction to Information Geometry 30 / 79

31 Connection coefficients(christoffel s symbols): (Γ k ij ) p Example(cont.): Now if we want to let the connection coefficients for Polar Coordinates to be zero, Γ k ij = 0 for i, j, k = r, ϕ, we can calculate the connection coefficients for Polar Coordinates: Γ x xx = sin2 ϕ cos ϕ r, Γ y xx = sin ϕ(1+cos2 ϕ) r, Γ x xy = Γ x yx = sin3 ϕ r Γ y yy = sin ϕ cos2 ϕ r., Γ y xy = Γ y yx = cos3 ϕ r, Γ x yy = cos ϕ(1+sin2 ϕ) r, and Yunshu Liu (ASPITRG) Introduction to Information Geometry 31 / 79

32 Affine connection Covariant derivative along curves Derivative: dx(t) X(t+dt) X(t) dt = lim dt 0 dt, what if X(t) and X(t + dt) lie in different tangent spaces? X t (t + dt) = Π γ(t+dt),γ(t) (X(t + dt)) δx(t) = X t (t + dt) X(t) = Π γ(t+dt),γ(t) (X(t + dt)) X(t) Yunshu Liu (ASPITRG) Introduction to Information Geometry 32 / 79

33 Affine connection Covariant derivative along curves We call δx(t) dt the covariant derivative of X(t): δx(t) dt X t (t + dt) X(t) = lim = Π γ(t+dt),γ(t)(x(t + dt)) X(t) dt 0 dt dt Π γ(t+dt) (X(t + dt)) = ({X k (t + dt) + dt γ i (t)x j (t)(γ k ij) γ(t) }( k ) γ(t) (10) ) i,j,k (9) δx(t) dt = i,j,k ({Ẋ k (t) + γ i (t)x j (t)(γ k ij) γ(t) }( k ) γ(t) ) (11) Yunshu Liu (ASPITRG) Introduction to Information Geometry 33 / 79

34 Affine connection Covariant derivative of any two tangent vector Covariant derivative of Y w.r.t. X, where X = n i=1 (Xi i ) and Y = n i=1 (Yi i ): X Y = i,j,k (X i { i Y k + Y j Γ k ij} k ) (12) i j = Note: ( X Y) p = Xp Y T p (S) n Γ k ij k (13) k=1 Yunshu Liu (ASPITRG) Introduction to Information Geometry 34 / 79

35 Examples of Affine connection metric connection Definition: If for all vector fields X, Y, Z T (S), Z X, Y = Z X, Y + X, Z Y. where Z X, Y denotes the derivative of the function X, Y along this vector field Z, we say that is a metric connection w.r.t. g. Equivalent condition: for all basis i, j, k T (S), k i, j = k i, j + i, k i. Property: parallel translation on a metric connection preserves inner products, which means parallel transport is an isometry. Π γ (D 1 ), Π γ (D 2 ) q = D 1, D 2 p. Yunshu Liu (ASPITRG) Introduction to Information Geometry 35 / 79

36 Examples of Affine connection Levi-Civita connection For a given connection, when Γ k ij = Γk ji hold for all i, j and k, we call it a symmetric connection or torsion-free connection. From i j = n k=1 Γk ij k, we know for a symmetric connection: i j = j i If a connection is both metric and symmetric, we call it the Riemannian connection or the Levi-Civita connection w.r.t. g. Yunshu Liu (ASPITRG) Introduction to Information Geometry 36 / 79

37 Basic concepts in differential geometry Basic concepts Manifold and Submanifold Tangent vector, Tangent space and Vector field Riemannian metric and Affine connection Flatness and autoparallel Yunshu Liu (ASPITRG) Introduction to Information Geometry 37 / 79

38 Flatness Affine coordinate system Let {ξ i } be a coordinate system for S, we call {ξ i } an affine coordinate system for the connection if the n basis vector fields i = are all parallel on S. ξ i Equivalent conditions for a coordinate system to be an affine coordinate system: i j = n k=1 (Γk ij k) = 0 for all i and j (14) Γ k ij = 0 for all i, j and k (15) Flatness S is flat w.r.t the connection : an affine coordinate system exist for the connection. Yunshu Liu (ASPITRG) Introduction to Information Geometry 38 / 79

39 Flatness Examples: i j = n k=1 (Γk ij k) = 0 for all i and j Γ k ij = 0 for all i, j and k Yunshu Liu (ASPITRG) Introduction to Information Geometry 39 / 79

40 Flatness Curvature R and torsion T of a connection R( i, j ) k = l (R l ijk l) and T( i, j ) = k (T k ij k ) (16) where R l ijk and Tk ij can be computed in the following way: R l ijk = iγ l jk jγ l ik + Γl ih Γh jk Γl jh Γh ik (17) T k ij = Γ k ij Γ k ji (18) If a connection is flat, then T=R=0; If T= 0, Γ k ij = 0 for all i, j and k, we get the symmetry connnection. Yunshu Liu (ASPITRG) Introduction to Information Geometry 40 / 79

41 Flatness Curvature Curvature R = 0 iff parallel translation does not depend on curve choice. Curvature is independent of coordinate system, under Riemannian connection, we can calculate: Curvature of 2 dimensional plane: R = 0; Curvature of 3 dimensional sphere: R = 2 r 2. Yunshu Liu (ASPITRG) Introduction to Information Geometry 41 / 79

42 Autoparallel submanifold Equivalent condition for a submanifold M of S to be autoparallel X Y T (M) for X, Y T (M) (19) a b T (M) for all a and b (20) a b = (Γ c ab c) (21) c where a = u and a b = are the basis for submanifold M w.r.t. u b coordinate system {u i }. Examples of autoparallel submanifold: Open subsets of manifold S are autoparallel; A curve with the properity that all the tangent vector are parallel Yunshu Liu (ASPITRG) Introduction to Information Geometry 42 / 79

43 Autoparallel submanifold Geodesics Geodesics(autoparallel curves): A curve with tangent vector transported by parallel translation. Examples under Riemannian connection: 2 dimensional flat plane: straight line 3 dimensional sphere: great circle Yunshu Liu (ASPITRG) Introduction to Information Geometry 43 / 79

44 Autoparallel submanifold Geodesics The geodesics with respect to the Riemannian connection are known to coincide with the shortest curve joining two points. Shortest curve: curve with the shortest length. Length of a curve γ : [a, b] S: γ = b a dγ dt dt = b a g ij γ i γ j dt (22) Yunshu Liu (ASPITRG) Introduction to Information Geometry 44 / 79

45 Part II Geometric structure of statistical models and statistical inference Yunshu Liu (ASPITRG) Introduction to Information Geometry 45 / 79

46 Motivation Motivation Consider the set of probability distributions as a manifold. Analysis the relationship between the geometric structure of the manifold and statistical estimation. Introduce concepts like metric, affine connection on statistical models and studying quantities such as distance, the tangent space (which provides linear approximations), geodesics and the curvature of a manifold. Yunshu Liu (ASPITRG) Introduction to Information Geometry 46 / 79

47 Statistical models Statistical models P(X ) = {p : X R p(x) > 0 ( x X ), p(x)dx = 1} (23) Example Normal Distribution: X = R, n = 2, ξ = [µ, σ], Ξ = {[µ, σ] < µ <, 0 < σ < } p(x, ξ) = 1 (x µ)2 exp 2πσ 2σ 2 Yunshu Liu (ASPITRG) Introduction to Information Geometry 47 / 79

48 Basic concepts The Fisher metric and α-connection Exponential family Divergence and Geometric statistical inference Yunshu Liu (ASPITRG) Introduction to Information Geometry 48 / 79

49 The Fisher information matrix Fisher information matrix G(ξ) = [g i,j (ξ)], and g i,j (ξ) = E ξ [ i l ξ j l ξ ] = i l(x; ξ) j l(x; ξ)p(x; ξ)dx where l ξ = l(x; ξ) = log p(x; ξ) and E ξ denotes the expectation w.r.t. the distribution p ξ. Motivation: Sufficient statistic and Cramér-Rao bound Yunshu Liu (ASPITRG) Introduction to Information Geometry 49 / 79

50 The Fisher information matrix Sufficient statistic Sufficient statistic: for Y = F(X), given the distribution p(x; ξ) of X, we have p(x; ξ) = q(f(x); ξ)r(x; ξ), if r(x; ξ) does not depend on ξ for all x, we say that F is a sufficient statistic for the model S. Then we can write p(x; ξ) = q(y; ξ)r(x). A sufficient statistic is a function whose value contains all the information needed to compute any estimate of the parameter (e.g. a maximum likelihood estimate). Fisher information matrix and sufficient statistic Let G(ξ) be the Fisher information matrix of S = p(x; ξ), and G F (ξ) be the Fisher information matrix of the induced model S F = q(y; ξ), then we have G F (ξ) G(ξ) in the sense that G(ξ) = G F (ξ) G(ξ) is positive semidefinite. G(ξ) = 0 iff. F is a sufficient statistic for S. Yunshu Liu (ASPITRG) Introduction to Information Geometry 50 / 79

51 Cramér-Rao inequality Cramér-Rao inequality The variance of any unbiased estimator is at least as high as the inverse of the Fisher information. Unbiased estimator ˆξ: E ξ [ˆξ(X)] = ξ The variance-covariance matrix V ξ [ˆξ] = [v ij ξ ] where v ij ξ = E ξ[(ˆξ i (X) ξ i )(ˆξ j (X) ξ j )] Thus Cramér-Rao inequality state that V ξ [ˆξ] G(ξ) 1, and an unbiased estimator ˆξ satisfying V ξ [ˆξ] = G(ξ) 1 is called an efficient estimator. Yunshu Liu (ASPITRG) Introduction to Information Geometry 51 / 79

52 α-connection α-connection Let S = {p ξ } be an n-dimensional model, and consider the function Γ (α) ij,k which maps each point ξ to the following value: (Γ (α) ij,k ) ξ = E ξ [( i j l ξ + 1 α i l ξ j l ξ )( k l ξ )] (24) 2 where α is an arbitrary real number. We defined an affine connection (α) which satisfy: (α) i j, k = Γ (α) ij,k (25) where g =, is the Fisher metric. We call (α) the α-connection Yunshu Liu (ASPITRG) Introduction to Information Geometry 52 / 79

53 α-connection Properties of α-connection α-connection is a symmetric connection Relationship between α-connection and β-connection: Γ (β) ij,k = Γ(α) ij,k + α β E[ i l ξ j l ξ k l ξ ] 2 The 0-connection is the Riemannian connection with respect to the Fisher metric. Γ (β) ij,k = Γ(0) ij,k + β 2 E[ il ξ j l ξ k l ξ ] (α) = 1 + α 2 (1) + 1 α ( 1) 2 Yunshu Liu (ASPITRG) Introduction to Information Geometry 53 / 79

54 Basic concepts The Fisher metric and α-connection Exponential family Divergence and Geometric statistical inference Yunshu Liu (ASPITRG) Introduction to Information Geometry 54 / 79

55 Exponential family Exponential family p(x; θ) = exp[c(x) + n θ i F i (x) ψ(θ)] [θ i ] are called the natural parameters(coordinates), and ψ is the potential function for [θ i ], which can be calculated as ψ(θ) = log i=1 exp[c(x) + n θ i F i (x)]dx The exponential families include many of the most common distributions, including the normal, exponential, gamma, beta, Dirichlet, Bernoulli, binomial, multinomial, Poisson, and so on. i=1 Yunshu Liu (ASPITRG) Introduction to Information Geometry 55 / 79

56 Exponential family Exponential family Examples: Normal Distribution p(x; µ, σ) = 1 2πσ e (x µ)2 2σ 2 (26) where C(x) = 0, F 1 (x) = x, F 2 (x) = x 2, and θ 1 = µ σ 2, θ 2 = 1 2σ 2 are the natural parameters, the potential function is : ψ = (θ1 ) 2 4θ log( π θ 2 ) = µ2 2σ 2 + log( 2πσ) (27) Yunshu Liu (ASPITRG) Introduction to Information Geometry 56 / 79

57 Mixture family Mixture family p(x; θ) = C(x) + n θ i F i (x) In this case we say that S is a mixture family and [θ i ] are called the mixture parameters. e-connection and m-connection The natural parameters of exponential family form a 1-affine coordinate system(γ (1) ij,k = 0), which means the connection is 1-flat, we call the connection (1) the e-connection, and call exponential family e-flat. The mixture parameters of mixture family form a (-1)-affine coordinate = 0), which means the connection is (-1)-flat, and we call the system(γ ( 1) ij,k connection ( 1) the m-connection and call mixture family m-flat. i=1 Yunshu Liu (ASPITRG) Introduction to Information Geometry 57 / 79

58 Dual connection Dual connection Definition: Let S be a manifold on which there is given a Riemannian metric g and two affine connection and. If for all vector fields X, Y, Z T (S), Z < X, Y >=< Z X, Y > + < X, ZY > (28) hold, we say that and are duals of each other w.r.t. g and call one the dual connection of the other. Additional, we call the triple (g,, ) a dualistic structure on S. Yunshu Liu (ASPITRG) Introduction to Information Geometry 58 / 79

59 Dual connection Properties For any statistical model, the α-connection and the ( α)-connection are dual with respect to the Fisher metric. Π γ (D 1 ), Π γ(d 2 ) q = D 1, D 2 p. where Π γ and Π γ are parallel translation along γ w.r.t. and. R = 0 R = 0 where R and R are the curvature tensors of and. Yunshu Liu (ASPITRG) Introduction to Information Geometry 59 / 79

60 Dually flat spaces and dual coordinate system Dually flat spaces Let (g,, ) be a dualistic structure on a manifold S, then we have R = 0 R = 0, and if the connnection and are both symmetric(t = T = 0), then we see that -flatness and -flatness are equivalent. We call (S, g,, ) a dually flat space if both duals and are flat. Examples: Since α-connections and α-connections are dual w.r.t. Fisher metric and α-connections are symmetry, we have for any statistical model S and for any real number α S is α flat S is ( α) flat (29) Yunshu Liu (ASPITRG) Introduction to Information Geometry 60 / 79

61 Dually flat spaces and dual coordinate system Dual coordinate system For a particular -affine coordinate system [θ i ], if we choose a corresponding -affine coordinate system [η j ] such that g = i, j = δ j i where i = and j = θ i η j. Then we say the two coordinate systems mutually dual w.r.t. metric g, and call one the dual coordinate system of the other. Existence of dual coordinate system A pair of dual coordinate system exist if and only if (S, g,, ) is a dually flat space. Yunshu Liu (ASPITRG) Introduction to Information Geometry 61 / 79

62 Legendre transformations Consider mutually dual coordinate system [θ i ] and [η i ] with functions ψ : S R and ϕ : S R satisfy the following equations: i ψ = η i i ϕ = θ i g i,j = i η j = j η i = i j ψ ϕ(η) = max θ {θ i η i ψ(θ)} ψ(θ) = max η {θ i η i ϕ(η)} Yunshu Liu (ASPITRG) Introduction to Information Geometry 62 / 79

63 Legendre transformations Geometric interpretation for f (p) = max x (px f (x)): A convex function f (x) is shown in red, and the tangent line at point (x 0, f (x 0 )) is shown in blue. The tangent line intersects the vertical axis at (0, f ) and f is the value of the Legendre transform f (p 0 ), where p 0 = ḟ (x 0). Note that for any other point on the red curve, a line drawn through that point with the same slope as the blue line will have a y-intercept above the point (0, f ), showing that is indeed a maximum. Yunshu Liu (ASPITRG) Introduction to Information Geometry 63 / 79

64 Examples of Legendre transformations Examples: The Legendre transform of f (x) = 1 p x p (where 1 < p < ) is f (x ) = 1 q x q (where 1 < q < ), The Legendre transform of f (x) = e x is f (x ) = x ln x x (where x > 0), The Legendre transform of f (x) = 1 2 xt Ax is f (x ) = 1 2 x T A 1 x, The Legendre transform of f (x) = x is f (x ) = 0 if x 1, and f (x ) = if x > 1. Yunshu Liu (ASPITRG) Introduction to Information Geometry 64 / 79

65 The natural parameter and dual parameter of Exponential family For distribution p(x; θ) = exp[c(x) + n i=1 θi F i (x) ψ(θ)], [θ i ] are called the natural parameters If we define η i = E θ [F i ] = F i (x)p(x; θ)dx, we can verify [η i ] is a (-1)-affine coordinate system dual to [θ i ], we call this [η i ] the expectation parameters or the dual parameters. Yunshu Liu (ASPITRG) Introduction to Information Geometry 65 / 79

66 The natural parameter and dual parameter of Exponential family Recall: Normal Distribution p(x; µ, σ) = 1 2πσ e (x µ)2 2σ 2 (30) where C(x) = 0, F 1 (x) = x, F 2 (x) = x 2, and θ 1 = µ σ 2, θ 2 = 1 2σ 2 are the natural parameters, the potential function is : ψ = (θ1 ) 2 4θ log( π θ 2 ) = µ2 2σ 2 + log( 2πσ) (31) The dual parameter are calculated as η 1 = ψ θ 1 = µ = θ1 2θ 2, η 2 = ψ θ 2 = µ 2 + σ 2 = (θ1 ) 2 2θ 2 4(θ 2 ) 2, It has potential function: ϕ = 1 2 (1 + log( π θ 2 )) = 1 (1 + log(2π)) + 2logσ) (32) 2 Yunshu Liu (ASPITRG) Introduction to Information Geometry 66 / 79

67 Basic concepts The Fisher metric and α-connection Exponential family Divergence and Geometric statistical inference Yunshu Liu (ASPITRG) Introduction to Information Geometry 67 / 79

68 Divergences Let S be a manifold and suppose that we are given a smooth function D = D( ) : S S R satisfying for any p, q S: D(p q) 0 with equality iff. p = q) (33) Then we introduce a distance-like measure of the separation between two points. Yunshu Liu (ASPITRG) Introduction to Information Geometry 68 / 79

69 Divergence, semimetrics and metrics A distance satisfying positive-definiteness, symmetry and triangle inequality is called a metric; A distance satisfying positive-definiteness and symmetry is called semimetrics; A distance satisfying only positive-definiteness is called a divergence. Yunshu Liu (ASPITRG) Introduction to Information Geometry 69 / 79

70 Generally, we use Kullback-Leibler divergence to measurethe difference between two probability distributions p and q. KL measures the expected number of extra bits required to code samples from p when using a code based on q, rather than using a code based on p. Typically p represents the true distribution of data, observations, or a precisely calculated theoretical distribution. The measure q typically represents a theory, model, description, or approximation of p. Yunshu Liu (ASPITRG) Introduction to Information Geometry 70 / 79 Kullback-Leibler divergence Discrete random variables p and q: D KL (p q) = i p(x)log p(x) q(x) (34) Continuous random variables p and q: D KL (p q) = p(x)log p(x) dx (35) q(x)

71 Bregman divergence Bregman divergence associated with F for points p, q is : B F (x y) = F(y) F(x) (y x), F(x), where F(x) is a convex function defined on a closed convex set. Examples: F(x) = x 2, then B F (x y) = x y 2. More generally, if F(x) = 1 2 xt Ax, then B F (x y) = 1 2 (x y)t A(x y). KL divergence:if F = i x logx x, we get KullbackLeibler divergence. Yunshu Liu (ASPITRG) Introduction to Information Geometry 71 / 79

72 Canonical divergence Canonical divergence(a divergence for dually flat space) Let (S, g,, ) be a dually flat space, and {[θ i ], [η j ]} be mutually dual affine coordinate systems with potentials {ψ, ϕ}, then the canonical divergence((g, ) divergence) is defined as: D(p q) = ψ(p) + ϕ(q) θ i (p)η j (q) (36) Yunshu Liu (ASPITRG) Introduction to Information Geometry 72 / 79

73 Canonical divergence Properties: Relation between (g, ) divergence and (g, ) divergence: D (p q) = D(q p) If M is a autoparallel submanifold w.r.t. either or, then the (g M, M )-divergence D M = D M M is given by D M (p q) = D(p q) If is a Riemannian connection( = ) which is flat on S, there exist a coordinate system which is self-dual(θ i = η i ), then ϕ = ψ = 1 2 i (θi ) 2, then the canonical divergence is D(p q) = 1 {d(p, q)}2 2 where d(p, q) = i {θi (p) θ i (q)} 2 Yunshu Liu (ASPITRG) Introduction to Information Geometry 73 / 79

74 Canonical divergence Triangular relation Let {[θ i ], [η i ]} be mutually dual affine coordinate systems of a dually flat space (S, g,, ), and let D be a divergence on S. Then a necessary and sufficient condition for D to be the (g, )-divergence is that for all p, q, r S the following triangular relation holds: D(p q) + D(q r) D(p r) = {θ i (p) θ i (q)}{η i (p) η i (q)} (37) Yunshu Liu (ASPITRG) Introduction to Information Geometry 74 / 79

75 Canonical divergence Pythagorean relation Let p, q, and r be three points in S. Let γ 1 be the -geodesic connecting p and q, and let γ 2 be the -geodesic connecting q and r. If at the intersection q the curve γ 1 and γ 2 are orthogonal(with respect to the inner product g), then we have the following Pythagorean relation. D(p r) = D(p q) + D(q r) (38) Yunshu Liu (ASPITRG) Introduction to Information Geometry 75 / 79

76 Canonical divergence Projection theorem Let p be a point in S and let M be a submanifold of S which is -autoparallel. Then a necessary and sufficient condition for a point q in M to satisfy D(p q) = min r M D(p r) (39) is for the -geodesic connecting p and q to be orthogonal to M at q. Yunshu Liu (ASPITRG) Introduction to Information Geometry 76 / 79

77 Canonical divergence Examples From the definition of exponential family and mixture family, the product of exponential family are still exponential family, the sum of mixture family are still mixture family. e-flat submanifold: set of all product distributions: E 0 = {p X p X (x 1,, x N ) = N i=1 p X i (x i )} m-flat submanifold: set of joint distributions with given marginals: M 0 = {p X X\i p X(x) = q i (x i ) i {1,, N}} Yunshu Liu (ASPITRG) Introduction to Information Geometry 77 / 79

78 Canonical divergence Examples Yunshu Liu (ASPITRG) Introduction to Information Geometry 78 / 79

79 Thanks! Thanks! Question? Yunshu Liu (ASPITRG) Introduction to Information Geometry 79 / 79

June 21, Peking University. Dual Connections. Zhengchao Wan. Overview. Duality of connections. Divergence: general contrast functions

June 21, Peking University. Dual Connections. Zhengchao Wan. Overview. Duality of connections. Divergence: general contrast functions Dual Peking University June 21, 2016 Divergences: Riemannian connection Let M be a manifold on which there is given a Riemannian metric g =,. A connection satisfying Z X, Y = Z X, Y + X, Z Y (1) for all

More information

Information Geometric view of Belief Propagation

Information Geometric view of Belief Propagation Information Geometric view of Belief Propagation Yunshu Liu 2013-10-17 References: [1]. Shiro Ikeda, Toshiyuki Tanaka and Shun-ichi Amari, Stochastic reasoning, Free energy and Information Geometry, Neural

More information

Chapter 2 Exponential Families and Mixture Families of Probability Distributions

Chapter 2 Exponential Families and Mixture Families of Probability Distributions Chapter 2 Exponential Families and Mixture Families of Probability Distributions The present chapter studies the geometry of the exponential family of probability distributions. It is not only a typical

More information

AN AFFINE EMBEDDING OF THE GAMMA MANIFOLD

AN AFFINE EMBEDDING OF THE GAMMA MANIFOLD AN AFFINE EMBEDDING OF THE GAMMA MANIFOLD C.T.J. DODSON AND HIROSHI MATSUZOE Abstract. For the space of gamma distributions with Fisher metric and exponential connections, natural coordinate systems, potential

More information

Chapter 3. Riemannian Manifolds - I. The subject of this thesis is to extend the combinatorial curve reconstruction approach to curves

Chapter 3. Riemannian Manifolds - I. The subject of this thesis is to extend the combinatorial curve reconstruction approach to curves Chapter 3 Riemannian Manifolds - I The subject of this thesis is to extend the combinatorial curve reconstruction approach to curves embedded in Riemannian manifolds. A Riemannian manifold is an abstraction

More information

DIFFERENTIAL GEOMETRY, LECTURE 16-17, JULY 14-17

DIFFERENTIAL GEOMETRY, LECTURE 16-17, JULY 14-17 DIFFERENTIAL GEOMETRY, LECTURE 16-17, JULY 14-17 6. Geodesics A parametrized line γ : [a, b] R n in R n is straight (and the parametrization is uniform) if the vector γ (t) does not depend on t. Thus,

More information

CALCULUS ON MANIFOLDS. 1. Riemannian manifolds Recall that for any smooth manifold M, dim M = n, the union T M =

CALCULUS ON MANIFOLDS. 1. Riemannian manifolds Recall that for any smooth manifold M, dim M = n, the union T M = CALCULUS ON MANIFOLDS 1. Riemannian manifolds Recall that for any smooth manifold M, dim M = n, the union T M = a M T am, called the tangent bundle, is itself a smooth manifold, dim T M = 2n. Example 1.

More information

F -Geometry and Amari s α Geometry on a Statistical Manifold

F -Geometry and Amari s α Geometry on a Statistical Manifold Entropy 014, 16, 47-487; doi:10.3390/e160547 OPEN ACCESS entropy ISSN 1099-4300 www.mdpi.com/journal/entropy Article F -Geometry and Amari s α Geometry on a Statistical Manifold Harsha K. V. * and Subrahamanian

More information

Information geometry for bivariate distribution control

Information geometry for bivariate distribution control Information geometry for bivariate distribution control C.T.J.Dodson + Hong Wang Mathematics + Control Systems Centre, University of Manchester Institute of Science and Technology Optimal control of stochastic

More information

Information geometry of Bayesian statistics

Information geometry of Bayesian statistics Information geometry of Bayesian statistics Hiroshi Matsuzoe Department of Computer Science and Engineering, Graduate School of Engineering, Nagoya Institute of Technology, Nagoya 466-8555, Japan Abstract.

More information

5.2 The Levi-Civita Connection on Surfaces. 1 Parallel transport of vector fields on a surface M

5.2 The Levi-Civita Connection on Surfaces. 1 Parallel transport of vector fields on a surface M 5.2 The Levi-Civita Connection on Surfaces In this section, we define the parallel transport of vector fields on a surface M, and then we introduce the concept of the Levi-Civita connection, which is also

More information

Information geometry of the power inverse Gaussian distribution

Information geometry of the power inverse Gaussian distribution Information geometry of the power inverse Gaussian distribution Zhenning Zhang, Huafei Sun and Fengwei Zhong Abstract. The power inverse Gaussian distribution is a common distribution in reliability analysis

More information

Math 6455 Nov 1, Differential Geometry I Fall 2006, Georgia Tech

Math 6455 Nov 1, Differential Geometry I Fall 2006, Georgia Tech Math 6455 Nov 1, 26 1 Differential Geometry I Fall 26, Georgia Tech Lecture Notes 14 Connections Suppose that we have a vector field X on a Riemannian manifold M. How can we measure how much X is changing

More information

UNIVERSITY OF DUBLIN

UNIVERSITY OF DUBLIN UNIVERSITY OF DUBLIN TRINITY COLLEGE JS & SS Mathematics SS Theoretical Physics SS TSM Mathematics Faculty of Engineering, Mathematics and Science school of mathematics Trinity Term 2015 Module MA3429

More information

THEODORE VORONOV DIFFERENTIAL GEOMETRY. Spring 2009

THEODORE VORONOV DIFFERENTIAL GEOMETRY. Spring 2009 [under construction] 8 Parallel transport 8.1 Equation of parallel transport Consider a vector bundle E B. We would like to compare vectors belonging to fibers over different points. Recall that this was

More information

Characterizing the Region of Entropic Vectors via Information Geometry

Characterizing the Region of Entropic Vectors via Information Geometry Characterizing the Region of Entropic Vectors via Information Geometry John MacLaren Walsh Department of Electrical and Computer Engineering Drexel University Philadelphia, PA jwalsh@ece.drexel.edu Thanks

More information

arxiv: v1 [cs.lg] 17 Aug 2018

arxiv: v1 [cs.lg] 17 Aug 2018 An elementary introduction to information geometry Frank Nielsen Sony Computer Science Laboratories Inc, Japan arxiv:1808.08271v1 [cs.lg] 17 Aug 2018 Abstract We describe the fundamental differential-geometric

More information

Symplectic and Kähler Structures on Statistical Manifolds Induced from Divergence Functions

Symplectic and Kähler Structures on Statistical Manifolds Induced from Divergence Functions Symplectic and Kähler Structures on Statistical Manifolds Induced from Divergence Functions Jun Zhang 1, and Fubo Li 1 University of Michigan, Ann Arbor, Michigan, USA Sichuan University, Chengdu, China

More information

LECTURE 10: THE PARALLEL TRANSPORT

LECTURE 10: THE PARALLEL TRANSPORT LECTURE 10: THE PARALLEL TRANSPORT 1. The parallel transport We shall start with the geometric meaning of linear connections. Suppose M is a smooth manifold with a linear connection. Let γ : [a, b] M be

More information

Applications of Information Geometry to Hypothesis Testing and Signal Detection

Applications of Information Geometry to Hypothesis Testing and Signal Detection CMCAA 2016 Applications of Information Geometry to Hypothesis Testing and Signal Detection Yongqiang Cheng National University of Defense Technology July 2016 Outline 1. Principles of Information Geometry

More information

4.7 The Levi-Civita connection and parallel transport

4.7 The Levi-Civita connection and parallel transport Classnotes: Geometry & Control of Dynamical Systems, M. Kawski. April 21, 2009 138 4.7 The Levi-Civita connection and parallel transport In the earlier investigation, characterizing the shortest curves

More information

Neighbourhoods of Randomness and Independence

Neighbourhoods of Randomness and Independence Neighbourhoods of Randomness and Independence C.T.J. Dodson School of Mathematics, Manchester University Augment information geometric measures in spaces of distributions, via explicit geometric representations

More information

LECTURE 15: COMPLETENESS AND CONVEXITY

LECTURE 15: COMPLETENESS AND CONVEXITY LECTURE 15: COMPLETENESS AND CONVEXITY 1. The Hopf-Rinow Theorem Recall that a Riemannian manifold (M, g) is called geodesically complete if the maximal defining interval of any geodesic is R. On the other

More information

Derivatives in General Relativity

Derivatives in General Relativity Derivatives in General Relativity One of the problems with curved space is in dealing with vectors how do you add a vector at one point in the surface of a sphere to a vector at a different point, and

More information

DIFFERENTIAL GEOMETRY. LECTURE 12-13,

DIFFERENTIAL GEOMETRY. LECTURE 12-13, DIFFERENTIAL GEOMETRY. LECTURE 12-13, 3.07.08 5. Riemannian metrics. Examples. Connections 5.1. Length of a curve. Let γ : [a, b] R n be a parametried curve. Its length can be calculated as the limit of

More information

Information Geometry of Positive Measures and Positive-Definite Matrices: Decomposable Dually Flat Structure

Information Geometry of Positive Measures and Positive-Definite Matrices: Decomposable Dually Flat Structure Entropy 014, 16, 131-145; doi:10.3390/e1604131 OPEN ACCESS entropy ISSN 1099-4300 www.mdpi.com/journal/entropy Article Information Geometry of Positive Measures and Positive-Definite Matrices: Decomposable

More information

Information geometry of mirror descent

Information geometry of mirror descent Information geometry of mirror descent Geometric Science of Information Anthea Monod Department of Statistical Science Duke University Information Initiative at Duke G. Raskutti (UW Madison) and S. Mukherjee

More information

Transport Continuity Property

Transport Continuity Property On Riemannian manifolds satisfying the Transport Continuity Property Université de Nice - Sophia Antipolis (Joint work with A. Figalli and C. Villani) I. Statement of the problem Optimal transport on Riemannian

More information

1. Geometry of the unit tangent bundle

1. Geometry of the unit tangent bundle 1 1. Geometry of the unit tangent bundle The main reference for this section is [8]. In the following, we consider (M, g) an n-dimensional smooth manifold endowed with a Riemannian metric g. 1.1. Notations

More information

+ dxk. dt 2. dt Γi km dxm. . Its equations of motion are second order differential equations. with intitial conditions

+ dxk. dt 2. dt Γi km dxm. . Its equations of motion are second order differential equations. with intitial conditions Homework 7. Solutions 1 Show that great circles are geodesics on sphere. Do it a) using the fact that for geodesic, acceleration is orthogonal to the surface. b ) using straightforwardl equations for geodesics

More information

Mean-field equations for higher-order quantum statistical models : an information geometric approach

Mean-field equations for higher-order quantum statistical models : an information geometric approach Mean-field equations for higher-order quantum statistical models : an information geometric approach N Yapage Department of Mathematics University of Ruhuna, Matara Sri Lanka. arxiv:1202.5726v1 [quant-ph]

More information

Inference. Data. Model. Variates

Inference. Data. Model. Variates Data Inference Variates Model ˆθ (,..., ) mˆθn(d) m θ2 M m θ1 (,, ) (,,, ) (,, ) α = :=: (, ) F( ) = = {(, ),, } F( ) X( ) = Γ( ) = Σ = ( ) = ( ) ( ) = { = } :=: (U, ) , = { = } = { = } x 2 e i, e j

More information

Information Geometric Structure on Positive Definite Matrices and its Applications

Information Geometric Structure on Positive Definite Matrices and its Applications Information Geometric Structure on Positive Definite Matrices and its Applications Atsumi Ohara Osaka University 2010 Feb. 21 at Osaka City University 大阪市立大学数学研究所情報幾何関連分野研究会 2010 情報工学への幾何学的アプローチ 1 Outline

More information

Advanced Machine Learning & Perception

Advanced Machine Learning & Perception Advanced Machine Learning & Perception Instructor: Tony Jebara Topic 6 Standard Kernels Unusual Input Spaces for Kernels String Kernels Probabilistic Kernels Fisher Kernels Probability Product Kernels

More information

Information geometry in optimization, machine learning and statistical inference

Information geometry in optimization, machine learning and statistical inference Front. Electr. Electron. Eng. China 2010, 5(3): 241 260 DOI 10.1007/s11460-010-0101-3 Shun-ichi AMARI Information geometry in optimization, machine learning and statistical inference c Higher Education

More information

HOMEWORK 2 - RIEMANNIAN GEOMETRY. 1. Problems In what follows (M, g) will always denote a Riemannian manifold with a Levi-Civita connection.

HOMEWORK 2 - RIEMANNIAN GEOMETRY. 1. Problems In what follows (M, g) will always denote a Riemannian manifold with a Levi-Civita connection. HOMEWORK 2 - RIEMANNIAN GEOMETRY ANDRÉ NEVES 1. Problems In what follows (M, g will always denote a Riemannian manifold with a Levi-Civita connection. 1 Let X, Y, Z be vector fields on M so that X(p Z(p

More information

Course Summary Math 211

Course Summary Math 211 Course Summary Math 211 table of contents I. Functions of several variables. II. R n. III. Derivatives. IV. Taylor s Theorem. V. Differential Geometry. VI. Applications. 1. Best affine approximations.

More information

CS Lecture 19. Exponential Families & Expectation Propagation

CS Lecture 19. Exponential Families & Expectation Propagation CS 6347 Lecture 19 Exponential Families & Expectation Propagation Discrete State Spaces We have been focusing on the case of MRFs over discrete state spaces Probability distributions over discrete spaces

More information

Bounding the Entropic Region via Information Geometry

Bounding the Entropic Region via Information Geometry Bounding the ntropic Region via Information Geometry Yunshu Liu John MacLaren Walsh Dept. of C, Drexel University, Philadelphia, PA 19104, USA yunshu.liu@drexel.edu jwalsh@coe.drexel.edu Abstract This

More information

Legendre transformation and information geometry

Legendre transformation and information geometry Legendre transformation and information geometry CIG-MEMO #2, v1 Frank Nielsen École Polytechnique Sony Computer Science Laboratorie, Inc http://www.informationgeometry.org September 2010 Abstract We explain

More information

Lecture 8. Connections

Lecture 8. Connections Lecture 8. Connections This lecture introduces connections, which are the machinery required to allow differentiation of vector fields. 8.1 Differentiating vector fields. The idea of differentiating vector

More information

Invariant Nonholonomic Riemannian Structures on Three-Dimensional Lie Groups

Invariant Nonholonomic Riemannian Structures on Three-Dimensional Lie Groups Invariant Nonholonomic Riemannian Structures on Three-Dimensional Lie Groups Dennis I. Barrett Geometry, Graphs and Control (GGC) Research Group Department of Mathematics, Rhodes University Grahamstown,

More information

Manifolds and tangent bundles. Vector fields and flows. 1 Differential manifolds, smooth maps, submanifolds

Manifolds and tangent bundles. Vector fields and flows. 1 Differential manifolds, smooth maps, submanifolds MA 755 Fall 05. Notes #1. I. Kogan. Manifolds and tangent bundles. Vector fields and flows. 1 Differential manifolds, smooth maps, submanifolds Definition 1 An n-dimensional C k -differentiable manifold

More information

Stratification of 3 3 Matrices

Stratification of 3 3 Matrices Stratification of 3 3 Matrices Meesue Yoo & Clay Shonkwiler March 2, 2006 1 Warmup with 2 2 Matrices { Best matrices of rank 2} = O(2) S 3 ( 2) { Best matrices of rank 1} S 3 (1) 1.1 Viewing O(2) S 3 (

More information

Isometries, Local Isometries, Riemannian Coverings and Submersions, Killing Vector Fields

Isometries, Local Isometries, Riemannian Coverings and Submersions, Killing Vector Fields Chapter 15 Isometries, Local Isometries, Riemannian Coverings and Submersions, Killing Vector Fields The goal of this chapter is to understand the behavior of isometries and local isometries, in particular

More information

William P. Thurston. The Geometry and Topology of Three-Manifolds

William P. Thurston. The Geometry and Topology of Three-Manifolds William P. Thurston The Geometry and Topology of Three-Manifolds Electronic version 1.1 - March 00 http://www.msri.org/publications/books/gt3m/ This is an electronic edition of the 1980 notes distributed

More information

BASIC DIFFERENTIAL GEOMETRY: CONNECTIONS AND GEODESICS

BASIC DIFFERENTIAL GEOMETRY: CONNECTIONS AND GEODESICS BASIC DIFFERENTIAL GEOMETRY: CONNECTIONS AND GEODESICS WERNER BALLMANN Introduction I discuss basic features of connections on manifolds: torsion and curvature tensor, geodesics and exponential maps, and

More information

Vectors, metric and the connection

Vectors, metric and the connection Vectors, metric and the connection 1 Contravariant and covariant vectors 1.1 Contravariant vectors Imagine a particle moving along some path in the 2-dimensional flat x y plane. Let its trajectory be given

More information

Chap. 1. Some Differential Geometric Tools

Chap. 1. Some Differential Geometric Tools Chap. 1. Some Differential Geometric Tools 1. Manifold, Diffeomorphism 1.1. The Implicit Function Theorem ϕ : U R n R n p (0 p < n), of class C k (k 1) x 0 U such that ϕ(x 0 ) = 0 rank Dϕ(x) = n p x U

More information

Introduction to Differential Geometry

Introduction to Differential Geometry More about Introduction to Differential Geometry Lecture 7 of 10: Dominic Joyce, Oxford University October 2018 EPSRC CDT in Partial Differential Equations foundation module. These slides available at

More information

Tensors, and differential forms - Lecture 2

Tensors, and differential forms - Lecture 2 Tensors, and differential forms - Lecture 2 1 Introduction The concept of a tensor is derived from considering the properties of a function under a transformation of the coordinate system. A description

More information

CUT LOCI AND DISTANCE FUNCTIONS

CUT LOCI AND DISTANCE FUNCTIONS Math. J. Okayama Univ. 49 (2007), 65 92 CUT LOCI AND DISTANCE FUNCTIONS Jin-ichi ITOH and Takashi SAKAI 1. Introduction Let (M, g) be a compact Riemannian manifold and d(p, q) the distance between p, q

More information

Welcome to Copenhagen!

Welcome to Copenhagen! Welcome to Copenhagen! Schedule: Monday Tuesday Wednesday Thursday Friday 8 Registration and welcome 9 Crash course on Crash course on Introduction to Differential and Differential and Information Geometry

More information

Hyperbolic Geometry on Geometric Surfaces

Hyperbolic Geometry on Geometric Surfaces Mathematics Seminar, 15 September 2010 Outline Introduction Hyperbolic geometry Abstract surfaces The hemisphere model as a geometric surface The Poincaré disk model as a geometric surface Conclusion Introduction

More information

MATH20411 PDEs and Vector Calculus B

MATH20411 PDEs and Vector Calculus B MATH2411 PDEs and Vector Calculus B Dr Stefan Güttel Acknowledgement The lecture notes and other course materials are based on notes provided by Dr Catherine Powell. SECTION 1: Introctory Material MATH2411

More information

Smooth Dynamics 2. Problem Set Nr. 1. Instructor: Submitted by: Prof. Wilkinson Clark Butler. University of Chicago Winter 2013

Smooth Dynamics 2. Problem Set Nr. 1. Instructor: Submitted by: Prof. Wilkinson Clark Butler. University of Chicago Winter 2013 Smooth Dynamics 2 Problem Set Nr. 1 University of Chicago Winter 2013 Instructor: Submitted by: Prof. Wilkinson Clark Butler Problem 1 Let M be a Riemannian manifold with metric, and Levi-Civita connection.

More information

Notes on the Riemannian Geometry of Lie Groups

Notes on the Riemannian Geometry of Lie Groups Rose- Hulman Undergraduate Mathematics Journal Notes on the Riemannian Geometry of Lie Groups Michael L. Geis a Volume, Sponsored by Rose-Hulman Institute of Technology Department of Mathematics Terre

More information

Week 6: Differential geometry I

Week 6: Differential geometry I Week 6: Differential geometry I Tensor algebra Covariant and contravariant tensors Consider two n dimensional coordinate systems x and x and assume that we can express the x i as functions of the x i,

More information

A brief introduction to Semi-Riemannian geometry and general relativity. Hans Ringström

A brief introduction to Semi-Riemannian geometry and general relativity. Hans Ringström A brief introduction to Semi-Riemannian geometry and general relativity Hans Ringström May 5, 2015 2 Contents 1 Scalar product spaces 1 1.1 Scalar products...................................... 1 1.2 Orthonormal

More information

Warped Riemannian metrics for location-scale models

Warped Riemannian metrics for location-scale models 1 Warped Riemannian metrics for location-scale models Salem Said, Lionel Bombrun, Yannick Berthoumieu arxiv:1707.07163v1 [math.st] 22 Jul 2017 Abstract The present paper shows that warped Riemannian metrics,

More information

1 First and second variational formulas for area

1 First and second variational formulas for area 1 First and second variational formulas for area In this chapter, we will derive the first and second variational formulas for the area of a submanifold. This will be useful in our later discussion on

More information

Part IB GEOMETRY (Lent 2016): Example Sheet 1

Part IB GEOMETRY (Lent 2016): Example Sheet 1 Part IB GEOMETRY (Lent 2016): Example Sheet 1 (a.g.kovalev@dpmms.cam.ac.uk) 1. Suppose that H is a hyperplane in Euclidean n-space R n defined by u x = c for some unit vector u and constant c. The reflection

More information

Chapter 12. Connections on Manifolds

Chapter 12. Connections on Manifolds Chapter 12 Connections on Manifolds 12.1 Connections on Manifolds Given a manifold, M, in general, for any two points, p, q 2 M, thereisno natural isomorphismbetween the tangent spaces T p M and T q M.

More information

Multiple Integrals and Vector Calculus (Oxford Physics) Synopsis and Problem Sets; Hilary 2015

Multiple Integrals and Vector Calculus (Oxford Physics) Synopsis and Problem Sets; Hilary 2015 Multiple Integrals and Vector Calculus (Oxford Physics) Ramin Golestanian Synopsis and Problem Sets; Hilary 215 The outline of the material, which will be covered in 14 lectures, is as follows: 1. Introduction

More information

Elements of differential geometry

Elements of differential geometry Elements of differential geometry R.Beig (Univ. Vienna) ESI-EMS-IAMP School on Mathematical GR, 28.7. - 1.8. 2014 1. tensor algebra 2. manifolds, vector and covector fields 3. actions under diffeos and

More information

Differential Geometry II Lecture 1: Introduction and Motivation

Differential Geometry II Lecture 1: Introduction and Motivation Differential Geometry II Lecture 1: Introduction and Motivation Robert Haslhofer 1 Content of this lecture This course is on Riemannian geometry and the geometry of submanifol. The goal of this first lecture

More information

Exercises in Geometry II University of Bonn, Summer semester 2015 Professor: Prof. Christian Blohmann Assistant: Saskia Voss Sheet 1

Exercises in Geometry II University of Bonn, Summer semester 2015 Professor: Prof. Christian Blohmann Assistant: Saskia Voss Sheet 1 Assistant: Saskia Voss Sheet 1 1. Conformal change of Riemannian metrics [3 points] Let (M, g) be a Riemannian manifold. A conformal change is a nonnegative function λ : M (0, ). Such a function defines

More information

Curvature homogeneity of type (1, 3) in pseudo-riemannian manifolds

Curvature homogeneity of type (1, 3) in pseudo-riemannian manifolds Curvature homogeneity of type (1, 3) in pseudo-riemannian manifolds Cullen McDonald August, 013 Abstract We construct two new families of pseudo-riemannian manifolds which are curvature homegeneous of

More information

Syllabus. May 3, Special relativity 1. 2 Differential geometry 3

Syllabus. May 3, Special relativity 1. 2 Differential geometry 3 Syllabus May 3, 2017 Contents 1 Special relativity 1 2 Differential geometry 3 3 General Relativity 13 3.1 Physical Principles.......................................... 13 3.2 Einstein s Equation..........................................

More information

Notes for MATH 434 Geometry and Transformations. Francis Bonahon Fall 2015

Notes for MATH 434 Geometry and Transformations. Francis Bonahon Fall 2015 Notes for MATH 434 Geometry and Transformations Francis Bonahon Fall 2015 Version: October 21, 2016 Department of Mathematics University of Southern California Los Angeles, CA 90089-2532, U.S.A. E-mail

More information

Maxima and Minima. (a, b) of R if

Maxima and Minima. (a, b) of R if Maxima and Minima Definition Let R be any region on the xy-plane, a function f (x, y) attains its absolute or global, maximum value M on R at the point (a, b) of R if (i) f (x, y) M for all points (x,

More information

PHYS 4390: GENERAL RELATIVITY NON-COORDINATE BASIS APPROACH

PHYS 4390: GENERAL RELATIVITY NON-COORDINATE BASIS APPROACH PHYS 4390: GENERAL RELATIVITY NON-COORDINATE BASIS APPROACH 1. Differential Forms To start our discussion, we will define a special class of type (0,r) tensors: Definition 1.1. A differential form of order

More information

Classical differential geometry of two-dimensional surfaces

Classical differential geometry of two-dimensional surfaces Classical differential geometry of two-dimensional surfaces 1 Basic definitions This section gives an overview of the basic notions of differential geometry for twodimensional surfaces. It follows mainly

More information

Math review. Math review

Math review. Math review Math review 1 Math review 3 1 series approximations 3 Taylor s Theorem 3 Binomial approximation 3 sin(x), for x in radians and x close to zero 4 cos(x), for x in radians and x close to zero 5 2 some geometry

More information

Errata for Vector and Geometric Calculus Printings 1-4

Errata for Vector and Geometric Calculus Printings 1-4 October 21, 2017 Errata for Vector and Geometric Calculus Printings 1-4 Note: p. m (n) refers to page m of Printing 4 and page n of Printings 1-3. p. 31 (29), just before Theorem 3.10. f x(h) = [f x][h]

More information

Sophus Lie s Approach to Differential Equations

Sophus Lie s Approach to Differential Equations Sophus Lie s Approach to Differential Equations IAP lecture 2006 (S. Helgason) 1 Groups Let X be a set. A one-to-one mapping ϕ of X onto X is called a bijection. Let B(X) denote the set of all bijections

More information

Chapter 16. Manifolds and Geodesics Manifold Theory. Reading: Osserman [7] Pg , 55, 63-65, Do Carmo [2] Pg ,

Chapter 16. Manifolds and Geodesics Manifold Theory. Reading: Osserman [7] Pg , 55, 63-65, Do Carmo [2] Pg , Chapter 16 Manifolds and Geodesics Reading: Osserman [7] Pg. 43-52, 55, 63-65, Do Carmo [2] Pg. 238-247, 325-335. 16.1 Manifold Theory Let us recall the definition of differentiable manifolds Definition

More information

WARPED PRODUCT METRICS ON R 2. = f(x) 1. The Levi-Civita Connection We did this calculation in class. To quickly recap, by metric compatibility,

WARPED PRODUCT METRICS ON R 2. = f(x) 1. The Levi-Civita Connection We did this calculation in class. To quickly recap, by metric compatibility, WARPED PRODUCT METRICS ON R 2 KEVIN WHYTE Let M be R 2 equipped with the metric : x = 1 = f(x) y and < x, y >= 0 1. The Levi-Civita Connection We did this calculation in class. To quickly recap, by metric

More information

2.10 Saddles, Nodes, Foci and Centers

2.10 Saddles, Nodes, Foci and Centers 2.10 Saddles, Nodes, Foci and Centers In Section 1.5, a linear system (1 where x R 2 was said to have a saddle, node, focus or center at the origin if its phase portrait was linearly equivalent to one

More information

Differential Geometry Exercises

Differential Geometry Exercises Differential Geometry Exercises Isaac Chavel Spring 2006 Jordan curve theorem We think of a regular C 2 simply closed path in the plane as a C 2 imbedding of the circle ω : S 1 R 2. Theorem. Given the

More information

Metrisability of Painleve equations and Hamiltonian systems of hydrodynamic type

Metrisability of Painleve equations and Hamiltonian systems of hydrodynamic type Metrisability of Painleve equations and Hamiltonian systems of hydrodynamic type Felipe Contatto Department of Applied Mathematics and Theoretical Physics University of Cambridge felipe.contatto@damtp.cam.ac.uk

More information

Topics in Information Geometry. Dodson, CTJ. MIMS EPrint: Manchester Institute for Mathematical Sciences School of Mathematics

Topics in Information Geometry. Dodson, CTJ. MIMS EPrint: Manchester Institute for Mathematical Sciences School of Mathematics Topics in Information Geometry Dodson, CTJ 005 MIMS EPrint: 005.49 Manchester Institute for Mathematical Sciences School of Mathematics The University of Manchester Reports available from: And by contacting:

More information

Dually Flat Geometries in the State Space of Statistical Models

Dually Flat Geometries in the State Space of Statistical Models 1/ 12 Dually Flat Geometries in the State Space of Statistical Models Jan Naudts Universiteit Antwerpen ECEA, November 2016 J. Naudts, Dually Flat Geometries in the State Space of Statistical Models. In

More information

0, otherwise. Find each of the following limits, or explain that the limit does not exist.

0, otherwise. Find each of the following limits, or explain that the limit does not exist. Midterm Solutions 1, y x 4 1. Let f(x, y) = 1, y 0 0, otherwise. Find each of the following limits, or explain that the limit does not exist. (a) (b) (c) lim f(x, y) (x,y) (0,1) lim f(x, y) (x,y) (2,3)

More information

Part IB Geometry. Theorems. Based on lectures by A. G. Kovalev Notes taken by Dexter Chua. Lent 2016

Part IB Geometry. Theorems. Based on lectures by A. G. Kovalev Notes taken by Dexter Chua. Lent 2016 Part IB Geometry Theorems Based on lectures by A. G. Kovalev Notes taken by Dexter Chua Lent 2016 These notes are not endorsed by the lecturers, and I have modified them (often significantly) after lectures.

More information

Lecture 35: December The fundamental statistical distances

Lecture 35: December The fundamental statistical distances 36-705: Intermediate Statistics Fall 207 Lecturer: Siva Balakrishnan Lecture 35: December 4 Today we will discuss distances and metrics between distributions that are useful in statistics. I will be lose

More information

Journée Interdisciplinaire Mathématiques Musique

Journée Interdisciplinaire Mathématiques Musique Journée Interdisciplinaire Mathématiques Musique Music Information Geometry Arnaud Dessein 1,2 and Arshia Cont 1 1 Institute for Research and Coordination of Acoustics and Music, Paris, France 2 Japanese-French

More information

Distances, volumes, and integration

Distances, volumes, and integration Distances, volumes, and integration Scribe: Aric Bartle 1 Local Shape of a Surface A question that we may ask ourselves is what significance does the second fundamental form play in the geometric characteristics

More information

LECTURE 16: CONJUGATE AND CUT POINTS

LECTURE 16: CONJUGATE AND CUT POINTS LECTURE 16: CONJUGATE AND CUT POINTS 1. Conjugate Points Let (M, g) be Riemannian and γ : [a, b] M a geodesic. Then by definition, exp p ((t a) γ(a)) = γ(t). We know that exp p is a diffeomorphism near

More information

Intro Vectors 2D implicit curves 2D parametric curves. Graphics 2011/2012, 4th quarter. Lecture 2: vectors, curves, and surfaces

Intro Vectors 2D implicit curves 2D parametric curves. Graphics 2011/2012, 4th quarter. Lecture 2: vectors, curves, and surfaces Lecture 2, curves, and surfaces Organizational remarks Tutorials: Tutorial 1 will be online later today TA sessions for questions start next week Practicals: Exams: Make sure to find a team partner very

More information

Geometric inequalities for black holes

Geometric inequalities for black holes Geometric inequalities for black holes Sergio Dain FaMAF-Universidad Nacional de Córdoba, CONICET, Argentina. 3 August, 2012 Einstein equations (vacuum) The spacetime is a four dimensional manifold M with

More information

Information Geometry

Information Geometry 2015 Workshop on High-Dimensional Statistical Analysis Dec.11 (Friday) ~15 (Tuesday) Humanities and Social Sciences Center, Academia Sinica, Taiwan Information Geometry and Spontaneous Data Learning Shinto

More information

Two dimensional manifolds

Two dimensional manifolds Two dimensional manifolds We are given a real two-dimensional manifold, M. A point of M is denoted X and local coordinates are X (x, y) R 2. If we use different local coordinates, (x, y ), we have x f(x,

More information

Exact Solutions of the Einstein Equations

Exact Solutions of the Einstein Equations Notes from phz 6607, Special and General Relativity University of Florida, Fall 2004, Detweiler Exact Solutions of the Einstein Equations These notes are not a substitute in any manner for class lectures.

More information

Analysis-3 lecture schemes

Analysis-3 lecture schemes Analysis-3 lecture schemes (with Homeworks) 1 Csörgő István November, 2015 1 A jegyzet az ELTE Informatikai Kar 2015. évi Jegyzetpályázatának támogatásával készült Contents 1. Lesson 1 4 1.1. The Space

More information

DIFFERENTIAL GEOMETRY HW 12

DIFFERENTIAL GEOMETRY HW 12 DIFFERENTIAL GEOMETRY HW 1 CLAY SHONKWILER 3 Find the Lie algebra so(n) of the special orthogonal group SO(n), and the explicit formula for the Lie bracket there. Proof. Since SO(n) is a subgroup of GL(n),

More information

Lecture 1: August 28

Lecture 1: August 28 36-705: Intermediate Statistics Fall 2017 Lecturer: Siva Balakrishnan Lecture 1: August 28 Our broad goal for the first few lectures is to try to understand the behaviour of sums of independent random

More information

General Relativity I

General Relativity I General Relativity I presented by John T. Whelan The University of Texas at Brownsville whelan@phys.utb.edu LIGO Livingston SURF Lecture 2002 July 5 General Relativity Lectures I. Today (JTW): Special

More information

Linear model A linear model assumes Y X N(µ(X),σ 2 I), And IE(Y X) = µ(x) = X β, 2/52

Linear model A linear model assumes Y X N(µ(X),σ 2 I), And IE(Y X) = µ(x) = X β, 2/52 Statistics for Applications Chapter 10: Generalized Linear Models (GLMs) 1/52 Linear model A linear model assumes Y X N(µ(X),σ 2 I), And IE(Y X) = µ(x) = X β, 2/52 Components of a linear model The two

More information

Open Questions in Quantum Information Geometry

Open Questions in Quantum Information Geometry Open Questions in Quantum Information Geometry M. R. Grasselli Department of Mathematics and Statistics McMaster University Imperial College, June 15, 2005 1. Classical Parametric Information Geometry

More information