Adjoint Orbits, Principal Components, and Neural Nets

Similar documents
SYMPLECTIC MANIFOLDS, GEOMETRIC QUANTIZATION, AND UNITARY REPRESENTATIONS OF LIE GROUPS. 1. Introduction

Symmetric Spaces Toolkit

DIFFERENTIAL GEOMETRY HW 12

Clifford Algebras and Spin Groups

Topics in Representation Theory: Lie Groups, Lie Algebras and the Exponential Map

Topics in Representation Theory: Roots and Weights

LECTURE 25-26: CARTAN S THEOREM OF MAXIMAL TORI. 1. Maximal Tori

Matrix Lie groups. and their Lie algebras. Mahmood Alaghmandan. A project in fulfillment of the requirement for the Lie algebra course

Tangent bundles, vector fields

SEMISIMPLE LIE GROUPS

THE EULER CHARACTERISTIC OF A LIE GROUP

LECTURE 3 MATH 261A. Office hours are now settled to be after class on Thursdays from 12 : 30 2 in Evans 815, or still by appointment.

A LITTLE TASTE OF SYMPLECTIC GEOMETRY: THE SCHUR-HORN THEOREM CONTENTS

Symmetries, Fields and Particles. Examples 1.

Quantum Field Theory III

GROUP THEORY PRIMER. New terms: so(2n), so(2n+1), symplectic algebra sp(2n)

Topics in Representation Theory: Cultural Background

These notes are incomplete they will be updated regularly.

Compound matrices and some classical inequalities

ERRATA FOR INTRODUCTION TO SYMPLECTIC TOPOLOGY

A little taste of symplectic geometry

Methods in Computer Vision: Introduction to Matrix Lie Groups

REPRESENTATION THEORY WEEK 7

1 Smooth manifolds and Lie groups

Poincaré Duality Angles on Riemannian Manifolds with Boundary

MATH 423 Linear Algebra II Lecture 33: Diagonalization of normal operators.

CHARACTERISTIC CLASSES

Lie Groups, Lie Algebras and the Exponential Map

Part III Symmetries, Fields and Particles

II. DIFFERENTIABLE MANIFOLDS. Washington Mio CENTER FOR APPLIED VISION AND IMAGING SCIENCES

1: Lie groups Matix groups, Lie algebras

Hermitian Symmetric Spaces

Discrete Rigid Body Dynamics and Optimal Control

QUATERNIONS AND ROTATIONS

LECTURE 10: THE ATIYAH-GUILLEMIN-STERNBERG CONVEXITY THEOREM

Notes on Lie Algebras

INSTANTON MODULI AND COMPACTIFICATION MATTHEW MAHOWALD

Quantum Mechanics for Mathematicians: The Heisenberg group and the Schrödinger Representation

Terse Notes on Riemannian Geometry

CLASSICAL GROUPS DAVID VOGAN

LECTURE 26: THE CHERN-WEIL THEORY

Hamiltonian flows, cotangent lifts, and momentum maps

Jean Gallier Department of Computer and Information Science University of Pennsylvania Philadelphia, PA 19104, USA

Versal deformations in generalized flag manifolds

Exercises in Geometry II University of Bonn, Summer semester 2015 Professor: Prof. Christian Blohmann Assistant: Saskia Voss Sheet 1

Tutorial 5 Clifford Algebra and so(n)

Problems in Linear Algebra and Representation Theory

Symmetries, Fields and Particles 2013 Solutions

Subgroups of Lie groups. Definition 0.7. A Lie subgroup of a Lie group G is a subgroup which is also a submanifold.

Choice of Riemannian Metrics for Rigid Body Kinematics

Topics in Representation Theory: SU(n), Weyl Chambers and the Diagram of a Group

1 Linear Algebra Problems

Representations of Matrix Lie Algebras

SPHERICAL UNITARY REPRESENTATIONS FOR REDUCTIVE GROUPS

1 Hermitian symmetric spaces: examples and basic properties

Bredon, Introduction to compact transformation groups, Academic Press

33 JINWEI ZHOU acts on the left of u =(u ::: u N ) 2 F N Let G(N) be the group acting on the right of F N which preserving the standard inner product

EQUIVARIANT COHOMOLOGY. p : E B such that there exist a countable open covering {U i } i I of B and homeomorphisms

Geodesic Equivalence in sub-riemannian Geometry

LINEAR ALGEBRA BOOT CAMP WEEK 4: THE SPECTRAL THEOREM

Killing Vector Fields of Constant Length on Riemannian Normal Homogeneous Spaces

A Field Extension as a Vector Space

Chapter 2. Introduction to Manifolds and Classical Lie Groups

Stratification of 3 3 Matrices

Math 350 Fall 2011 Notes about inner product spaces. In this notes we state and prove some important properties of inner product spaces.

Lecture 5 : Projections

REGULAR TRIPLETS IN COMPACT SYMMETRIC SPACES

Bott Periodicity. Anthony Bosman Senior Honors Thesis Department of Mathematics, Stanford University Adviser: Eleny Ionel

Application of Numerical Algebraic Geometry to Geometric Data Analysis

arxiv:math-ph/ v1 31 May 2000

Quantum Theory and Group Representations

L 2 Geometry of the Symplectomorphism Group

Hodge Structures. October 8, A few examples of symmetric spaces

SYMPLECTIC GEOMETRY: LECTURE 5

Mathematics. EC / EE / IN / ME / CE. for

Positive definite preserving linear transformations on symmetric matrix spaces

An introduction to the geometry of homogeneous spaces

The Solvability Conditions for the Inverse Eigenvalue Problem of Hermitian and Generalized Skew-Hamiltonian Matrices and Its Approximation

Branching rules of unitary representations: Examples and applications to automorphic forms.

Symmetries, Groups, and Conservation Laws

Vortex Equations on Riemannian Surfaces

C*-Algebras and Group Representations

On the algebraic structure of Dyson s Circular Ensembles. Jonathan D.H. SMITH. January 29, 2012

The Spinor Representation

Optimisation on Manifolds

Reproducing formulas associated with symbols

(Kac Moody) Chevalley groups and Lie algebras with built in structure constants Lecture 1. Lisa Carbone Rutgers University

arxiv: v1 [math.gr] 8 Nov 2008

Mathematical foundations - linear algebra

GEOMETRIC QUANTIZATION

MATRIX LIE GROUPS. Claudiu C Remsing. Dept of Mathematics (Pure and Applied) Rhodes University Grahamstown September Maths Seminar 2007

Linear Algebra Review. Vectors

Differential Topology Final Exam With Solutions

Differential Geometry, Lie Groups, and Symmetric Spaces

Cayley-Hamilton Theorem

LECTURE: KOBORDISMENTHEORIE, WINTER TERM 2011/12; SUMMARY AND LITERATURE

Matrices A brief introduction

Left-invariant Einstein metrics

LECTURE 16: LIE GROUPS AND THEIR LIE ALGEBRAS. 1. Lie groups

Symmetries, Fields and Particles 2013 Solutions

Transcription:

Adjoint Orbits, Principal Components, and Neural Nets Some facts about Lie groups and examples Examples of adjoint orbits and a distance measure Descent equations on adjoint orbits Properties of the double bracket equation Smoothed versions of the double bracket equation The principal component extractor The performance of subspace filters Variations on a theme

Where We Are 9:30-10:45 Part 1. Examples and Mathematical Background 10:45-11:15 Coffee break 11:15-12:30 Part 2. Principal components, Neural Nets, and Automata 12:30-14:30 Lunch 14:30-15:45 Part 3. Precise and Approximate Representation of Numbers 15:45-16:15 Coffee break 16:15-17:30 Part 4. Quantum Computation

The Adjoint Orbit Theory and Some Applications 1. Some facts about Lie groups and examples Examples of adjoint orbits and a distance measure Descent equations on adjoint orbits Properties of the double bracket equation Smoothed versions of the double bracket equation Loops and deck transformations

Some Background By a Lie Group G we understand a group with a topology such that multiplication and inversion are continuous. (In this setting continuous implies differentiable.) We say that a group acts on a differentiable manifold X via φ if φ : G X M is differentiable and φ(g 2 G 1,x)=φ(G 2,φ(G 1,x)). The group of orthogonal matrices So(n) acts on the n 1-dimensional sphere via the action φ(θ,x)=θx

More Mathematics Background Associated with every Lie group is a Lie algebra L which may be thought of as describing how G looks in a small set around the identity. Abstractly, a Lie algebra is a vector space with a bilinear mapping φ : L L L such that [L 1,L 2 ]= [L 2,L 1 ] [L 1, [L 2,L 3 ]] + [L 2, [L 3,L 1 ]] + [L 3, [L 1,L 2 ]] = 0 The Lie algebra associated with the real orthogonal group is the set of skew- symmetric matrices of the same dimension. The bilinear operation is given by [Ω 1, Ω 2 ]=Ω 1 Ω 2 Ω 2 Ω 1.

A Little More Mathematics Background Let Θ be an orthogonal matrix and let Q be a symmetric matrix with eigenvalues λ 1,λ 2,..., λ n. The formula Θ T QΘ defines a group action on Sym(λ 1,λ 2,..., λ n ). The set of orthogonal matrices is of dimension n(n 1)/2 and the space Sym(Λ) is of dimension n(n +1)/2. This action is basic to a lot Matlab! The action of the group of unitary matrices on the space of skew-hermitian matrices via (U, H) U HU can be thought of as generalizing this action. It is an example of a group acting on its own Lie algebra. This is an adjoint action.

Still More Mathematics Background Consider Lie algebras whose elements are n by n matrices and Lie groups whose elements are nonsingular n by n matrices. The mapping exp : L e L sends the Lie algebra into the group of invertible matrices. The identity P 1 e L P = e P 1 LP defines the adjoint action. If φ : G X X is a group action then there is an equivalence relation on X defined by x y if y = φ(g 1,x) for some G 1 G. Sets of equivalent points are called orbits. The subset of H G such that φ(h, x 0 ) = x 0 forms a subgroup called the isotropy group t x 0.

The Last for now, Mathematics Background Any L 1 L defines via [L 1, ] : L L, a linear transformation on a finite dimensional space. It is often written ad L1 ( ). ad L1 (ad L2 ( )) =[L 1, [L 2, ]] defines a linear transformation on L as well. The sum of the eigenvalues of this map defines what is called the Killing form κ(l 1,L 2 ), on L. For semisimple compact groups such as the orthogonal or special unitary group, the Killing form is negative definite and proportional to the more familiar tr(ω 1 Ω 2 ). The Killing form on G defines a metric on the adjoint orbit called the normal metric.

Getting a Feel for the Normal Metric Explanation: Consider perturbing Θ via Θ Θ(I + Ω). Linearizing the equation we get Θ T QΘ=H HΩ+Ω T H =[H, Ω] = dh Thus Ω=ad 1 H (dh) If H is diagonal then ω ij = dh ij λ i λ j

Steepest Descent on an Adjoint Orbit Let Q = Q T and N = N T be symmetric matrices and let Θ be orthogonal. Consider the function trθ T QΘN thought of as a function on the orthogonal matrices. Relative to the Killing metric on the orthogonal group, the gradient descent flow for minimizing this function is Θ =[Θ T QΘ,N]Θ If we let Θ T QΘ=H then the derivative of H can be expressed as Ḣ =[H, [H, N]]

A Descent Equation on an Adjoint Orbit Let Q = Q T and N = N T be symmetric matrices and let ψ(h) be a real valued function on Sym(Λ). What is the gradient of ψ(h)? The gradient on a Riemannian space is G 1 dψ. On Sym(Λ) the inverse of the Riemannian metric is given by [H, [H, ]]. and so the descent equation is Ḣ = [H, [H, dψ(h)]] Thus for ψ(h) =tr(hn)wehaveḣ = [H, [H, N]]. If N is diagonal then trhn achieves its minimum when H is diagonal and similarly ordered with N.

A Descent Equation with Multiple Equilibria If ψ(h) = tr(diag(h)h) then Ḣ = [H, [H, 2diag(H)]] Let Q = Q T and N = N T be diagonal matrices with distinct eigenvalues. The descent equation is Ḣ = [H, [H, dψ(h)]] Thus for ψ(h) =tr(hn)wehaveḣ =[H, [H, N]] If ψ(h) =diagh then Ḣ =2[H, [H, diag(h)]]

A Descent Equation with Smoothing Added Consider replacing the system with Ḣ =[H, [H, N]] Ḣ =[H, q(d)p ] ; p(d)p =[H, N] Here D = d/dt. This smooths the signals but does not alter the equilibrium points. Stability is un affected if q/p is a positive real function.

The Double Bracket Flow for Analog Computation Principal Components in R n Learning without a teacher is sometimes approached by finding principal components. Ẇ = x(t)x T (t) - forgetting term Θ T (t)w (t)θ(t) = diag(λ 1..., λ n ) Columns of Θ are components The principal components are assembled in a hidden layer

x x x x n w w w v 1 1 1 2 3 v v v 2 3 m m m y y y y 2 3 n

Adaptive Subspace Filtering t Filter power frequency

Some Equations Let u be a vector of inputs, and let Λ be a diagonal editing matrix that selects energy levels that are desirable. An adaptive subspace filter with input u and output y can be realized by implementing the equations dq T = uu ( 1 tr( Q)) Q dt dθ T = [ Θ Q Θ, N] Θ dt y = ΘΛΘ T u

Neural Nets as Flows on Grassmann Manifolds Denote by G(n, k) the space of k-planes in n- space. This space is a differentiable manifold that can be parameterized by the set of all k by n matrices of rank k. It is a manifold. Adaptive subspace filters steer the weights so as to define a particular element of this space. Thus ΛΘ, defines such a point if Λ looks like Λ= 1 0... 0 0 1... 0............ 0 0... 1

Summary of Part 2 1. We have given some mathematical background necessary to work with flows on adjoint orbits and indicated some applications. 2. We have defined flows that will stabilize at invariant subspaces corresponding to the principal components of a vector process. These flows can be interpreted as flows that learn without a teacher. 3. We have argued that in spite of its limitations, steepest descent is usually the first choice in algorithm design. 4. We have interpreted a basic neural network algorithm as a flow in a Grassmann manifold generated by a steepest descent tracking algorithm.

A Few References M. W. Berry et al., Matrices, Vector Spaces, and Information Retrieval SIAM Review, vol. 41, No. 2, 1999. R. W. Brockett, Dynamical Systems That Learn Subspaces in Mathematical System Theory: The Influence of R. E. Kalman, (A.C. Antoulas, ed.) Springer -Verlag, Berlin. 1991. pp. 579--592. R. W. Brockett An Estimation Theoretic Basis for the Design of Sorting and Classification Networks, in Neural Networks, (R. Mammone and Y. Zeevi, eds.) Academic Press, 1991, pp. 23-41.