A mystery of transpose of matrix multiplication Why (AB) T = B T A T?

Similar documents
Dot Products, Transposes, and Orthogonal Projections

Getting Started with Communications Engineering. Rows first, columns second. Remember that. R then C. 1

Determinants in detail

Polynomials; Add/Subtract

Vector, Matrix, and Tensor Derivatives

Matrix Multiplication

Solving Systems of Linear Equations

Mon Feb Matrix algebra and matrix inverses. Announcements: Warm-up Exercise:

Practical Linear Algebra: A Geometry Toolbox

A Primer on Three Vectors

Sometimes the domains X and Z will be the same, so this might be written:

Linear Algebra Done Wrong. Sergei Treil. Department of Mathematics, Brown University

Topic 15 Notes Jeremy Orloff

Matrix Arithmetic. j=1

Page 52. Lecture 3: Inner Product Spaces Dual Spaces, Dirac Notation, and Adjoints Date Revised: 2008/10/03 Date Given: 2008/10/03

Lecture 3 Linear Algebra Background

Numerical Methods Lecture 2 Simultaneous Equations

Basic Linear Algebra in MATLAB

An Intuitive Introduction to Motivic Homotopy Theory Vladimir Voevodsky

Matrix Operations. Linear Combination Vector Algebra Angle Between Vectors Projections and Reflections Equality of matrices, Augmented Matrix

M. Matrices and Linear Algebra

Matrices. Chapter Definitions and Notations

START: Read 1 Guide for Repeated Interactive Read-Alouds

Physics 110. Electricity and Magnetism. Professor Dine. Spring, Handout: Vectors and Tensors: Everything You Need to Know

Matrices: 2.1 Operations with Matrices

Mathematical Principles for Scientific Computing and Visualization

Matrices. 1 a a2 1 b b 2 1 c c π

Vectors To begin, let us describe an element of the state space as a point with numerical coordinates, that is x 1. x 2. x =

Group Representations

. =. a i1 x 1 + a i2 x 2 + a in x n = b i. a 11 a 12 a 1n a 21 a 22 a 1n. i1 a i2 a in

Elementary maths for GMT

MATH 320, WEEK 7: Matrices, Matrix Operations

P (E) = P (A 1 )P (A 2 )... P (A n ).

CS 246 Review of Linear Algebra 01/17/19

Kevin James. MTHSC 3110 Section 2.1 Matrix Operations

Section 1.6. M N = [a ij b ij ], (1.6.2)

What is A + B? What is A B? What is AB? What is BA? What is A 2? and B = QUESTION 2. What is the reduced row echelon matrix of A =

Absolute motion versus relative motion in Special Relativity is not dealt with properly

Getting Started with Communications Engineering

Math 3108: Linear Algebra

Linear Algebra (MATH ) Spring 2011 Final Exam Practice Problem Solutions

The student solutions shown below highlight the most commonly used approaches and also some that feature nice use of algebraic polynomial formulas.

Introduction - Motivation. Many phenomena (physical, chemical, biological, etc.) are model by differential equations. f f(x + h) f(x) (x) = lim

Linear Algebra (part 1) : Matrices and Systems of Linear Equations (by Evan Dummit, 2016, v. 2.02)

MATH 315 Linear Algebra Homework #1 Assigned: August 20, 2018

A Review of Linear Algebra

MATH10212 Linear Algebra B Homework Week 5

MITOCW ocw-18_02-f07-lec02_220k

Linear Algebra (Review) Volker Tresp 2018

Now, if you see that x and y are present in both equations, you may write:

Some Notes on Linear Algebra

The Sleeping Beauty 3

Mathematics for Graphics and Vision

MITOCW ocw f99-lec30_300k

MITOCW ocw nov2005-pt1-220k_512kb.mp4

Matrix Algebra & Elementary Matrices

Part 1: You are given the following system of two equations: x + 2y = 16 3x 4y = 2

Solution: By inspection, the standard matrix of T is: A = Where, Ae 1 = 3. , and Ae 3 = 4. , Ae 2 =

Practical Linear Algebra: A Geometry Toolbox

Final Review Sheet. B = (1, 1 + 3x, 1 + x 2 ) then 2 + 3x + 6x 2

Mechanics, Heat, Oscillations and Waves Prof. V. Balakrishnan Department of Physics Indian Institute of Technology, Madras

Math 123, Week 2: Matrix Operations, Inverses

The Quaternions & Octonions: A Basic introduction to Their Algebras. By: Kyle McAllister. Boise State University

Linear Equations and Matrix

Math 308 Midterm Answers and Comments July 18, Part A. Short answer questions

4 Linear Algebra Review

Explorers 4 Teacher s notes for the Comprehension Test: The Snow Queen

x n -2.5 Definition A list is a list of objects, where multiplicity is allowed, and order matters. For example, as lists

NOTES ON VECTORS, PLANES, AND LINES

Hypercomplex numbers

MITOCW ocw f99-lec01_300k

Properties of Arithmetic

The PROMYS Math Circle Problem of the Week #3 February 3, 2017

ME751 Advanced Computational Multibody Dynamics

ELEMENTARY LINEAR ALGEBRA

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI

Ratios, Proportions, Unit Conversions, and the Factor-Label Method

Special Theory of Relativity Prof. Shiva Prasad Department of Physics Indian Institute of Technology, Bombay. Lecture - 15 Momentum Energy Four Vector

Note: Please use the actual date you accessed this material in your citation.

Answers in blue. If you have questions or spot an error, let me know. 1. Find all matrices that commute with A =. 4 3

Introduction to Algebra: The First Week

Special Theory of Relativity Prof. Shiva Prasad Department of Physics Indian Institute of Technology, Bombay

MATH Mathematics for Agriculture II

b 1 b 2.. b = b m A = [a 1,a 2,...,a n ] where a 1,j a 2,j a j = a m,j Let A R m n and x 1 x 2 x = x n

Direct Proofs. the product of two consecutive integers plus the larger of the two integers

Math 308 Midterm November 6, 2009

DS-GA 1002 Lecture notes 0 Fall Linear Algebra. These notes provide a review of basic concepts in linear algebra.

Lecture 6: Lies, Inner Product Spaces, and Symmetric Matrices

Machine Learning for Large-Scale Data Analysis and Decision Making A. Week #1

Algorithmic Game Theory and Applications. Lecture 4: 2-player zero-sum games, and the Minimax Theorem

Topic 1: Propositional logic

THE EVIL QUEEN AND THE RED CACTUS By Dilmehr Kaur. The. Story Analysis BY JIHAN KAMILA AULIRAHAM. And The

. a m1 a mn. a 1 a 2 a = a n

Modern Algebra Prof. Manindra Agrawal Department of Computer Science and Engineering Indian Institute of Technology, Kanpur

1 Notes and Directions on Dirac Notation

13. Systems of Linear Equations 1

Math 290-2: Linear Algebra & Multivariable Calculus Northwestern University, Lecture Notes

Logarithms for analog circuits

Logarithms for analog circuits

Transcription:

A mystery of transpose of matrix multiplication Why (AB) T = B T A T? Yamauchi, Hitoshi Nichiyou kenkyu aikouka Berlin, Germany 2009-5-24(Sun) Contents Introduction 2 Dot product of vectors 3 Matrix Multiplication 3 4 Transpose of Matrix Multiplication 4 Abstract Why does the transpose of matrix multiplication is (AB) T = B T A T? I was told this is the rule when I was a university student But, there must be something we could understand this First, I would like to show you that the relationship of dot product of vectors and its transpose is = vu T Then I will point out to you that a matrix is a representation of transform rather a representation of simultaneous equations This point of view gives us that a matrix multiplication includes dot products Combining these two point of views and one more, a vector is a special case of a matrix, we could understand the first equation Introduction This article talks about the transpose of matrix multiplication (AB) T = B T A T () When I saw this relationship, I wonder why this happens For me, transpose is an operator, it looks like this is a special distribution law If multiply something (for instance, -), ( ) (a + b) = ( a) + ( b) There is no place exchange of a and b, it does not become b a But in the case of transpose, (AB) T = B } T {{ A T } (2) The order AB becomes BA Why this happens? If I compare the each of element, I could see this should be, however, I feel something magical and could not feel to understand it Recently, I read a book and felt a bit better So I would like to introduce the explanation Before we start to talk about the matrix multiplication, I would like to start from dot product of vectors since vector is a special case of matrix Then we will generalize back this idea to matrix multiplication Because a simpler form is usually easier to understand, we will start a simple one and then go further 2 Dot product of vectors Let s think about two vectors u, v Most of the linear algebra textbooks omit the elements of vector since it is too cumbersome, however, I will put the elements here When we wrote elements like u = u u n (3) as a general vector, this is also cumbersome So, I will start with three dimensional vector Then we could extend this to general dimensional vectors Here the main actor of this story is transpose T, this is an operator to exchange the row

and column of a matrix or a vector u T = u T u (4) I assume you know what is a dot product of vectors (dot product is also called as an inner product) The dot product (uv) is (uv) = u T v = ( u ) v = u v + + (5) Let s think about one more step Why I want to go one more step? Because this article is for explaining what is transpose Therefore I would like to go one more step Then some of you want to know why transpose matters for me This is because I matter it There is no way to prove this is interesting subject in a mathematical way I just feel it is intriguing There are many good mathematics book which has deep insight of mathematics I have nothing to add to these books regarding mathematical insight However, the emotion of my interest is mine and this is I can explain Many of mathematical book tries to extract the beauty of mathematics in the mathematical way I totally agree this is the way to see the beauty of mathematics But this beauty is a queen beauty who has high natural pride It is almost impossible to be near by for me I would like to understand this beauty in more familiar way Therefore, I am writing this with my feelings to the queen Of course this method has a dangerous aspect since I sometimes can see only one view of the beauty, sometimes I lost other view points Now, Let s think about one more step and think about transpose of this (u v + + ) T =? (6) A dot product produces a scalar value, therefore transpose does not change the result Therefore, it should be (uv) T = (u v + + ) T = (u v + + ) = (uv) (7) We could think about the transpose operation further using this clue transpose does not affect the result of vector dot product If the distribution law is kept also for transpose, = u T] T v] T = uv T (8) (Where we use transpose of transpose produces original form (u T T = u)) However, (uv) T = = ( ) u v = = ( u ) T u v ( v ) T T = u v u u v v (uv) (9) This is even not a scalar, so something is wrong (Some of the reader wonder why a vector multiplication becomes a matrix Equation 9 is a Tensor product I could not explain this in this article, but, you could look up this with keyword Tensor product ) From Equation 9, u T T v T (0) To make transposed and original dot product the same, (uv) T v u () is necessary This means we need to exchange the vector position of u and v This means (uv) T = ( ) v u = v T u = (v T T u) = (vu) (2) 2

to extends the idea I wonder, how many students realize a matrix multiplication includes dot products? I realized it quite later Here, y is a dot product of a n] and x n] y = a x + a 2 x 2 + + a n x n x x 2 a a 2 a n (5) x n Figure : Anatomy of dot product vector transpose The in Equation 2 is required There is no objective reason, I think this is correct At the end, = v T u = v T u T] T (3) Figure shows that: exchange the vector position, 2 distribute the transpose operator You might not like this since here is only a sufficient condition But from the one transpose for each element and two vectors combination, we could find only this combination is the possibility So, I believe this is sufficient explanation Now the remaining problem is what is the relationship between matrix multiplication and vector dot products You might see the rest of the story This explanation is based on Farin and Hansford s book ] I recommend this book since this book explains these basic ideas quite well Some knows the Farin s book is difficult I also had such an impression and hesitated to look into this book But, this book is easy to read I enjoy the book a lot 3 Matrix Multiplication My calculum of mathematics introduced matrix as a representation of simultaneous equation y = a x + a 2 x 2 + + a n x n y 2 = a 2 x + a 22 x 2 + + a 2n x n y n = a n x + a n2 x 2 + + a nn x n (4) This is a natural introduction, but when I thought matrix multiplication, I needed a jump Another my favorite book 2] shows that matrix has an aspect of a representation of transformation According to this idea, matrix is composed of coordinate vectors By any ideas or aspects, these are all property of matrix The formal operations of matrix are all the same in any way to see matrix One powerful mathematical idea is when the forms are the same, they are the same We could capture the same aspect of different things If one aspect is the same, then we could expect these different two things have a common behavior For example, every person has personality, there is no exact the same person But, some of them share their hobby, they could have similar property People who share the hobby might buy the same things To extends this idea further, in totally different country, totally different generation people could buy the same thing because of the sharing aspects Found a similar aspect in the different things is the one basic mathematical thinking Here, we could find the same form in coordinate transformation and simultaneous equations When we see a matrix as a coordinate system, such matrix is composed of coordinate axis vectors Here, we think three dimensional coordinates only a = a a 2 a 2 = a 3 = a 3 a 2 a 22 a 32 a 3 a 23 a 33 (6) 3

Figure 2: A three dimensional coordinate system Figure 4: Two coordinate systems ten to express dot product form explicitly: Figure 3: Coordinate system and projection In any coordinate system, the coordinate value itself is given by projection = dot products We could write a matrix A as A a a 2 a 3 = a a 2 a 3 a 2 a 22 a 32 (7) a 3 a 23 a 33 Figure 2 shows an example coordinate system in this construction A coordinate of a point x is the following: x = a x + a 2 x 2 + a 3 x 3 = a a 2 x + a 2 a 22 x 2 + a 3 a 23 a 3 a 32 x 3 (8) a 33 Where x, x 2, x 3 are projected length of x onto a, a 2, a 3 axis, respectively Now we said projected This is dot product Let s see in two dimensional case in Figure 3 since I think a three dimensional case is still a bit cumbersome You can see each coordinate value is dot product to each coodinate axis Equation 8 can be rewrit- x = a x + a 2 x 2 + a 3 x 3 a a 2 a 3 x x 2 (9) x 3 x is only one element of coordinate system axis In three dimensional case, there are three axes, x, x 2, x 3 Each coordinate axis is projected to the other coordinate axes to transform the coordinate system Figure 4 shows two coordinate systems a, a 2, a 3 and x, x 2, x 3 AX ( ) a a 2 a 3 x x 2 x 3 = a a 2 a 3 a 2 a 22 a 32 x 2 x 3 x 2 x 22 x 32 a 3 a 23 a 33 x 3 x 23 x 33 (20) I hope you now see the dot product inside the matrix 4 Transpose of Matrix Multiplication A transpose of matrix multiplication has vector dot products, therefore, it should be (AB) T = B T A T This article s discussion is based on the difference between dot product and tensor product, however, I was suggested the standard proof is still simple and understandable, so, I will show 4

the proof (B T A T ) ij = (b T ) ik (a T ) kj = k = k b ki a jk a jk b ki (2) = (AB) ji = (AB) T I talked about the reason of Equation 2 in this article The proof is indeed simple and beautiful It is just too simple for me and could not think about the behind when I was a student I enjoyed to find the behind of Equation 2 I hope you can also enjoy the behind of this simple proof Acknowledgments Thanks to C Rössl for explaining me about coordinate transformation Thanks to L Gruenschloss to make a suggestion to add the last proof References ] Gerald Farin Dianne Hansford Practical Linear Algebra; A Geometry Toolbox A K Peters, Ltd, 2005 2] Koukichi Sugihara Mathematical Theory of Graphics (Gurafikusu no Suuri) Kyouritu Shuppan, 995 5