Math Camp. Justin Grimmer. Associate Professor Department of Political Science Stanford University. September 9th, 2016

Similar documents
Review: Linear and Vector Algebra

Math Camp II. Basic Linear Algebra. Yiqing Xu. Aug 26, 2014 MIT

Linear Algebra V = T = ( 4 3 ).

Knowledge Discovery and Data Mining 1 (VO) ( )

Matrix Arithmetic. j=1

Matrix Operations. Linear Combination Vector Algebra Angle Between Vectors Projections and Reflections Equality of matrices, Augmented Matrix

Chapter 3: Theory Review: Solutions Math 308 F Spring 2015

Review of Basic Concepts in Linear Algebra

M. Matrices and Linear Algebra

1 Matrices and matrix algebra

MATRICES. a m,1 a m,n A =

Chapter 1: Systems of linear equations and matrices. Section 1.1: Introduction to systems of linear equations

Math Linear Algebra Final Exam Review Sheet

22A-2 SUMMER 2014 LECTURE Agenda

Vectors and Matrices Statistics with Vectors and Matrices

Linear Algebra Review. Vectors

Chapter 2. Matrix Arithmetic. Chapter 2

Math 360 Linear Algebra Fall Class Notes. a a a a a a. a a a

We could express the left side as a sum of vectors and obtain the Vector Form of a Linear System: a 12 a x n. a m2

Things we can already do with matrices. Unit II - Matrix arithmetic. Defining the matrix product. Things that fail in matrix arithmetic

Properties of Linear Transformations from R n to R m

Linear Algebra March 16, 2019

A primer on matrices

Math Camp Lecture 4: Linear Algebra. Xiao Yu Wang. Aug 2010 MIT. Xiao Yu Wang (MIT) Math Camp /10 1 / 88

x 1 x 2. x 1, x 2,..., x n R. x n

Applied Matrix Algebra Lecture Notes Section 2.2. Gerald Höhn Department of Mathematics, Kansas State University

Linear Algebra. The analysis of many models in the social sciences reduces to the study of systems of equations.

MATH.2720 Introduction to Programming with MATLAB Vector and Matrix Algebra

Phys 201. Matrices and Determinants

A FIRST COURSE IN LINEAR ALGEBRA. An Open Text by Ken Kuttler. Matrix Arithmetic

Elementary maths for GMT

1 Multiply Eq. E i by λ 0: (λe i ) (E i ) 2 Multiply Eq. E j by λ and add to Eq. E i : (E i + λe j ) (E i )

Matrices and Vectors. Definition of Matrix. An MxN matrix A is a two-dimensional array of numbers A =

Basic Linear Algebra in MATLAB

Introduction to Matrix Algebra

Linear Algebra Homework and Study Guide

4 Linear Algebra Review

CS123 INTRODUCTION TO COMPUTER GRAPHICS. Linear Algebra 1/33

Kevin James. MTHSC 3110 Section 2.1 Matrix Operations

. a m1 a mn. a 1 a 2 a = a n

Introduction - Motivation. Many phenomena (physical, chemical, biological, etc.) are model by differential equations. f f(x + h) f(x) (x) = lim

Overview. Motivation for the inner product. Question. Definition

Linear Algebra (Review) Volker Tresp 2018

Applied Linear Algebra in Geoscience Using MATLAB

Matrix Algebra for Engineers Jeffrey R. Chasnov

CS 246 Review of Linear Algebra 01/17/19

Matrix operations Linear Algebra with Computer Science Application

7.5 Operations with Matrices. Copyright Cengage Learning. All rights reserved.

Stat 206: Linear algebra

Review of linear algebra

Basic Concepts in Linear Algebra

Linear Algebra Section 2.6 : LU Decomposition Section 2.7 : Permutations and transposes Wednesday, February 13th Math 301 Week #4

Math Bootcamp An p-dimensional vector is p numbers put together. Written as. x 1 x =. x p

An Introduction to Matrix Algebra

Linear Algebra and Matrix Inversion

A matrix over a field F is a rectangular array of elements from F. The symbol

Properties of Matrices and Operations on Matrices

[POLS 8500] Review of Linear Algebra, Probability and Information Theory

Lecture 3 Linear Algebra Background

A Introduction to Matrix Algebra and the Multivariate Normal Distribution

MAC Module 2 Systems of Linear Equations and Matrices II. Learning Objectives. Upon completing this module, you should be able to :

MATH 22A: LINEAR ALGEBRA Chapter 4

Lecture 23: 6.1 Inner Products

Matrices MA1S1. Tristan McLoughlin. November 9, Anton & Rorres: Ch

Dot Products, Transposes, and Orthogonal Projections

CS123 INTRODUCTION TO COMPUTER GRAPHICS. Linear Algebra /34

Fundamentals of Engineering Analysis (650163)

Linear Algebra. Preliminary Lecture Notes

LS.1 Review of Linear Algebra

Pre-sessional Mathematics for Big Data MSc Class 2: Linear Algebra

Unit 3: Matrices. Juan Luis Melero and Eduardo Eyras. September 2018

Matrices. Chapter Definitions and Notations

Matrix representation of a linear map

Lecture 3: Linear Algebra Review, Part II

Linear Algebra. James Je Heon Kim

Vector Spaces, Orthogonality, and Linear Least Squares

4 Linear Algebra Review

Linear Equations and Matrix

Matrix Arithmetic. a 11 a. A + B = + a m1 a mn. + b. a 11 + b 11 a 1n + b 1n = a m1. b m1 b mn. and scalar multiplication for matrices via.

MATH 167: APPLIED LINEAR ALGEBRA Least-Squares

Machine Learning for Large-Scale Data Analysis and Decision Making A. Week #1

Lecture 6: Geometry of OLS Estimation of Linear Regession

Linear Algebra. Preliminary Lecture Notes

Linear Models Review

Linear Algebra. Matrices Operations. Consider, for example, a system of equations such as x + 2y z + 4w = 0, 3x 4y + 2z 6w = 0, x 3y 2z + w = 0.

MATH 106 LINEAR ALGEBRA LECTURE NOTES

Systems of Linear Equations

3 (Maths) Linear Algebra

Linear Algebra (Review) Volker Tresp 2017

Lecture 3: Vectors. In Song Kim. September 1, 2011

ECS130 Scientific Computing. Lecture 1: Introduction. Monday, January 7, 10:00 10:50 am

ME751 Advanced Computational Multibody Dynamics

Matrix & Linear Algebra

2. Matrix Algebra and Random Vectors

Matrix Arithmetic. (Multidimensional Math) Institute for Advanced Analytics. Shaina Race, PhD

MATH Topics in Applied Mathematics Lecture 12: Evaluation of determinants. Cross product.

Lecture 7. Econ August 18

I&C 6N. Computational Linear Algebra

Vectors and their uses

Row Space, Column Space, and Nullspace

Transcription:

Math Camp Justin Grimmer Associate Professor Department of Political Science Stanford University September 9th, 2016 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 1 / 61

Where We ve Been, Where We re Going Calculus: Analyze behavior of functions on real line - Convergence - Differentiation - Integration Linear Algebra - Data stored in matrices - Higher dimensional spaces - complex world, condition on many factors - flood of big data, store in many dimensions - Linear Algebra: - Algebra of matrices - Geometry of high dimensional space - Calculus (multivariable) in many dimensions Very important for regression(!!!!) Justin Grimmer (Stanford University) Methodology I September 9th, 2016 2 / 61

Points + Vectors. - A point in R 1-1 - π - e - An ordered pair in R 2 = R R - (1, 2) - (0, 0) - (π, e) - An ordered triple in R 3 = R R R - (3.1, 4.5, 6.11132) - An ordered n-tuple in R n = R R... R - (a 1, a 2,..., a n ) Justin Grimmer (Stanford University) Methodology I September 9th, 2016 3 / 61

Points and Vectors Definition A point x R n is an ordered n-tuple, (x 1, x 2,..., x n ). The vector x R n is the arrow pointing from the origin (0, 0,..., 0) to x. Justin Grimmer (Stanford University) Methodology I September 9th, 2016 4 / 61

One Dimensional Example 1 0.5 0 0.5 1 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 5 / 61

One Dimensional Example 1 0.5 0 0.5 1 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 5 / 61

One Dimensional Example 1 0.5 0 0.5 1 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 5 / 61

Two Dimensional Example 1 0.5 1 0.5 0 0 0.5 1 0.5 1 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 6 / 61

Two Dimensional Example 1 0.5 1 0.5 0 0 0.5 1 0.5 1 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 6 / 61

Two Dimensional Example 1 0.5 1 0.5 0 0 0.5 1 0.5 1 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 6 / 61

Three Dimensional Example - (Latitude, Longitude, Elevation) - (1, 2, 3) - (0, 1, 0) Justin Grimmer (Stanford University) Methodology I September 9th, 2016 7 / 61

N-Dimensional Example - Individual campaign donation records x = (1000, 0, 10, 50, 15, 4, 0, 0, 0,..., 2400000000) - Counties have proportion of vote for Obama y = (0.8, 0.5, 0.6,..., 0.2) - Run experiment, assess feeling thermometer of elected official t = (0, 100, 50, 70, 80,..., 100) Justin Grimmer (Stanford University) Methodology I September 9th, 2016 8 / 61

Arithmetic with Vectors Definition Suppose u and v are vectors u, v R n, u = (u 1, u 2, u 3,..., u n ) v = (v 1, v 2, v 3,..., v n ) We will say u = v if u 1 = v 1, u 2 = v 2,..., u n = v n Define the sum of u + v as u + v = (u 1 + v 1, u 2 + v 2, u 3 + v 3,..., u n + v n ) Suppose k R. We will call k a scalar. Define ku as the scalar product ku = (ku 1, ku 2,..., ku n ) Justin Grimmer (Stanford University) Methodology I September 9th, 2016 9 / 61

Examples Suppose: u = (1, 2, 3, 4, 5) v = (1, 1, 1, 1, 1) k = 2 Then, u + v = (1 + 1, 2 + 1, 3 + 1, 4 + 1, 5 + 1) = (2, 3, 4, 5, 6) ku = (2 1, 2 2, 2 3, 2 4, 2 5) = (2, 4, 6, 8, 10) kv = (2 1, 2 1, 2 1, 2 1, 2 1) = (2, 2, 2, 2, 2) Justin Grimmer (Stanford University) Methodology I September 9th, 2016 10 / 61

Properties of Arithmetic Challenge Proofs we can do these! Theorem Suppose that u, v, w R n and k and l are scalars. a) u + v = v + u Justin Grimmer (Stanford University) Methodology I September 9th, 2016 11 / 61

Properties of Arithmetic Challenge Proofs we can do these! Theorem Suppose that u, v, w R n and k and l are scalars. a) u + v = v + u Proof. u + v = (u 1 + v 1, u 2 + v 2,..., u n + v n ) = (v 1 + u 1, v 2 + u 2,..., v n + u n ) = v + u Justin Grimmer (Stanford University) Methodology I September 9th, 2016 11 / 61

Properties of Arithmetic Challenge Proofs we can do these! Theorem Suppose that u, v, w R n and k and l are scalars. b) u + 0 = 0 + u = u Justin Grimmer (Stanford University) Methodology I September 9th, 2016 12 / 61

Properties of Arithmetic Challenge Proofs we can do these! Theorem Suppose that u, v, w R n and k and l are scalars. b) u + 0 = 0 + u = u Proof. u + 0 = (u 1 + 0, u 2 + 0,..., u n + 0) = (0 + u 1, 0 + u 2,..., 0 + u n ) = 0 + u = (u 1, u 2,..., u n ) = u Justin Grimmer (Stanford University) Methodology I September 9th, 2016 12 / 61

Properties of Arithmetic Challenge Proofs we can do these! Theorem Suppose that u, v, w R n and k and l are scalars. c) (l + k)u = l(u) + k(u) Proof. How can we show this? Justin Grimmer (Stanford University) Methodology I September 9th, 2016 13 / 61

Challenge Proofs - Show that 1u = u - Show that u + 1u = 0 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 14 / 61

Inner Product Definition Suppose u R n and v R n then define u v, u v = u 1 v 1 + u 2 v 2 +... + u n v n N = u i v i i=1 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 15 / 61

Examples Suppose u = (1, 2, 3) and v = (2, 3, 1). Then, u v = 1 2 + 2 3 + 3 1 = 2 + 6 + 3 = 11 Suppose y = (y 1, y 2,..., y N ) and 1 = (1, 1, 1,..., 1). Then, y 1 = y 1 + y 2 +... + y n n = i=1 y i Justin Grimmer (Stanford University) Methodology I September 9th, 2016 16 / 61

R Code Create a vector in R Justin Grimmer (Stanford University) Methodology I September 9th, 2016 17 / 61

R Code Create a vector in R vec <- c(1, 2, 3, 4, 5) Justin Grimmer (Stanford University) Methodology I September 9th, 2016 17 / 61

R Code Create a vector in R vec <- c(1, 2, 3, 4, 5) vec<- c() Justin Grimmer (Stanford University) Methodology I September 9th, 2016 17 / 61

R Code Create a vector in R vec <- c(1, 2, 3, 4, 5) vec<- c() vec[1]<- 1 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 17 / 61

R Code Create a vector in R vec <- c(1, 2, 3, 4, 5) vec<- c() vec[1]<- 1 vec[2]<- 2 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 17 / 61

R Code Create a vector in R vec <- c(1, 2, 3, 4, 5) vec<- c() vec[1]<- 1 vec[2]<- 2 vec[3]<- 3 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 17 / 61

R Code Create a vector in R vec <- c(1, 2, 3, 4, 5) vec<- c() vec[1]<- 1 vec[2]<- 2 vec[3]<- 3 vec[4]<- 4 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 17 / 61

R Code Create a vector in R vec <- c(1, 2, 3, 4, 5) vec<- c() vec[1]<- 1 vec[2]<- 2 vec[3]<- 3 vec[4]<- 4 vec[5]<- 5 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 17 / 61

R Code Create a vector in R vec <- c(1, 2, 3, 4, 5) vec<- c() vec[1]<- 1 vec[2]<- 2 vec[3]<- 3 vec[4]<- 4 vec[5]<- 5 x1<- c(2, 2, 3, 2) Justin Grimmer (Stanford University) Methodology I September 9th, 2016 17 / 61

R Code Create a vector in R vec <- c(1, 2, 3, 4, 5) vec<- c() vec[1]<- 1 vec[2]<- 2 vec[3]<- 3 vec[4]<- 4 vec[5]<- 5 x1<- c(2, 2, 3, 2) x2<- c(5, 3, 1, 3) Justin Grimmer (Stanford University) Methodology I September 9th, 2016 17 / 61

R Code Create a vector in R vec <- c(1, 2, 3, 4, 5) vec<- c() vec[1]<- 1 vec[2]<- 2 vec[3]<- 3 vec[4]<- 4 vec[5]<- 5 x1<- c(2, 2, 3, 2) x2<- c(5, 3, 1, 3) add <- x1 + x2 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 17 / 61

R Code Create a vector in R vec <- c(1, 2, 3, 4, 5) vec<- c() vec[1]<- 1 vec[2]<- 2 vec[3]<- 3 vec[4]<- 4 vec[5]<- 5 x1<- c(2, 2, 3, 2) x2<- c(5, 3, 1, 3) add <- x1 + x2 add [1] 7 5 4 5 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 17 / 61

R Code Create a vector in R vec <- c(1, 2, 3, 4, 5) vec<- c() vec[1]<- 1 vec[2]<- 2 vec[3]<- 3 vec[4]<- 4 vec[5]<- 5 x1<- c(2, 2, 3, 2) x2<- c(5, 3, 1, 3) add <- x1 + x2 add [1] 7 5 4 5 scalar<- 10 *x1 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 17 / 61

R Code Create a vector in R vec <- c(1, 2, 3, 4, 5) vec<- c() vec[1]<- 1 vec[2]<- 2 vec[3]<- 3 vec[4]<- 4 vec[5]<- 5 x1<- c(2, 2, 3, 2) x2<- c(5, 3, 1, 3) add <- x1 + x2 add [1] 7 5 4 5 scalar<- 10 *x1 scalar [1] 20 20 30 20 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 17 / 61

R Code Create a vector in R vec <- c(1, 2, 3, 4, 5) vec<- c() vec[1]<- 1 vec[2]<- 2 vec[3]<- 3 vec[4]<- 4 vec[5]<- 5 x1<- c(2, 2, 3, 2) x2<- c(5, 3, 1, 3) add <- x1 + x2 add [1] 7 5 4 5 scalar<- 10 *x1 scalar [1] 20 20 30 20 output<- x1 %*% x2 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 17 / 61

R Code Create a vector in R vec <- c(1, 2, 3, 4, 5) vec<- c() vec[1]<- 1 vec[2]<- 2 vec[3]<- 3 vec[4]<- 4 vec[5]<- 5 x1<- c(2, 2, 3, 2) x2<- c(5, 3, 1, 3) add <- x1 + x2 add [1] 7 5 4 5 scalar<- 10 *x1 scalar [1] 20 20 30 20 output<- x1 %*% x2 output [,1] [1,] 25 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 17 / 61

Challenge Problems - Suppose v = (1, 4, 1, 4) and w = (4, 1, 4, 1). Calculate: v w - Prove v w = w v - Super hard: prove v v 0 and v v = 0 if and only if v = 0. Justin Grimmer (Stanford University) Methodology I September 9th, 2016 18 / 61

Vector Length x_2 x_1 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 19 / 61

Vector Length x_2 - Pythogorean Theorem: Side with length a a x_1 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 19 / 61

Vector Length x_2 - Pythogorean Theorem: Side with length a - Side with length b and right triangle b a x_1 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 19 / 61

Vector Length x_2 c = (a^2 + b^2)^(1/2) b - Pythogorean Theorem: Side with length a - Side with length b and right triangle - c = a 2 + b 2 a x_1 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 19 / 61

Vector Length x_2 c = (a^2 + b^2)^(1/2) b - Pythogorean Theorem: Side with length a - Side with length b and right triangle - c = a 2 + b 2 - This is generally true a x_1 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 19 / 61

Vector Length Definition Suppose v R n. Then, we will define its length as v = (v v) 1/2 = (v 2 1 + v 2 2 + v 2 3 +... + v 2 n ) 1/2 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 20 / 61

Calculating a Length Example 1: suppose x = (1, 1, 1). x = (x x) 1/2 = (1 + 1 + 1) 1/2 = 3 Example 2: R code for length function len.vec<- function(x) { p1< sqrt(x% %x) return(p1) } x <- c(1,1,1) len.vec(x) [,1] [1,] 1.732051 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 21 / 61

Coding Problem Let s calculate the length of some vectors - Write a function to assess the length of a vector. - Use it to calculate the length of: - y<- c(10, 20, 30, 40) - x<- seq(1, 1000*pi, len=1000) Justin Grimmer (Stanford University) Methodology I September 9th, 2016 22 / 61

Texts in Space Justin Grimmer (Stanford University) Methodology I September 9th, 2016 23 / 61

Texts in Space Doc1 = (1, 1, 3,..., 5) Justin Grimmer (Stanford University) Methodology I September 9th, 2016 23 / 61

Texts in Space Doc1 = (1, 1, 3,..., 5) Doc2 = (2, 0, 0,..., 1) Justin Grimmer (Stanford University) Methodology I September 9th, 2016 23 / 61

Texts in Space Doc1 = (1, 1, 3,..., 5) Doc2 = (2, 0, 0,..., 1) Doc1, Doc2 R M Justin Grimmer (Stanford University) Methodology I September 9th, 2016 23 / 61

Texts in Space Doc1 = (1, 1, 3,..., 5) Doc2 = (2, 0, 0,..., 1) Doc1, Doc2 R M Provides many operations that will be useful Justin Grimmer (Stanford University) Methodology I September 9th, 2016 23 / 61

Texts in Space Doc1 = (1, 1, 3,..., 5) Doc2 = (2, 0, 0,..., 1) Doc1, Doc2 R M Provides many operations that will be useful Inner Product between documents: Justin Grimmer (Stanford University) Methodology I September 9th, 2016 23 / 61

Texts in Space Doc1 = (1, 1, 3,..., 5) Doc2 = (2, 0, 0,..., 1) Doc1, Doc2 R M Provides many operations that will be useful Inner Product between documents: Doc1 Doc2 = (1, 1, 3,..., 5) (2, 0, 0,..., 1) Justin Grimmer (Stanford University) Methodology I September 9th, 2016 23 / 61

Texts in Space Doc1 = (1, 1, 3,..., 5) Doc2 = (2, 0, 0,..., 1) Doc1, Doc2 R M Provides many operations that will be useful Inner Product between documents: Doc1 Doc2 = (1, 1, 3,..., 5) (2, 0, 0,..., 1) = 1 2 + 1 0 + 3 0 +... + 5 1 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 23 / 61

Texts in Space Doc1 = (1, 1, 3,..., 5) Doc2 = (2, 0, 0,..., 1) Doc1, Doc2 R M Provides many operations that will be useful Inner Product between documents: Doc1 Doc2 = (1, 1, 3,..., 5) (2, 0, 0,..., 1) = 1 2 + 1 0 + 3 0 +... + 5 1 = 7 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 23 / 61

Length of document: Justin Grimmer (Stanford University) Methodology I September 9th, 2016 24 / 61

Length of document: Doc1 Doc1 Doc1 = (1, 1, 3,..., 5) (1, 1, 3,..., 5) = 1 2 + 1 2 + 3 2 + 5 2 = 6 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 24 / 61

Length of document: Doc1 Doc1 Doc1 = (1, 1, 3,..., 5) (1, 1, 3,..., 5) = 1 2 + 1 2 + 3 2 + 5 2 = 6 Cosine of the angle between documents: Justin Grimmer (Stanford University) Methodology I September 9th, 2016 24 / 61

Length of document: Doc1 Doc1 Doc1 = (1, 1, 3,..., 5) (1, 1, 3,..., 5) = 1 2 + 1 2 + 3 2 + 5 2 = 6 Cosine of the angle between documents: cos θ ( Doc1 Doc1 = 7 6 2.24 = 0.52 ) ( ) Doc2 Doc2 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 24 / 61

Measuring Similarity Documents in space measure similarity/dissimilarity Justin Grimmer (Stanford University) Methodology I September 9th, 2016 25 / 61

Measuring Similarity Documents in space measure similarity/dissimilarity What properties should similarity measure have? Justin Grimmer (Stanford University) Methodology I September 9th, 2016 25 / 61

Measuring Similarity Documents in space measure similarity/dissimilarity What properties should similarity measure have? - Maximum: document with itself Justin Grimmer (Stanford University) Methodology I September 9th, 2016 25 / 61

Measuring Similarity Documents in space measure similarity/dissimilarity What properties should similarity measure have? - Maximum: document with itself - Minimum: documents have no words in common (orthogonal ) Justin Grimmer (Stanford University) Methodology I September 9th, 2016 25 / 61

Measuring Similarity Documents in space measure similarity/dissimilarity What properties should similarity measure have? - Maximum: document with itself - Minimum: documents have no words in common (orthogonal ) - Increasing when more of same words used Justin Grimmer (Stanford University) Methodology I September 9th, 2016 25 / 61

Measuring Similarity Documents in space measure similarity/dissimilarity What properties should similarity measure have? - Maximum: document with itself - Minimum: documents have no words in common (orthogonal ) - Increasing when more of same words used -? s(a, b) = s(b, a). Justin Grimmer (Stanford University) Methodology I September 9th, 2016 25 / 61

Measuring Similarity Word 2 0 1 2 3 4 0 1 2 3 4 Word 1 Measure 1: Inner product Justin Grimmer (Stanford University) Methodology I September 9th, 2016 26 / 61

Measuring Similarity Word 2 0 1 2 3 4 0 1 2 3 4 Word 1 Measure 1: Inner product (2, 1) (1, 4) = 6 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 26 / 61

Word 2 0 1 2 3 4 0 1 2 3 4 Word 1 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 27 / 61

Word 2 0 1 2 3 4 0 1 2 3 4 Word 1 Problem(?): length dependent Justin Grimmer (Stanford University) Methodology I September 9th, 2016 27 / 61

Word 2 0 1 2 3 4 0 1 2 3 4 Word 1 Problem(?): length dependent (4, 2) (1, 4) = 12 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 27 / 61

Word 2 0 1 2 3 4 θ 0 1 2 3 4 Word 1 Problem(?): length dependent (4, 2) (1, 4) = 12 a b = a b cos θ Justin Grimmer (Stanford University) Methodology I September 9th, 2016 27 / 61

Cosine Similarity cos θ: removes document length from similarity measure Justin Grimmer (Stanford University) Methodology I September 9th, 2016 28 / 61

Cosine Similarity cos θ: removes document length from similarity measure cos θ = ( ) ( ) a b a b Justin Grimmer (Stanford University) Methodology I September 9th, 2016 28 / 61

Cosine Similarity cos θ: removes document length from similarity measure cos θ = (4, 2) (4, 2) ( ) ( ) a b a b = (0.89, 0.45) Justin Grimmer (Stanford University) Methodology I September 9th, 2016 28 / 61

Cosine Similarity cos θ: removes document length from similarity measure cos θ = (4, 2) (4, 2) (2, 1) (2, 1) ( ) ( ) a b a b = (0.89, 0.45) = (0.89, 0.45) Justin Grimmer (Stanford University) Methodology I September 9th, 2016 28 / 61

Cosine Similarity cos θ: removes document length from similarity measure cos θ = (4, 2) (4, 2) (2, 1) (2, 1) (1, 4) (1, 4) ( ) ( ) a b a b = (0.89, 0.45) = (0.89, 0.45) = (0.24, 0.97) Justin Grimmer (Stanford University) Methodology I September 9th, 2016 28 / 61

Cosine Similarity cos θ: removes document length from similarity measure cos θ = (4, 2) (4, 2) (2, 1) (2, 1) (1, 4) (1, 4) ( ) ( ) a b a b = (0.89, 0.45) = (0.89, 0.45) = (0.24, 0.97) (0.89, 0.45) (0.24, 0.97) = 0.65 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 28 / 61

Cosine Similarity Word 2 0 1 2 3 4 θ 0 1 2 3 4 Word 1 cos θ: removes document length from similarity measure Justin Grimmer (Stanford University) Methodology I September 9th, 2016 28 / 61

Cosine Similarity Word 2 0 1 2 3 4 θ 0 1 2 3 4 Word 1 cos θ: removes document length from similarity measure Project onto Hypersphere Justin Grimmer (Stanford University) Methodology I September 9th, 2016 28 / 61

Cosine Similarity Word 2 0 1 2 3 4 θ 0 1 2 3 4 Word 1 cos θ: removes document length from similarity measure Project onto Hypersphere cos θ Inverse distance on Hypersphere Justin Grimmer (Stanford University) Methodology I September 9th, 2016 28 / 61

Cosine Similarity Word 2 0 1 2 3 4 θ 0 1 2 3 4 Word 1 cos θ: removes document length from similarity measure Project onto Hypersphere cos θ Inverse distance on Hypersphere von Mises Fisher distribution : distribution on sphere surface Justin Grimmer (Stanford University) Methodology I September 9th, 2016 28 / 61

Matrices Definition A Matrix is a rectangular array of numbers A = a 11 a 12... a 1n a 21 a 22... a 2n...... a m1 a m2... a mn If A has m rows n columns we will say that A is an m n matrix. Suppose X and Y are m n matrices. Then X = Y if x ij = y ij for all i and j Justin Grimmer (Stanford University) Methodology I September 9th, 2016 29 / 61

Simple Examples I = 1 0 0... 0 0 1 0... 0 0 0 1... 0....... 0 0 0... 1 If I is an n n matrix we will call an identity matrix. Justin Grimmer (Stanford University) Methodology I September 9th, 2016 30 / 61

Simple Examples X = ( 1 2 ) 3 2 1 4 X is an 2 3 matrix Justin Grimmer (Stanford University) Methodology I September 9th, 2016 31 / 61

Matrix Algebra Definition Suppose X and Y are m n matrices. Then define x 11 x 12... x 1n y 11 y 12... y 1n x 21 x 22... x 2n X + Y =...... + y 21 y 22... y 2n...... x m1 x m2... x mn y m1 y m2... y mn x 11 + y 11 x 12 + y 12... x 1n + y 1n x 21 + y 21 x 22 + y 22... x 2n + y 2n =...... x m1 + y m1 x m2 + y m2... x mn + y mn Justin Grimmer (Stanford University) Methodology I September 9th, 2016 32 / 61

Matrix Algebra Definition Suppose X is an m n matrix and k R. Then, kx 11 kx 12... kx 1n kx 21 kx 22... kx 2n kx =...... kx m1 kx m2... kx mn Prove theorems about this tonight Justin Grimmer (Stanford University) Methodology I September 9th, 2016 33 / 61

R Code Using matrix command mat1<- matrix(na, nrow=3, ncol=2) ## Creating matrix mat1[1,1]<- 1 mat1[1,2]<- 2 mat1[2,1]<- 1 mat1[2,2]<- 4 mat1[3,1]<- exp(1) mat1[3,2]<- 4 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 34 / 61

R Code Using rbind r1<- c(1, 2) r2<- c(1, 4) r3<- c(exp(1), 4) mat1<- rbind(r1, r2, r3) Justin Grimmer (Stanford University) Methodology I September 9th, 2016 34 / 61

R Code Using cbind c1<- c(1, 1, exp(1) ) c2<- c(2, 4, 4) Justin Grimmer (Stanford University) Methodology I September 9th, 2016 34 / 61

R Code dim(mat1) [1] 3 2 mat1 + mat1 [,1] [,2] [1,] 2.000000 4 [2,] 2.000000 8 [3,] 5.436564 8 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 34 / 61

R Code What if the matrices are of different dimension mat1<- matrix(1, nrow=3, ncol=2) mat2<- matrix(2, nrow=10, ncol=3) mat1 + mat2 Error in mat1 + mat2 : non-conformable arrays Justin Grimmer (Stanford University) Methodology I September 9th, 2016 35 / 61

Matrix Transpose We will use matrix transpose to flip the dimensionality of a matrix Justin Grimmer (Stanford University) Methodology I September 9th, 2016 36 / 61

Matrix Transpose We will use matrix transpose to flip the dimensionality of a matrix X = x 11 x 12... x 1n x 21 x 22... x 2n...... x m1 x m2... x mn Justin Grimmer (Stanford University) Methodology I September 9th, 2016 36 / 61

Matrix Transpose We will use matrix transpose to flip the dimensionality of a matrix X = x 11 x 12... x 1n x 21 x 22... x 2n...... x m1 x m2... x mn X = x 11 x 12. x 1n Justin Grimmer (Stanford University) Methodology I September 9th, 2016 36 / 61

Matrix Transpose We will use matrix transpose to flip the dimensionality of a matrix X = x 11 x 12... x 1n x 21 x 22... x 2n...... x m1 x m2... x mn X = x 11 x 21 x 12 x 22.. x 1n x 2n Justin Grimmer (Stanford University) Methodology I September 9th, 2016 36 / 61

Matrix Transpose We will use matrix transpose to flip the dimensionality of a matrix X = x 11 x 12... x 1n x 21 x 22... x 2n...... x m1 x m2... x mn X = x 11 x 21... x 12 x 22..... x 1n x 2n... Justin Grimmer (Stanford University) Methodology I September 9th, 2016 36 / 61

Matrix Transpose We will use matrix transpose to flip the dimensionality of a matrix X = X = x 11 x 12... x 1n x 21 x 22... x 2n...... x m1 x m2... x mn x 11 x 21... x m1 x 12 x 22... x m2... x 1n x 2n... x mn Justin Grimmer (Stanford University) Methodology I September 9th, 2016 36 / 61

Matrix Transpose We will use matrix transpose to flip the dimensionality of a matrix X = X = x 11 x 12... x 1n x 21 x 22... x 2n...... x m1 x m2... x mn x 11 x 21... x m1 x 12 x 22... x m2... x 1n x 2n... x mn If X is an m n then X is n m. Justin Grimmer (Stanford University) Methodology I September 9th, 2016 36 / 61

Matrix Transpose We will use matrix transpose to flip the dimensionality of a matrix X = X = x 11 x 12... x 1n x 21 x 22... x 2n...... x m1 x m2... x mn x 11 x 21... x m1 x 12 x 22... x m2... x 1n x 2n... x mn If X is an m n then X is n m. If X = X then we say X is symmetric. Justin Grimmer (Stanford University) Methodology I September 9th, 2016 36 / 61

Matrix Transpose Example 1: X = ( ) 4 1 4 1 2 then X = 1 2 1 2 3 2 3 In R mat1<- matrix(c(1, 2, 3), nrow=3, ncol=2) mat2<- t(mat1) dim(mat1) 3 2 dim(mat2) 2 3 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 37 / 61

Matrix Multiplication How do we multiply matrices? Justin Grimmer (Stanford University) Methodology I September 9th, 2016 38 / 61

Matrix Multiplication How do we multiply matrices? Because we want to use matrix multiplication to solve equations we won t use an intuitive definition Justin Grimmer (Stanford University) Methodology I September 9th, 2016 38 / 61

Matrix Multiplication How do we multiply matrices? Because we want to use matrix multiplication to solve equations we won t use an intuitive definition Suppose we have two matrices Justin Grimmer (Stanford University) Methodology I September 9th, 2016 38 / 61

Matrix Multiplication How do we multiply matrices? Because we want to use matrix multiplication to solve equations we won t use an intuitive definition Suppose ( we) have two ( matrices ) 1 1 1 2 X =, Y = 1 1 3 4 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 38 / 61

Matrix Multiplication How do we multiply matrices? Because we want to use matrix multiplication to solve equations we won t use an intuitive definition Suppose ( we) have two ( matrices ) 1 1 1 2 X =, Y = 1 1 3 4 We will create a new matrix A by matrix multiplication: Justin Grimmer (Stanford University) Methodology I September 9th, 2016 38 / 61

Matrix Multiplication How do we multiply matrices? Because we want to use matrix multiplication to solve equations we won t use an intuitive definition Suppose ( we) have two ( matrices ) 1 1 1 2 X =, Y = 1 1 3 4 We will create a new matrix A by matrix multiplication: A = X Y Justin Grimmer (Stanford University) Methodology I September 9th, 2016 38 / 61

Matrix Multiplication How do we multiply matrices? Because we want to use matrix multiplication to solve equations we won t use an intuitive definition Suppose ( we) have two ( matrices ) 1 1 1 2 X =, Y = 1 1 3 4 We will create a new matrix A by matrix multiplication: A = X ( Y ) ( ) 1 1 1 2 = 1 1 3 4 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 38 / 61

Matrix Multiplication How do we multiply matrices? Because we want to use matrix multiplication to solve equations we won t use an intuitive definition Suppose ( we) have two ( matrices ) 1 1 1 2 X =, Y = 1 1 3 4 We will create a new matrix A by matrix multiplication: A = X ( Y ) ( ) 1 1 1 2 = 1 1 3 4 1 1 + 1 3 = Justin Grimmer (Stanford University) Methodology I September 9th, 2016 38 / 61

Matrix Multiplication How do we multiply matrices? Because we want to use matrix multiplication to solve equations we won t use an intuitive definition Suppose ( we) have two ( matrices ) 1 1 1 2 X =, Y = 1 1 3 4 We will create a new matrix A by matrix multiplication: A = X ( Y ) ( ) 1 1 1 2 = 1 1 3 4 = 1 1 + 1 3 1 2 + 1 4 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 38 / 61

Matrix Multiplication How do we multiply matrices? Because we want to use matrix multiplication to solve equations we won t use an intuitive definition Suppose ( we) have two ( matrices ) 1 1 1 2 X =, Y = 1 1 3 4 We will create a new matrix A by matrix multiplication: A = X ( Y ) ( ) 1 1 1 2 = 1 1 3 4 1 1 + 1 3 1 2 + 1 4 = 1 1 + 1 3 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 38 / 61

Matrix Multiplication How do we multiply matrices? Because we want to use matrix multiplication to solve equations we won t use an intuitive definition Suppose ( we) have two ( matrices ) 1 1 1 2 X =, Y = 1 1 3 4 We will create a new matrix A by matrix multiplication: A = X ( Y ) ( ) 1 1 1 2 = 1 1 3 4 1 1 + 1 3 1 2 + 1 4 = 1 1 + 1 3 1 2 + 1 4 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 38 / 61

Matrix Multiplication How do we multiply matrices? Because we want to use matrix multiplication to solve equations we won t use an intuitive definition Suppose ( we) have two ( matrices ) 1 1 1 2 X =, Y = 1 1 3 4 We will create a new matrix A by matrix multiplication: A = X ( Y ) ( ) 1 1 1 2 = 1 1 3 4 1 1 + 1 3 1 2 + 1 4 = 1 1 + 1 3 1 2 + 1 4 = ( ) 4 6 4 6 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 38 / 61

Matrix Multiplication Definition Suppose X is an m n matrix and Y is an n k matrix. Then define the matrix A an m k matrix that obtains from multiplying X and Y as, A = X Y x 11 x 12... x 1n y 11 y 12... y 1k x 21 x 22... x 2n y 21 y 22... y 2k =............ x m1 x m2... x mn y n1 y n2... y nk x 11 y 11 + x 12 y 21 +... + x 1n y n1... x 11 y 1k + x 12 y 2k +... + x 1n y nk =..... x m1 y 11 + x m2 y 21 +... + x mn y n1... x m1 y 11 + x m2 y 12 +... + x mn y nk Justin Grimmer (Stanford University) Methodology I September 9th, 2016 39 / 61

Definition Suppose X is an m n matrix and Y is an n k matrix. Write the row x 1 x 2 vectors of X =. and Y as column vector Y = ( ) y 1 y 2... y k. x m Then the m k matrix A = X Y can be written as x 1 y 1 x 1 y 2... x 1 y k x 2 y 1 x 2 y 2... x 2 y k A =...... x m y 1 x m y 2... x m y k Justin Grimmer (Stanford University) Methodology I September 9th, 2016 40 / 61

Matrix Multiplication Let s ( work on an) example together! 1 4 5 X = 10 2 3 2 3 Y = 1 5 What is X Y? 3 5 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 41 / 61

Matrix Multiplication Let s ( work on an) example together! 1 4 5 X = 10 2 3 2 3 Y = 1 5 What is X Y? 3 5 Not all matrices can be multiplied. Matrix AB exists only if the number of columns in A = number of rows in B. If AB exists we will say the matrices are conformable Justin Grimmer (Stanford University) Methodology I September 9th, 2016 41 / 61

Matrix Multiplication with a Vector 2 3 4 5 Suppose X = 1 5 1 2 a 3 4 matrix and that v = 3 5 3 4 matrix (or a column vector) what is X v? What is X v? 3 3 4 10 a 4 1 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 42 / 61

Algebraic Properties Suppose X is an m n matrix and Y is an n k matrix. Suppose that 1 0... 0 0 1... 0 I =..... as the identity matrix and that k Re.. 0 0... 1 - X Y Y X in general!!!! (but it could) - X I = X (let s talk it out!) - (X ) = X - (X Y ) = Y X - (kx ) = kx - (X + Y ) = X + Y Justin Grimmer (Stanford University) Methodology I September 9th, 2016 43 / 61

Examples, Implenting in R R and matrix multiplication X<- matrix(na, nrow=2, ncol=3) Y<- matrix(na, nrow=3, ncol=2) X[1,]<- c(1, 4, 5) X[2,]<- c(10, 2, 3) Y[1,]<- c(2, 3) Y[2,]<- c(1, 5) Y[3,]<- c(3, 5) A<- X%*%Y > A [,1] [,2] [1,] 21 48 [2,] 31 55 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 44 / 61

Matrix Inversion Big topic: suppose X is an n n matrix. We want to find the matrix X 1 such that where I is the n n identity matrix. Why? - Regression - Solving systems of equations X 1 X = X X 1 = I - Will provide intuition about colinearity, fixed effects, treatment designs and what we can learn as social scientists Calculate Properties of Inverses when do inverses exist Application to regression analysis Justin Grimmer (Stanford University) Methodology I September 9th, 2016 45 / 61

Some Motivating Examples Consider the following equations: x 1 + x 2 + x 3 = 0 0x 1 + 0x 2 + x 3 = 5 (0.1) Justin Grimmer (Stanford University) Methodology I September 9th, 2016 46 / 61

Some Motivating Examples Consider the following equations: x 1 + x 2 + x 3 = 0 x 1 + x 2 + 0x 3 = 0 0x 1 + x 2 + x 3 = 0 x 1 + 0x 2 + x 3 = 0 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 46 / 61

Some Motivating Examples Consider the following equations: x 1 + x 2 + x 3 = 0 0x 1 + 5x 2 + 0x 3 = 5 0x 1 + 0x 2 + 3x 3 = 6 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 46 / 61

Some Motivating Examples Consider the following equations: 1 1 1 A = 0 5 0 0 0 3 x 1 + x 2 + x 3 = 0 0x 1 + 5x 2 + 0x 3 = 5 0x 1 + 0x 2 + 3x 3 = 6 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 46 / 61

Some Motivating Examples Consider the following equations: 1 1 1 A = 0 5 0 0 0 3 x = (x 1, x 2, x 3 ) x 1 + x 2 + x 3 = 0 0x 1 + 5x 2 + 0x 3 = 5 0x 1 + 0x 2 + 3x 3 = 6 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 46 / 61

Some Motivating Examples Consider the following equations: 1 1 1 A = 0 5 0 0 0 3 x = (x 1, x 2, x 3 ) b = (0, 5, 6) x 1 + x 2 + x 3 = 0 0x 1 + 5x 2 + 0x 3 = 5 0x 1 + 0x 2 + 3x 3 = 6 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 46 / 61

Some Motivating Examples Consider the following equations: x 1 + x 2 + x 3 = 0 0x 1 + 5x 2 + 0x 3 = 5 0x 1 + 0x 2 + 3x 3 = 6 1 1 1 A = 0 5 0 0 0 3 x = (x 1, x 2, x 3 ) b = (0, 5, 6) The system of equations are now, Justin Grimmer (Stanford University) Methodology I September 9th, 2016 46 / 61

Some Motivating Examples Consider the following equations: x 1 + x 2 + x 3 = 0 0x 1 + 5x 2 + 0x 3 = 5 0x 1 + 0x 2 + 3x 3 = 6 1 1 1 A = 0 5 0 0 0 3 x = (x 1, x 2, x 3 ) b = (0, 5, 6) The system of equations are now, Ax = b Justin Grimmer (Stanford University) Methodology I September 9th, 2016 46 / 61

Some Motivating Examples Consider the following equations: x 1 + x 2 + x 3 = 0 0x 1 + 5x 2 + 0x 3 = 5 0x 1 + 0x 2 + 3x 3 = 6 1 1 1 A = 0 5 0 0 0 3 x = (x 1, x 2, x 3 ) b = (0, 5, 6) The system of equations are now, Ax = b A 1 exists if and only if Ax = b has only one solution. Justin Grimmer (Stanford University) Methodology I September 9th, 2016 46 / 61

Matrix Inversion, Definition Definition Suppose X is an n n matrix. We will call X 1 the inverse of X if X 1 X = X X 1 = I If X 1 exists then X is invertible. If X 1 does not exist, then we will say X is singular. Justin Grimmer (Stanford University) Methodology I September 9th, 2016 47 / 61

Matrix Inversion You ll never invert a matrix by hand. We re going to use R X<- matrix(na, nrow=3, ncol=3) X[1,]<- c(2, 3, 4) X[2,]<- c(3, 1, 3) X[3,]<- c(2, 4, 2) X.inv<- solve(x) > X.inv [,1] [,2] [,3] [1,] -0.5 0.5 0.25 [2,] 0.0-0.2 0.30 [3,] 0.5-0.1-0.35 X.inv%*%X [,1] [,2] [,3] [1,] 1 0.000000e+00-2.220446e-16 [2,] 0 1.000000e+00 0.000000e+00 [3,] 0-2.220446e-16 1.000000e+00 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 48 / 61

Matrix Inversion 1) Calculate Inverses 2) Properties of Inverses Justin Grimmer (Stanford University) Methodology I September 9th, 2016 49 / 61

Matrix Inversion 1) Calculate Inverses 2) Properties of Inverses Theorem The inverse of matrix X, X 1, is unique Justin Grimmer (Stanford University) Methodology I September 9th, 2016 49 / 61

Matrix Inversion 1) Calculate Inverses 2) Properties of Inverses Theorem The inverse of matrix X, X 1, is unique Proof. By way of contradiction, suppose not. Then there are at least two matrices A and C such that AX = I and CX = I This implies that, AXC = (AX )C = I C = C Justin Grimmer (Stanford University) Methodology I September 9th, 2016 49 / 61

Matrix Inversion But it also implies that AXC = A(X C) = A(I ) = A So C = AXC = A or C = A but this contradicts our assumption that there are two unique inverses. Justin Grimmer (Stanford University) Methodology I September 9th, 2016 50 / 61

Matrix Inversion Theorem Suppose A has inverse A 1 and B has inverse B 1. Then, (AB) 1 = B 1 A 1 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 51 / 61

Matrix Inversion Theorem Suppose A has inverse A 1 and B has inverse B 1. Then, (AB) 1 = B 1 A 1 Proof. We need to show that (B 1 A 1 )(AB) = (AB)(B 1 A 1 ) = I. (B 1 A 1 )(AB) = B 1 (A 1 A)B = B 1 I B = B 1 B = I Justin Grimmer (Stanford University) Methodology I September 9th, 2016 51 / 61

Matrix Inversion (AB)(B 1 A 1 ) = A(BB 1 )A 1 = AI A 1 = AA 1 = I So AB is invertible and (AB) 1 = B 1 A 1. Justin Grimmer (Stanford University) Methodology I September 9th, 2016 52 / 61

Challenge Inversion Proofs - Show that (A 1 ) 1 = A. - Show that (ka) 1 = 1 k A 1 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 53 / 61

Matrix Inversion 1) How to Calculate an Inverse 2) Inversion properties 3) When do inverses exist? Linear Independence: not repeated information in matrix will be the key (for both inversion and regressions) Justin Grimmer (Stanford University) Methodology I September 9th, 2016 54 / 61

Matrix Inversion: Existence Definition Suppose we have a set of vectors S = {v 1, v 2,..., v r } And consider the system of equations k 1 v 1 + k 2 v 2 +... + k r v r = 0 If the only solution is k 1 = 0, k 2 = 0, k 3 = 0,..., k r = 0 then we say that the set is linearly independent. If there are other solutions, then the set is linearly dependent. Justin Grimmer (Stanford University) Methodology I September 9th, 2016 55 / 61

Matrix Inversion: Existence Consider v 1 = (1, 0, 0), v 2 = (0, 1, 0), v 3 = (0, 0, 1) Can we write this as a combination of other vectors? Justin Grimmer (Stanford University) Methodology I September 9th, 2016 56 / 61

Matrix Inversion: Existence Consider v 1 = (1, 0, 0), v 2 = (0, 1, 0), v 3 = (0, 0, 1) Can we write this as a combination of other vectors? no! Justin Grimmer (Stanford University) Methodology I September 9th, 2016 56 / 61

Matrix Inversion: Existence Consider v 1 = (1, 0, 0), v 2 = (0, 1, 0), v 3 = (0, 0, 1) Can we write this as a combination of other vectors? no! Consider v 1 = (1, 0, 0), v 2 = (0, 1, 0), v 3 = (0, 0, 1), v 4 = (1, 2, 3). Can we write this as a combination of other vectors? Justin Grimmer (Stanford University) Methodology I September 9th, 2016 56 / 61

Matrix Inversion: Existence Consider v 1 = (1, 0, 0), v 2 = (0, 1, 0), v 3 = (0, 0, 1) Can we write this as a combination of other vectors? no! Consider v 1 = (1, 0, 0), v 2 = (0, 1, 0), v 3 = (0, 0, 1), v 4 = (1, 2, 3). Can we write this as a combination of other vectors? v 4 = v 1 + 2v 2 + 3v 3 Justin Grimmer (Stanford University) Methodology I September 9th, 2016 56 / 61

Matrix Inversion: Existence Theorem Suppose v 1, v 2,..., v K R n. If K > n then the set is linearly dependent Justin Grimmer (Stanford University) Methodology I September 9th, 2016 57 / 61

Matrix Inversion: Existence Theorem Suppose v 1, v 2,..., v K R n. If K > n then the set is linearly dependent - v 1 = (v 11, v 21,..., v n1 ) Justin Grimmer (Stanford University) Methodology I September 9th, 2016 57 / 61

Matrix Inversion: Existence Theorem Suppose v 1, v 2,..., v K R n. If K > n then the set is linearly dependent - v 1 = (v 11, v 21,..., v n1 ) - Says that if there are more vectors in the set than elements in each vector, one must be linearly dependent Justin Grimmer (Stanford University) Methodology I September 9th, 2016 57 / 61

Matrix Inversion: Existence We care because of the following theorem Theorem x 2 Suppose X is an n n matrix. Recall we can write this matrix as.. Then X has an inverse if and only if S = {x 1, x 2,..., x n } is linearly independent If this is true, we say X has full rank x 1 x n Justin Grimmer (Stanford University) Methodology I September 9th, 2016 58 / 61

Linear Regression Justin Grimmer (Stanford University) Methodology I September 9th, 2016 59 / 61

Linear Regression In 350a you learn about linear regression. For each i (individual) we observe covariates x i1, x i2,..., x ik and independent variable Y i. Then, Justin Grimmer (Stanford University) Methodology I September 9th, 2016 59 / 61

Linear Regression In 350a you learn about linear regression. For each i (individual) we observe covariates x i1, x i2,..., x ik and independent variable Y i. Then, Y i = β 0 + β 1 x i1 + β 2 x i2 +... + β k x ik Justin Grimmer (Stanford University) Methodology I September 9th, 2016 59 / 61

Linear Regression In 350a you learn about linear regression. For each i (individual) we observe covariates x i1, x i2,..., x ik and independent variable Y i. Then, Y 1 = β 0 + β 1 x 11 + β 2 x 12 +... + β k x 1k Y 2 = β 0 + β 1 x 21 + β 2 x 22 +... + β k x 2k... Y i = β 0 + β 1 x i1 + β 2 x i2 +... + β k x ik... Y n = β 0 + β 1 x n1 + β 2 x n2 +... + β k x nk Justin Grimmer (Stanford University) Methodology I September 9th, 2016 59 / 61

Linear Regression - Define x i = (1, x i1, x i2,..., x ik ) x 1 x 2 - Define X =. x n - Define β = (β 0, β 1,..., β k ) - Define Y = (Y 1, Y 2,..., Y n ). Then we can write Y = X β Justin Grimmer (Stanford University) Methodology I September 9th, 2016 60 / 61

Linear Regression Y = X β X Y = X X β (X X ) 1 X Y = (X X ) 1 X X β (X X ) 1 X Y = β Big question: is (X X ) 1 invertible? We ll investigate in homework! Justin Grimmer (Stanford University) Methodology I September 9th, 2016 61 / 61