LECTURE 6 Systems of Linear Equations You may recall that in Math 303, matrices were first introduced as a means of encapsulating the essential data underlying a system of linear equations; that is to say, given a set of n linear equations in m variables one could arrange coefficients a ij of the variables x j into a rectangular array with n rows and m columns a a m A = a n a nm arrange the numbers on the right hand side as an array with one column b b = In fact, once matrix multiplication was been defined it was possible to replace the original set of linear with a single matrix equation Ax = b where x is m matrix holding the variables x,, x m x = b n x x m But you ll note, in this course, I have thus far avoided any connections to systems of linear equations My reason for this is to constantly stress the abstract vector space point of view However, now having beat that horse to death, I think we can safely look at a principal application of linear algebra without abandoning our abstract perspective Terminology 6 Let F be a field An n m linear system over F is a collection of n linear equations in m unknowns where each coefficient a ij F and each b i F Such a system will be called solvable if there exists a choice of field elements x,, x n F such that each equation is satisfied The field elements a ij,
6 SYSTEMS OF LINEAR EQUATIONS 2 i n, j m can be arranged in an n m matrix with entries in F: a a m A = a n a nm This matrix is called the coefficient matrix of the linear system The field elements b,, b n can be arranged in a n column vector b b = This column vector (for lack of a better standard terminology) will be referred to as the inhomogeneous part of the system If it so happens that each b i = 0 F, i n, then we say that the linear system is homogeneous Theorem 62 Consider a n m linear system with coefficient matrix A and inhomogenous part b F n For each i between and n, let c i denote the element of F n formed by writing the entries in the i th column of A in order (from top to bottom) Then the linear system has a solution if and only if either of the following two conditions is satisfied (i) b span (c,, c m ) (ii) dim span (c,, c m ) = dim span (c,, c m, b) b n Proof Suppose b lies in the span of c,, c m then there exists elements λ,, λ m F such that b = λ c + + λ m c m Both sides of this equation are vectors in F n Writing this vector equation component by component b = (b) = λ (c ) + + λ m (c m ) = λ a + λ m a m b n = (b) n = λ (c ) n + + λ m (c m ) n = λ a n + + λ m a nm we see x = λ,, x m = λ m furnishes us with a solution of the linear system On the other hand, if x,, x m is a solution to the linear system, we can reverse the above argument and regard the original linear system as stipulating a relationship between the entries in the columns of A, the numbers x,, x m and a vector b F n Interpreted this way, the relationship simply express the vector b as a linear combination of the column vectors of A To prove the equivalence of (i) and (ii) we note first that (i) immediately imples (ii) To see that (ii) also implies (i), Suppose that dim span (c,, c m ) = dim span (c,, c m, b) By Theorem 53, pick a subset {c,, c k } of the column vectors {c,, c m } that forms a basis for span (c,, c m ) Then b span (c,, c k ) if and only if b span (c,, c m ) Now if b / span (c,, c k ), then the can be no dependence relation amongst the vectors {c,, c k, b}, hence {c,, c k, b} would be a linearly independent set, hence k = dim span {c,, c k} = dim span {c,, c m } but k + = dim span (c,, c k, b) = dim span (c,, c m, b) which contradicts our hypothesis Example 63 Determine if the linear system has a solution 2x x 2 + x 3 = x x 2 + 2x 3 = 0 x + x 3 = 3
6 SYSTEMS OF LINEAR EQUATIONS 3 We have A = 2 2 0, b = By the preceding theorem, we just to determine if b lies in the span of c = 2, c 2 = 0, c 3 = 2 First we use row reduction to obtain a basis for span (c, c 2, c 3 ) The coefficient matrix of the vectors c, c 2, c 3 with respect to the standard basis of R 3 is 2 0 2 This matrix can be row reduced to 2 0 0 0 0 and so [, 2, ] and [0,, ] provide a basis for span (c, c 2, c 3 ) So dim span (c, c 2, c 3 ) = 2 Let us similarly compute the dimension of span (c, c 2, c 3, b) 2 0 0 2 0 row reduction 0 0 0 0 0 3 0 0 0 and so dim span (c, c 2, c 3, b) = 3 Since dim span (c, c 2, c 3 ) = 2 3 = dim span (c, c 2, c 3, b), we conclude that the given linear system has no solution 0 3 Let me now introduce some standard terminology that allows a more succinct expression of the result of Theorem 72 Definition 64 The rank of a linear system (or of its associated coefficient matrix) is the dimension of the column space of its coefficient matrix Definition 65 Let A is the coefficient matrix for a n m linear system and let b is the inhomogeneous part of the same system The augmented matrix for this system is the n (m + ) matrix a a m b [A b] a n a nm b n Theorem 66 A linear system has a solution if and only if rank of its coefficient matrix equals the rank of its augmented matrix The column space of a matrix is, of course, just the span of its column vectors
6 SYSTEMS OF LINEAR EQUATIONS 4 Thus far, we have only developed methods for checking as to whether or not a given linear system has solutions Before actually working out some methods of solution, let me first give one nice structural theorem about the solutions of such a system Recall that a linear system is called homogeneous if each b i, i n is equal to 0 F ; otherwise, it is called non-homogeneous Homogeneous linear systems have a very useful property: Theorem 67 Let S (A, 0) be a homogeneous n m linear system Then the solution set of S (A, 0) is a subspace of F m Let [c,, c m ] be the columns of A By the proof of Theorem 62, If x, y F m are solutions of S (A, 0), then x c + + x m c m = 0 y c + + y m c m = 0 Now consider a linear combination of x and y, αx + βy We have (αx + βy) c + + (αx + βy) m c m = α (x c + + x m c m ) + β (y c + + y m c m ) = α 0 + β 0 = 0 And so every linear combination of x and y is also a solution of S (A, 0) Hence, the solution set of S (A, 0) is a subspace of F m Before discussing the solutions of non-homogeneous linear systems, let me introduce a simple geometric construction Definition 68 Let p 0 be an element of a vector space V and let S be subspace of V The hyperplane through p 0 generated by S is the set H p0,s = {v V v = p 0 + s ; s S} Remark 69 A special case is when S = span (d) In this case, H p0,s is called the line through p 0 in the direction of d Remark 60 Hyperplanes are subsets of V, but they usually are not subspaces prove H p0,s is a subspace p 0 S In fact, later we shall Back to linear systems If we are given a non-homogenous linear system, we can always attach to it a corresponding homogeneous system by considering the homogeneous linear system with exactly the same coefficient matrix as the original linear system Theorem 6 Suppose S (A, b) is a n m linear system with coefficient matrix A and inhomogeneous part b and S (A, 0) is the corresponding homogeneous linear system Then if x = [x,, x m ] F m is a solution to S (A, b) then any other solution x of S (A, b) can be expressed as x = x + x 0
6 SYSTEMS OF LINEAR EQUATIONS 5 where x 0 is a solution to S (A, 0) Proof Let c,, c m be the column vectors of A Suppose x and x are two solutions of S (A, b) From the proof of Theorem 72, this means x c + x 2 c 2 + + x m c m = b x c + x 2c 2 + + x mc m = b If we subtract the first equation from the second we get (x x ) c + (x 2 x 2 ) c 2 + + (x m x m ) c m = 0 F n which says that x x is a solution of S (A, 0) If we denote this solution by x 0 we have x x = x 0 x = x + x 0 I ll now rephrase this result in a manner similar to the way the solutions of a linear differential equation are typically described Corollary 62 Let x be a solution of an n m linear system S (A, b), and let S be the solution set of the corresponding homogeneous linear system S (A, 0) Then the solution set of S (A, b) coincides with the hyperplane through x generated by S Proof Let x be given as above and let y be an arbitrary solution of S (A, b) By the preceding theorem, y x = s for some s S (A, 0) y = x + s for some s S y H x,s On the other hand, it is easy to see that if y H x,s then y is a solution of S (A, b) For y c + + y m c m = (x + s) c + + (x + s) m c m = (x c + x 2 c 2 + + x m c m ) + (s c + s 2 c 2 + + s m c m ) = b + 0 = b