Footnotes to Linear Algebra (MA 540 fall 2013), T. Goodwillie, Bases

Footnotes to Linear Algebra (MA 540 fall 2013), T. Goodwillie, Bases November 18, 2013 1 Spanning and linear independence I will outline a slightly different approach to the material in Chapter 2 of Axler s text. 1.1 Spanning If V is a vector space and S is a finite subset of V then the span of S may be defined as the set of all vectors v V such that v can be written as a linear combination of elements of S: v = Σ s S c s s The span of S is always a subspace of V. The span of S contains S. Every subspace of V that contains S must contain the span of S, and of course every subspace of V that contains the span of S must contain S; the span of S is the smallest subspace that contains S. We say that S is a spanning set for V if the span of S is V. If S is a spanning set for V then for every v V there is at least one way to give scalars c s such that v = Σ s S c s s. 1.2 Independence We say that S is a linearly independent set if for every v V there is at most one way to give scalars c s such that v = Σ s S c s s. An equivalent condition on the set S is that whenever 0 = Σ s S c s s then the scalars c s must all be zero. 1

Let us show why this condition is equivalent to S being linearly independent. First we assume that S is linearly independent and we show that S satisfies the second condition. If scalars c s are such that 0 = Σ s S c s s, then since also 0 = Σ s S 0s we can say that for every s S the scalar c s (the coefficient of s in the first expression) is equal to 0 (the coefficient of s in the second expression). On the other hand, what if we assume that S is not linearly independent? Then for some vector v V there are two different collections of scalars both giving v: v = Σ s S c s s = Σ s S d s s. But then we have 0 = v v = Σ s S c s s Σ s S d s s = Σ s S (c s d s )s, so that 0 has been written as a linear combination of the elements of S using coefficients c s d s that are not all zero. So in this case S does not satisfy the second condition. 1.3 Bases A subset of V is called a basis for V if it is both a spanning set and linearly independent. Thus if B is a basis for V then for every v V there is exactly one way to give scalars c b such that v = Σ b B c b b. 1.4 Bases and subspaces It is clear that any subset of a linearly independent set must be linearly independent (just as it is clear that any subset of V that contains a spanning set must be a spanning set). And every linearly independent set in V is a basis for a subspace (its span). In particular, whenever we take a basis B for V and write it as the union of two sets u 1..., u m and w 1..., w n with empty intersection, then these sets are bases for subspaces U and W. In fact, V is the direct sum of U and W. Why? Clearly U + W is V, because it is a subspace of V that contains the basis. To see that U W is {0}, suppose that v U W. Then there are scalars c i and d j such that c 1 u 1 +... + c m u m = v = d 1 w 1 +... + d n w n The equation c 1 u 1 +... + c m u m d 1 w 1... d n w n = 0 implies that all c i and d j are zero, since B is a basis. Thus v = 0, as we wished to show. Conversely, if V = U W (i.e. if V = U + W and U W = {0}), then the union of a basis for U and a basis for W is always a basis for V. To check this carefully, suppose that u 1..., u m and 2

w 1..., w n are bases for U and W. The span of u 1,..., w n is a subspace of V that contains both U and W, so it must be V. For linear independence, suppose that The vector (c 1 u 1 +... + c m u m ) + (d 1 w 1 +... + d n w n ) = 0. c 1 u 1 +... + c m u m = (d 1 w 1 +... + d n w n ) is in both U and W, so it is zero. But this implies that all c i are zero because the u 1..., u m are linearly independent, and it implies that all d j are zero because the w 1..., w n are linearly independent. 1.5 A remark We have not yet shown that V has a basis. That s because we have not used the fact that our scalars form a field. It is time to start dividing by non-zero scalars. 1.6 Two key observations (1) If Σ s S c s s = 0 and the coefficient c s0 of some particular s 0 S is not zero, then s 0 can be expressed as a linear combination of the vectors s S different from s 0. In fact, s 0 is equal to Σ s s0 ( cs c s0 )s. (2) Assume that I is a linearly independent set and that s is not in the span of I. Then I {s} is again linearly independent. In fact, if there were a nontrivial linear relation between the elements of I {s} then the coefficient of s could not be zero since that would give a nontrivial linear relation between just the elements of I, contradicting the first assumption. And if the coefficient of s is not zero then (1) applies and shows that s in the span of I, contradicting the second assumption. 1.7 Existence of a basis, one approach To show that V has a basis, we can proceed as follows: This works if V has a finite spanning set. Start with a finite spanning set S. If it is not a basis for V (not linearly independent), then there is a nontrivial linear relation between the elements of S, so that (1) above says that there is some element of S such that when we remove it we still have a spanning set. Remove such an element of S. Repeat until the spanning set becomes a basis. 3

1.8 Existence of a basis, an opposite approach This produces a basis for V as long as V does not have arbitrarily large linearly independent sets. Maybe the empty set is a basis (i.e. maybe 0 is the only vector in V ). If not, then choose a vector v 1 0. Maybe the linearly independent set consisting of v 1 alone is a basis. If not then that set does not span V, so we can choose a vector v 2 not in the span of v 1 By (2) above, the set consisting of the two vectors v 1 and v 2 is linearly independent. Maybe this set is a basis. If not then that set does not span V, so we can choose a vector v 3 not in the span of v 1 and v 2. By (2) above, the set consisting of these three vectors is linearly independent. Maybe it is a basis. And so on. This must yield a basis eventually (unless it goes on forever, producing arbitrarily large linearly independent sets). 1.9 An aside: Finite-dimensional and infinite-dimensional It turns out that there are only two possibilities for a vector space V : 1.9.1 The finite-dimensional case By finite-dimensional we mean that there exists a finite set spanning V. In this case, as proved below: V has a finite basis. (We showed this in 1.7.) Every basis has the same number of elements as every other basis. Call this number n. Every spanning set has at least n elements. Every linearly independent set has at most n elements. We call n the dimension of V and denote it by dim(v ). 1.9.2 The infinite-dimensional case By infinite-dimensional we mean that there exists no finite set spanning V. In this case for every n there is a linearly independent set having n elements. To see this, follow the procedure described in 1.8. The process never ends. In fact, although we will not do this in the course, the concepts of spanning set, linearly independent set, and basis can be extended beyond the finite-dimensional case. It can even be shown that V 4

still has a basis if V is infinite-dimensional; the basis will now be an infinite set. 1.10 Back to business All of the assertions above about the finite-dimensional case follow immediately from the next Lemma: 1.11 The fundamental inequality Lemma: If S spans V and I is a linearly independent set in V then S has at least as many elements as I. Proof: We first show that if S spans V and I is a linearly independent set in V such that S does not contain I, then there exists another spanning set S that has the same number of elements as S but has more elements of I in it than S does. To do this, we choose an element i I such that i / S. There are scalars c s such that i = Σ s S c s s, because S spans V. The numbers c s for s / I cannot all be zero, because an element of I cannot be a linear combination of other elements of I, since I is linearly independent. Choose some s S such that c s 0 and s / I. If S is defined to be S with i adjoined and s removed, then S has the same number of elements as S and has more elements of I in it than S does. Also, it still spans V because the deleted element s is in its span, by (1) above. To complete the proof of the Lemma, suppose that some spanning set for V has n elements and that the set I is linearly independent and has m elements. Choose a spanning set having n elements and having as many elements of I as possible. It must have all of I in it, because otherwise the previous paragraph shows how to make a new S with more elements of I in it. Therefore n m. 1.12 Dimension and sum and intersection At this point we know (see Bases and subspaces above) that dim(u W ) = dim(u) + dim(w ). We want to show that more generally dim(u + W ) = dim(u) + dim(w ) dim(w U). If V is a vector space and W is a subspace of V, let us say that a subspace C of V is complementary to W in V if V = W C, i.e. if W + C = V and W C = {0}. For every subspace of V there is at least one complementary subspace (if V is finite-dimensional). We can obtain this by starting with a basis of W, adjoining new vectors to it to get a basis for V, and referring again to Bases and subspaces above: the new vectors must necessarily form a basis for some subspace complementary to W. 5

Note that for any given W V there are generally many complementary subspaces. But we know they all have the same dimension, namely dim(v ) dim(w ) Now, suppose that some finite-dimensional vector space V is the sum of two subspaces, V = W +U, but do not assume the sum is a direct sum. That is, do not assume that W U = {0}. Choose a subspace of W that is complementary to W U (in W ). Call it C. I claim that, in addition to being complementary to W U in W, C is also complementary to U in V. To prove this we have to show two things: 1. C + U = V. To see this, observe that V = W + U = (C + (W U)) + U = C + ((W U) + U) = C + U, where the last step uses that W U U. 2. C U = {0}. To see this, observe that C U = (C W ) U = C (W U) = {0}, where the first step uses that C = C W. It follows that dim(c) is equal to both dim(v ) dim(u) and dim(w ) dim(w U). 2 Infinite bases This section is an entirely optional supplement to the course. The course is mainly about finite-dimensional vector spaces, and much of what we will learn only applies in the finite-dimensional case. But the notions of spanning and linear independence can be extended to the general case in a good way. Instead of finite lists or finite sets of vectors, we use sets of vectors. If S is a subset of V, not necessarily finite, then call S a spanning set for V if every element of V may be expressed as a (finite) linear combination v = Σ s c s s, where the sum is over all s in some finite subset of S and for each such s c s is a scalar. We can also write this as an infinite sum, with the understanding that c s = 0 for all but finitely many s S. More generally, we can speak of S spanning a subspace of V. Every subset of V spans some subspace. Call S a linearly independent set if whenever Σ s c s s = 0 then c s = 0 for every s S. Call S a basis if it spans V and is also linearly independent. This means that every vector can be uniquely expressed as a linear combination of the vectors in S. The following statements can be proved without assuming that V has a finite spanning set, but the proof depends on the Axiom of Choice: 6

Every vector space has a basis. More generally, every spanning set in V has a subset that is a basis, and for every linearly independent set in V there is some basis that contains it. Even more generally, we can prove that, given any spanning set S and any linearly independent set I such that I S, there is a basis B such that I B S. The key point is observation (2) above, which is still valid if the set S is infinite. Let us use this observation to prove the desired statement. In the case of a countable spanning set we do not need Choice. Take the elements of B I and list them in some order v 1, v 2,.... Let W n be the subspace of V spanned by I {v 1,..., v n }. Note that W n+1 is spanned by W n and v n+1. The set I is a basis for W 0. Inductively make a basis B n for W n. Let B 0 be I. Having made B n, make B n+1 by either adjoining the vector v n+1 to B n or not. If v n+1 W n then let B n+1 = B n : in this case W n+1 = W n so that B n is a basis for W n+1. If v n+1 / W n, then let B n+1 = B n {v n+1 }. This is linearly independent by the observation above, and it spans W n+1. At the end of this infinite process, we have what we want: the union of the sets is linearly independent and it spans V. B 0 B 1... (To extend this argument to the uncountable case we need the Axiom of Choice in some form. I won t go into that here.) One consequence of this is that for every subspace of V there is a complementary subspace. This way of making a basis generalizes 1.8 above. There does not appear to be a version of 1.7 in the infinite-dimensional case. 7