TRACE AND NORM KEITH CONRAD

Similar documents
a c b b a + c b + c. c b a + c

THE DIFFERENT IDEAL. Then R n = V V, V = V, and V 1 V 2 V KEITH CONRAD 2 V

GALOIS THEORY AT WORK: CONCRETE EXAMPLES

disc f R 3 (X) in K[X] G f in K irreducible S 4 = in K irreducible A 4 in K reducible D 4 or Z/4Z = in K reducible V Table 1

SEPARABILITY II KEITH CONRAD

GALOIS GROUPS OF CUBICS AND QUARTICS (NOT IN CHARACTERISTIC 2)

GALOIS THEORY AT WORK

Automorphisms and bases

Part II Galois Theory

D-MATH Algebra I HS17 Prof. Emmanuel Kowalski. Solution 12. Algebraic closure, splitting field

FACTORING IN QUADRATIC FIELDS. 1. Introduction

0.1 Rational Canonical Forms

The minimal polynomial

THE MINIMAL POLYNOMIAL AND SOME APPLICATIONS

Part II Galois Theory

MATH 101A: ALGEBRA I, PART D: GALOIS THEORY 11

Galois Theory Overview/Example Part 1: Extension Fields. Overview:

Matrix Arithmetic. j=1

GENERATING SETS KEITH CONRAD

UNIVERSAL IDENTITIES. b = det

A matrix is a rectangular array of. objects arranged in rows and columns. The objects are called the entries. is called the size of the matrix, and

MATH Topics in Applied Mathematics Lecture 12: Evaluation of determinants. Cross product.

x by so in other words ( )

Elementary maths for GMT

A matrix is a rectangular array of. objects arranged in rows and columns. The objects are called the entries. is called the size of the matrix, and

Q N id β. 2. Let I and J be ideals in a commutative ring A. Give a simple description of

Knowledge Discovery and Data Mining 1 (VO) ( )

Topics in linear algebra

18.S34 linear algebra problems (2007)

Solution. That ϕ W is a linear map W W follows from the definition of subspace. The map ϕ is ϕ(v + W ) = ϕ(v) + W, which is well-defined since

x by so in other words ( )

Math 10860, Honors Calculus 2

EXTENSIONS OF ABSOLUTE VALUES

Solutions of exercise sheet 11

Inverse Galois Problem for Totally Real Number Fields

CSL361 Problem set 4: Basic linear algebra

GALOIS GROUPS AS PERMUTATION GROUPS

THE P-ADIC NUMBERS AND FINITE FIELD EXTENSIONS OF Q p

RECOGNIZING GALOIS GROUPS S n AND A n

IDEAL CLASSES AND RELATIVE INTEGERS

Math Linear Algebra Final Exam Review Sheet

EIGENVALUES AND EIGENVECTORS 3

Theorem 5.3. Let E/F, E = F (u), be a simple field extension. Then u is algebraic if and only if E/F is finite. In this case, [E : F ] = deg f u.

ELEMENTARY LINEAR ALGEBRA

Eigenvalues and Eigenvectors: An Introduction

Thus, the integral closure A i of A in F i is a finitely generated (and torsion-free) A-module. It is not a priori clear if the A i s are locally

A Field Extension as a Vector Space

Math 360 Linear Algebra Fall Class Notes. a a a a a a. a a a

ELEMENTARY LINEAR ALGEBRA

Review of linear algebra

Homework 5 M 373K Mark Lindberg and Travis Schedler

Linear Algebra Highlights

MAT 535 Problem Set 5 Solutions

MATH Mathematics for Agriculture II

Determinants. Beifang Chen

Solution to Homework 1

IMPORTANT DEFINITIONS AND THEOREMS REFERENCE SHEET

PYTHAGOREAN TRIPLES KEITH CONRAD

NOTES ON FINITE FIELDS

A connection between number theory and linear algebra

Linear Algebra (part 1) : Vector Spaces (by Evan Dummit, 2017, v. 1.07) 1.1 The Formal Denition of a Vector Space

Chapter 2 Notes, Linear Algebra 5e Lay

TOTALLY RAMIFIED PRIMES AND EISENSTEIN POLYNOMIALS. 1. Introduction

FIELD THEORY. Contents

a11 a A = : a 21 a 22

Public-key Cryptography: Theory and Practice

Chapter 3. Vector spaces

Selected exercises from Abstract Algebra by Dummit and Foote (3rd edition).

Section 3.2. Multiplication of Matrices and Multiplication of Vectors and Matrices

Solving Quadratic Equations by Formula

a 11 x 1 + a 12 x a 1n x n = b 1 a 21 x 1 + a 22 x a 2n x n = b 2.

Final Review Sheet. B = (1, 1 + 3x, 1 + x 2 ) then 2 + 3x + 6x 2

ISOMETRIES OF R n KEITH CONRAD

CHAPTER 0 PRELIMINARY MATERIAL. Paul Vojta. University of California, Berkeley. 18 February 1998

Math 2331 Linear Algebra

Linear Algebra review Powers of a diagonalizable matrix Spectral decomposition

c ij x i x j c ij x i y j

Linear Equations in Linear Algebra

over a field F with char F 2: we define

IMPORTANT DEFINITIONS AND THEOREMS REFERENCE SHEET

Contents. 2.1 Vectors in R n. Linear Algebra (part 2) : Vector Spaces (by Evan Dummit, 2017, v. 2.50) 2 Vector Spaces

Linear Algebra review Powers of a diagonalizable matrix Spectral decomposition

3.2 Constructible Numbers

ELEMENTARY LINEAR ALGEBRA

Lecture Notes Math 371: Algebra (Fall 2006) by Nathanael Leedom Ackerman

9. Integral Ring Extensions

x + 2y + 3z = 8 x + 3y = 7 x + 2z = 3

Algebraic Number Theory

M.6. Rational canonical form

THE FUNDAMENTAL THEOREM OF ALGEBRA VIA PROPER MAPS

Chapter 5. Linear Algebra. A linear (algebraic) equation in. unknowns, x 1, x 2,..., x n, is. an equation of the form

arxiv: v1 [math.gr] 3 Feb 2019

88 CHAPTER 3. SYMMETRIES

22M: 121 Final Exam. Answer any three in this section. Each question is worth 10 points.

Problem Set (T) If A is an m n matrix, B is an n p matrix and D is a p s matrix, then show

Practice Algebra Qualifying Exam Solutions

ALGORITHMS FOR COMPUTING QUARTIC GALOIS GROUPS OVER FIELDS OF CHARACTERISTIC 0

Matrix operations Linear Algebra with Computer Science Application

THE UNIT GROUP OF A REAL QUADRATIC FIELD

Algebra Review. Instructor: Laszlo Babai Notes by Vincent Lucarelli and the instructor. June 15, 2001

Transcription:

TRACE AND NORM KEITH CONRAD 1. Introduction Let L/K be a finite extension of fields, with n = [L : K]. We will associate to this extension two important functions L K, called the trace and the norm. They are related to the trace and determinant of matrices and have many important applications in the study of fields (and rings). Among elementary applications, the trace can be used to show certain numbers are not in certain fields and the norm can be used to show some number in L is not a perfect power in L. The math behind the definitions of the trace and norm also leads to a systematic way of finding the minimal polynomial over K of any element of L. 2. Basic Definitions Our starting point is the process by which an abstract linear transformation ϕ: V V from an n-dimensional K-vector space V to itself is turned into a matrix: for a basis {e 1,..., e n } of V, write ϕ(e j ) = n i=1 a ije i, with a ij K. 1 We declare the matrix of ϕ with respect to that basis, also called the matrix representation of ϕ, to be [ϕ] := (a ij ). This definition is motivated by the way entries of a square matrix A = (a ij ) can be recovered from the effect of A on the standard basis e 1,..., e n of K n : Ae j = n i=1 a ije i (the jth column of A). Example 2.1. Let ϕ: C C be complex conjugation: ϕ(z) = z. This is R-linear: z + z = z + z and cz = cz for c R. Using the basis {1, i}, we compute the complex conjugate of each number in the basis and write the answer in terms of the basis: ϕ(1) = 1 = 1 1 + 0 i, ϕ(i) = i = 0 1 + ( 1) i. From the coefficients in the first equation, the first column of [ϕ] is ( 1 0), and from the coefficients in the second equation, the second column is ( 0 1), so [ϕ] is ( ) 1 0. 0 1 If instead we use the basis {1 + i, 1 + 2i}, then ϕ(1 + i) = 1 i = 3 (1 + i) 2 (1 + 2i), ϕ(1 + 2i) = 1 2i = 4 (1 + i) + ( 3) (1 + 2i). Now the first column of [ϕ] is ( ) ( 3 2 and the second column is 4 3), so [ϕ] is ( ) 3 4. 2 3 1 Watch the indices! It is not ϕ(ei) = n j=1 aijej. The point is that aij appears in the formula for ϕ(ej), not ϕ(e i). 1

2 KEITH CONRAD If the basis of V changes, or even the order of the terms in the basis changes, then the matrix usually changes, but it will be a conjugate of the first matrix. (Two squares matrices M and N are called conjugate if N = UMU 1 for an invertible matrix U.) Conjugate matrices have the same trace (trace = sum of main diagonal entries) and determinant, so we declare Tr(ϕ) = Tr([ϕ]) and det(ϕ) = det([ϕ]), using any matrix representation of ϕ. For instance, both matrix representations we computed for complex conjugation on C, treated as an R-linear map, have trace 0 and determinant 1. We turn now to field extensions. For a finite extension of fields L/K, we associate to each element α of L the K-linear transformation m α : L L, where m α is multiplication by α: m α (x) = αx for x L. Each m α is a K-linear function from L to L: m α (x + y) = α(x + y) = αx + αy = m α (x) + m α (y), m α (cx) = α(cx) = c(αx) = cm α (x) for x and y in L and c K. By choosing a K-basis of L we can create a matrix representation for m α, which is denoted [m α ]. (We need to put an ordering on the basis to get a matrix, but we will often just refer to picking a basis and listing it in a definite way instead of saying pick an ordered basis.) Example 2.2. Let K = R, L = C, and use basis {1, i}. For α = a + bi with real a and b, when we multiply the basis 1 and i by α and write the answer in terms of the basis we have α 1 = a 1 + bi, α i = b 1 + ai. Therefore the first column of [m α ] is ( ) ( a b and the second column is b ) a : [mα ] equals ( ) a b. b a Using the basis {i, 1}, which is the same basis listed in the opposite order, we compute and [m α ] changes to α i = ai + ( b) 1, α 1 = bi + a 1 ( a b b a which serves as a reminder that [m α ] depends on the ordering of the K-basis of L. Example 2.3. Let K = Q and L = Q( 2). Use the basis {1, 2}. For α = a + b 2 with rational a and b, we multiply it by 1 and 2: ), Therefore [m α ] equals α 1 = a 1 + b 2, α 2 = 2b 1 + a 2. ( a 2b b a ). Example 2.4. Let K = Q and L = Q( 3 2). Using the basis {1, 3 2, 3 4}, the matrix representation for multiplication by α = a+b 3 2+c 3 4 on L, where a, b, and c are rational,

is obtained by multiplying α by 1, From these calculations, [m α ] is TRACE AND NORM 3 3 2, and 3 4: α 1 = a + b 3 2 + c 3 4, α 3 2 = 2c + a 3 2 + b 3 4, α 3 4 = 2b + 2c 3 2 + a 3 4. a 2c 2b b a 2c c b a Example 2.5. Let K = Q and L = Q(γ) for γ a root of X 3 X 1. Then γ 3 = 1 + γ. Use the basis {1, γ, γ 2 }. For α = a + bγ + cγ 2 with rational a, b, and c, multiply α by 1, γ, and γ 2 : Therefore [m α ] equals α 1 = a + bγ + cγ 2,. α γ = aγ + bγ 2 + cγ 3 = c + (a + c)γ + bγ 2, α γ 2 = cγ + (a + c)γ 2 + bγ 3 = b + (b + c)γ + (a + c)γ 2. a c b b a + c b + c c b a + c Example 2.6. Let K = Q and L = Q(γ) for γ a root of X 4 X 1 (which is irreducible over Q since it s irreducible mod 2). Use the basis {1, γ, γ 2, γ 3 }. For α = a + bγ + cγ 2 + dγ 3 with rational a, b, c, and d, verify that [m α ] equals. a d c b b a + d c + d b + c c b a + d c + d d c b a + d Example 2.7. If c K, then with respect to any K-basis of L, [m c ] is the scalar diagonal matrix ci n, where n is the dimension of L over K: if {e 1,..., e n } is a K-basis then m c (e j ) = ce j, so [m c ] has jth column with a c in the jth row and 0 elsewhere.. Here, finally, are the trace and norm mappings that we want to study. Definition 2.8. The trace and norm of α from L to K are the trace and determinant of any matrix representation for m α as a K-linear map: Tr L/K (α) = Tr([m α ]) K, N L/K (α) = det([m α ]) K. Let s use matrices in our previous examples to calculate some trace and norm formulas. By Example 2.2, Tr C/R (a + bi) = 2a, N C/R (a + bi) = a 2 + b 2. By Example 2.3, By Example 2.4, Tr Q( 2)/Q (a + b 2) = 2a, N Q( 2)/Q (a + b 2) = a 2 2b 2. Tr Q( 3 2)/Q (a + b 3 2 + c 3 4) = 3a, N Q( 3 2)/Q (a + b 3 2 + c 3 4) = a 3 + 2b 3 + 4c 3 6abc.

4 KEITH CONRAD By Example 2.5, Tr Q(γ)/Q (a + bγ + cγ 2 ) = 3a + 2c and N Q(γ)/Q (a + bγ + cγ 2 ) = a 3 + b 3 + c 3 ab 2 + ac 2 bc 2 + 2a 2 c 3abc. By Example 2.6, Tr Q(γ)/Q (a + bγ + cγ 2 + dγ 3 ) = 4a + 3d and N Q(γ)/Q (a + bγ + cγ 2 + dγ 3 ) = a 4 b 4 + c 4 d 4 + 3a 3 d 2a 2 c 2 + 3a 2 d 2 + ab 3 + ac 3 +ad 3 + 2b 2 d 2 bc 3 bd 3 + cd 3 3a 2 bc + 4ab 2 c 4a 2 bd 5abd 2 + ac 2 d + 4acd 2 + 3b 2 cd 4bc 2 d 3abcd. By Example 2.7, for any c K the matrix [m c ] is ci n, where n = [L : K], so Tr L/K (c) = nc, N L/K (c) = c n. In particular, Tr L/K (1) = [L : K] and N L/K (1) = 1. Remark 2.9. In the literature you might see S or Sp used in place of Tr since Spur is the German word for trace (not because S is the first letter in the word sum ). Remark 2.10. The word norm has another meaning in algebra besides the one above: a method of measuring the size of elements in a vector space is called a norm. For instance, when v = (a 1,..., a n ) is in R n, its squared length v = v v = a 2 1 + + a2 n is called the norm of v. This concept, suitably axiomatized, leads to normed vector spaces, which occur throughout analysis, but vector space norms have essentially nothing to do with the field norm we are using. Remark 2.11. There is an alternate definition of Tr L/K (α) and N L/K (α) in terms of a sum and product running over the field embeddings of L into an algebraic closure of K. See, for instance, [1, Chap. VI, Sec. 5]. That alternate definition is a bit clunky to work with when L/K is not separable. 3. Initial Properties of the Trace and Norm The most basic properties of the trace and norm follow from the way m α depends on α. Lemma 3.1. Let α and β belong to L. 1) If α β then m α m β, 2) As functions L L, m α+β = m α + m β and m αβ = m α m β, and m 1 is the identity map L L. Concretely, this says the matrices in the previous examples are embeddings of L into the n n matrices over K, where n = [L : K]. For instance, from Example 2.2 the 2 2 real matrices of the special form ( a b b ) add and multiply in the same way as complex numbers a add and multiply. Compare multiplication: ( ) ( ) ( ) a b c d ac bd (ad + bc) (a+bi)(c+di) = (ac bd)+(ad+bc)i, =. b a d c ad + bc ac bd Proof. Since m α (1) = α, we can recover the number α from the mapping m α, so α m α is injective. For α, β, and x in L, m α+β (x) = (α + β)(x) = αx + βx = m α (x) + m β (x) = (m α + m β )(x)

TRACE AND NORM 5 and (m α m β )(x) = m α (βx) = α(βx) = (αβ)x = m αβ (x), so m α+β = m α + m β and m αβ = m α m β. Easily m 1 is the identity map on L: m 1 (x) = 1 x = x for all x L. Theorem 3.2. The trace Tr L/K : L K is K-linear and the norm N L/K : L K is multiplicative. Moreover, N L/K (L ) K. Proof. We have equations m α+β = m α + m β and m αβ = m α m β. Picking a basis of L/K and passing to matrix representations in these equations, [m α+β ] = [m α +m β ] = [m α ]+[m β ] and [m αβ ] = [m α m β ] = [m α ][m β ]. Therefore Tr L/K (α+β) = Tr([m α+β ]) = Tr([m α ]+[m β ]) = Tr([m α ])+Tr([m β ]) = Tr L/K (α)+tr L/K (β) and N L/K (αβ) = det([m αβ ]) = det([m α ][m β ]) = det([m α ]) det([m β ]) = N L/K (α)n L/K (β). So Tr L/K : L K is additive and N L/K : L K is multiplicative. To show Tr L/K is K-linear, not just additive, for c K and α L we have m cα = cm α as mappings L L (check that), so [m cα ] = [cm α ] = c[m α ]. Therefore Tr L/K (cα) = Tr L/K (c[m α ]) = ctr L/K (α). Since N L/K (1) = det([m 1 ]) = 1, for nonzero α in L taking norms of both sides of α (1/α) = 1 implies N L/K (α)n L/K (1/α) = 1, so N L/K (α) 0. The linearity of Tr L/K means calculating it on all elements is reduced to finding its values on a basis. Example 3.3. Consider the extension Q(γ)/Q where γ 3 γ 1 = 0. For a, b, c Q, we saw before that Tr Q(γ)/Q (a + bγ + cγ 2 ) = 3a + 2c by using the matrix for multiplication by a + bγ + cγ 2 from Example 2.5. Because the trace is Q-linear, we can calculate this trace in another way: Tr(a + bγ + cγ 2 ) = atr(1) + btr(γ) + ctr(γ 2 ), where Tr = Tr Q(γ)/Q. Therefore finding the trace down to Q of a general element of Q(γ) is reduced to finding the traces of just three numbers: 1, γ, and γ 2. Tr(1) = [Q(γ) : Q] = 3. To compute Tr(γ) we will compute [m γ ] using the basis {1, γ, γ 2 }: γ 1 = γ, γ γ = γ 2, and γ γ 2 = γ 3 = 1 + γ, so [m γ ] = 0 0 1 1 0 1 0 1 0 Therefore Tr(γ) = 0. To compute Tr(γ 2 ) we want to find the matrix [m γ 2], which can be found either directly by multiplying γ 2 on the basis {1, γ, γ 2 } or by squaring the matrix [m γ ] just above, since [m γ 2] = [m γ m γ ] = [m γ ] 2. Either way, (3.1) [m γ 2] = so Tr(γ 2 ) = 2. 0 1 0 0 1 1 1 0 1.,

6 KEITH CONRAD Therefore Tr(a + bγ + cγ 2 ) = atr(1) + btr(γ) + ctr(γ 2 ) = 3a + 2c, which agrees with our calculation of this trace earlier. 4. Applications of the Trace and Norm Here are two negative results about Q( 3 2) that will be settled with the trace and norm. Task 1: Show 3 3 Q( 3 2). Task 2: Show 1 + 3 2 is not a perfect square in Q( 3 2). Solution to Task 1: We argue by contradiction: assume 3 3 Q( 3 2), so 3 (4.1) 3 3 = a + b 2 + c 3 4 for some rational a, b, and c. Our goal is to show such an equation is impossible. The awful way to do this is to cube both sides to get 3 = (a + b 3 2 + c 3 4) 3 = (a 3 + 2b 3 + 4c 3 + 12abc) + (3a 2 b + 6ac 2 + 6b 2 c) 3 2 + (3a 2 c + 3ab 2 + 6bc 2 ) 3 4 and then set the rational term on the right to be 3 and the coefficients of 3 2 and 3 4 to be 0. This is a system of 3 cubic equations in a, b, and c, and we want to show it has no rational solution. What a mess. A better approach is to tackle the problem linearly using traces. From the assumption that 3 3 is in the cubic field Q( 3 2) we have Q( 3 2) = Q( 3 3). Call this common cubic field K. The two primitive elements 3 2 and 3 3 for K/Q suggest different bases over Q: {1, 3 2, 3 4} and {1, 3 3, 3 9}. Using the first basis, the matrix for multiplication by 3 2 is 0 0 2 1 0 0, 0 1 0 so Tr K/Q ( 3 2) = 0. In a similar way we get Tr K/Q ( 3 3) = 0 using the second basis. The matrix for multiplication by 3 4 with respect to the basis {1, 3 2, 3 4} is 0 2 0 0 0 2, 1 0 0 so Tr K/Q ( 3 4) = 0. Applying Tr K/Q to both sides of (4.1), we get 0 = atr K/Q (1) + btr K/Q ( 3 2) + ctr K/Q ( 3 4) = 3a, so a = 0. Now multiply both sides of (4.1) by 3 2: 3 (4.2) 3 6 = a 2 + b 3 4 + 2c. The number 3 6 has degree 3 over Q, so it has to be a primitive element of K, and using the basis {1, 3 6, 3 36} for K/Q gives us Tr K/Q ( 3 6) = 0. Therefore if we apply Tr K/Q to both sides of (4.2) we get 0 = 6c, so c = 0. Returning to (4.1), it now reads 3 3 = b 3 2, so 3/2 = b 3, which clearly has no rational solution. We have a contradiction, so 3 3 Q( 3 2).

TRACE AND NORM 7 Solution to Task 2: If 1+ 3 2 = α 2 for α in K = Q( 3 2), then N K/Q (1+ 3 2) = (N K/Q (α)) 2 in Q. To compute N K/Q (1 + 3 2), use the basis {1, 3 2, 3 4} for K/Q: the matrix for multiplication by 1 + 3 2 is 1 0 2 1 1 0, 0 1 1 whose determinant is 3, so N K/Q (1 + 3 2) = 3. Therefore 3 = (N K/Q (α)) 2 in Q, but this is impossible because N K/Q (α) would be a rational square root of 3. While this idea suffices to prove a number in a field is not a square (or other power) by showing its norm down to a subfield we know better is not a square there, if the norm turns out to be a square in the smaller field that does not usually mean the original number is a square in the larger field. Another application of the trace and norm is an algebraic analogue of volume for any basis of a field extension. In R n, if v 1,..., v n is a basis then the parellelepiped spanned by the v i s is the set { n i=1 c iv i : 0 c i 1} and its volume in R n is det(v i v j ), so its squared volume is det(v i v j ). The trace product Tr L/K (αβ) for α and β in L is the right analogue of the dot product v w for v and w in R n, so if we drop the absolute value (which makes no sense in general fields) and replace the dot product with the trace product, then we are led to consider the determinant det(tr L/K (e i e j )) K for any basis e 1,..., e n of L/K to be something like a squared volume. This is called the discriminant of the basis and is denoted disc L/K (e 1,..., e n ). The next example explains the name. Example 4.1. Let [L : K] = 2 and pick α L K, so L = K(α). Let the minimal polynomial of α over K be X 2 + bx + c. Then Tr L/K (1) = 2, Tr L/K (α) = b, and Tr L/K (α 2 ) = Tr L/K ( bα c) = b 2 2c. Therefore disc L/K (1, α) equals ( ) ( ) TrL/K (1 1) Tr det L/K (1 α) 2 b = det Tr L/K (α 1) Tr L/K (α α) b b 2 = 2b 2c 2 4c b 2 = b 2 4c, which is the familiar discriminant of X 2 + bx + c. When L = K(α), there are two formulas for the discriminant of the basis {1, α,..., α n 1 } of K(α)/K in terms of the minimal polynomial f(x) for α over K, which we state here without proof: 2 If f(x) = (X α 1 ) (X α n ) over a splitting field, then disc K(α)/K (1, α,..., α n 1 ) = i<j(α j α i ) 2. The discriminant is the norm of a derivative, up to a sign: disc K(α)/K (1, α,..., α n 1 ) = ( 1) n(n 1)/2 N K(α)/K (f (α)). Example 4.2. In the notation of Example 4.1, f(x) = X 2 + bx + c. Factoring f(x) over a splitting field as (X α 1 )(X α 2 ), we have α 1 + α 2 = b and α 1 α 2 = c by equating coefficients, so (α 2 α 1 ) 2 = (α 1 + α 2 ) 2 4α 1 α 2 = b 2 4c. 2 Proofs are given in the second handout on trace and norm.

8 KEITH CONRAD Alternatively, ( 1) n(n 1)/2 N K(α)/K (f (α)) = N K(α)/K (2α + b). The matrix for multiplication by 2α + b in the basis {1, α} is ( 2 b 2c b ), whose determinant is b2 + 4c, so N K(α)/K (2α + b) = b 2 4c. 5. The Trace and Norm in terms of Roots The numbers Tr L/K (α) and N L/K (α) can be expressed in terms of the coefficients or the roots of the minimal polynomial of α over K. To explain this we need another polynomial in K[X] related to α besides its minimal polynomial over K, inspired by more linear algebra. Definition 5.1. For α L, its characteristic polynomial relative to the extension L/K is the characteristic polynomial of a matrix representation [m α ]: where n = [L : K]. χ α,l/k (X) = det(x I n [m α ]) K[X], Note that we are defining the characteristic polynomial of a matrix A to be det(xi n A), not det(a XI n ). This is because we like our polynomials to be monic; if we used the second way then the leading coefficient would be ( 1) n. Example 5.2. For the extension C/R, the characteristic polynomial of the matrix in Example 2.2 is χ a+bi,c/r (X) = X 2 2aX + a 2 + b 2. Example 5.3. For the extension Q( 2)/Q, the characteristic polynomial of the matrix in Example 2.3 is χ a+b 2,Q( 2)/Q (X) = X 2 2aX + a 2 2b 2. Example 5.4. For the extension Q( 3 2)/Q, from Example 2.4 χ a+b 3 2+c 3 4,Q( 3 2)/Q (X) = X 3 3aX 2 + (3a 2 6bc)X (a 3 + 2b 3 + 4c 3 6abc). Example 5.5. For c K, m c : L L has matrix representation ci n, so χ c,l/k (X) = det(xi n ci n ) = (X c) n = X n ncx n 1 + + ( 1) n c n. For any n n square matrix A, its trace and determinant appear up to sign as coefficients in its characteristic polynomial: so det(xi n A) = X n Tr(A)X n 1 + + ( 1) n det A, (5.1) χ α,l/k (X) = X n Tr L/K (α)x n 1 + + ( 1) n N L/K (α). This tells us the trace and norm of α are, up to sign, coefficients of the characteristic polynomial of α, which can be seen in the above examples. Unlike the minimal polynomial of α over K, whose degree [K(α) : K] varies with K, the degree of χ α,l/k (X) is always n, which is independent of the choice of α in L. Theorem 5.6. Every α in L is a root of its characteristic polynomial χ α,l/k (X). Proof. This will be a consequence of the Cayley-Hamilton theorem in linear algebra, which says any linear operator is killed by its characteristic polynomial: if a square matrix A has characteristic polynomial χ(x) = det(xi n A) K[X], then χ(a) = O.

TRACE AND NORM 9 For any polynomial f(x) = X n + c n 1 X n 1 + + c 1 X + c 0 in K[X], f([m α ]) = [m α ] n + c n 1 [m α ] n 1 + + c 1 [m α ] + c 0 I n = [m α n] + c n 1 [m α n 1] + + c 1 [m α ] + c 0 [m 1 ] since [m x ][m y ] = [m xy ] = [m α n] + [m cn 1 α n 1] + + [m c 1 α] + [m c0 ] since c[m x ] = [m cx ] = [m α n + m cn 1 α n 1 + + m c 1 α + m c0 ] since [m x ] + [m y ] = [m x + m y ] = [m α n +c n 1 α n 1 + +c 1 α+c 0 ] since m x + m y = m x+y = [m f(α) ]. In words, the value f(x) at a matrix for multiplication by α on L is a matrix for multiplication by f(α) on L. Taking for f(x) the polynomial χ α,l/k (X), we have χ α,l/k ([m α ]) = O by Cayley Hamilton, so [m χα,l/k (α)] = O. The only β in L such that [m β ] = O is 0, so χ α,l/k (α) = 0. Example 5.7. The complex number a + bi is a root of χ a+bi,c/r (X) = X 2 2aX + a 2 + b 2, which can be seen by direct substitution of a + bi into this polynomial. Example 5.8. Let γ 3 γ 1 = 0 over Q. Since [Q(γ) : Q] = 3 and γ 2 is irrational (otherwise γ can t have degree 3 over Q), Q(γ 2 ) = Q(γ), so γ 2 has a minimal polynomial of degree 3 over Q. What is that minimal polynomial? A tedious way to find it is to solve (γ 2 ) 3 + a(γ 2 ) 2 + bγ 2 + c = 0 for unknown rational coefficients a, b, c by expressing γ 6 and γ 4 as Q-linear combinations of 1, γ, γ 2 to get a system of 3 linear equations in the three unknowns a, b, c, and solve for a, b, c. Ugh. A more direct method is to calculate the characteristic polynomial of [m γ 2] in (3.1), which is X 3 2X 2 + X 1 (check!). This has γ 2 as a root by Theorem 5.6 and it has the right degree, so X 3 2X 2 + X 1 is the minimal polynomial of γ 2 over Q. Isn t that nice? If L = K(α), then χ α,l/k (X) is the minimal polynomial of α over K since this polynomial is monic in K[X], has α as a root, and its degree is [L : K] = [K(α) : K]. If α doesn t generate L over K then χ α,l/k (X) is not the minimal polynomial of α in K[X] since the degree of χ α,l/k (X) is n = [L : K], which is too big. However, the characteristic polynomial tells us the minimal polynomial in principle, since it is a power of the minimal polynomial. Theorem 5.9. For α L, let π α,k (X) be the minimal polynomial of α in K[X]. If L = K(α) then χ α,l/k (α) = π α,k (X). More generally, χ α,l/k (X) = π α,k (X) n/d where d = deg π α,k (X) = [K(α) : K]. In words, χ α,l/k (X) is the power of the minimal polynomial of α over K that has degree n. For example, if c K its minimal polynomial in K[X] is X c while its characteristic polynomial for L/K is (X c) n. Proof. A K-basis of K(α) is {1, α,..., α d 1 }. Let m = [L : K(α)] and β 1,..., β m be a K(α)-basis of L. Then a K-basis of L is {β 1, αβ 1,..., α d 1 β 1,..., β m, αβ m,..., α d 1 β m }. Let α α j = d 1 i=0 c ijα i for 0 j d 1, where c ij K. The matrix for multiplication by α on K(α) with respect to {1, α,..., α d 1 } is (c ij ), so χ α,k(α)/k (X) = det(x I d (c ij )). This polynomial has α as a root by Theorem 5.6 (using K(α) in place of L) and it has degree d.

10 KEITH CONRAD Therefore det(x I d (c ij )) = π α,k (X) because π α,k (X) is the only monic polynomial in K[X] of degree d = [K(α) : K] with α as a root. Since α α j β k = d 1 i=0 c ijα i β k, with respect to the above K-basis of L the matrix for m α is a block diagonal matrix with m repeated d d diagonal blocks (c ij ), so χ α,l/k (X) = det(x I d (c ij )) m = π α,k (X) m = π α,k (X) n/d. If L = K(α) then n/d = 1, so the characteristic and minimal polynomials of α coincide. Corollary 5.10. Let the minimal polynomial for α over K be X d +c d 1 X d 1 + +c 1 X+c 0. Then and more generally Tr K(α)/K (α) = c d 1, N K(α)/K (α) = ( 1) d c 0, Tr L/K (α) = n d c d 1, N L/K (α) = ( 1) n c n/d 0. Proof. We will prove the general formula at the end. The special case L = K(α) then arises from setting n = d. Write π α,k (X) for the minimal polynomial of α over K. By Theorem 5.9, χ α,l/k (X) = π α,k (X) n/d = (X d + c d 1 X d 1 + + c 1 X + c 0 ) n/d = X n + n d c d 1X n 1 + + c n/d 0. From this formula and (5.1), Tr L/K (α) = n d c d 1 and N L/K (α) = ( 1) n c n/d 0. Example 5.11. If γ is a root of X 3 X 1, then Tr Q(γ)/Q (γ) = 0, N Q(γ)/Q (γ) = 1. Example 5.12. If γ is a root of X 4 X 1, then Tr Q(γ)/Q (γ) = 0, N Q(γ)/Q (γ) = 1. Example 5.13. If γ is a root of X 5 + 6X 4 + X 3 + 5 (which is irreducible over Q, since it s irreducible mod 2), then Tr Q(γ)/Q (γ) = 6, N Q(γ)/Q (γ) = 5. In the notation of Corollary 5.10, n/d equals [L : K(α)], so we can rewrite the formulas for Tr L/K (α) and N L/K (α) at the end of Corollary 5.10 in terms of Tr K(α)/K (α) and N K(α)/K (α), as follows: (5.2) Tr L/K (α) = [L : K(α)]Tr K(α)/K (α), N L/K (α) = N K(α)/K (α) [L:K(α)]. Example 5.14. If γ is a root of X 3 + X 2 + 7X + 2, which is irreducible over Q, then Tr Q(γ)/Q (γ) = 1, N Q(γ)/Q (γ) = 2.

TRACE AND NORM 11 If L is an extension of Q containing γ such that [L : Q] = 12, then we make a field diagram and see that L 4 Q(γ) Q Tr L/Q (γ) = 4( 1) = 4, N L/Q (γ) = ( 2) 4 = 16. Corollary 5.15. Let the minimal polynomial for α over K split completely over a large enough field extension of K as (X α 1 ) (X α d ) Then and more generally Tr K(α)/K (α) = α 1 + + α d, N K(α)/K (α) = α 1 α d, Tr L/K (α) = n d (α 1 + + α d ), N L/K (α) = (α 1 α d ) n/d, where [L : K] = n and d = [K(α) : K]. Proof. The factorization of the minimal polynomial X d + c d 1 X d 1 + + c 1 X + c 0 into linear factors implies c d 1 = d i=1 α i and c 0 = ( 1) d d i=1 α i. Substituting these into the formulas from Corollary 5.10 expresses the trace and norm of α in terms of the roots of the minimal polynomial. The roots α 1,..., α d of the minimal polynomial of α over K do not have to be in L for the trace and norm formulas in Corollary 5.15 to work: those formulas use numbers that may be outside of L. When L K(α), the formulas say that the α i s have to be repeated n/d times in the sum and product, which makes a total of n terms in the sum and product. This is already evident in the special case of elements in K: for c K, Tr L/K (c) = nc and N L/K (c) = c n. Example 5.16. If c K then χ α+c,l/k (X) = χ α,l/k (X c) from the definition of the characteristic polynomial. Since N L/K (α) = ( 1) n χ α,l/k (0), replacing α with α + c tells us that N L/K (α + c) = ( 1) n χ α,l/k ( c). If we use c in place of c, then N L/K (α c) = ( 1) n χ α,l/k (c). For instance, in Q( 3 2)/Q the number 3 2 has characteristic polynomial X 3 2, so the characteristic polynomial of c + 3 2 for Q( 3 2)/Q, where c Q, is (X c) 3 2. Thus N Q( 3 2)/Q (c + 3 2) = (( c) 3 2) = c 3 + 2. The norm is not c 3 2. And this is only for c Q. To find N Q( 3 2)/Q (1 + 3 4 + 3 2) you can t use the above formula with c = 1 + 3 4: the number wouldn t even be rational. 3 References [1] S. Lang, Algebra, 3rd ed., Addison-Wesley, New York, 1993.