SYMMETRIC POLYNOMIALS

Similar documents
Polynomials, Ideals, and Gröbner Bases

POLYNOMIAL DIVISION AND GRÖBNER BASES. Samira Zeada

Rings If R is a commutative ring, a zero divisor is a nonzero element x such that xy = 0 for some nonzero element y R.

a + bi by sending α = a + bi to a 2 + b 2. To see properties (1) and (2), it helps to think of complex numbers in polar coordinates:

5.1 Monomials. Algebra 2

review To find the coefficient of all the terms in 15ab + 60bc 17ca: Coefficient of ab = 15 Coefficient of bc = 60 Coefficient of ca = -17

Problem Set 1 Solutions

Permutations and Polynomials Sarah Kitchen February 7, 2006

FACTORING IN QUADRATIC FIELDS. 1. Introduction

Handout #6 INTRODUCTION TO ALGEBRAIC STRUCTURES: Prof. Moseley AN ALGEBRAIC FIELD

Lecture 1. (i,j) N 2 kx i y j, and this makes k[x, y]

DIHEDRAL GROUPS II KEITH CONRAD

where c R and the content of f is one. 1

THE DIVISION THEOREM IN Z AND F [T ]

THE DIVISION THEOREM IN Z AND R[T ]

Chapter 2: Real solutions to univariate polynomials

Algebra 1. Standard 1: Operations With Real Numbers Students simplify and compare expressions. They use rational exponents and simplify square roots.

OSTROWSKI S THEOREM FOR Q(i)

MIT Algebraic techniques and semidefinite optimization February 16, Lecture 4

GALOIS GROUPS AS PERMUTATION GROUPS

disc f R 3 (X) in K[X] G f in K irreducible S 4 = in K irreducible A 4 in K reducible D 4 or Z/4Z = in K reducible V Table 1

4. Noether normalisation

Algebra Review. Terrametra Resources. Lynn Patten

UNIVERSAL IDENTITIES. b = det

GENERATING SETS KEITH CONRAD

MATH 326: RINGS AND MODULES STEFAN GILLE

GALOIS GROUPS OF CUBICS AND QUARTICS (NOT IN CHARACTERISTIC 2)

Lecture 7: Schwartz-Zippel Lemma, Perfect Matching. 1.1 Polynomial Identity Testing and Schwartz-Zippel Lemma

CHAPTER 3 Further properties of splines and B-splines

Linear Algebra March 16, 2019

g(x) = 1 1 x = 1 + x + x2 + x 3 + is not a polynomial, since it doesn t have finite degree. g(x) is an example of a power series.

MATH 433 Applied Algebra Lecture 22: Semigroups. Rings.

Polynomial Rings. i=0. i=0. n+m. i=0. k=0

Appendix C Vector and matrix algebra

Formal Groups. Niki Myrto Mavraki

SUPPLEMENTARY NOTES: CHAPTER 1

* 8 Groups, with Appendix containing Rings and Fields.

Commutative Rings and Fields

Gröbner Bases over a Dual Valuation Domain

Introduction to finite fields

EXTERIOR POWERS KEITH CONRAD

Systems of Linear Equations and Matrices

PUTNAM TRAINING POLYNOMIALS. Exercises 1. Find a polynomial with integral coefficients whose zeros include

PYTHAGOREAN TRIPLES KEITH CONRAD

Systems of Linear Equations and Matrices

A Generalization of Wilson s Theorem

15. Polynomial rings Definition-Lemma Let R be a ring and let x be an indeterminate.

CPSC 536N: Randomized Algorithms Term 2. Lecture 9

0 Sets and Induction. Sets

GAUSSIAN ELIMINATION AND LU DECOMPOSITION (SUPPLEMENT FOR MA511)

Definition 2.3. We define addition and multiplication of matrices as follows.

Revised Section 9.4. Brian C. Hall. December 29, 2015

MA106 Linear Algebra lecture notes

Linear Equations in Linear Algebra

Numbers, proof and all that jazz.

Properties of Real Numbers

Course 311: Michaelmas Term 2005 Part III: Topics in Commutative Algebra

EXAMPLES OF PROOFS BY INDUCTION

THE ALGEBRAIC GEOMETRY DICTIONARY FOR BEGINNERS. Contents

Groups. 3.1 Definition of a Group. Introduction. Definition 3.1 Group

Definitions, Theorems and Exercises. Abstract Algebra Math 332. Ethan D. Bloch

Linear Algebra Lecture Notes

ON SUM OF SQUARES DECOMPOSITION FOR A BIQUADRATIC MATRIX FUNCTION

ARITHMETIC PROGRESSIONS IN CYCLES OF QUADRATIC POLYNOMIALS

Integral Extensions. Chapter Integral Elements Definitions and Comments Lemma

SIMPLE RADICAL EXTENSIONS

8. Limit Laws. lim(f g)(x) = lim f(x) lim g(x), (x) = lim x a f(x) g lim x a g(x)

The planar resolution algorithm

Pyramids and monomial blowing-ups

Mathematical Olympiad Training Polynomials

THE MINIMAL POLYNOMIAL AND SOME APPLICATIONS

be any ring homomorphism and let s S be any element of S. Then there is a unique ring homomorphism

Algebra Review 2. 1 Fields. A field is an extension of the concept of a group.

Study Guide for Math 095

Algebra Homework, Edition 2 9 September 2010

Unit 13: Polynomials and Exponents

REFLECTIONS IN A EUCLIDEAN SPACE

THE GAUSSIAN INTEGERS

IDEAL CLASSES AND RELATIVE INTEGERS

Something that can have different values at different times. A variable is usually represented by a letter in algebraic expressions.

ALGEBRA. 1. Some elementary number theory 1.1. Primes and divisibility. We denote the collection of integers

TROPICAL SCHEME THEORY

MATH 2400 LECTURE NOTES: POLYNOMIAL AND RATIONAL FUNCTIONS. Contents 1. Polynomial Functions 1 2. Rational Functions 6

THE FUNDAMENTAL THEOREM OF ALGEBRA VIA PROPER MAPS

4 Elementary matrices, continued

Solving Linear Systems Using Gaussian Elimination

Chapter 5 Rational Expressions

SEPARABILITY II KEITH CONRAD

and this makes M into an R-module by (1.2). 2

THE DIAMOND LEMMA FOR RING THEORY

SOLVING SOLVABLE QUINTICS. D. S. Dummit

Real Numbers. Real numbers are divided into two types, rational numbers and irrational numbers

THE DIFFERENT IDEAL. Then R n = V V, V = V, and V 1 V 2 V KEITH CONRAD 2 V

The Integers. Math 3040: Spring Contents 1. The Basic Construction 1 2. Adding integers 4 3. Ordering integers Multiplying integers 12

ISOMORPHISMS KEITH CONRAD

2. Two binary operations (addition, denoted + and multiplication, denoted

6 Lecture 6: More constructions with Huber rings

Lesson 3 Algebraic expression: - the result obtained by applying operations (+, -,, ) to a collection of numbers and/or variables o

Finite Fields: An introduction through exercises Jonathan Buss Spring 2014

GROUPS. Chapter-1 EXAMPLES 1.1. INTRODUCTION 1.2. BINARY OPERATION

Transcription:

SYMMETRIC POLYNOMIALS KEITH CONRAD Let F be a field. A polynomial f(x 1,..., X n ) F [X 1,..., X n ] is called symmetric if it is unchanged by any permutation of its variables: for every permutation σ of {1,..., n}. f(x 1,..., X n ) = f(x σ(1),..., X σ(n) ) Example 1. The sum X 1 + +X n and product X 1 X n are symmetric, as are the power sums X r 1 + + Xr n for any r 1. Example 2. Let f(x 1, X 2, X 3 ) = X 5 1 + X 2X 3. This polynomial is unchanged if we interchange X 2 and X 3, but if we interchange X 1 and X 3 then f becomes X 5 3 + X 2X 1, which is not f. This polynomial is only partially symmetric. An important collection of symmetric polynomials occurs as the coefficients in the polynomial (1) (T X 1 )(T X 2 ) (T X n ) = T n s 1 T n 1 + s 2 T n 2 + ( 1) n s n. Here s 1 is the sum of the X i s, s n is their product, and more generally s k = X i1 X ik 1 i 1 < <i k n is the sum of the products of the X i s taken k terms at a time. The s k s are all symmetric in X 1,..., X n and are called the elementary symmetric polynomials or elementary symmetric functions in the X i s Example 3. Let α = 3+ 5 2 and β = 3 5 2. Although α and β are not rational, their elementary symmetric polynomials are: s 1 = α + β = 3 and s 2 = αβ = 1. Example 4. Let α, β, and γ be the three roots of T 3 T 1, so T 3 T 1 = (T α)(t β)(t γ). Multiplying out the right side and equating coefficients on both sides, the elementary symmetric functions of α, β, and γ are s 1 = α + β + γ = 0, s 2 = αβ + αγ + βγ = 1, and s 3 = αβγ = 1. Theorem 5. The set of symmetric polynomials in F [X 1,..., X n ] is F [s 1,..., s n ]. That is, every symmetric polynomial in n variables is a polynomial in the elementary symmetric functions of those n variables. Example 6. In two variables, the polynomial X 3 + Y 3 is symmetric in X and Y. As a polynomial in X + Y and XY, X 3 + Y 3 = (X + Y ) 3 3XY (X + Y ) = s 3 1 3s 1 s 2. 1

2 KEITH CONRAD Our proof of Theorem 5 will proceed by induction on the multidegree of a polynomial in several variables, which is defined in terms of a certain ordering on multivariable polynomials, as follows. Definition 7. For two vectors a = (a 1,..., a n ) and b = (b 1,..., b n ) in N n, set a < b if, for the first i such that a i b i, we have a i < b i. Example 8. In N 4, (3, 0, 2, 4) < (5, 1, 1, 3) and (3, 0, 2, 4) < (3, 0, 3, 1). For any two n-tuples a and b in N n, either a = b, a < b, or b < a, so N n is totally ordered under <. (For example, (0, 0,..., 0) < a for all a (0, 0,..., 0).) This way of ordering n-tuples is called the lexicographic (i.e., dictionary) ordering since it resembles the way words are ordered in the dictionary: first order by the first letter, and for words with the same first letter order by the second letter, and so on. It is simple to check that for i, j, and k in N n, (2) i < j = i + k < j + k. A polynomial f F [X 1,..., X n ] can be written in the form f(x 1,..., X n ) = c i1,...,i n X i1 1 Xin i 1,...,i n We will abbreviate this in multi-index form to f = i c it i, where T i := X i 1 1 Xin n i = (i 1,..., i n ). Note T i T j = T i+j. Definition 9. For a nonzero polynomial f F [X 1,..., X n ], write f = i c it i. Set the multidegree of f to be mdeg f = max{i : c i 0} N n. The multidegree of the zero polynomial is not defined. If mdeg f = a, we call c a T a the leading term of f and c a the leading coefficient of f, written c a = lead f. Example 10. mdeg(7x 1 X 5 2 + 3X 2) = (1, 5) and lead(7x 1 X 5 2 + 3X 2) = 7. Example 11. mdeg(x 1 ) = (1, 0,..., 0) and mdeg(x n ) = (0, 0,..., 1). Example 12. The multidegrees of the elementary symmetric polynomials are mdeg(s 1 ) = (1, 0, 0,..., 0), mdeg(s 2 ) = (1, 1, 0,..., 0),..., and mdeg(s n ) = (1, 1, 1,..., 1). For k = 1,..., n, the leading term of s k is X 1 X k, so the leading coefficient of s k is 1. Example 13. Polynomials with multidegree (0, 0,..., 0) are the nonzero constants. Remark 14. There is a simpler notion of degree of a multivariable polynomial: the largest sum of exponents of a nonzero monomial in the polynomial, e.g., X 1 X 3 2 + X2 1 has degree 4. This degree has values in N rather than N n. We won t be using it; the multidegree is more convenient for our purposes. Our definition of multidegree is specific to calling X 1 the first variable and X n the last variable. Despite its ad hoc nature (there is nothing intrinsic about making X 1 the first variable), the multidegree is useful since it permits us to prove theorems about all multivariable polynomials by induction on the multidegree. The following lemma shows that a number of standard properties of the degree of polynomials in one variable carry over to multidegrees of multivariable polynomials. n. for

SYMMETRIC POLYNOMIALS 3 Lemma 15. For nonzero f and g in F [X 1,..., X n ], mdeg(fg) = mdeg(f) + mdeg(g) in N n and lead(fg) = (lead f)(lead g). For f and g in F [X 1,..., X n ], mdeg(f + g) max(mdeg f, mdeg g) and if mdeg f < mdeg g then mdeg(f + g) = mdeg g. Proof. We will prove the first result and leave the second to the reader. Let mdeg f = a and mdeg g = b, say f = c a T a + i<a c it i with c a 0 and g = c b T b + j<b c j T j with c b 0. This amounts to pulling out the top multidegree terms of f and g. Then fg has a nonzero term c a c b T a+b and every other term has multidegree a + j, b + i, or i + j where i < a and j < b. By (2), all these other multidegrees are less than a + b, so mdeg(fg) = a + b = mdeg f + mdeg g and lead(fg) = c a c b = (lead f)(lead g). Now we are ready to prove Theorem 5. Proof. We want to show every symmetric polynomial in F [X 1,..., X n ] is a polynomial in F [s 1,..., s n ]. We can ignore the zero polynomial. Our argument is by induction on the multidegree. Multidegrees are totally ordered, so it makes sense to give a proof using induction on them. A polynomial in F [X 1,..., X n ] with multidegree (0, 0,..., 0) is in F, and F F [s 1,..., s n ]. Now pick an d (0, 0,..., 0) in N n and suppose the theorem is proved for all symmetric polynomials with multidegree less than d. Write d = (d 1,..., d n ). Pick any symmetric polynomial f with multidegree d. (If there aren t any symmetric polynomials with multidegree d, then there is nothing to do and move on the next n-tuple in the total ordering on N n.) Pull out the leading term of f: (3) f = c d X d 1 1 Xdn n + i<d c i T i, where c d 0. We will find a polynomial in s 1,..., s n with the same leading term as f. Its difference with f will then be symmetric with smaller multidegree than d, so by induction we ll be done. By Example 12 and Lemma 15, for any nonnegative integers a 1,..., a n, mdeg(s a 1 1 sa 2 2 san n ) = (a 1 + a 2 + + a n, a 2 + + a n,..., a n ). The ith coordinate here is a i + a i+1 + + a n. To make this multidegree equal to d, we must set (4) a 1 = d 1 d 2, a 2 = d 2 d 3,..., a n 1 = d n 1 d n, a n = d n. But does this make sense? That is, do we know that d 1 d 2, d 2 d 3,..., d n 1 d n, d n are all nonnegative? If that isn t true then we have a problem. So we need to show the coordinates in d satisfy (5) d 1 d 2 d n 0. In other words, an n-tuple which is the multidegree of a symmetric polynomial has to satisfy (5). To appreciate this issue, consider f = X 1 X 5 2 + 3X 2. The multidegree of f is (1, 5), so the exponents don t satisfy (5). But this f is not symmetric, and that is the key point. If we took f = X 1 X 5 2 + X5 1 X 2 then f is symmetric and mdeg f = (5, 1) does satisfy (5). The verification of (5) will depend crucially on f being symmetric.

4 KEITH CONRAD Since (d 1,..., d n ) is the multidegree of a nonzero monomial in f, and f is symmetric, every vector with the d i s permuted is also a multidegree of a nonzero monomial in f. (Here is where the symmetry of f in the X i s is used: under any permutation of the X i s, f stays unchanged.) Since (d 1,..., d n ) is the largest multidegree of all the monomials in f, (d 1,..., d n ) must be larger in N n than any of its nontrivial permutations 1, which means d 1 d 2 d n 0. That shows the definition of a 1,..., a n in (4) has nonnegative values, so s a 1 1 san n is a polynomial. Its multidegree is the same as that of f by (4). Moreover, by Lemma 15, lead(s a 1 1 san n ) = (lead s 1 ) a1 (lead s n ) an = 1. Therefore f and c d s a 1 1 san n, where c d = lead f, have the same leading term, namely c d X d 1 1 Xdn n. If f = c d s a 1 1 san n then we re done. If f c d s a 1 1 san n then the difference f c d s a 1 1 san n is nonzero with mdeg(f c d s a 1 1 san n ) < (d 1,..., d n ). The polynomial f c d s a 1 1 san n is symmetric since both terms in the difference are symmetric. By induction on the multidegree, f c d s a 1 1 san n F [s 1,..., s n ], so f F [s 1,..., s n ]. Let s summarize the recursive step: if f is a symmetric polynomial in X 1,..., X n then leading term of f is c d X d 1 1 Xd n 1 n 1 Xdn n = mdeg(f c d s d 1 d 2 1 s d n 1 d n n 1 s dn n ) < mdeg(f). Example 16. In three variables, let f(x, Y, Z) = X 4 + Y 4 + Z 4. We want to write this as a polynomial in the elementary symmetric polynomials in X, Y, and Z, which are s 1 = X + Y + Z, s 2 = XY + XZ + Y Z, s 3 = XY Z. Treating X, Y, Z as X 1, X 2, X 3, the multidegree of s a 1 sb 2 sc 3 is (a + b + c, b + c, c). The leading term of f is X 4, with multidegree (4, 0, 0). This is the multidegree of s 4 1 = (X + Y + Z) 4, which has leading term X 4. So we subtract: f s 4 1 = 4x 3 y 4x 3 z + 6x 2 y 2 12x 2 yz 6x 2 z 2 4xy 3 12xy 2 z 12xyz 2 4xz 3 4y 3 z 6y 2 z 2 4yz 3. This has leading term 4x 3 y, with multidegree (3, 1, 0). This is (a + b + c, b + c, c) when c = 0, b = 1, a = 2. So we add 4s a 1 sb 2 sc 3 = 4s2 1 s 2 to f s 4 1 to cancel the leading term: f s 4 1 + 4s 2 1s 2 = 2x 2 y 2 + 8x 2 yz + 2x 2 z 2 + 8xy 2 z + 8xyz 2 + 2y 2 z 2, whose leading term is 2x 2 y 2 with multidegree (2, 2, 0). This is (a+b+c, b+c, c) when c = 0, b = 2, a = 0. So we subtract 2s 2 2 : f s 4 1 + 4s 2 1s 2 2s 2 2 = 4x 2 yz + 4xy 2 z + 4xyz 2. The leading term is 4x 2 yz, which has multidegree (2, 1, 1). This is (a + b + c, b + c, c) for c = 1, b = 0, and a = 1, so we subtract 4s 1 s 3 : Thus f s 4 1 + 4s 2 1s 2 2s 2 2 4s 1 s 3 = 0. (6) X 4 + Y 4 + Z 4 = s 4 1 4s 2 1s 2 + 2s 2 2 + 4s 1 s 3. 1 A trivial permutation is one that exchanges equal coordinates, like (2, 2, 1) and (2, 2, 1).

SYMMETRIC POLYNOMIALS 5 Remark 17. The proof we have given here is based on [2, Sect. 7.1], where there is an additional argument that shows the representation of a symmetric polynomial as a polynomial in the elementary symmetric polynomials is unique. (For example, the only expression of X 4 + Y 4 + Z 4 as a polynomial in s 1, s 2, and s 3 is the one appearing in (6).) For a different proof of Theorem 5, which uses the more usual notion of degree of a multivariable polynomial described in Remark 14, see [1, Sect. 16.1] (there is a gap in that proof, but the basic ideas are there). Corollary 18. Let L/K be a field extension and f(t ) K[T ] factor as in L[T ]. Then for all positive integers r, (T α 1 )(T α 2 ) (T α n ) (T α r 1)(T α r 2) (T α r n) K[T ]. Proof. The coefficients of (T α r 1 )(T αr 2 ) (T αr n) are symmetric polynomials in α 1,..., α n with coefficients in K, so these coefficients are polynomials in the elementary symmetric polynomials in the α i s with coefficients in K. The elementary symmetric polynomials in the α i s are (up to sign) the coefficients of f(t ), so they lie in K. Therefore any polynomial in the elementary symmetric functions of the α i s with coefficients in K lies in K. Example 19. Let f(t ) = T 2 + 5T + 2 = (T α)(t β) where α = ( 5 + 17)/2 and β = ( 5 17)/2. Although α and β are not rational, their elementary symmetric functions are rational: s 1 = α + β = 5 and s 2 = αβ = 1. Therefore any symmetric polynomial in α and β with rational coefficients is rational (since it is a polynomial in α + β and αβ with rational coefficients). In particular, (T α r )(T β r ) Q[T ] for all r 1. Taking r = 2, 3, and 4, we have (T α 2 )(T β 2 ) = T 2 21T + 4, (T α 3 )(T β 3 ) = T 2 + 95T + 8, (T α 4 )(T β 4 ) = T 2 433T + 16. Example 20. Let α, β, and γ be the three roots of T 3 T 1, so T 3 T 1 = (T α)(t β)(t γ). The elementary symmetric functions of α, β, and γ are all rational, so for every positive integer r, (T α r )(T β r )(T γ r ) has rational coefficients. As explicit examples, (T α 2 )(T β 2 )(T γ 2 ) = T 3 2T 2 + T 1, (T α 3 )(T β 3 )(T γ 3 ) = T 3 3T 2 + 2T 1. In the proof of Theorem 5, the fact that the coefficients come from a field F is not important; we never had to divide in F. The same proof shows for any commutative ring R that the symmetric polynomials in R[X 1,..., X n ] are R[s 1,..., s n ]. (Actually, there is a slight hitch: if R is not a domain then the formula mdeg(fg) = mdeg f + mdeg g is true only as long as the leading coefficients of f and g are both not zero-divisors in R, and that is true for the relevant case of elementary symmetric polynomials s 1,..., s n, whose leading coefficients equal 1.)

6 KEITH CONRAD Example 21. Taking α and β as in Example 19, their elementary symmetric functions are both integers, so any symmetric polynomial in α and β with integral coefficients is an integral polynomial in α + β and αβ with integral coefficients, and thus is an integer. This implies (T α r )(T β r ), whose coefficients are α r + β r and α r β r, has integral coefficients and not just rational coefficients. Examples of this for small r are seen in Example 19. References [1] M. Artin, Algebra, 2nd ed., Prentice-Hall, 2010. [2] D. Cox, J. Little, D. O Shea, Ideals, Varieties, and Algorithms: An Introduction to Computational Algebraic Geometry and Commutative Algebra, Springer-Verlag, New York, 1992.