DOUBLE ZETA VALUES AND MODULAR FORMS

Similar documents
A class of non-holomorphic modular forms

Multiple Eisenstein series

THE RAMANUJAN-SERRE DIFFERENTIAL OPERATORS AND CERTAIN ELLIPTIC CURVES. (q = e 2πiτ, τ H : the upper-half plane) ( d 5) q n

Mathematische Annalen

Theorem 5.3. Let E/F, E = F (u), be a simple field extension. Then u is algebraic if and only if E/F is finite. In this case, [E : F ] = deg f u.

Bare-bones outline of eigenvalue theory and the Jordan canonical form

AN EXACT SEQUENCE FOR THE BROADHURST-KREIMER CONJECTURE

FOUNDATIONS OF ALGEBRAIC GEOMETRY CLASS 27

1 Basic Combinatorics

On the zeros of certain modular forms

10. Smooth Varieties. 82 Andreas Gathmann

1 Fields and vector spaces

Multiple divisor functions, their algebraic structure and the relation to multiple zeta values

Course 311: Michaelmas Term 2005 Part III: Topics in Commutative Algebra

FORMAL GROUPS OF CERTAIN Q-CURVES OVER QUADRATIC FIELDS

arxiv: v1 [math.gr] 8 Nov 2008

Math 121 Homework 5: Notes on Selected Problems

The algebra of multiple divisor functions and applications to multiple zeta values

We simply compute: for v = x i e i, bilinearity of B implies that Q B (v) = B(v, v) is given by xi x j B(e i, e j ) =

Chapter 2 Linear Transformations

Topics in linear algebra

October 25, 2013 INNER PRODUCT SPACES

Algebraic structures I

Mock modular forms and their shadows

Linear Algebra Notes. Lecture Notes, University of Toronto, Fall 2016

1. A Little Set Theory

Foundations of Mathematics MATH 220 FALL 2017 Lecture Notes

Linear Algebra. Min Yan

Exercises for algebraic curves

FOURIER COEFFICIENTS OF VECTOR-VALUED MODULAR FORMS OF DIMENSION 2

Transcendental Numbers and Hopf Algebras

Chapter 3. Rings. The basic commutative rings in mathematics are the integers Z, the. Examples

( files chap2 to chap

Infinite-Dimensional Triangularization

SYMMETRIC SUBGROUP ACTIONS ON ISOTROPIC GRASSMANNIANS

NONCOMMUTATIVE POLYNOMIAL EQUATIONS. Edward S. Letzter. Introduction

THE GROUP OF UNITS OF SOME FINITE LOCAL RINGS I

Math 210B:Algebra, Homework 2

Introduction to Group Theory

2 Generating Functions

φ(xy) = (xy) n = x n y n = φ(x)φ(y)

Rings and groups. Ya. Sysak

20 The modular equation

5 Dedekind extensions

MATH 101B: ALGEBRA II PART A: HOMOLOGICAL ALGEBRA

Irrationality exponent and rational approximations with prescribed growth

IDEAL CLASSES AND RELATIVE INTEGERS

ELEMENTARY LINEAR ALGEBRA

NOTES ON LINEAR ALGEBRA OVER INTEGRAL DOMAINS. Contents. 1. Introduction 1 2. Rank and basis 1 3. The set of linear maps 4. 1.

CHAPTER 2. Ordered vector spaces. 2.1 Ordered rings and fields

Some notes on Coxeter groups

Integral Extensions. Chapter Integral Elements Definitions and Comments Lemma

The Symmetric Groups

Introduction to Arithmetic Geometry Fall 2013 Lecture #23 11/26/2013

20 The modular equation

LINEAR ALGEBRA BOOT CAMP WEEK 1: THE BASICS

MATH PRACTICE EXAM 1 SOLUTIONS

6. Duals of L p spaces

ARCS IN FINITE PROJECTIVE SPACES. Basic objects and definitions

Introduction to modules

MATH 8253 ALGEBRAIC GEOMETRY WEEK 12

Chapter 1 Vector Spaces

DIVISORS ON NONSINGULAR CURVES

RIEMANN SURFACES. max(0, deg x f)x.

The 4-periodic spiral determinant

k k would be reducible. But the zero locus of f in A n+1

Ramanujan s first letter to Hardy: 5 + = 1 + e 2π 1 + e 4π 1 +

LIMITING CASES OF BOARDMAN S FIVE HALVES THEOREM

INTRODUCTION TO REAL ANALYTIC GEOMETRY

Algebraic Number Theory

13:00 13:50 poly-euler L

Representation theory and quantum mechanics tutorial Spin and the hydrogen atom

be any ring homomorphism and let s S be any element of S. Then there is a unique ring homomorphism

ELEMENTARY SUBALGEBRAS OF RESTRICTED LIE ALGEBRAS

A finite universal SAGBI basis for the kernel of a derivation. Osaka Journal of Mathematics. 41(4) P.759-P.792

The Maximal Rank Conjecture

TOPOLOGICAL COMPLEXITY OF 2-TORSION LENS SPACES AND ku-(co)homology

Class Number Type Relations for Fourier Coefficients of Mock Modular Forms

a 11 x 1 + a 12 x a 1n x n = b 1 a 21 x 1 + a 22 x a 2n x n = b 2.

MATH FINAL EXAM REVIEW HINTS

How many units can a commutative ring have?

Curves on P 1 P 1. Peter Bruin 16 November 2005

SPECTRAL THEORY EVAN JENKINS

August 23, 2017 Let us measure everything that is measurable, and make measurable everything that is not yet so. Galileo Galilei. 1.

ne varieties (continued)

ALGEBRA II: RINGS AND MODULES OVER LITTLE RINGS.

Mathematics Course 111: Algebra I Part I: Algebraic Structures, Sets and Permutations

Math 210B. Artin Rees and completions

Elementary linear algebra

(τ) = q (1 q n ) 24. E 4 (τ) = q q q 3 + = (1 q) 240 (1 q 2 ) (1 q 3 ) (1.1)

Measure and integration

q-series Michael Gri for the partition function, he developed the basic idea of the q-exponential. From

fy (X(g)) Y (f)x(g) gy (X(f)) Y (g)x(f)) = fx(y (g)) + gx(y (f)) fy (X(g)) gy (X(f))

Comparison of Virginia s College and Career Ready Mathematics Performance Expectations with the Common Core State Standards for Mathematics

Chapter 1. Sets and Numbers

Irredundant Families of Subcubes

1. Let r, s, t, v be the homogeneous relations defined on the set M = {2, 3, 4, 5, 6} by

Mock Modular Forms and Class Number Relations

Class groups and Galois representations

Transcription:

DOUBLE ZETA VALUES AND MODULAR FORMS HERBERT GANGL, MASANOBU KANEKO, AND DON ZAGIER Dedicated to the memory of Tsuneo Araawa. Introduction and main results The double zeta values, which are defined for integers r, s, by ζ(r, s = m r n s, ( m>n>0 are subject to numerous relations. Already Euler found that when the weight = r + s is odd the double zeta values can be reduced to products of usual zeta values. Furthermore, he gave the sum formula ζ(r, r = ζ( ( >. ( r= The aims of the present paper are: to give other interesting relations among double zeta values, to show that the structure of the Q-vector space of all relations among double zeta values of weight is connected in many different ways with the structure of the space of modular forms M of weight on the full modular group Γ = PSL(, Z, and to introduce and study both transcendental and combinatorial double Eisenstein series which explain the relation between double zeta values and modular forms and provide new realizations of the space of double zeta relations. Double zeta values are a special case of multiple zeta values, defined by sums lie ( but with longer decreasing sequences of integers, which are nown to satisfy a collection of relations called the double shuffle relations (cf., e.g., [3], [5], []. The specialization of these relations to the double zeta case is given by the following two sets of easily proved relations (see Section : ζ(r, s + ζ(s, r = ζ(r ζ(s ζ( (r + s = ; r, s, [( ( ] r r + ζ(r, r = ζ(j ζ( j ( j j j. (3 r= We wish to study the relations which can be deduced from (3. Since we want to do this algebraically, it is useful to wor, not with the double zeta values themselves, which for all we now may satisfy other relations than (3 (it is not even nown that any ζ(r, s/π r+s is irrational, but with the formal double zeta space D, generated by formal symbols Z r,s, P r,s and Z subject to the relations (3, with Z r,s, P r,s and Z taing the role of ζ(r, s, ζ(rζ(s and ζ(, respectively, and where r and s are allowed to assume the value. In D we can prove a number of explicit relations. In particular, Euler s result that all Z r,s are rational linear combinations of the P r,s when the weight is odd holds in the formal double zeta space D, so that we can (and usually will assume that is even. Similarly, the formal analogue of Euler s sum formula ( holds in

HERBERT GANGL, MASANOBU KANEKO, AND DON ZAGIER D, and in fact (for even has a refinement giving the sums of the even- and oddargument double zeta values of weight separately. Surprisingly, they are always in the ratio 3:, independently of : Theorem. For even >, one has r= r even Z r, r = 3 4 Z, r= r odd Z r, r = 4 Z. (4 As an example of a more complicated identity, we show that, for m, n odd, m + n = >, n ( m B ν Z n ν,m+ν = ( s λ m,n (r, sp r,s, (5 ν ν=0 ν=0 where B ν is the νth Bernoulli number and n ( ( m + ν r λ m,n (r, s = B ν (6 ν n ν (which despite appearances is symmetric in r and s. Since B ν = 0 for all odd ν except ν =, this implies that any Z ev,ev can be written in terms of Z od,od s and P r,s s. But in fact only Z od,od s are required: Theorem. Let > be even. Then the Z r, r with 0 < r < odd are a basis of D. There are explicit representations of the elements of various bases of D as linear combinations of the Z od,od s. Theorem will be proved in Section 4 by rewriting the defining relations (3 of D algebraically in terms of the action of the group ring Z[Γ ] on a space of polynomials. This leads to both a simple proof of the first statement and to several concrete versions of the second. One of these, a variant of (5, is Z m+, m + Z λ 0 m, m(r, s ( Z r,s + Z (7 = m r, s odd for m =, 3,..., 3, where ( s r ( l λ 0 m,n(r, s = λ m,n (r, s B s m = m m l=0 ( r l B n l (with B ν = 0 for ν < 0. Since Z equals 4 r> odd Z r, r by Theorem, this expresses all even-argument double zeta values in terms of odd-argument ones. Theorem is false for double zeta values. Instead we have the following result, which gives the first connection with modular forms: Theorem 3. (Rough statement. The values ζ(od, od of weight satisfy at least dim S linearly independent relations, where S denotes the space of cusp forms of weight on Γ. Example. For = and = 6, the first weights for which there are non-zero cusp forms on Γ, we have the identities 8 ζ(9, 3 + 50 ζ(7, 5 + 68 ζ(5, 7 = 597 ζ( (8 69 66 ζ(3, 3 + 375 ζ(, 5 + 686 ζ(9, 7 + 675 ζ(7, 9 + 396 ζ(5, = 78967 367 ζ(6,

DOUBLE ZETA VALUES AND MODULAR FORMS 3 which can be written in terms only of ζ(od, od s using Theorem. Conjecturally (and numerically, these are the only relations over Q among odd-argument double zeta values up to weight 6, and more generally we expect that there are no further relations among the ζ(od, od except the ones predicted by Theorem 3. Although Theorem 3 holds for the true double zeta world and is false in the formal one, it is in fact a consequence of a result in the formal space. In fact, it follows from two different though complementary results. Both of them involve period polynomials. We recall the definition of these polynomials. (A more detailed review will be given in Section 5. For each even we consider the space V of homogeneous polynomials of degree in two variables and the subspace W V of polynomials satisfying the relations P (X, Y + P ( Y, X = 0, P (X, Y + P (X Y, X + P (Y, Y X = 0. It splits as the direct sum of subspaces W + and W of polynomials which are symmetric and antisymmetric with respect to X Y, with the former being odd and the latter even with respect to X X. The Eichler-Shimura-Manin theory tells us that there are canonical isomorphisms over C between S and W + and between M and W. The full statement of Theorem 3, given in Section 5, associates to any polynomial in W, in an injective way, an explicit relation among the numbers Z od,od and P ev,ev (and Z. For the above example (8, for instance, the polynomial X Y (X Y 3 in W leads to the relation 8 Z 9,3 + 50 Z 7,5 + 68 Z 5,7 = 8P 4,8 + 95 3 P 6,6 67 3 Z, (9 which by Euler s theorem agrees with (8 modulo Qπ, and similarly the complete version of the relation given above between odd double zeta values in weight 6 is 66 Z 3,3 + 375 Z,5 + 686 Z 9,7 + 675 Z 7,9 + 396 Z 5, = 66P 4, + 85P 6,0 + 364 3 P 8,8 08 3 Z 6. The other result about formal double zeta values which implies Theorem 3 involves the space W + rather than W. More precisely, it involves a certain - dimensional extension Ŵ + V + C (X Y + X Y (see Section 6 for details which is isomorphic to M rather than S : Theorem 4. If {Z r,s, P r,s, Z } is a collection of numbers satisfying the double shuffle relations in weight, then the polynomial P r,s X r Y s ( X Y + X Y r, s even Z belongs to Ŵ + (and to W + if Z = 0. Every element of Ŵ + arises in this way. From one point of view, this says that the subspace P ev of D spanned by the P r,s with r and s even is canonically dual to Ŵ +. From another, it says that there are /6 + O( relations among the P ev,ev, these relations being the same as the relations satisfied by the coefficients of period polynomials in W +. In fact, we will prove Theorem 4 in this form. It is this point of view which leads to the most direct connection with modular forms, because it is nown (as a consequence of the so-called Ranin-Selberg or unfolding method that the coefficients of (extended symmetric period polynomials satisfy the same linear relations as the products G r G s M (r + s =, where G r denotes the Eisenstein series of weight r on PSL (Z. (When r or s is equal to, the product G G must be modified

4 HERBERT GANGL, MASANOBU KANEKO, AND DON ZAGIER slightly by adding an appropriate multiple of G to compensate for the nonmodularity of G. Thus the proof of Theorem 4, combined with the nown facts that the products G r G s span M and, after dividing by π, have rational Fourier coefficients, also leads to the following, more intuitive, statement: Theorem 5. The space P ev is canonically isomorphic to M Q, by a map which sends P r,s to (πi G r G s (plus a multiple of G if r or s = and Z to (πi G. Theorem 5 tells us that there is a realization of the symmetric (P- part of the double shuffle relations given by products of Eisenstein series. This implies by linear algebra that there must be a realization (and in fact, infinitely many realizations of the full space D having these products as its symmetric part. It is then natural to as whether there is a natural choice of such a realization. In the last part of the paper we show, following an idea already adumbrated in [5], that there are in fact two such choices. More precisely, we show that one can extend the map M in two different ways to a map from D to a larger space of functions, by finding double Eisenstein series which are related to products of Eisenstein series in exactly the same way as double zeta values are related to products of Riemann zeta values. One of these ways is transcendental, in terms of holomorphic functions in the upper half plane, and the other combinatorial, in terms of formal power series in q with rational coefficients. Both ways are interesting, and they also turn out to be related: the Fourier expansion of the transcendental double Eisenstein series splits up into three terms, the most complicated of which is (a multiple of the combinatorial double Eisenstein series. We now explain this in more detail. P ev The transcendental version of the double Eisenstein series G r,s (τ is defined, in complete analogy with (, as G r,s (τ = m, n Zτ+Z m n 0 m r n s (τ H = upper half-plane, where n 0 means n = nτ + b with n > 0 or n = 0, b > 0 and m n means m n 0. The series converges absolutely for r 3, s, and also maes sense for s = if the sum over n (for m fixed is interpreted as a Cauchy principal value. The same combinatorial proof that establishes (3 shows that, at least in the convergent cases, the corresponding equations still hold with ζ(r, s replaced by G r,s (τ and with (each ζ( replaced by the function G (τ = m Zτ+Z m 0 m ( (again to be interpreted as a Cauchy principal value if =, which equals the previously mentioned Eisenstein series if is even. In other words, at least for the cases of absolute convergence, we have a realization of the double shuffle relations on the space of holomorphic functions in H given by Z r,s G r,s (τ, P r,s G r (τg s (τ, Z G (τ. The combinatorial/arithmetic aspect emerges when we study the Fourier expansions of the single and double Eisenstein series. The former are given by the well-nown formula (πi G (τ = ζ( + g (q, (0

DOUBLE ZETA VALUES AND MODULAR FORMS 5 where q = e πiτ, ζ( = (πi ζ(, and g (q = ( u q un (. ( (! u,n>0 The corresponding result for G r,s (τ is given by Theorem 6. The Fourier expansion of G r,s (τ for r 3, s is given by (πi r s G r,s (τ = ζ(r, s + Cr,s p g h(q ζ(p + g r,s (q ( with q = e πiτ, ζ(r, s = (πi r s ζ(r, s, ( p Cr,s p = δ s,p + ( s s and g r,s (q = ( r+s (r!(s! m>n>0 u, v>0 h+p=r+s h, p> ( p + ( p r r Z (3 u r v s q um+vn Q[[q]]. (4 We can reinterpret this theorem in the light of the following considerations. If is even, the case when G (τ is modular (or quasi-modular if =, then by Euler s theorem the number ζ( occurring on the right-hand side of (0 is the rational number B /!, which we denote by β. Hence this right-hand side can be replaced by the expression Z (q = g (q + β (, (5 which we call the combinatorial Eisenstein series because it is purely combinatorially defined as an element of Q[[q]] and is proportional to the usual Eisenstein series (and hence modular when is even and 4. In the same way, we define β r,s (q = Cr,s p β p g h (q (r, s (6 h+p=r+s and set Z r,s (q = g r,s (q + β r,s (q (r 3, s, (7 the combinatorial double Eisenstein series. Then the right-hand side of ( can be rewritten as ζ(r, s + Cr,s p ζ(p g h (q + Z r,s (q. (8 h+p=r+s h, p>, p odd The three pieces in (8 lie in three non-intersecting Q-subspaces of C[[q]] : the first term is in C (more precisely, in R or ir depending on the parity of r + s, the second term in ir[[q]] 0, and the third in Q[[q]] 0, where A[[q]] 0 = qa[[q]] denotes the space of power series without constant term with coefficients in a vector space A. The first term is our familiar double zeta value realization of the double shuffle relations. The second also fulfils the double shuffle relations, independently of the arithmetic natures of ζ(p and g h (q, because by a simple result which will be proved in Section (Corollary the numbers Cr,s p for any odd value of p less than r + s already satisfy these relations. The following theorem, which we will prove in Section 7, says that the combinatorial double Eisenstein series, suitably extended to the missing cases r =, and s =, also satisfies the double shuffle relations. Theorem 7. (Rough statement. There is a realization of the double shuffle relations in Q[[q]] 0 which in the region corresponding to absolute convergence agrees with (7 and (5 and sends P r,s to Z r (qz s (q β r β s for r, s >.

6 HERBERT GANGL, MASANOBU KANEKO, AND DON ZAGIER If we now use (8 with the extended definition of Z r,s (q to define the double Eisenstein series G r,s (τ in the previously undefined cases r =, r =, and s =, then we find that there is also a realization {Z r,s, P r,s, Z } of the double shuffle relations in the space of holomorphic functions in the upper half-plane which maps Z r,s to G r,s (τ for r 3, s, P r,s to G r (τg s (τ for r, s > and maps Z to G (τ for all >. Remar. Some of the ideas developed in this paper were already mentioned, in a very preliminary form, in [5] and [6]. The discovery that there are unexpected relations among ζ(od, od s starting in weight originated with a question posed by T. Terasoma about the linear independence, modulo π, of ζ(r, s with r, s > odd, r + s =. We also mention that there is a related phenomenon for the stable derivation algebra of Y. Ihara [4] inside the Lie algebra of derivations of the free Lie algebra on two generators. The recent paper of L. Schneps [4] should have a close connection to our present wor. Also related are several results of A.B. Goncharov, who defined a coproduct structure on (formal multiple zeta values in [6] and described relations between double zeta values and the cohomology of PSL (Z in [5]. Acnowledgements. H. G. and M. K. gratefully acnowledge financial support by the Collège de France (Paris and the Max-Planc-Institut (Bonn. M. K. was partially supported by the Ministry of Education, Science, Sports and Culture, Grant-in-Aid for Scientific Research (B, 534004, 003 005.. The formal double zeta space We begin by discussing the double shuffle relations (3. The first follows from the obvious decomposition of lattice points in N N into the three disjoint subsets {(m, n m > n}, {(m, n m < n} and {(m, n m = n}, giving the identity ( m>n + m<n + m=n m r n s = m m r n n s, which is precisely the first equation in (3. For the second, we can use the partial fraction expansion m i n j = [ ( r ( r ] i (m + n r n s + j (m + n r m s (i + j =. (9 (Proof: Compute the poles of both sides as rational functions of n, with m fixed. In the formal setting, it is convenient to extend the set of generators and relations in (3 slightly by including the case r = (in the case of double zeta values, this would give a non-convergent series: we introduce formal variables Z r,s, P r,s and Z and impose the relations Z r,s + Z s,r = P r,s Z (r + s =, [( ( ] r r + Z r,s = P i,j (i + j =. i j (From now on, whenever we write r + s = or i + j = without comment, it is assumed that the variables are integers. The formal double zeta space is now defined as the Q-vector space D = {Q-linear combinations of formal symbols Z r,s, P r,s, Z } relations (0. (0

DOUBLE ZETA VALUES AND MODULAR FORMS 7 Alternatively, since Eqs. (0 express the P i,j in terms of the Z r,s, we can define D as D = {Q-linear combinations of formal symbols Z r,s, Z } relation (, ( where relation ( is given by taing the difference of Eqs. (0: [( ( ] r r + Z r,s = Z i,j + Z j,i + Z (i + j =. ( i j Of course, since both sides of ( are symmetric in i and j, it is enough to tae ( for i j. We thus have (for even generators and / relations, so dim D ( even. (3 (We will see below that in fact equality holds. points D (A of D for any Q-vector space A by Finally, we define the A-valued D (A = Hom Q (D, A = {(Z r,s, Z A, satisfying (} ; this can also be represented as the set of ( -tuples (Z r,s, P r,s, Z satisfying (0, and we will use both forms. An element of D (A will be called a realization of the double zeta space in A. For example, with A = R and any κ R we have an R-realization of D (for > given by { ζ(r, s, if r >, Z r,s κ, if r =, { ζ(rζ(s, if r, s >, (4 P r,s κ + ζ(, + ζ(, if r = or s =, Z ζ(. Here we could also treat κ as a variable and consider this as a realization in R+Q κ or R[κ]. We now introduce two convenient ways to wor with D. The first is by generating functions. Let (Z r,s, P r,s, Z D (A be a realization of D in A. Then we can see easily that the identities (0 are equivalent to the relations Z (X, Y + Z (Y, X = P (X, Y Z X Y, X Y Z (X + Y, Y + Z (X + Y, X = P (X, Y for the generating functions Z (X, Y = Z r,s X r Y s, P (X, Y = P r,s X r Y s of the Z r,s and P r,s, respectively, in A[X, Y ]. (Equations (0 just express the equality of the coefficient of X r Y s in (5. Similarly, ( is equivalent to the single relation Z (X + Y, Y + Z (X + Y, X Z (X, Y Z (Y, X = Z X Y (6 X Y for the polynomial Z. As an example of the use of these equations, we will prove the first two identities mentioned in the Introduction, namely the fact that all Z r,s s are combinations of P r,s s and of Z if is odd, and the separate even and odd sum formulas as given in Theorem if is even. For the first, we can wor with (0 with the (5

8 HERBERT GANGL, MASANOBU KANEKO, AND DON ZAGIER right-hand sides both replaced by 0 (because we want to wor modulo all P r,s s and Z. Then (5 become simply Z (X, Y + Z (Y, X = 0 and Z (X + Y, Y + Z (X + Y, X = 0. Rewriting the latter equation as Z (X, Y + Z (X, X Y = 0, we see that Z is anti-invariant under the two involutions ε : (X, Y (Y, X and τ : (X, Y (X, X Y. Since (ετ 3 maps (X, Y to ( X, Y and Z is homogeneous of degree, these two relations imply Z (X, Y = ( Z (X, Y, so Z = 0 if is odd, proving the first identity. (One can refine this proof to give an explicit formula for Z (X, Y as A(X, Y A(X, X Y + A(Y, Y X, where A(X, Y = r P r,sx r Y s Z X Y X Y. For Theorem, it suffices to apply (6 with (X, Y = (, 0 and (,. This gives (for even Z (, Z (0, = Z, Z (, Z (0, = Z, and Theorem follows by adding and subtracting the equations. We remar that it is occasionally convenient to wor with the infinite product D = D consisting of collections of numbers { {Z r,s } r,s, {P r,s } r,s, {Z } } satisfying (0 for all. Then the corresponding generating functions Z(X, Y, P(X, Y and z(t = Z T satisfy z(x z(y Z(X, Y + Z(Y, X = P(X, Y, X Y Z(X + Y, Y + Z(X + Y, Y = P(X, Y, and similarly for (6. For example, the reader may want to verify that the function Z(X, Y Z(0, Y is equal to m>n>0 X/m(m X(n Y for the realization (4 and to use this to verify the -less version of (6 directly for this generating function. (The calculation which requires some wor gives the result only up to an additive constant, corresponding to the fact that (4 holds only for >. The following proposition, which will be used in Section 7, gives some easy solutions of relations (6 (with Z = 0. Proposition. Let A(X, Y V be a polynomial which is even with respect to Y. Then the function (7 Z (X, Y = A(X, Y A(X, X Y + A(Y, Y X (8 gives a realization of Equation (6 with Z = 0. Proof. One checs by direct calculation that if Z (X, Y is defined by (8 then both Z (X, Y +Z (Y, X and Z (X+Y, Y +Z (X+Y, X equal A(X, Y +A(Y, X. Note that the assertion of the proposition also holds if A(X, Y = A(Y, X or if A is anti-symmetric (with P 0 in the latter case. Corollary. Let 0 < p < be two integers with p odd. Then the numbers Z r,s = C p r,s (r + s = with C p r,s defined by Equation (3 satisfy ( with Z = 0. Proof. This is simply Proposition applied to A(X, Y = X p Y p. corresponding numbers P r,s in (0 are equal to δ r,p + δ s,p. The The second way of woring with D is by studying the relations among the Z r,s (or Z r,s, P r,s and Z. The following result gives a useful description of them. We introduce the notation V = X r Y s r + s =, V = r + s =. m r n s We define an isomorphism V V by F (X, Y = ( r f r,s X r Y s F (m, n = f r,s m r n s.

DOUBLE ZETA VALUES AND MODULAR FORMS 9 Then we have the following Lemma. Let F, G, H V and F, G, H the corresponding elements of V. Then the following two statements are equivalent: (i H (m, n = F (m + n, n + G (m, m + n, (ii F (X, Y = H(X, X + Y, G(X, Y = H(X + Y, Y. Proof. Equation (9 implies that any element h V can be decomposed as f(m + n, n + g(m, m + n for some f and g in V, and this decomposition is obviously unique since f(, x has poles only at x = 0 and g( x, only at x =. If f = F etc., then an inspection of (9 shows that the coefficients f r,s and g r,s of F and G are related to the coefficients h r,s of H by f r,s = ( r h i,j, g r,s = ( s h i,j. i j i+j= ( r i+j= Using the binomial coefficient identity ( ( r i = j we find that these formulas are equivalent to (ii. ( j s (r + s = i + j =, Proposition. Let a r,s and λ be rational numbers. Then the following three statements are equivalent: (i The relation holds in D. (ii The generating function A(X, Y = a r,s Z r,s = λz (9 ( a r,s X r Y s V (30 r can be written as H(X, X +Y H(X, Y for some symmetric homogeneous polynomial H Q[X, Y ] of degree, and (iii The generating function λ = A (m, n = 0 H(t, tdt. (3 a r,s m r n s V (3 can be written as f(m, n f(m + n, m f(m + n, n for some f V, and λ = f(, A (, = f(,. (33 Proof. If we choose the symmetric polynomial H(X, Y = X m Y n +X n Y m and use the binomial theorem to compute the a r,s in (30 and the beta integral to compute λ = (m!(n!/(!, then we find that (9 reduces to (. Since these H s span the space of symmetric polynomials in V, this proves the equivalence of the first two statements. The equivalence of (ii and (iii follows by applying the lemma with F = A + H, G(X, Y = F (Y, X and f = F, g(m, n = f(n, m. To chec that the values of λ in (ii and (iii agree, we again use the beta integral 0 tr ( t s dt = (r!(s! (! to get ( 0 H(t, tdt = h r,s = H (,. Remar. We can also write (3 as λ = hr,s, where H = ( r hr,s X r Y s.

0 HERBERT GANGL, MASANOBU KANEKO, AND DON ZAGIER The two approaches outlined above are equivalent by a duality which we will discuss below, but it is very convenient to have both. As an example of the use of the proposition, we give a second quic proof of Theorem from the Introduction. Taing H = X + Y in the proposition gives a r,s = (r, a, = 0, λ =, while taing H = (X Y gives a r,s = ( r (r, a, = 0, λ =. Again adding and subtracting the two relations thus obtained gives (4. As a second example, we observe that Eq. ( contains no Z,, so that Z, is a free variable (as we already saw in the realization (4. Thus a, must vanish in any relation of the form (9, and we can also see this in the proposition by setting X = 0. 3. Using the action of PGL (Z We have already repeatedly used the space V of homogeneous polynomials of degree in X and Y. We now mae this approach more systematic by exploiting two further structures on V : the action of the group Γ = PGL (Z and the Γ- invariant scalar product. The former ( is defined in the obvious way by (F γ(x, Y = a b F (ax + by, cx + dy for γ = (we suppose throughout that is even and c d the latter by X r Y s, X m Y n = ( r δ (r,s,(n,m (34 ( m for r, s, m, n, r + s = m + n =. The invariance property F γ, G γ = F, G is easily checed. We extend the action of Γ on V to an action of the group ring R = Z[Γ] by linearity. Then F ξ, G = F, G ξ, where ξ ξ is the antiautomorphism of R induced by γ γ. We occasionally wor with the model of V consisting of polynomials f(x of one variable of degree, corresponding to the homogeneous model via f(x = F (x,, F (X, Y = Y f(x/y. The group operation in this version taes the form (f γ(x = (cx + d f((ax + b/(cx + d. The group Γ contains distinguished elements. involutions ( 0 ε =, δ = 0 First there are the commuting, ( 0 0 sending F (X, Y to F (Y, X and F ( X, Y, respectively. The (±-eigenspaces of ε will be denoted by V ± and the (±-eigenspaces of δ by V ev and V od; we also write V +,ev for the space of even symmetric polynomials and similarly for the other three double eigenspaces of dimension /4 + O(. In PSL (Z, we have the elements ( ( ( ( 0 0 S =, U =, T = US =, T 0 0 0 = U S =, with the relations S = U 3 =, S = εδ and εuε = U, εt ε = T, δt δ = T, ST S = T. We will also consider various special elements of the group ring Z[Γ]. First, we have the projections π + = (ε +, πod = ( δ, π+,od = π + π od = π od π +, etc. onto V +, V od, V +,od etc. Next, we have the element = (T (ε + which by (6 essentially characterizes D (Q: the codimension subspace D 0 (Q of realizations with Z = 0 is identified precisely with Ker(, and the full space

DOUBLE ZETA VALUES AND MODULAR FORMS D (Q corresponds to the space of Z V such that Z Q X Y X Y. We can now interpret part of Proposition ( the equivalence of (i and (ii when Z = 0 as the dual statement of this with respect to the non-degenerate scalar product (34: a relation (9 with Z = 0 is reformulated equivalently as A S, Z = 0 (compare equations (9 and (30; the extra S comes from the interchange of r and s and the sign ( r in (34. So this holds for all Z Ker( if and only if A S Im( = V + (T, i.e., if and only if A V + (T S = V + (T (for the last step, use V + = V + S and ST S = T, and this is just (ii of Proposition. Finally, we have the element It is related to the above elements by as we see by the calculation Λ = εu + U Z[Γ]. (35 Λ = 4 π od π + Z[Γ], (36 Λ (T = ( εu + U (US = [ U(ε ] S + U (ε = δ + (δ US + U (ε, followed by multiplying both sides on the right by ε +. It follows from (36 that for any polynomial A V which is even or antisymmetric or S-invariant, the coefficients of A Λ give a realization of D with Z = 0 by taing Z = A Λ and P = π + (A. This is equivalent to Proposition and the remar in its proof. As an example for how to wor with the structures just introduced, we prove Eq. (5 from the Introduction. To do this, we define ( B m,n (X, Y = Y B n (X/Y (m + n =, (37 m where B ν (x = ν ( ν µ=0 µ Bµ x ν µ denotes the νth Bernoulli polynomial. The numbers λ m,n (r, s defined in (6 are the coefficients of the generating series ( λ m,n (r, s X r Y s = B m,n (X, X + Y. (38 r The symmetry λ m,n (r, s = ( m λ m,n (s, r mentioned for m odd in the Introduction follows from this formula together with the standard property B ν ( x = ( ν B ν (x of Bernoulli polynomials (cf., e.g., []. Set a r,s = ( s [( s m Bs m λ m,n (r, s ]. Then we see that the polynomial (30 has the form H(X, X + Y H(X, Y with H(X, Y = B m,n (X, X Y. The symmetry property just mentioned implies that H(X, Y = H(Y, X, so we can apply Proposition to get (9 with λ = ( s λ m,n (r, s. This is (5. 4. Representing even double zeta values in terms of odd ones In this section, we prove Theorem. Since we already now that dim D / (cf. (3, we have only to show that any Z ev,ev is a linear combination of Z od,od s. This means that any collection of numbers {a r,s r + s = ; r, s even} can be completed to a collection {a r,s (r + s =, λ} satisfying (9 in D. By Proposition, this is equivalent to showing that any polynomial F V od is the odd part of a polynomial of the form H (T with H V + (since then F ε is the odd part of H (T. Thus the result to be proved is:

HERBERT GANGL, MASANOBU KANEKO, AND DON ZAGIER Proposition 3. The space V ( > even has the decomposition V ev + V + (T = V. Proof. Here it is more convenient to use the -variable model. Let V T δ V be the fixed point set of the involution g(x g( x. Then we have the following commutative diagram with the top row exact: 0 Q V T δ +ε εu T V od 0 π od V + T V + (T To see the exactness, we first observe that if g V T δ then the polynomial f = g (T is odd because, from T δt = δ and g T δ = g, we deduce g T = g δ. Conversely, an odd polynomial f(x V has degree 3 (since is even and hence can be written as g(x+ g(x for some g V. But then g (T δ ( T = g (T ( + δ = f ( + δ = 0, so g (T δ is a constant and hence zero since it vanishes at x = /. (One can also argue that the map V T δ T /Q V od which is obviously injective, must be an isomorphism because both sides have dimension /. It is clear that the ernel of V T δ T V od is Q. Next, we have to show that h = g ( + ε εu is symmetric for g V T δ. This follows from εuε = U = T SU and thus g εuε = g T SU = g δsu = g εu. The commutativity of the square now follows from the calculation (ε εu(t = (εt ε( + δ + ( T δε( UT, which implies that (h g (T = g (ε εu(t = g (εt ε( + δ which vanishes under ( δ. It follows from the diagram that the map π od : V + od (T V is surjective which is equivalent to the statement of the theorem. The proposition and its proof give us an explicit way to realize the asserted decomposition by starting with any basis of V T δ. To obtain a relation (9 with prescribed values a r,s = f r,s for r and s even, we write the generating function f ε V od as g (T with g V T δ, then A = g (+ε εu(t ε = g (+ε εu(t has odd part f and belongs to V + (T, so that Proposition applies. To obtain explicit relations of this decomposition, we can choose any basis of the space of functions symmetric about x = /. In particular, from the three bases g(x = (x ν, (x x ν and B ν (x, where 0 ν ( /, we get three explicit collections of relations. For the first one, suitably normalized, we find that the coefficients of the associated relation (9 are a r,s = λ = ( r s ν ( s r + ( r ( α ν α α+β=ν ( [ ( 3t ν t ν dt ν 0 ( s β ], (ν + (r, s even (r, s odd,

DOUBLE ZETA VALUES AND MODULAR FORMS 3 Table : Relations among double zeta values of weight coming from Proposition 4. Each row of the table gives a relation among the (formal double zeta values displayed in the top line. The Z, column has been omitted, since all of its entries would be zero. ( 0 0 0 ( 8 0 ( 6 0 ( 4 ( 0 0 ( 0 8 ( 0 6 ( 0 4 ( 0 ( 0 0 ( 0 8 ( 0 6 ( 0 4 g(x Z,0 Z 4,8 Z 6,6 Z 8,4 Z 0, Z Z 3,9 Z 5,7 Z 7,5 Z 9,3 Z, (x 0 5 8 3 8 355 4 3 5 63 55 03 (x 8 0 384 30 68 7 4 99 43 387 43 45 (x 6 9 0 0 60 80 5 94 38 6 4 0 (x 4 5 0 0 0 56 68 6 4 4 0 0 ( (x 0 0 0 0 8 7 4 9 3 3 3 45 (x x 5 6 6 0 0 33 504 0 0 4 9 (x x 4 0 0 3 7 0 0 4 0 0 5 7 0 (x x 3 7 0 0 5 0 0 4 0 0 0 (x x 0 0 0 7 0 4 8 3 0 8 30 3 0 (x 9 x 0 0 0 0 9 4 8 4 4 6 0 B0 (x 0 0 0 0 767 5 55 33 4 65 3 3 4 65 6 33 B8 (x 0 3 0 0 0 6 3 5 3 3 B6 (x 0 0 5 0 0 4 6 3 0 8 5 3 0 3 B4 (x 0 0 0 7 0 6 4 4 6 4 3 4 ( 0 B (x 0 0 0 0 9 3 9 9 5 0 0 0 0 0 4 and for the second basis we find ( a r,s = r ( ν s ν ( ( ν ( ν r ν ν r ν (r, s even, (r, s odd (we omit the value of λ in this case. In both cases, the coefficients for r, s even form a triangular matrix. The third family g(x = B ν (x yields Eq. (7, as the reader can chec as an exercise by imitating the proof of Eq. (5 which was given at the end of Section 3. The following table gives these three collections of relations for the case =. 5. Double zeta values and period polynomials In this section we describe various connections between period polynomials and the (formal double zeta space, and prove Theorems 3 and 4. Since both of them involve period polynomials, we begin by reviewing these. The definition of period polynomials was already given briefly in the Introduction. The motivation comes from the connection with modular forms, which we will review in the next section. Here we discuss only the algebraic properties. The space W is defined as W = Ker( + S Ker( + U + U V, (39 i.e. as the intersection of the ( -eigenspace of the involution S and the sum of the ( ± 3 -eigenspaces of the element U of order 3. Since εsε = S and

4 HERBERT GANGL, MASANOBU KANEKO, AND DON ZAGIER εuε = U, the involution ε acts on W and splits it as the direct sum of subspaces W ± = W V ± of symmetric and antisymmetric polynomials. Since elements in W are also ( -eigenfunctions of S and since Sε = εs = δ, we also have W + = W od V +,od and W = W ev V,ev. Another important property of period polynomials is given by the following lemma. Lemma. Let > be even. Then and W = Ker ( T T, V W ± = Ker ( T T ε, V. Proof. It is equivalent for f V to be in Ker( T T or to satisfy f ( + S = f (+U +U, since ( T T S = (+S (+U +U. But a polynomial which is fixed by both S and U is fixed by the full modular group and thus vanishes. The second assertion of the lemma follows from the first, because if f is annihilated by T T = T εt ε and f ε = ±f then f is also annihilated by T T ε, and conversely if f is annihilated by T T ε then f = f T ( ± ε V ± and hence f ( T T = f ( T εt ε = 0. Remar. The operator L = T T plays a ey role in the discovery by J. Lewis that there are holomorphic functions annihilated by this operator which have the same relation to the so-called Maass wave forms as period polynomials have to holomorphic modular forms ([0], []. We call the equation f L = 0 the Lewis equation. As in the Introduction, we denote by P the subspace of D spanned by the P r,s (and Z, but it can be omitted by virtue of Theorem, and by P ev the subspace spanned by the P ev,ev. Note that P ev corresponds to generating functions in V od because of the shift by in the exponents of X and Y. Theorem 3. The spaces P ev and W are canonically isomorphic to each other. More precisely, to each p W we associate the coefficients p r,s and q r,s (r + s = which are defined by p(x, Y = ( r pr,s X r Y s and p(x + Y, Y = ( r qr,s X r Y s. Then q r,s q s,r = p r,s (in particular q r,s = q s,r for r, s even and q r,s Z r,s 3 q r,s Z r,s (mod Z, (40 r, s even r, s odd and conversely, an element r, s odd c r,sz r,s D c r,s = q r,s arising in this way. belongs to P ev if and only if Remars.. The equivalence of the first and last statements of the theorem follows from Theorem : since the Z od,od form a basis of D, it is equivalent to spea of elements of P ev or of relations of the form ( P ev,ev = ( Z od,od.. Since the double zeta realizations ζ(rζ(s and ζ( of P r,s (r, s even and Z are rational multiples of π, and since π is a Q-linear combination of Z od,od s by Theorem, Theorem 3 as stated here contains the rough statement given in the Introduction. (The number of relations drops from dim W = dim M to dim S = dim M because one relation gets used up to eliminate ζ(. Example. For every even >, the space W contains the polynomial p(x = x (in the inhomogeneous notation. Here p(x + = ( r r x r, i.e., q, = 0 and all other q r,s are equal to, and Theorem 3 reduces to a weaer version of Theorem.

DOUBLE ZETA VALUES AND MODULAR FORMS 5 Table : The coefficients p r,s and q r,s for p(x = x (x 3 W. r 3 4 5 6 7 8 9 0 s 0 9 8 7 6 5 4 3 60 p r,s 0 0 8 0 8 0 8 0 8 0 0 60 q r,s 0 0 0 84 68 90 50 84 8 0 0 Example. The space W is -dimensional, spanned by the two polynomials p(x = x 0 and x (x 3. For the latter, we have p(x+ = x 8 +8x 7 +5x 6 + 38x 5 +8x 4 +8x 3, so the p r,s and q r,s of the theorem are given (after multiplication by 60 by the table The q r,s with r and s even (underlined are symmetric and the relation (40, divided by 3, becomes 8Z 8,4 + 90 3 Z 6,6 + 8Z 4,8 8Z 9,3 + 50Z 7,5 + 68Z 5,7 (mod Z, in agreement with Eq. (9 of the Introduction. The example for = 6 given there arises in the same way from the polynomial p(x = x (x 3 (x 4 x + W6. Proof. The function q = p T satisfies q ( ε = p (T T ε = p (T + εt ε = p because p is antisymmetric and satisfies the Lewis equation. This shows that q r,s q s,r = p r,s and also means that if we decompose q in the obvious way as q = q ev,+ + q ev, + q od,+ + q od,, then q ev, = p and qod, = 0. Write [a, b, c] to denote aq ev,+ + bq ev, + cq od,+. Then and hence [0,, 0] T [,, ] T = p T = p εt ε = p T ε = q ε = [,, ], = q ST = p T ST = p S = p = [0,, 0], [, 0, ] (T = [, 3, ] [, 0, ] = [ 3, 3, ]. This says that q od 3q ev is the image under T of the symmetric polynomial ( q ev,+ q od,+, so Proposition implies Eq. (40. (The omitted coefficient of Z in (40 is easily determined, by computing the integral in (3, as ( r q r,s. This gives a map W Pev which is obviously injective since the antisymmetrization of the coefficients on the right-hand side of (40 are the coefficients of p itself. We omit the proof of surjectivity since it will follow from Theorem 4 below that dim W = dim Pev, so that injectivity suffices. Theorem 3 tells us that, given any relation of the form (9 in D with a r,s = a s,r for r and s even, there exists a unique element p W such that the a r,s with r, s odd are equal to the numbers q r,s in the theorem. On the other hand, given an element of P ev, its representation as a linear relation of the generators P ev,ev is not unique, because these generators are not linearly independent. The next proposition, which is a first form of Theorem 4 of the Introduction, describes the relations among them, i.e., all relations of the form (9 with a r,s = a s,r for all r and s. It turns out that in any such relation the odd-index a r,s all vanish (in accordance with the widely believed and numerically verified statement that there are no relations over Q among products of values of the Riemann zeta function at odd arguments, while the even-index a r,s are related to the space W +. Proposition 4. Let a r,s and µ be numbers with a s,r = a r,s. Then the following three statements are equivalent: (i The relation a r,s P r,s = µz (4 holds in D.

6 HERBERT GANGL, MASANOBU KANEKO, AND DON ZAGIER (ii The generating function A(X, Y = ( a r,s X r Y s V (4 r can be written as A = H ( S for some H V U V +, and (iii The generating function µ = H, X Y. (43 X Y A (m, n = a r,s m r n s V (44 can be written as f(m, n f(m + n, n f(m, m + n for some symmetric f V, and µ = f(,. (45 If these statements hold, then a r,s = 0 for odd r and s. Proof. Except for the assertions about the value of the constant µ, which we will leave to the reader, each part of this proposition is equivalent to the corresponding part of Proposition of Section with the extra condition that a r,s = a s,r. For (i this is obvious; for (iii it follows because f(m, n A (m, n = f(m + n, m + f(m + n, n in Proposition is always symmetric, so that A is symmetric if and only if f is; and for (ii it follows because if H V +, then the element H (T is symmetric if and only if H = H U (because H T (ε = H ( UT, in which case H (T = H (T = H (S. The last assertion of the proposition is then clear since A V + od ( S V. Proposition 5, without the statements concerning µ, says that the following are equivalent for symmetric A : (i a r,s P r,s 0 (mod Z ; (ii A V U,+ ( S ; (iii A V ( T T (in the obvious notation. Statement (ii in turn is equivalent to (iv A V od and A W + because for v V we have: with respect to the scalar product (34, v is orthogonal to (V U,+ ( S = V ( + U + U ( + ε( S v ( S( + ε( + U + U = 0 v ( S( + ε Ker( + U + U Ker( + S Ker( ε = W + v W + + V + V ev. On the other hand, (i is equivalent to the condition that a r,s P r,s = 0 for any realization {Z r,s, P r,s, Z } of D with Z = 0, while (iv says that all a od,od are zero and a r,s p r,s = 0 for any p = p r,s X r Y s W +. The equivalence of (i and (iv therefore says that any symmetric collection of numbers {P r,s (r, s odd} can be extended to a realization of D with Z = 0, while a symmetric collection of numbers {P r,s (r, s even} can be extended to a realization of D with Z = 0 if and only if the corresponding generating function belongs to W +. This is precisely the statement of Theorem 4 as given in the Introduction in the case Z = 0. (For the full statement we need the extended period polynomial space Ŵ +, which will be discussed in the next section in the context of modular forms. Before doing that, we give a slight improvement of the result just stated. This result says that,

DOUBLE ZETA VALUES AND MODULAR FORMS 7 if P is any polynomial belonging to V +,ev or to W + V +,od, then there exists a Z V with Z ( + ε = P, Z T ( + ε = P. (46 The following proposition maes this explicit: Proposition 5. (i Let P V +,ev. Then Z = P Λ with Λ as in (35 gives a solution of Eqs. (46. (ii Let P W +. Then Z = 3 P (T + gives a solution of Eqs. (46. Proof. (i The identities Λ( + ε = + ε and ΛT = S + T ( ε together with P ε = P δ = P immediately imply (46. (ii By Lemma 5, P satisfies the Lewis equation P ( T T = 0. Hence, using P ε = P and P δ = P, we find P ( + T ( + ε = P + P (T δt ε = 3P + P ( T T T = 3P and P (T + T ( + ε = P + P (T + T = 3P. 6. Double zeta values and modular forms In this section we reinterpret the results of Section 5 from the modular point of view. To do this, we begin by reviewing the theory of period polynomials of modular forms on PSL (Z, including various supplementary results which are less well-nown and which are needed here. The period polynomial associated to a cusp form f S can be defined by P f (X, Y = 0 (X Y τ f(τ dτ. (47 The identity (ax + by (cx + dy τ = (a cτ(x γ (τy together with the modularity of f shows that γ ( P f γ(x, Y = γ (0 ( a b for any γ = Γ c d. In particular, ( P f ( T T = 0 (X Y τ f(τ dτ 0 (X Y τ f(τ dτ = 0, so P f W by Lemma 5. (One can also verify that P f (+S = P f (+U +U = 0 directly by a similar calculation. This gives the basic connection between cusp forms and period polynomials. The complete Eichler-Shimura-Manin theory, of which summaries can be found in many places (e.g. [9], tells us that the maps assigning to f the symmetric (odd and antisymmetric (even parts P + f and P f of P f give isomorphisms from the space S of cusp forms of weight on Γ onto W + and a codimension subspace of W. (The latter was determined in [8]. The theory is also defined over Q in the sense that the even and odd period polynomials of a normalized Hece eigenform f are proportional to polynomials with coefficients in the number field generated by the Fourier coefficients of f, and transform properly under Gal(Q/Q. For instance, the even and odd period polynomials of the modular form S, in the inhomogeneous version, are multiples of 36 69 (x0 x (x 3 and x(x (x 4(4x, respectively.

8 HERBERT GANGL, MASANOBU KANEKO, AND DON ZAGIER For f M with f( = a 0 0 the integral (47 diverges, but the modified definition [7] τ0 P f (X, Y = (X Y τ ( f(τ a 0 τ dτ (X Y τ ( f(τ a 0 dτ + τ 0 + a ( 0 Y τ 0 X 0 (X Y τ 0 (any τ 0 H maes sense, since the integrals converge and the derivative of the right-hand side with respect to τ 0 vanishes. We call this function the extended period polynomial of f. For instance, the extended period polynomial of the Eisenstein series G is πi ζ( (X Y (πi r=0 B r r! B r ( r! Xr Y r, where B r denotes the rth Bernoulli number. The function P f (X, Y no longer lies in V, but in the larger space V = C X r Y s. On the other hand, the same calculation as before shows r,s 0 that it is again annihilated by + S, + U + U and T T. (The group Γ does not act on V, but it acts on the larger space C(X, Y of rational functions in two variables, so this statement maes sense. In other words, Pf belongs to the space Ŵ := Ker( + S, V Ker( + U + U, V = Ker( T T, V, which we call the space of extended period polynomials. This space fits into a short exact sequence 0 W Ŵ λ C 0, where λ associates to P Ŵ the coefficient of X /Y in P. By writing P = P + λ( P (X /Y + Y /X we can identify Ŵ with the space Ŵ = { (P, λ V C P ( + S = P ( + U + U + λφ = 0 }, where Φ V is the polynomial ( X Φ (X, Y = + Y ( + U + U Y X ( X (X Y = + Y + (X Y + X Y. Y X X Y Corresponding to the canonical splitting M = S C G of modular forms into cusp forms and Eisenstein series, we have the splittings Ŵ = W C Ê, Ŵ = W C (E, B, where Ê (X, Y = ( ( Y B r B s X r Y s = E (X, Y + B r X r, s 0 + X. Y The space Ŵ also splits into symmetric and antisymmetric parts Ŵ ±. Since Ê is symmetric, we have Ŵ + = W + C Ê and Ŵ = W. If f and g are two cusp forms, then the Petersson scalar product (f, g is proportional to the pairing P + f (T T, Pg ([7], [8]. (It is essential that odd period polynomials are always paired with even ones, because if f is a normalized Hece eigenform then the coefficients of P ± f belong to ω ±(f Q[X, Y ] with some constants ω ± (f whose product is essentially (f, f. But the straight pairing P + f, P g would vanish since P + f and Pg have opposite symmetry properties and also opposite parity. The effect of (T T is to change odd polynomials to even ones (48

DOUBLE ZETA VALUES AND MODULAR FORMS 9 and vice versa. This pairing is Hece invariant, but we do not explain this here since we have not discussed the action of Hece operators on period polynomials. It extends in a natural way to a pairing Ŵ + W C in such a way that the codimension subspace of W corresponding to period polynomials of cusp forms is precisely the space of polynomials whose pairing with Ê vanishes (cf. [8]. The result (Theorem 9 of [8] is { {P f f S ( } = r ar,s X r Y s } ( r κr,s a r,s = 0, where κ r,s = κ s,r = 0<j j even + ( j r [ ( r + ( j B j B j ( r ( r ( B r B s r ] B. Equation (49 says that κ r,s is essentially the coefficient of X r Y s in E T with E as in (48. We can now easily extend Proposition 5 (and hence the preliminary version of Theorem 4 mentioned in the previous section to a statement concerning Ŵ + rather than W +. Given any symmetric extended period polynomial P = p r,s X r Y s Ŵ + r,s 0 r, s even (so that λ( P = p,0 = p 0,, there is a realization of P with { p r,s (r, s even, P r,s Z p,0. 0 (r, s odd, To see this, instead of going through the whole proof of Proposition 5, eeping careful trac of the constant µ, it is sufficient (since W + has codimension in Ŵ + to chec this for one single extended polynomial P which belongs to Ŵ + but not to W +, and this is easy: the function Ê defined in (48 has coefficients P r,s = ( r Br B s = 4! β r β s, where β r = ζ(r/(πi r = B r /r! (r 0 even, and on the other hand the original double zeta realization of D has P r,s = ζ(rζ(s = (πi β r β s and Z = ζ( = (πi β. This completes the discussion of extended period polynomials and the proof of Theorem 4. We also mention the corresponding extension of Proposition 5: Supplement to Proposition 5: If P = P + λ(x Y + X Y, then we have a solution of (5 with P = P and Z = λ given by Z = 3 P (T + + λ X 6 Y U ( + ε(5 3U + Uε = 3 P (T + + λ X Y 6 X Y (5 3U + Uε. Proof. Just apply the calculation of the proof of part (ii of Proposition 5 to P with Ẑ = P 3 ( + T since P satisfies the same relations as P. If we apply this result to the special case P = 4! Ê then we find that, as well as the Euler realization of D in R with Z E r,s = ζ(r, s, P E r,s = ζ(rζ(s and Z E = ζ(, we also have a Bernoulli realization with Z B = β, P B r,s = β r β s (so (49

0 HERBERT GANGL, MASANOBU KANEKO, AND DON ZAGIER that Pr,s B = 0 if r and s are odd and >, and with Zr,s B equal to the double Bernoulli number Zr,s B = ( m β m β n + β rβ s β [ ( ( ] 5 + 3 3 r 3 r r m+n= = ( m β m β n β [ ( ( ] + + 3 r r r 3 (β rβ s β. m+n= m,n 0 These are almost exactly the same as the numbers (49 occurring in the result on the image of cusp forms in W cited above! Also note that the number ZB r,s is essentially the double zeta value ζ( r, s/(r!(s! at negative arguments, which has been studied in []. The Bernoulli realization has the same even-index p r,s as the Euler realization (up to a factor (πi, but 0 instead of the presumably transcendental values ζ(odζ(od/(πi ; the fact that both the original P r,s and the new ones can be realized in D is an illustration of the fact that the numbers P od,od in P are completely unconstrained (this corresponds to the vanishing of a od,od in Proposition 5. The results of Section 5 were formulated purely algebraically, but we can now easily relate them to the theory of modular forms. A result of Ranin ([3]; see also [8] says that, for a normalized Hece eigenform f S, one has (f, G r G s X r Y s = c f P + f (X, Y, r, s even where c f is essentially P f (, 0. In particular, the cusp forms G r G s βrβs β G satisfy the same linear relations as elements of W +, so they form the coefficients of an element G (τ; X, Y = ( G r (τg s (τ β rβ s G (τ X r Y s W + β S. r, s But then setting Ĝ = G B G (τê gives the much simpler statement that the polynomial Ĝ (τ; X, Y = G r (τg s (τx r Y s belongs to Ŵ + r, s 0 M. Theorem 5 follows immediately. 7. Double Eisenstein series At the end of the last section, we applied Ranin s result relating products of Eisenstein series to period polynomials of cusp forms to show that there is a realization of the double shuffle space D sending P r,s to G r (τg s (τ and Z to G (τ for all r, s and. This proof, however, is very indirect, and in view of the simplicity of the final statement one would expect that there should be a simpler and more natural argument. This is indeed the case, as we now explain. This alternative approach also leads directly to the double Eisenstein series which are the final topic of this paper. We present the argument in a more abstract form than we will use here. Let A be a discrete subgroup of C (or possibly some more general commutative topological field which is totally ordered, i.e. can be decomposed into a disjoint union A + {0} ( A + with A + closed under addition. For m, n A we write n 0 to mean n A + and m n to mean m n 0. Then we claim that, at least in the range

DOUBLE ZETA VALUES AND MODULAR FORMS for which the sums Z(r = m 0 m r converge, the numbers P r,s = Z(rZ(s and Z = Z( give a realization of P. Indeed, by (iii of Proposition we can write A (m, n = f(m, n f(m + n, n f(m, m + n for some f V C and a r,s Z(rZ(s = ( A (m, n = f(m, n m, n 0 = m = n 0 m, n 0 A (m, n = f(, Z(. m n 0 n m 0 Applying this concept to the special case where A = Zτ + Z for some number τ in the upper half plane H with the ordering described in the Introduction, we find that indeed the functions {G r (τ G s (τ, G (τ} give a realization of the {P r,s, Z }- part of D. The corresponding realization of the {Z r,s }-part is given by the double Eisenstein series G r,s as defined in the Introduction. In this section, we study these in more detail. A simple estimate shows that the series defining G converges absolutely if and only if > and the series defining G r,s if and only if s > and r >. Our first object is to compute the Fourier expansion of G r,s (Theorem 6. We begin by recalling the corresponding computation for G, which is of course well-nown. We define power series ϕ 0 (q and ϕ (q in Q[[q]] both actually polynomials of degree in q/( q by ϕ 0 (q = ( (! u= u q u, ϕ (q = δ, + ϕ 0 (q (. The Lipschitz formula says that (τ + a = (πi ϕ (q (50 a Z for τ H and all, where q = e πiτ as usual and where the sum on the lefthand side has to be interpreted as a Cauchy principal value if =. Applying this to G (τ with, where the summation in the non-absolutely convergent case = is to be carried out in the order, we find where G (τ = a>0 a + m>0 g (q = m>0 a Z (mτ + a ϕ 0 (q m = m, u>0 = ζ( + (πi g (q, ( u (! qmu. (5 The statement of Theorem 6, which we repeat here for convenience, was that the Fourier expansion of G r,s (τ in the convergent case is given by the analogous formula G r,s (τ = ζ(r, s + Cr,s p (πi h g h (qζ(p + (πi g r,s (q, (5 h+p= where = r + s, C p r,s is a simple numerical coefficient given by (3, and g r,s (q = m>n>0 ϕ 0 r(q m ϕ 0 s(q n = m>n>0 u, v>0 ( u r ( v s (r! (s! qmu+nv. (53 (The condition h > and p > in ( can be dropped, even though the definitions of ζ(p and g h (q are problematic in these cases, because Cr,s p vanishes when p = or p = r + s unless r =.