A Proof of Dirichlet s Theorem on Primes in Arithmetic Progressions

Similar documents
Dirichlet s Theorem. Martin Orr. August 21, The aim of this article is to prove Dirichlet s theorem on primes in arithmetic progressions:

Dirichlet s Theorem. Calvin Lin Zhiwei. August 18, 2007

1 Primes in arithmetic progressions

DIRICHLET S THEOREM ON PRIMES IN ARITHMETIC PROGRESSIONS. 1. Introduction

Math 259: Introduction to Analytic Number Theory Primes in arithmetic progressions: Dirichlet characters and L-functions

1, for s = σ + it where σ, t R and σ > 1

The Prime Number Theorem

Chapter 4. Characters and Gauss sums. 4.1 Characters on finite abelian groups

THE DENSITY OF PRIMES OF THE FORM a + km

Dirichlet s Theorem and Algebraic Number Fields. Pedro Sousa Vieira

18.785: Analytic Number Theory, MIT, spring 2006 (K.S. Kedlaya) Dirichlet series and arithmetic functions

Chapter 1. Introduction to prime number theory. 1.1 The Prime Number Theorem

ELEMENTARY PROOF OF DIRICHLET THEOREM

Primes in arithmetic progressions

NOTES ON DIOPHANTINE APPROXIMATION

The primitive root theorem

MATH-GA : Introduction to Number Theory I ALENA PIRUTKA

ANALYTIC NUMBER THEORY AND DIRICHLET S THEOREM

MATH 216T TOPICS IN NUMBER THEORY

VII.5. The Weierstrass Factorization Theorem

Theorem 1.1 (Prime Number Theorem, Hadamard, de la Vallée Poussin, 1896). let π(x) denote the number of primes x. Then x as x. log x.

The zeta function, L-functions, and irreducible polynomials

Chapter 8. P-adic numbers. 8.1 Absolute values

Lemma 1.1. The field K embeds as a subfield of Q(ζ D ).

Department of Mathematics, University of California, Berkeley. GRADUATE PRELIMINARY EXAMINATION, Part A Fall Semester 2016

Chapter 1. Introduction to prime number theory. 1.1 The Prime Number Theorem

Congruent Number Problem and Elliptic curves

Part II. Number Theory. Year

p-adic L-functions for Dirichlet characters

DIRICHLET S THEOREM ABOUT PRIMES IN ARITHMETIC PROGRESSIONS. Contents. 1. Dirichlet s theorem on arithmetic progressions

a = mq + r where 0 r m 1.

L-FUNCTIONS AND THE RIEMANN HYPOTHESIS. 1 n s

Class Field Theory. Anna Haensch. Spring 2012

Mathematics 324 Riemann Zeta Function August 5, 2005

TOPICS IN NUMBER THEORY - EXERCISE SHEET I. École Polytechnique Fédérale de Lausanne

MATH 216T TOPICS IN NUMBER THEORY

Considering our result for the sum and product of analytic functions, this means that for (a 0, a 1,..., a N ) C N+1, the polynomial.

Local Fields. Chapter Absolute Values and Discrete Valuations Definitions and Comments

Predictive criteria for the representation of primes by binary quadratic forms

Maximal Class Numbers of CM Number Fields

Definitions. Notations. Injective, Surjective and Bijective. Divides. Cartesian Product. Relations. Equivalence Relations

A PROBLEM ON THE CONJECTURE CONCERNING THE DISTRIBUTION OF GENERALIZED FERMAT PRIME NUMBERS (A NEW METHOD FOR THE SEARCH FOR LARGE PRIMES)

18 The analytic class number formula

LINNIK S THEOREM MATH 613E UBC

Prime Divisors of Palindromes

Balanced subgroups of the multiplicative group

A Curious Connection Between Fermat Numbers and Finite Groups

Standard forms for writing numbers

The Chinese Remainder Theorem

Bernoulli Numbers and their Applications

Summary Slides for MATH 342 June 25, 2018

CHARACTERS OF FINITE GROUPS.

Notes on Systems of Linear Congruences

Moreover this binary operation satisfies the following properties

Math 229: Introduction to Analytic Number Theory The product formula for ξ(s) and ζ(s); vertical distribution of zeros

ζ (s) = s 1 s {u} [u] ζ (s) = s 0 u 1+sdu, {u} Note how the integral runs from 0 and not 1.

We have been going places in the car of calculus for years, but this analysis course is about how the car actually works.

INDEX. Bolzano-Weierstrass theorem, for sequences, boundary points, bounded functions, 142 bounded sets, 42 43

PRIME NUMBER THEOREM

(1) A frac = b : a, b A, b 0. We can define addition and multiplication of fractions as we normally would. a b + c d

Part V. 17 Introduction: What are measures and why measurable sets. Lebesgue Integration Theory

Mathematics for Cryptography

NUMBER SYSTEMS. Number theory is the study of the integers. We denote the set of integers by Z:

MAT 570 REAL ANALYSIS LECTURE NOTES. Contents. 1. Sets Functions Countability Axiom of choice Equivalence relations 9

CYCLOTOMIC FIELDS CARL ERICKSON

Before giving the detailed proof, we outline our strategy. Define the functions. for Re s > 1.

ANALYTIC AND COMBINATORIAL NUMBER THEORY I (Lecture Notes) MARTIN KLAZAR

Converse theorems for modular L-functions

2 Asymptotic density and Dirichlet density

Contribution of Problems

KUMMER S LEMMA KEITH CONRAD

NONABELIAN GROUPS WITH PERFECT ORDER SUBSETS

Logical Connectives and Quantifiers

A proof of Freiman s Theorem, continued. An analogue of Freiman s Theorem in a bounded torsion group

Definitions, Theorems and Exercises. Abstract Algebra Math 332. Ethan D. Bloch

Real Analysis Math 131AH Rudin, Chapter #1. Dominique Abdi

= 1 2x. x 2 a ) 0 (mod p n ), (x 2 + 2a + a2. x a ) 2

SPRING 2006 PRELIMINARY EXAMINATION SOLUTIONS

Class Numbers, Continued Fractions, and the Hilbert Modular Group

CONTINUED FRACTIONS, PELL S EQUATION, AND TRANSCENDENTAL NUMBERS

How many units can a commutative ring have?

Chapter 5. Modular arithmetic. 5.1 The modular ring

We start with a simple result from Fourier analysis. Given a function f : [0, 1] C, we define the Fourier coefficients of f by

Smol Results on the Möbius Function

The Prime Number Theorem

Properties of Ramanujan Graphs

Congruences and Residue Class Rings

WEIERSTRASS THEOREMS AND RINGS OF HOLOMORPHIC FUNCTIONS

A talk given at the Institute of Mathematics (Beijing, June 29, 2008)

as x. Before giving the detailed proof, we outline our strategy. Define the functions for Re s > 1.

FORMAL GROUPS OF CERTAIN Q-CURVES OVER QUADRATIC FIELDS

Math 314 Course Notes: Brief description

INVERSE LIMITS AND PROFINITE GROUPS

Math 118: Advanced Number Theory. Samit Dasgupta and Gary Kirby

RIEMANN SURFACES. max(0, deg x f)x.

MASTERS EXAMINATION IN MATHEMATICS

LECTURES ON ANALYTIC NUMBER THEORY

Notes on Equidistribution

MOMENTS OF HYPERGEOMETRIC HURWITZ ZETA FUNCTIONS

CHARACTERS OF FINITE ABELIAN GROUPS (SHORT VERSION)

Transcription:

A Proof of Dirichlet s Theorem on Primes in Arithmetic Progressions Andrew Droll, School of Mathematics and Statistics, Carleton University (Dated: January 5, 2007) Electronic address: adroll@connect.carleton.ca

I. INTRODUCTION In this report we review general properties of Dirichlet characters and L-functions, and use these to prove Dirichlet s theorem that in any arithmetic progression (ba + m) b=0 with (a, m) = there are infinitely many primes. This result was first proved in 837 by J. Dirichlet. Euler had previously conjectured that there are infinitely many primes congruent to modulo any nonzero integer. Dirichlet s theorem itself was first conjectured by Gauss. We follow the general exposition in [, Ch.6], elaborating and reordering at every stage. Section II begins by reviewing the properties of complex characters on groups and Dirichlet characters modulo m. In this section we also define Dirichlet L-functions. Our proof of Dirichlet s theorem relies heavily on the properties of L-functions. This being the case, Section III examines elementary properties of Dirichlet L-functions in some detail. We prove the existence of Euler product decompositions, and specify a branch of the logarithm of particular use for our proof. Additionally, we introduce the Riemann zeta function and several of its properties. Dirichlet s theorem is stated without proof in Section IV, and the existence of analytic continuations for Dirichlet L-functions is proved. In Section V, we approach the most difficult part of the proof of Dirichlet s theorem, involving the boundedness of L-functions and their logarithms. We conclude our proof of Dirichlet s theorem in Section VI. Our presentation frequently takes advantage of elementary results from group theory, real analysis, number theory, and complex analysis. References on these topics can be found in the bibliography. Notation: Wherever we work with abstract groups, we write the group operation multiplicatively. C, R, Z, and N denote the complex numbers, real numbers, integers, and natural numbers (i.e. positive integers), respectively. T denotes the unit circle in the complex plane. For m Z, Z m denotes the integers modulo m. For a monoid M, U(M) denotes the group of units of M. For z C, z denotes the complex conjugate of z. For z C, R(z) denotes the real part of z, and I(z) denotes the imaginary part of z. 2

φ denotes the Euler phi function, i.e. φ(m) = U(Z m ) for all m Z\{0}. If A is a finite group and g A then g denotes the order of g. For a set S, S denotes the cardinality of S. Log denotes the principal branch of the complex natural logarithm, and log denotes the real natural logarithm. II. PRELIMINARIES We will introduce some fundamental definitions and properties in this section. These will be important in the proof of Dirichlet s theorem to follow. Throughout this section, A is a finite abelian group. Definition.. Any group homomorphism χ : A T is called a character on A. We denote the set of all characters on a group A by Â. Since A and T are abelian groups, by an elementary theorem in group thery, Â is also an abelian group under the multiplication defined by (χψ)(a) = χ(a)ψ(a) whenever χ, ψ Â, a A. Definition.2. Let χ 0 denote the trivial character on A, that is, χ 0 (a) = for all a A. We take note of the following theorem from group theory, which we will need to prove some properties of characters. A proof can be found in Chapter 7 of [2]. Theorem. subgroups. A finite abelian group can be expressed as a direct product of cyclic We can recast Theorem in the following way: There exist g,..., g n A such that every g A can be expressed in the form g f...g fn n for some unique f,..., f n Z with 3

0 f i < g i for each i. Lemma.. The group of characters  is isomorphic to A. Proof: Take g,..., g n A such that every g A can be expressed in the form g f...g fn n for some unique f,..., f n Z with 0 f i < g i, i n, and let e i = g i for each i. It is clear that any group homomorphism of A is determined completely by its effect on g,..., g n. Let χ Â. Since ge j j = e for each j (where e is the identity element of A), we get χ(g j ) e j =, so we must have χ(g j ) = e 2πih j/e j, where 0 h j < e j. For each j, j n, define a character χ j  by χ j(g j ) = e 2πi/e j and χ j (g q ) = when q j. The character χ j is well-defined; Indeed, if g a...g an n = a = b = g b...g bn n then we have ab = e, so that b k a k (mod e k ) for each k. Consequently, for each k we have some m k Z with a k = b k + m k e k, giving χ j (a) = e 2a jπi/e j = e 2(b j+m k e k )πi/e j = e 2b jπi/e j = χ j (b). By the discussion above, every character χ can be expressed as a product χ = χ m...χ mn, where for each j, m j N is unique and 0 m j < e j. Further, all such products are characters because  is a group. We now define f :  A by f(χ) = f(χ m...χ mn n ) = g m...gn mn. It is clear from the discussion above that f is a bijection. Furthermore, if χ = χ k...χ kn n, χ = χ m...χ mn n f(χ χ) = f(χ k +m...χn kn+mn ) = g k +m...gn kn+mn = (g k...gn kn )(g m...gn mn ) = f(χ )f(χ), so f is a homomorphism. This shows that  is isomorphic to A, as desired. n then Theorem 2. (a) If χ, ψ δ(χ, ψ) = 0 otherwise.  then a A χ(a)ψ(a) = A δ(χ, ψ), where δ(χ, ψ) = if ψ = χ and (b) If a, b A then χ  χ(a)χ(b) = A δ(a, b) where δ(a, b) = if a = b and δ(a, b) = 0 otherwise. Proof: Note that if χ Â, a A, then χ(a)χ(a) = since χ(a) is a root of unity, so that χ(a) = χ(a) = χ(a ) = χ (a). We first prove part (a) of the proposition. We can write a A χ(a)ψ(a) = a A χ(a)ψ (a) = a A (χψ )(a). If χ = ψ then χψ is the trivial character χ 0. Otherwise, χψ is some nontrivial character. Define χψ = η. If χ ψ, then there exists b A such that η(b). Since A is a group, 4

for every b, c A there exists some unique a A with ba = c, and thus η(a) = η(ab) = η(a)η(b) = η(b) η(a), a A a A a A a A which implies (η(b) ) a A η(a) = 0. However by assumption η(b), and we must have 0 = a A η(a) = a A χ(a)ψ(a). If ψ = χ then η(a) = a A so indeed, A = a A η(a) = a A χ(a)ψ(a). This proves (a). We now prove part (b). Write χ  χ(a)χ(b) = χ  χ(a)χ(b ) = χ  χ(ab ). If a b then c = ab e (the identity of A). As in the proof of Lemma., take g,..., g n A, e,..., e n N such that every g A can be expressed in the form g f...g fn n for some unique f,..., f n Z with 0 f i < e i i, i n. Further, define χ i for i n in the same way as in the proof of Lemma.. Because c e, we see that if c = g c...g cn n (with 0 c i < e i for each i), at least one of c,...c n is not zero, say c j 0. Then χ j (c) = χ j (g c...g cn n ) = χ j (g c j j ) = e2πic j/j. thus Since  is a group, for every ψ, η  there exists some unique χ  with ψχ = η, and which implies χ  χ(ab ) = χ  χ(c) = χ Â(χ j χ)(c) = χ  χ j (c)χ(c) = χ j (c) χ  χ(c), (χ j (c) ) χ  χ(c) = 0. By construction χ j (c), and we must have χ  χ(c) = 0. Thus for a b we have χ  χ(a)χ(b) = 0. On the other hand, if a = b then ab = e, and = χ(ab ) = χ(a)χ(b) for all χ Â. We get A =  = χ  χ(ab ) = χ  χ(a)χ(b) (using Lemma.). 5

These two arguments prove (b). Definition.3. Let m be a positive integer, and let χ be a character on U(Z m ). Define χ : N C by χ(n) = χ (n + mz) if (n, m) = and χ(n) = 0 otherwise. Any function χ constructed in this manner is referred to as a Dirichlet character modulo m, and χ is called the character associated with χ. We denote the set of all Dirichlet characters modulo m by m. For the remainder of this section, let m be a positive integer, and let χ m. Furthermore, let C > denote the set {s C R(s) > }. Lemma.2. Let χ, ψ m, and let χ, ψ be the characters associated with χ and ψ respectively. Then ψ = χ if and only if ψ = χ. Further, there is a bijection between m and the set of characters on U(Z m ). Proof: Suppose ψ = χ, so that whenever (a, m) = we have ψ (a + mz) = ψ(a) = χ(a) = χ (a + mz). By an elementary result, (a, m) = a + mz U(Z m ). This proves one implication. Conversely, suppose ψ = χ. By definition, when (a, m) we have ψ(a) = 0 = χ(a). When (a, m) = we find ψ(a) = ψ (a + mz) = χ (a + mz) = χ(a), and so ψ = χ. Finally, let Û denote the set of characters on U(Z m) and define f : Û m in the following way: for every χ Û, define f(χ ) to be the Dirichlet character χ m with χ(a) = χ (a + mz) whenever (a, m) =. By the arguments above, f is injective. From the definition of a Dirichlet character modulo m, f is clearly surjective. Corollary.2.. (a) If χ, ψ m, then m a=0 where δ is defined as in Proposition.. (b) If a, b Z with (a, m) = (b, m) = then χ(a)ψ(a) = φ(m)δ(χ, ψ), χ m χ(a)χ(b) = φ(m)δ (a, b), 6

where δ is defined by δ (a, b) = if a b (mod m) and δ (a, b) = 0 otherwise. Proof: For part (a), note that when (a, m), by definition χ(a) = ψ(a) = 0. When (a, m) =, again by definition, χ(a) = χ (a + mz) and ψ(a) = ψ (a + mz) for some characters χ and ψ on U(Z m ). By elementary results, (a, m) = a + mz U(Z m ). Thus we can rewrite the sum in (a) as a U(Z m) χ (a)ψ (a). By Proposition., this sum is equal to U(Z m ) δ(χ, ψ ) = U(Z m ) δ(χ, ψ), and since φ(m) = U(Z m ) by definition, this finishes the proof. For (b), note that since (a, m) = (b, m) =, for each Dirichlet character χ m, we have χ(a) = χ (a + mz) and χ(b) = χ (b + mz) for some character χ on U(Z m ). Using this in conjunction with Lemma.2, we see that we can write the sum over Dirichlet characters χ m χ(a)χ(b) as χ U(Z χ m) (a + mz)χ (b + mz). By Proposition.2, this sum is equal to U(Z m ) δ(a + mz, b + mz). Now (b) follows from the argument above if we recall the definition φ(m) = U(Z m ) and that a + mz = b + mz if and only if a b (mod m). Lemma.3. For all s, t N, χ(st) = χ(s)χ(t). Proof: First suppose (s, m) = (t, m) =. Clearly then (st, m) =, and so by definition we have χ(st) = χ (st + mz), where χ is some character on U(Z m ). Since χ is a homomorphism, this gives χ(st) = χ (st + mz) = χ ((s + mz)(t + mz)) = χ (s + mz)χ (t + mz) = χ(s)χ(t). Next, suppose that (s, m) or (t, m). By definition χ(s)χ(t) = 0. Clearly, we have (st, m), and so χ(st) = 0 as well. Lemma.4. For all n, k N, χ(n + km) = χ(n). Proof: Let χ be the character associated with χ, as in Definition.3. If (n, m) then by definition χ(n) = 0. Clearly, in this case we have (n + m, m), and so χ(n + m) = 0 as well. On the other hand, if (n, m) = then by definition χ(n) = χ (n+mz). In this case we have (n + m, m) =, so χ(n + m) = χ ((n + m) + mz). However, (n + m) + mz = n + mz, and we get χ(n + m) = χ ((n + m) + mz) = χ (n + mz) = χ(n). Thus, we have found χ(n + m) = χ(n) for all positive integers n. Suppose χ(n + km) = χ(n) for some positive integers k, n. Then n + km is a positive integer, and so by the result we have just proved, χ(n + (k + )m) = χ(n + km + m) = χ(n + km) = χ(n). By induction, we conclude that χ(n + km) = χ(n) for all positive integers n and k. 7

We are now ready to define Dirichlet L-functions. Before stating the definition, we will need to prove the convergence of series of a certain form. In the discussion that follows we use the principal branch of the complex logarithm, so that z x = e xlog(z) for z, x C. Note that if z is a positive real number then Log(z) R, from which it follows that z x = e (R(x)+iI(x))Log(z) = e R(x)Log(z) = z R(x). Proposition.. Let f : N C be any function with f(n) for every positive integer n. Then the series f(n)n s is absolutely convergent for all s C > and uniformly convergent on {s C R(s) > + δ} whenever δ > 0. Proof: Let s C with σ = R(s) >. Then by the comment above, f(n)n s = f(n) n s n s = n σ for all positive integers n. Now, n n σ is a monotone decreasing continuous positive function of n, and so by the integral test for series convergence (see [3, Ch.9]), n σ converges if and only if n σ dn converges. Since σ >, lim n n σ+ = 0, and so [ n n σ σ+ dn = σ + ] ( = lim n ) n σ+ σ + σ + = σ. The integral converges, so nσ converges as well. Thus f(n)n s converges absolutely whenever R(s) > as we wanted. Let δ > 0. Then n s < n (+δ) for all s with R(s) > + δ. Here n (+δ) converges by the argument above, and so n s converges uniformly for all s > + δ by the Weierstrass M-test [3, Ch.20]. Proposition. implies that χ(n)n s converges absolutely on C > and uniformly on {s C R(s) > + δ} whenever δ > 0. This invites the following definition, which will conclude the section. Definition.4. Let L χ : C > C be defined by L χ (s) = χ(n)n s. We refer to L χ as the the Dirichlet L-function associated with χ. By Proposition., the series for L χ converges everywhere on C >. 8

It is important to note that our notation for L-functions is not standard. In the literature, the L-function associated with a Dirichlet character χ is typically denoted by L(s, χ). This standard notation is not in accord with the way in which functions are handled in modern mathematics. Our modified notation eliminates the problem. III. PROPERTIES OF DIRICHLET L-FUNCTIONS Here we present some general results on L-functions and the Riemann zeta function which will be necessary for the proof of Dirichlet s theorem. Let m be a positive integer throughout this section, and χ be a Dirichlet character modulo m. Proposition 2.. L χ (s) is analytic for all s C >. Proof: By Proposition., L χ (s) = χ(n)n s is uniformly convergent on {s C R(s) > + δ} whenever δ > 0. Clearly χ(n)n s is analytic for s C >, so uniform convergence ensures that L χ is analytic as well (see [4, Ch.5]). Theorem 3 [Euler Decomposition]. Let f be a multiplicative function on the positive integers, i.e. for all m, n N, we have f(mn) = f(m)f(n). Assume also that f(n) for all n N. Let s C with R(s) >. Then ( f(p)p s ), f(n)n s = p where the product is over all primes p. Proof: By Proposition., f(n)n s converges absolutely. Let p be any prime. Since R(s) >, we have f(p)p s = f(p) p s p s = p R(s) and so using the usual expression for a geometric series we see that m=0 f(p)m p ms converges absolutely to ( f(p)p s ). Let N be a positive integer and consider ( ) ( ) ( ( f(p)p s ) = f(2) m 2 m s f(3) m 2 3 m 2s p N m =0 m 2 =0 m k =0 f(p k ) m p m ks k where here p i denotes the ith prime and p k is the largest prime less than or equal to N. Each factor on the right side of the equation above is an absolutely convergent series, and 9 ),

we can rewrite the product as a multiple summation, ( f(p)p s ) =... f(2) m 2 m s f(p k ) m k p m ks k p N For fixed m,, m k, n = 2 m p m k k = m =0 m =0... m k =0 m k =0 f(2 m... p m k k )(2m... p m k k ) s. is an integer. The set of prime factors of n is some subset of {2, 3,..., p k }. By choosing nonnegative integers m,..., m k appropriately we can represent any integer whose prime factors are a subset of {2, 3,..., p k } as 2 m... p m k k. Now, if j is a positive integer with j N then all of the prime factors p of j satisfy p N. Since {2, 3,..., p k } is the set of all primes less than N +, this means j = 2 m... p m k k for some integers m,..., m k with 0 m i for each i. From number theory, the choice of m,..., m k is unique (see [2, Ch.2]). This means there exists R N {j Z j > N} such that Thus, {2 m... p m k k m,..., m k N {0}} = {n Z n N} R N. ( f(p)p s ) = p N N f(n)n s + n R N f(n)n s. Since R N {j Z j > N}, we have n R N f(n)n s n>n f(n)n s. Now, because f(n)n s converges, which implies lim N N f(n)n s 0 = lim N n>n ( = lim f(n)n s ) f(n)n s N n>n = f(n)n s lim f(n)n s, f(n)n s lim N and so lim N n R N f(n)n s = 0 as well. This gives ( f(p)p s ) = lim p = 0 N p N N n>n n R N f(n)n s 0, ( f(p)p s ) f(n)n s.

Consider the trivial Dirichlet character modulo, I 0. Since every positive integer is relatively prime to, we have I 0 (n) = for all positive integers n. Note that by Proposition., I 0(n)n s = n s converges absolutely whenever R(s) >, converges uniformly on {s C R(s) > + δ} whenever δ > 0, and that L I0 (s) = n s is a Dirichlet L-function. By Theorem 3, the Euler product representation for ζ(s) is p ( I 0(p)p s ) = p ( p s ) whenever R(s) >. Definition 2.. For s C, R(s) >, define the Riemann zeta funtion by ζ(s) = n s. As noted above, ζ can be interpreted as the Dirichlet L-function associated with the trivial Dirichlet character modulo. Proposition 2.2. Consider ζ as a function of a real variable s. Then lim s +(s )ζ(s) =. Proof: Let s R, s >. We know t s is a monotone decreasing function of t when t > 0. This means for n, n Z, Now, because s >, (n + ) s < n+ n t s dt < n s. so in fact, [ t t s s+ dt = s ] = s, ζ(s) = This gives (n + ) s < n+ n t s dt = ζ(s) < s < ζ(s) = < s ζ(s) < 0 t s dt = s < n s = ζ(s).

= + s > ζ(s) > s = s > (s )ζ(s) > = = lim s lim s + s +(s )ζ(s). Therefore lim s +(s )ζ(s) =, as we wanted. Corollary. Consider ζ as a function of a real variable s. Then Proof: For s > we have lim s + log ζ(s) log [(s ) ] =. log ζ(s) = log [ (s ) (s )ζ(s) ] = log [ (s ) ] + log [(s )ζ(s)], from which it follows that log ζ(s) log [(s ) ] = + log [(s )ζ(s)] log [(s ) ], and so lim s + ( log ζ(s) log [(s ) ] = lim + s + ) log [(s )ζ(s)]. log [(s ) ] Now, by Proposition 2.2, lim s +(s )ζ(s) =, so lim s + log [(s )ζ(s)] = 0 and we get lim s + ζ(s) log [(s ) ] =. Proposition 2.3. p k= (/k)χ(p)k p ks converges absolutely whenever R(s) >, and converges uniformly on {s C R(s) > + δ} whenever δ > 0, where the outer ( ) sum is over all primes p. Furthermore, exp p k= (/k)χ(p)k p ks = L χ (s) whenever R(s) >. Proof: Consider p k= p ks = p k= ( p k ) s. Define η : N Z by η(n) = if n = p k for some prime p and positive integer k, and η(n) = 0 otherwise. By Proposition., since η(n) for all n, η(n)n s converges absolutely whenever 2

R(s) > and uniformly on {s C R(s) > + δ} whenever δ > 0. But η(n)n s is precisely p k= p ks, so the latter converges absolutely and uniformly under the same conditions. Since (/k)χ(p) k p ks p ks for all primes p and positive integers k, p k= (/k)χ(p)k p ks is also absolutely and unformly convergent under the same conditions, as we wanted. Next, recall that Log( x) has the power series representation k= xk /k absolutely convergent for x < and uniformly convergent on {x C x < δ} whenever δ > 0. Since χ(p)p s p s < whenever R(s) >, the series k= (/k)χ(pk )p ks converges to Log (( χ(p)p s )) whenever R(s) >, for all primes p. Thus, ( ) exp (/k)χ(p) k p ks = ( χ(p)p s ). k= Let N be some positive integer. If p i denotes the ith prime and p t is the largest prime less than or equal to N, then p N k= (/k)χ(p) k p ks = Log ( ( χ(2)2 ks ) ) Log ( ( χ(p t )p ks t ) ), which implies ( ) exp (/k)χ(p) k p ks = p N( χ(p)p s ). () p N k= By Theorem 3, L χ (s) = χ(n)n s = p ( χ(p)p s ) = lim N p N ( χ(p)p s ). Since the exponential function is continuous, taking the limit as N in () gives ( ) precisely exp p k= (/k)χ(p)k p ks = L χ (s), as we wanted. Corollary. Assume R(s) >. Then ζ(s) 0. Proof: By Proposition 2.3, p k= (/k)p ks converges absolutely, and ( ) exp (/k)p ks = ζ(s). p k= This implies that p k= (/k)p ks = Log(ζ(s)) + 2πiq for some integer q. In particular, Log(ζ(s)) is defined, from which it follows that ζ(s) 0. 3

Definition 2.2. Define G χ : {s R s > } C by G χ (s) = p k= (/k)χ(p)k p ks for every s > (where the outer sum is over all primes p). By Proposition 2.3, the series for G χ converges absolutely, and is uniformly convergent on {s R s > + δ} whenever δ > 0. We refer to G as the logarithm associated with L χ. Note that G χ (s) is not a priori equal to Log(L χ (s)). Instead, G χ (s) is the sum over all primes p of Log( χ(p)p s ). Each of these logarithms may have a nonzero imaginary part, and the sum over all primes of these imaginary parts gives I(G χ (s)). By definition, I(Log(z)) π for all z C\{0}. On the other hand, there is no guarantee that I(G χ (s)) π. We work with the branch of the logarithm specified by G χ in order to have an unambiguous, easily manipulable expression for the logarithm of L χ. Proposition 2.4 G χ is continuous on (, ). Proof: We have G χ (s) = p k= (/k)χ(p)k p ks uniformly convergent on {s R s > + δ} whenever δ > 0. For each positive integer k and prime p, s (/k)χ(p) k p ks is clearly continuous for s >. (, ) as well (see [3, Ch.20]). Uniform convergence then ensures G χ is continuous on We now express G χ in a new way, which will be necessary for the proof of Dirichlet s theorem. We separate the series for G χ into a sum of two terms, one of which is bounded as s +, and one of which will be investigated further in the next section. We will find eventually that in fact G χ is bounded as s + whenever χ is nontrivial. This will be the most difficult result necessary for our proof of Dirichlet s theorem. Proposition 2.5. Assume s >. Then G χ (s) = p m χ(p)p s + R χ (s) for some function R χ, where R χ remains bounded as s +. 4

Proof: By definition, G χ (s) = p k= (/k)χ(p)k p ks. By Proposition 2.3 this series is absolutely convergent for s >, so we can separate the terms with k =, and note that χ(p) = 0 if p m, to get G χ (s) = p m χ(p)p s + p (/k)χ(p) k p ks. k=2 Write R χ (s) = p k=2 (/k)χ(p)k p ks, so we have G χ (s) = p m χ(p)p s + R χ (s). Here, (/k)χ(p) k p ks p ks for all positive integers k, so R χ (s) p (/k)χ(p) k p ks p k=2 p ks. k=2 Now, p s < for p prime, so using the usual expression for geometric series we have p k=2 p ks = p p 2s ( p s ) ( 2 s ) p p 2s, since clearly ( p s ) ( 2 s ) for all primes p, s >. Now, for s > we have p 2s p 2 and ( 2 s ) ( /2) = 2, so in fact, p k=2 p ks ( 2 s ) p p 2s 2 p p 2 = 2ζ(2). This implies R χ (s) 2ζ(2) as well. Here p m χ(p)p s is clearly uniformly convergent on {s R s > + δ} whenever δ > 0 (by Proposition.), and χ(p)p s is clearly continuous. Thus both p m χ(p)p s and G χ (s) are continuous, and so R χ (s) is as well. This gives lim s + R χ (s) 2ζ(2), so R χ (s) is bounded as s +, as we wanted. IV. DIRICHLET S THEOREM We are now in a position to begin the proof of Dirichlet s theorem. In this section we will state the theorem and prove some preliminary results. The most difficult part of the proof will be reserved for its own section to follow. Again let m be a positive integer throughout, and let χ 0 denote the trivial Dirichlet character modulo m. Definition 3.. Let P be a set of primes. Define the Dirichlet density of P by p P d(p) = lim p s s + log [(s ) ], 5

if the limit exists. Lemma 3.. Let P be a set of primes. (a) If P is finite then d(p) = 0. (b) If P consists of all but finitely many primes then d(p) =. (c) If P = P P 2 where P and P 2 are disjoint and d(p ), d(p 2 ) exist, then d(p) = d(p ) + d(p 2 ). Proof: To prove (a), note that if P is finite then p P p s is bounded as s +. Since log [(s ) ] is clearly unbounded as s +, we are done. To prove (b) let I 0 be the trivial Dirichlet character modulo, and note that by Proposition 2.3, for real s > we have exp(g I0 (s)) = L I0 (s) = ζ(s). Since G I0 (s) = p k=(/k)i 0 (p) k p ks = p (/k)p ks k= is real for real s >, we must have log ζ(s) = G I0 (s) (recall that log denotes the real logarithm). If P consists of all but finitely many primes, we can write p s = p p P p s p F p s, where F is some finite set of primes. Using part (a), this gives d(p) = lim s + = lim s + if the limit exists. By Proposition 2.5, (p p s ) p F p s log [(s ) ] p p s log [(s ) ], log ζ(s) = G I0 (s) = p I 0 (p)p s + R(s) = p p s + R(s) where R(s) remains bounded as s +. Thus again by part (a), lim s + if the limit exists. In other words, log ζ(s) log [(s ) ] = lim s + p p s log [(s ) ], log ζ(s) d(p) = lim s + log [(s ) ] 6

as long as the limit on the right exists. The corollary to to Proposition 2.2 tells us that the limit on the right is equal to, and so d(p) = as we wanted. Finally, to prove (c), note that for s > we have p s = p s + p s (2) p P p P 2 p P (all of the series in (2) are bounded by ζ(s) and hence converge absolutely). By assumption, and both exist, so d(p ) = lim s + d(p 2 ) = lim s + p P p s log [(s ) ] p P 2 p s log [(s ) ] d(p) = lim s + = lim s + p P p s log [(s ) ] p P p s + p P 2 p s log [(s ) ] exists as well, and is equal to d(p ) + d(p 2 ) as we wanted. We now state Dirichlet s theorem. Several more results will be needed before a proof of the theorem can be given. Theorem 4 [Dirichlet s Theorem]. Suppose a and m are relatively prime integers, with m >. Let P(a; m) be the set of primes p with p a (mod m). Then d(p(a; m)) = /φ(m). In particular, P(a; m) is infinite. We will now show that for χ m, L χ can be analytically continued to the region {s C R(s) > 0} if χ is nontrivial and to {s C R(s) > 0, s } if χ = χ 0. The continuations can be constructed by representing the L-functions in integral form. We will first need a lemma regarding representations of series. Lemma 3.2. Let {a n } and {b n } be sequences of complex numbers, and suppose a nb n converges. Further suppose (a + a 2 +... + a n )b n 0 as n. 7

Then a n b n = (a +... + a n )(b n b n+ ). Proof: Let A 0 = 0, A n = a +... + a n for each n. Then A n A n = a n, so for each positive integer N, Since A 0 = 0, and we get N a n b n = N (A n A n )b n = N A n b n = N A n b n = n=2 N A n b n N A n b n+, N A n b n. which implies a n b n N a n b n = = lim N N = N A n b n N A n b n N N A n b n A n b n+ N = A N b N + A n b n N N = A N b N + A n (b n b n+ ), a n b n = lim N ( A N b N + A n b n+ ) N A n (b n b n+ ). Here A N b N = (a +... + a N )b N, so by our hypotheses A N b N 0 as N. Thus ( ) N a n b n = lim A N b N + A n (b n b n+ ) = A n (b n b n+ ), N or a nb n = (a +... + a n )(b n b n+ ), as we wanted. Lemma 3.3. Let s C, R(s) >. Then L χ0 (s) = p p m( s ) ζ(s). 8

Proof: By definition, L χ0 (s) = χ 0(n)p s. Since χ 0 is multiplicative (Lemma.3) and χ 0 (n) for all positive integers n, Theorem 3 gives L χ0 (s) = p ( χ 0(p)p s ), where the product is over all primes p. As χ 0 is the trivial Dirichlet character modulo m, for an integer n we have χ 0 (n) = 0 if (n, m) and χ 0 (n) = otherwise. In particular, for p prime, ( χ 0 (p)p s ) = if p m and ( χ 0 (p)p s ) = ( p s ) if p m. Thus L χ0 (s) = p ( χ 0 (p)p s ) = p m( p s ). Now recall that by definition ζ(s) = n s. Theorem 3 again gives a product decomposition, ζ(s) = p ( p s ). Therefore, we get L χ0 (s) = ( p s ) p m = ( ) p p m( s ) ( p s ) p = p p m( s ) ζ(s). Proposition 3.. ζ(s) (s ) can be analytically continued to the domain {s C R(s) > 0}. Proof: Assume R(s) >. Then, by definition, ζ(s) = n s, where the series is absolutely convergent (Proposition.). Write a n = and b n = n s, for each positive integer n. Then for each positive integer n, a + a 2 +... + a n = n. Since R(s) >, (a +... + a n )b n = nn s = n s+ 0 as n. Thus, by Lemma 3.2 ζ(s) = = = = n s a n b n (a +... + a n )(b n b n+ ) n(n s (n + ) s ). 9

Now, note that n s (n + ) s = s n+ n x s dx, so that ζ(s) = n(n s (n + ) s ) = s n n n+ x s dx. For a real number x, let [x] denote the largest integer smaller than or equal to x, and write the fractional part of x as x = x [x]. Obviously, 0 x < for all real x. We can rewrite our new expression for ζ(s) using this notation by noting that if n x n + where n is an integer, then n = [x]. We get ζ(s) = s n = s = s = s = s n n+ n n+ n n+ n n+ n n+ x s dx nx s dx [x]x s dx (x x )x s dx ( x s x x s ) dx. (3) Now, since R(s) >, and for R(s) > 0, since x <, [ x x s s+ dx = s x x s dx 20 < = ] = s x x s dx x s dx x R(s) dx ] [ x R(s) = R(s) = R(s).

Here we have used the fact that for real x and complex s, x s = x R(s), as demonstrated in Section I. In particular, this means that (x s x x s ) dx converges. Returning to (3), we proceed to see or ζ(s) = s = s = s n n+ = s s s ( x s x x s ) dx ( x s x x s ) dx x s dx s x x s dx, ζ(s) (s ) = s s s s = s x x s dx x x s dx. x x s dx Here, x x s is clearly analytic for all s C whenever x, and we have already shown x x s dx converges when R(s) > 0, so in fact this integral defines an analytic function of s for R(s) > 0 (see [5]). Moreover, Z(s) = x x s dx is the analytic continuation of ζ(s) (s ) to the domain {s C R(s) > 0}, as we wanted. Corollary. L χ0 can be analytically continued to the domain {s C R(s) > 0, s }. Furthermore, the analytic continuation has a simple pole at s =. Proof: Let Z(s) be the analytic continuation of ζ(s) (s ) to {s C R(s) > 0} as in the proof of Proposition 3., and let C = p m ( p s ). Then by Lemma 3.3, when R(s) > we have L χ0 (s) = Cζ(s) = C(Z(s) + (s ) ). Clearly (s ) is analytic whenever s, and by definition, Z(s) is as well. Thus L χ0 (s) = C(Z(s)+(s ) ) is the analytic continuation of L χ0 to {s C R(s) > 0, s }, as we wanted. Since Z(s) is analytic at s = and (s ) has a simple pole at s =, it is clear that L χ0 (s) has a simple pole at s =. 2

Lemma 3.4. Let χ be a nontrivial Dirichlet character modulo m. Then N n=0 χ(n) φ(m) for all positive integers N. Proof: By the division algorithm (see [2, Ch.2]), we can express N in the form N = qm+r, where q and r are integers and 0 r < m. Write N χ(n) = m n=0 χ(n) + m n=0 χ(n + m) +... + m n=0 χ(n + (q )m) + By Lemma.4, χ(n + km) = χ(n) for all positive integers n and k. This gives N χ(n) = m n=0 χ(n) +... + m = q χ(n) + n=0 m n=0 r χ(n). n=0 χ(n + (q )m) + r χ(n + qm). n=0 r χ(n + qm) Note that whenever χ(n) 0 we have χ 0 (n) =, so m n=0 χ(n) = m n=0 χ 0(n)χ(n). Since by assumption χ χ 0, part (a) of Corollary.2. gives m n=0 χ 0(n)χ(n) = 0, and so m n=0 χ(n) = 0 as well. Thus and, since r m, N χ(n) N χ(n) = m = q χ(n) + n=0 r χ(n) n=0 r χ(n) = n=0 r χ(n) n=0 n=0 r χ(n), n=0 m n=0 χ(n). By definition, χ(n) = when n+mz U(Z m ) and χ(n) = 0 otherwise. Also by definition U(Z m ) = φ(m), and clearly n + mz n 2 + mz when n n 2 and 0 n, n 2 < m. Together these give φ(m) m n=0 χ(n) N χ(n), as we wanted. Proposition 3.2. Let χ be a nontrivial Dirichlet character modulo m. Then L χ can be analytically continued to the domain {s C R(s) > 0}. Proof: Assume R(s) >. Then by definition, L χ (s) = χ(n)n s. Let a n = χ(n), b n = n s for each positive integer n. Then (a +...+a n ) a +...+ a n n for every positive integer n. Since R(s) >, this means (a +... + a n )b n n b n = n n s = nn R(s) = n R(s)+. 22

Since n Re(s)+ decreases to 0 as n increases to infinity, (a +... + b n )b n does as well. Furthermore, since L χ (s) = χ(n)n s = a nb n converges, by Lemma 3.2 we have L χ (s) = a n b n = = (a +... + a n )(b n b n+ ) ( n ) χ(k)(n s (n + ) s ). As in the proof of Proposition 3., write so that k= n s (n + ) s = s L χ (s) = s n+ n x s dx, ( n ) n+ χ(k) x s dx. k= Define S(x) = n x χ(n) for all positive real numbers x, where the sum is over all positive integers n less than or equal to x. Then for n x < n +, we have S(x) = S(n), so ( n ) n+ L χ (s) = s χ(k) x s dx = s = s S(n) k= n+ n n+ n n n x s dx S(x)x s dx. By Lemma 3.4, we have S(x) φ(m) for all x, so, for R(s) > 0, S(x)x s dx S(x)x s dx φ(m)x R(s) dx [ ] x R(s) = φ(m) R(s) = φ(m) R(s). In particular, we see that S(x)x s dx converges, and thus n+ L χ (s) = s S(x)x s dx = s S(x)x s dx. n 23

Clearly S(x)x s is analytic whenever R(s) > 0, and we have already shown that the integral on the right converges whenever R(s) > 0. Thus the integral defines an analytic function on {s C R(s) > 0} (see [5]). Here L χ (s) = s S(x)x s dx is the analytic continuation of L χ to the domain {s C R(s) > 0}, as we wanted. Henceforth, whenever we refer to L χ constructed above, unless otherwise specified. we actually refer to the analytic continuation Corollary. If χ m is nontrivial, then L χ (s) is bounded as s. V. THE BOUNDEDNESS OF G χ Once again let χ 0 denote the trivial Dirichlet character modulo a positive integer m, and let G χ denote the logarithm associated with L χ for a Dirichlet character χ modulo m, as in Definition 2.2. In order to show that G χ (s) is bounded as s +, we will require that L χ () 0. Otherwise, since G χ is a branch of the logarithm of L χ, it will diverge as s +. Lemma 4.. Let s > be real. Then χ m L χ (s) is real, and χ m L χ (s) >. Proof: Let χ m. Then by definition G χ (s) = (/k)χ(p k )p ks, p and the series is absolutely convergent for all χ m (Proposition 2.3). Thus k= G χ (s) = (/k)χ(p k )p ks χ m χ m p k= = ( (/k)p ) ks χ(p k ). p χ m k= Since χ() = we have χ(p k ) = χ m χ m χ(p k )χ() 24

for all primes p and positive integers k. If p k (mod m) then (p, m) = so (p k, m) =. By Corollary.2., the sum is then equal to φ(m). On the other hand, if p k (mod m) then we may have either (p k, m) = or (p k, m). If (p k, m) = then by Corollary.2., the sum is 0. If (p k, m) then by definition χ(p k ) = 0 so the sum is again 0. As a result, ( G χ (s) = (/k)p ) ks χ(p k ) = φ(m) (/k)p ks, χ m p χ m p,k k= where the sum is now over all primes p and positive integers k such that p k (mod m). Here φ(m)(/k)p ks is real and positive for all primes p and integers k, and clearly there exist a prime p and positive integer k such that p k (mod m). Therefore, χ(s) m G χ (s) is real and positive. In particular, ( ) exp G χ (s) >. χ m By Proposition 2.3, for each χ m we have exp(g χ (s)) = L χ (s). Thus ( ) < exp G χ (s) = exp(g χ (s)) = L χ (s), χ m χ m χ m as we wanted. Lemma 4.2. Let δ > 0, and let k be a nonnegative integer. Then (log n) k lim = 0. n n δ Proof: If k = 0, the result is obvious. If k = then we are looking at log n lim. n n δ Here we can use L Hopital s rule (see [3, Ch.4]) to find log n lim n n δ = lim n n = lim δnδ n δn δ = 0. Now suppose k >. Then ( (log n) k lim = lim n n δ n ) k log n, n δ/k 25

if the limit on the right exists. By the argument above the limit on the right is precisely 0, and so we find (log n) k lim n n δ = 0 k = 0. Proposition 4.. Let f be a nonnegative function defined on the positive integers and let s >. Assume that f(n)f(m) = f(nm) whenever (m, n) =, and that there is a real number c such that f(p k ) c for every prime p and positive integer k. Then f(n)n s converges and f(n)n s = p ( ) + f(p k )p ks, k= where the product is over all primes p. Proof: Here f(p k )p ks < cp ks for every prime p and positive integer k. Because s >, p s < and k= p ks is an (absolutely) convergent geometric series. Thus, by comparison, k= f(pk )p ks converges absolutely as well. For convenience, let a(p) = k= f(pk )p ks for each prime p. Since f is nonnegative by assumption, a(p) is as well. We have a(p) c k= p ks. The series on the right is geometric and is equal to p s ( p s ). For a prime p we have ( p s ) < ( 2 ) = 2, so in fact, a(p) < 2cp s. Note p p s converges since s >. Therefore we can let M = 2c p p s p N a(p), where the last inequality holds for all positive integers N. Recall that the Taylor series for exp(x) is k=0 x k k! = + x + x2 2 +..., so in particular, for x 0 we have + x exp(x). ( + a(p)) exp(a(p)), and in particular, + a(p)) p N( ( ) exp(a(p)) = exp a(p). p N p N Thus for each prime p, we have 26

Since exp is a monotone increasing function, this means that ( ) ( + a(p)) exp a(p) exp(m). p N p N Recall a(p) = k= f(pk )p ks. Note that (, ) =, so f() 2 = f()f() = f(), and thus f() = since f is nonnegative by assumption. Then + a(p) = k=0 f(pk )p ks. Let N be any positive integer. We have exp(m) p N( + a(p)) = ( ) f(p k )p ks p N k=0 ( ) ( ) ( = f(2 k )2 k s f(3 k 2 )3 k 2s... k =0 k 2 =0 k r=0 f(p kr r )p krs r where p r denotes the largest prime smaller than or equal to N. Recall that each series on the right is absolutely convergent, so we can rearrange the terms to get Here, 2 k,..., p kr r assumptions on f, As a result, exp(m) p N( + a(p)) =... k =0 k r=0 f(2 k )2 k s...f(p kr r )p krs r. are pairwise relatively prime for any positive integers k,..., k r. Thus by the f(2 k )...f(p kr r ) = f(2 k...p kr r ). ), exp(m) p N( + a(p)) = =... k =0... k =0 k r=0 k r=0 f(2 k )2 ks...f(p kr r )p krs r f(2 k...p kr r )(2 k...p kr r ) s, so by the same argument as in the proof of Theorem 3, there is some subset R N Z j > N} with lim N n R N f(n)n s = 0, of {j 27

and exp(m) p N( + a(p)) = for any N. N f(n)n s + Taking the limit as N in the last inequality then gives exp(m) ( + a(p)) = ( ) + f(p k )p ks = p p proving both of the proposition s assertions. k= n R N f(n)n s, f(n)n s, We are now ready to approach the most difficult part of the proof of Dirichlet s theorem. The next Proposition will allow an obvious proof of the boundedness of G χ (s) as s + whenever χ is nontrivial. Proposition 4.2. Let χ be a nontrivial Dirichlet character modulo a positive integer m. Then L χ () 0. Proof: First, we assume that χ(n) is not real for some positive integer n. Let χ be the character associated with χ, as in Definition.3. For every integer n, we have χ (n + mz)χ (n + mz) =. Defining χ by χ (n+mz) = χ (n + mz) for every n gives χ = (χ ). As noted in Definition., Û(Z m ) is a group, so in particular, χ Û(Z m). Let χ be the Dirichlet character modulo m with associated character χ. Then clearly, χ(n) = χ(n) for every positive integer n (in particular, χ is nontrivial). Since χ(n) is not real for some n, we have χ χ. By definition, for real s >, so that L χ (s) = L χ (s) = χ(n)n s, χ(n)n s = χ(n)n s = L χ (s). Suppose L χ () = 0. Since both L χ and L χ are analytic whenever R(s) > 0 (Proposition 3.2), L χ () = lim s + L χ(s) = 0, as well. Consider the product η m L η (s) as s +. By the corollary to Proposition 3., L χ0 has a simple pole at and is analytic elsewhere in some neighborhood of. By 28

Proposition 3.2, every other factor is analytic at s =. The argument above shows that the two factors L χ and L χ have zeros at s =. Overall, this gives exactly one factor with a simple pole at, and at least two factors with zeros at, so that lim L η (s) = 0. s + η m However, by Lemma 4., η m L η (s) > for all real s >. Since the product is analytic for s in a neighborhood of s =, this is a contradiction to the last equality. conclude L χ () 0. Thus we It remains to consider the case in which χ is real for all arguments. In this case, for every positive integer n, we have χ(n) = or χ(n) =. Assume L χ () = 0. Whenever R(s) > we have ζ(s) 0 (Corollary to Proposition 2.3). Then by Lemma 3.3, we also have L χ0 (s) 0 whenever R(s) >. By the corollary to Proposition 3., L χ0 (s) is analytic whenever R(s) > 0 and s, and has a simple pole at s =. By Proposition 3.2, L χ (s) is analytic whenever R(s) > 0. The assumed zero of L χ at then ensures that L χ0 (s)l χ (s) has a removable discontinuity at s =. Thus we can define a function ψ by ψ(s) = L χ(s)l χ0 (s), L χ0 (2s) analytic on {s C R(s) > /2}. Note that L χ (s)l χ0 (s) is analytic whenever s and R(s) > 0. By the corollary to Proposition 3., L χ0 (2s) has a simple pole at s = /2. Therefore, ψ(s) 0 as s /2 +. Furthermore, since ψ(s) is analytic whenever R(s) > /2, we have a power series representation valid whenever s 2 < 3 2. ψ(s) = b k (s 2) k, (4) k=0 We now express ψ in another form, using Proposition 4.. For now, let s > be real. By Theorem 3, we have the product expansions L χ (s) = p ( χ(p)p s ), and L χ0 (s) = p ( χ 0(p)p s ). Thus, ψ(s) = L χ(s)l χ0 (s) L χ0 (2s) ( ) ( ) p ( χ(p)p s ) p ( χ 0(p)p s ) = = p p ( χ 0(p)p 2s ) ( χ 0 (p)p 2s ) ( χ(p)p s )( χ 0 (p)p s ). 29

Recall that χ 0 (p) = whenever p m, and χ 0 (p) = χ(p) = 0 otherwise. Therefore, if p m and otherwise. As a result, ( χ 0 (p)p 2s ) ( χ(p)p s )( χ 0 (p)p s ) = ( p 2s ) ( χ(p)p s )( p s ) ψ(s) ( χ 0 (p)p 2s ) ( χ(p)p s )( χ 0 (p)p s ) = = ( χ 0 (p)p 2s ) ( χ(p)p s )( χ p 0 (p)p s ) = ( p 2s ) ( χ(p)p s )( p s ) p m = ( p s )( + p s ) ( χ(p)p s )( p s ) p m = p m + p s χ(p)p s. When χ(p) = we have + p s =. χ(p)p s On the other hand, when χ(p) = we have Thus, ψ(s) + p s + p s = χ(p)p s p. s = p m + p s χ(p)p = s χ(p)= + p s p s, where the product is now over all primes p such that χ(p) =. Since s >, p s < for every prime, and we can use the usual expression for the sum of a geometric series to get so that p = p ks, s k=0 + p s p s = ( + p s ) = p ks + k=0 = + 2 30 k=0 k=0 p ks. k= p ks p (k+)s

Thus we find ψ(s) = χ(p)= + p s p = s χ(p)= ( + ) 2p ks. k= We are now ready to apply Proposition 4.. Define a function f on the positive integers by the following: f(p k ) = 2 whenever p is a prime with χ(p) = and k is any positive integer f(p k ) = 0 whenever p is a prime with χ(p) and k is any positive integer f() = f(m)f(n) = f(mn) whenever (m, n) =. Note that since every positive integer n has a unique factorization into a product of prime powers, these conditions uniquely define f(n). Clearly, 0 f(p k ) 2 for every prime p and positive integer k. Furthermore, if p is a prime then whenever χ(p) = and + f(p k )p ks = + k= + k= f(p k )p ks = k= 2p ks whenever χ(p). Rewriting ψ in terms of f and using Proposition 4., we find ψ(s) = ( ) + 2p ks χ(p)= k= = ( ) + f(p k )p ks p k= = f(n)n s. Returning to (4), for real s > with s 2 < 3, we have 2 ψ(s) = f(n)n s = b k (s 2) k. k=0 3

By Taylor s theorem (see [3, Ch.2], [4, Ch.2]), for each k 0, b k = ψ(k) (2) k!. To find the kth derivative of ψ, we differentiate the series f(n)n s. Arguing inductively, for each pair of nonnegative integers k and n, we obtain d k ds k f(n)n s = f(n)( log n) k n s. Let δ > 0, and assume s > + 2δ. By Lemma 4.2, there is some integer M such that for any n M, f(n)( log n) k n s f(n)n s+δ f(n)n (+2δ)+δ = f(n)n δ. By Proposition 4., n=m f(n)n δ converges, so by the Weierstrass M-test, f(n)( log n) k n s converges absolutely for s > and uniformly on s {x R x > + 2δ}. By a result in real analysis (see 20.5 of [3]), we then have f(n)( log n) k n s = for all s > and nonnegative integers k. This implies that for each k 0. We can let so that In this way, b k = ψ(k) (2) k! c k = k! ψ(s) = = k! d k ds k f(n)n s = ψ (k) (s) f(n)( log n) k n 2 f(n)(log n) k n 2 0, b k = ( ) k c k. b k (s 2) k = k=0 c k (2 s) k whenever s 2 < 3/2. For /2 s 2, all of the terms in the series on the right are nonnegative. Furthermore, we have c 0 = 0! k=0 f(n)(log n) 0 n 2 f() =. 32

Thus, for /2 s 2, ψ(s) = c k (2 s) k c 0 (2 s) 0 = c 0. k=0 We have found that ψ(s) is differentiable for s > /2, that lim s /2 + ψ(s) = 0, and that ψ(s) for /2 s 2. This is a contradiction, and so we conclude that L χ () 0, as we wanted. We are now prepared to examine the behaviour of G χ (s) as s +. We have already determined the behaviour of L χ at. As G χ is essentially the logarithm of L χ, this will be sufficient. Proposition 4.3. lim s + G χ0 (s) log [(s ) ] =. Furthermore, if χ m is nontrivial then G χ (s) is bounded as s +. Proof: Assume s >. By definition, G χ0 (s) = p k=(/k)χ 0 (p k )p ks = p m (/k)p ks, so G χ0 (s) is clearly real. By Proposition 2.3, exp(g χ0 (s)) = L χ0 (s). Thus we must have G χ0 (s) = log(l χ0 (s)) (recall that log denotes the real natural logarithm). By Lemma 3.3, L χ0 (s) = p m( p s )ζ(s), k= so that G χ0 (s) = log(l χ0 (s)) = p m log( p s ) + log(ζ(s)), and G χ0 (s) log [(s ) ] = p m log( p s ) + log(ζ(s)). log [(s ) ] Clearly p m log( p s ) is bounded as s +. Observe that lim s + log [(s ) ] =. Thus we find that lim s + G χ0 (s) log [(s ) ] = lim s + p m log( p s ) + log(ζ(s)) log [(s ) ] log(ζ(s)) = lim s + log [(s ) ]. 33

By the corollary to Proposition 2.2, the last limit is equal to, and we find as we wanted. lim s + G χ0 (s) log [(s ) ] =, Now let χ m be nontrivial. By Proposition 2.3, exp(g χ (s)) = L χ (s). By Proposition 2.4, G χ is continuous on (, ), and by Proposition 3.2, L χ is analytic on {z C Rz > 0}. By Proposition 4.2, L χ () 0, so that Log(L χ ()) is defined. The exponential map is continuous, so we find that As a result, exp( lim s + G χ(s)) = lim s + exp(g χ(s)) = lim s + L χ(s) = L χ () 0. lim G χ(s) = Log(L χ ()) + 2πik s + for some integer k. In particular, G χ (s) is bounded as s +, as we wanted. VI. CONCLUSION TO THE PROOF OF DIRICHLET S THEOREM have Let m be a positive integer, and let s > be real. By Proposition 2.5, for χ m, we G χ (s) = p m χ(p)p s + R χ (s), where R χ (s) is bounded as s +. Let a be an integer with (a, m) =. Then for each Dirichlet character χ, χ(a)g χ (s) = p m χ(a)χ(p)p s + χ(a)r χ (s). Taking the sum over all Dirichlet characters, we find χ(a)g χ (s) = χ(a)χ(p)p s + χ(a)r χ (s). χ m p m χ m χ m 34

Here, p m χ(p)p s is bounded by n n s = ζ(s) and thus p m χ(p)p s is absolutely convergent. As a result, we can rearrange terms to find ( χ(a)g χ (s) = p ) s χ(p)χ(a) + χ(a)r χ (s). χ m χ m χ m p m Here (a, m) = by assumption. Thus by Corollary.2., if p m (so (p, m) = ), then χ m χ(p)χ(a) is equal to φ(m) when p a (mod m) and is equal to 0 otherwise. We use this to simplify our last equality to χ m χ(a)g χ (s) = φ(m) p P(a;m) p s + χ m χ(a)r χ (s), where we have used the definition of P(a; m) from the statement of Theorem 4 (P(a; m) is the set of all primes congruent to a modulo m). Let G a (s) = χ χ(a)g χ(s), where the sum is over all nontrivial Dirichlet characters, and let R a (s) = χ m χ(a)r χ (s). We rearrange our previous equality to find χ 0 (a)g χ0 (s) + G a (s) = φ(m) p P(a;m) p s + R a (s). Dividing by log [(s ) ] and taking the limit as s +, we find χ 0 (a)g χ0 (s) + G a (s) φ(m) p P(a;m) lim = lim p s + R a (s), (5) s + log [(s ) ] s + log [(s ) ] if the limits exist. By Proposition 4.3, lim s + χ 0 (a)g χ0 (s) log [(s ) ] = χ 0(a). Again by Proposition 4.3, G χ (s) is bounded as s + whenever χ is nontrivial, so that G a (s) is bounded as s +. Recall that R χ (s) is bounded as s + for each character χ, so that R a (s) is as well. Finally, observe that lim s + log [(s ) ] =. Thus lim s + Using these to simplify (5), we find G a (s) log [(s ) ] = lim s + R a (s) log [(s ) ] = 0. χ 0 (a) χ 0 (a)g χ0 (s) + G a (s) = lim s + log [(s ) ] φ(m) p P(a;m) = lim p s + R a (s) s + = φ(m) lim s + log [(s ) ] p P(a;m) p s log [(s ) ], 35

or χ 0 (a) p P(a;m) φ(m) = lim p s s + log [(s ) ]. Here we have (a, m) = by assumption, so by definition, χ 0 (a) =. Furthermore, from Definition 3., if the limit exists. Thus lim s + p P(a;m) p s log [(s ) ] = d(p(a; m)), φ(m) = χ(a) p P(a;m) φ(m) = lim p s s + log [(s ) ] = d(p(a; m)), as asserted in the statement of Theorem 4 (Dirichlet s theorem). In particular, since P(a; m) has nonzero Dirichlet density, by Lemma 3., it is infinite. [] Kenneth Ireland, Michael Rosen. A Classical Introduction to Modern Number Theory, Second Edition. Harrisonburg, Virginia, USA. Springer-Verlag, 990. ISBN 0-387-97329-X. [2] W. Keith Nicholson. Introduction to Abstract Algebra, Second Edition. USA. Wiley- Interscience, 999. ISBN 0-47-3309-0. [3] Angus E. Taylor, W. Robert Mann. Advanced Calculus, Third Edition. USA. Wiley, 983. ISBN 0-47-02566-6. [4] Serge Lang. Complex Analysis, Second Edition. Harrisonburg, Virginia, USA. Springer- Verlag, 988. ISBN 0-387-96085-6. [5] Gerald B. Folland. Real Analysis, Modern Techniques and their Applications. USA. Wiley, 984. ISBN 0-47-80958-6. 36