MULTIPLE RECURRENCE AND CONVERGENCE FOR CERTAIN AVERAGES ALONG SHIFTED PRIMES

Similar documents
BERNARD HOST AND BRYNA KRA

POINCARÉ RECURRENCE AND NUMBER THEORY: THIRTY YEARS LATER

Dept of Math., SCU+USTC

ERGODIC AVERAGES FOR INDEPENDENT POLYNOMIALS AND APPLICATIONS

From combinatorics to ergodic theory and back again

CONVERGENCE OF POLYNOMIAL ERGODIC AVERAGES

CONVERGENCE OF MULTIPLE ERGODIC AVERAGES FOR SOME COMMUTING TRANSFORMATIONS

Ergodic methods in additive combinatorics

RESEARCH STATEMENT WENBO SUN

From combinatorics to ergodic theory and back again

Nonconventional ergodic averages and nilmanifolds

VARIATIONS ON TOPOLOGICAL RECURRENCE

ANALYSIS OF TWO STEP NILSEQUENCES

AVERAGING ALONG CUBES

OPTIMAL LOWER BOUNDS FOR MULTIPLE RECURRENCE

Ergodic theorem involving additive and multiplicative groups of a field and

DYNAMICAL CUBES AND A CRITERIA FOR SYSTEMS HAVING PRODUCT EXTENSIONS

Arithmetic progressions in primes

Roth s theorem on 3-arithmetic progressions. CIMPA Research school Shillong 2013

MATH 117 LECTURE NOTES

Rationally almost periodic sequences, polynomial multiple recurrence and symbolic dynamics

FURSTENBERG S ERGODIC THEORY PROOF OF SZEMERÉDI S THEOREM

Generalizing the Hardy-Littlewood Method for Primes

Roth s Theorem on 3-term Arithmetic Progressions

A STRUCTURE THEOREM FOR MULTIPLICATIVE FUNCTIONS OVER THE GAUSSIAN INTEGERS AND APPLICATIONS

Szemerédi s Theorem via Ergodic Theory

3 (Due ). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure?

Van der Corput sets with respect to compact groups

Higher-order Fourier analysis of F n p and the complexity of systems of linear forms

On Shalom Tao s Non-Quantitative Proof of Gromov s Polynomial Growth Theorem

Approximate Transitivity for Zero Entropy Systems

The dichotomy between structure and randomness. International Congress of Mathematicians, Aug Terence Tao (UCLA)

CHARACTERISTIC POLYNOMIAL PATTERNS IN DIFFERENCE SETS OF MATRICES

The Hopf argument. Yves Coudene. IRMAR, Université Rennes 1, campus beaulieu, bat Rennes cedex, France

NORMAL NUMBERS AND UNIFORM DISTRIBUTION (WEEKS 1-3) OPEN PROBLEMS IN NUMBER THEORY SPRING 2018, TEL AVIV UNIVERSITY

On families of anticommuting matrices

1 Math 241A-B Homework Problem List for F2015 and W2016

Roth s Theorem on Arithmetic Progressions

Functional Analysis. Franck Sueur Metric spaces Definitions Completeness Compactness Separability...

WEIERSTRASS THEOREMS AND RINGS OF HOLOMORPHIC FUNCTIONS

REAL AND COMPLEX ANALYSIS

A VERY BRIEF REVIEW OF MEASURE THEORY

INVARIANTS FOR COMMUTATIVE GROUP ALGEBRAS

Square-Difference-Free Sets of Size Ω(n )

Chapter 3: Baire category and open mapping theorems

Introduction to Bases in Banach Spaces

Idempotent Ultrafilters, Multiple Weak Mixing and Szemerédi s Theorem for Generalized Polynomials

(1) Consider the space S consisting of all continuous real-valued functions on the closed interval [0, 1]. For f, g S, define

Remarks on a Ramsey theory for trees

The Green-Tao Theorem on arithmetic progressions within the primes. Thomas Bloom

Notions such as convergent sequence and Cauchy sequence make sense for any metric space. Convergent Sequences are Cauchy

The Green-Tao theorem and a relative Szemerédi theorem

MATHS 730 FC Lecture Notes March 5, Introduction

A n (T )f = 1 n 1. T i f. i=0

On the convergence of spherical multiergodic averages for Markov action groups

A new proof of Gromov s theorem on groups of polynomial growth

The structure of unitary actions of finitely generated nilpotent groups

Real Analysis Notes. Thomas Goller

Real Analysis Math 131AH Rudin, Chapter #1. Dominique Abdi

Horocycle Flow at Prime Times

METRIC HEIGHTS ON AN ABELIAN GROUP

Some topics in analysis related to Banach algebras, 2

Maths 212: Homework Solutions

A PROPERTY OF STRICTLY SINGULAR 1-1 OPERATORS

ERGODIC DYNAMICAL SYSTEMS CONJUGATE TO THEIR COMPOSITION SQUARES. 0. Introduction

The Möbius function is strongly orthogonal to nilsequences

Entropy dimensions and a class of constructive examples

l(y j ) = 0 for all y j (1)

arxiv:math/ v1 [math.co] 20 Apr 2006

ON NEARLY SEMIFREE CIRCLE ACTIONS

The Structure of C -algebras Associated with Hyperbolic Dynamical Systems

Manfred Einsiedler Thomas Ward. Ergodic Theory. with a view towards Number Theory. ^ Springer

ON MATRIX VALUED SQUARE INTEGRABLE POSITIVE DEFINITE FUNCTIONS

Analysis Finite and Infinite Sets The Real Numbers The Cantor Set

Measurable functions are approximately nice, even if look terrible.

Real Analysis Problems

THE AUTOMORPHISM GROUP OF A MINIMAL SHIFT OF STRETCHED EXPONENTIAL GROWTH

Spectral Continuity Properties of Graph Laplacians

Three hours THE UNIVERSITY OF MANCHESTER. 24th January

5 Measure theory II. (or. lim. Prove the proposition. 5. For fixed F A and φ M define the restriction of φ on F by writing.

= 1 2x. x 2 a ) 0 (mod p n ), (x 2 + 2a + a2. x a ) 2

On combinatorial properties of nil Bohr sets of integers and related problems

1 The Heisenberg group does not admit a bi- Lipschitz embedding into L 1

IRRATIONAL ROTATION OF THE CIRCLE AND THE BINARY ODOMETER ARE FINITARILY ORBIT EQUIVALENT

Fragmentability and σ-fragmentability

MAA6617 COURSE NOTES SPRING 2014

Problem Set 6: Solutions Math 201A: Fall a n x n,

Representations of moderate growth Paul Garrett 1. Constructing norms on groups

COARSELY EMBEDDABLE METRIC SPACES WITHOUT PROPERTY A

Part II Probability and Measure

arxiv: v1 [math.ds] 13 Jul 2015

Exponential triples. Alessandro Sisto. Mathematical Institute, St Giles, Oxford OX1 3LB, United Kingdom

The dichotomy between structure and randomness, arithmetic progressions, and the primes

Formal Groups. Niki Myrto Mavraki

January 3, 2005 POINTWISE CONVERGENCE OF NONCONVENTIONAL AVERAGES

Real Analysis, 2nd Edition, G.B.Folland Elements of Functional Analysis

Lecture 9 Metric spaces. The contraction fixed point theorem. The implicit function theorem. The existence of solutions to differenti. equations.

A note on a construction of J. F. Feinstein

Application of Ergodic Theory to Uniform distribution mod 1. Oleg Ivrii

DIOPHANTINE APPROXIMATION AND CONTINUED FRACTIONS IN POWER SERIES FIELDS

Transcription:

MULTIPLE RECURRECE AD COVERGECE FOR CERTAI AVERAGES ALOG SHIFTED PRIMES WEBO SU Abstract. We show that any subset A with positive upper Banach density contains the pattern {m, m + [nα],..., m + k[nα]}, for some m and n = p for some prime p, where α R\Q. Making use for the Furstenberg Correspondence Principle, we do this by proving an associated recurrence result in ergodic theory along the shifted primes. We also prove the convergence result for the associated averages along primes and indicate other applications of these methods.. Statement of Results.. Introduction. A measure preserving system (X, X, µ, T ) consists of a probability space (X, X, µ) and a measurable, measure preserving transformation T acting on it. Throughout this paper, we assume T is invertible. The limit behavior (existence and positivity) in of a measure preserving system (X, X, µ, T ) of multiple averages of the form lim f (T a(n) x)... f k (T ak(n) x) has been widely studied in recent years for different choices of functions a i (n), where a i (n): are integer sequences (see, for example, [], [0], [], [7], [8], [2], [25]). In this paper, we study similar results for averages over shifted primes, meaning, the primes or the primes plus or minus. Let P denote the set of all primes and π() denote the number of primes up to. For all x R, {x} denotes the fractional part of x and [x] denotes the largest integer which is no larger than x. We prove the following theorem: Theorem.. Let k, α R\Q, (X, X, µ, T ) be a measure preserving system. Let f,..., f k L (µ). Then the average T [pα] f... T k[pα] f k π() p [] P converges in as. The conclusion is also true if i[nα] is replaced by [inα] for i =,..., k.

2 WEBO SU Frantzikinakis, Host and Kra [8] first proved the linear polynomial case of Theorem. (i.e. the case all i[nα] are replaced with in), with the proof of the case k 3 conditional upon the results of [4] and [6] that were subsequently proven. Then Wooley and Ziegler [24] proved the convergence result for the polynomial case (i.e. the case all i[nα] are replaced with some polynomial with integer coefficients) for a single transformation. After that, these results were generalized to the multi-dimensional polynomial case for commutative transformations by Frantzikinakis, Host and Kra [9]. It is natural to ask whether one can prove a version of these theorems for generalized polynomials. Theorem. extends these results to linear generalized polynomials. By what is now referred to as the Furstenberg Correspondence Principle, first introduced in [0], the positivity of f T a (n) f... T a k(n) fdµ for some measure preserving system (X, X, µ, T ) and some function f 0 not identically 0 is equivalent to the positivity of the upper Banach density d of the set ) d (E (E a (n)) (E a k (n)) for some E, where d I E (E) = lim sup I. So it is natural to ask when I the limit in Theorem. is positive. We prove the following theorem: Theorem.2. Let (X, X, µ, T ) be a measure preserving system, A be a measurable set with µ(a) > 0 and α R\Q be an irrational number. Then the set of integers n such that µ(a T [nα] A T [knα] A) > 0 has nonempty intersection with P +, and with P. The conclusion is also true if [inα] is replaced by i[nα] for i =,..., k. We immediately deduce the following: Corollary.3. For any k, α R\Q, every subset A with positive upper Banach density contains the following pattern: {m, m + [nα],..., m + [knα]}, for some m, n P +, and for some m, n P. The conclusion is also true if [inα] is replaced by i[nα] for i =,..., k. Remark.4. It is worth noting that Theorem.2 fails if P + is replaced with P. For example, if (X, X, µ, T ) is the -dimensional torus with T x = x + 2 2 ( mod ) and A = ( /8, /8), then µ(a T [n 2] A) > 0 only if n is even.

MULTIPLE AVERAGES FOR CERTAI AVERAGES ALOG SHIFTED PRIMES 3 The first result in this direction was obtained by Sarközy [23], who proved that the difference set of E E for a set E with positive upper Banach density contains a shifted prime p for some p P (and a similar result holds for p + ). The linear polynomial case of Theorem.2 was proved in [8], with the proof of the case k 3 conditional upon the results of [4] and [6] that were subsequently proven. Then Wooley and Ziegler [24] proved the recurrence result for the polynomial case for a single transformation. Then Bergelson, Leibman and Ziegler [5] proved the linear polynomial case of Theorem.2 for the multi-dimensional case. The multidimensional polynomial version of Theorem.2 for commutative transformations was proved by Frantzikinakis, Host and Kra [9]. The method used in proving Theorems. and.2 also applies to other results on shifted primes for cubes and certain weighted averages. We indicate how such results can be obtained in Section 5..2. Strategy and Organization. The technique used in [9] was to compare the average under consideration along the shifted primes to the analogous average along all the integers. In order to obtain results for averages along [nα] (Theorems. and.2), we need to modify the technique. Roughly speaking, the difficulty is that the difference a(n + h) a(n) is independent of n when a(n) = cn, c, but this is not the case when a(n) = [nα]. To overcome this obstacle, we need to consider only those n such that {nα} lies in some short interval to guarantee a(n + h) a(n) is constant. The background material is presented in Section 2. Section 3 is used to obtain some estimate for Gowers norms that is used in later sections. We present the proofs of Theorems. and.2 in Section 4. Other applications of the estimate for Gowers norms are presented in Section 5. Ackonwledgment. The author would like to thank. Frantzikinakis for helpful discussions related to the estimate for Gowers norms and the material in Section 4, B. Kra for all the instructive advices in preparation of this article, and the referee for the patient reading and helpful suggestions. 2. Background 2.. otation. Throughout this paper, we denote the set of positive integers by, the set of real numbers by R, and the set of rational numbers by Q. Write Z = Z/Z. When needed, the set Z is identified with []: = {,..., }. We use o () to denote a quantity that converges to 0 when and all other parameters are fixed. The expression a(n) b(n) stands for a(n) Cb(n) for some constant C. If C depends on some parameter d, we write a(n) d b(n). If A is a subset of a space X, we write A (x) to be the index function of A, taking value for x A and 0 elsewhere. We denote the space of continuous functions on X by C(X). For any complex-valued function g(n), denote Cg(n) = g(n). If S is a finite set and a: S C is a function on S, we write E n S a(n) = S Σ n Sa(n).

4 WEBO SU For any t and any element ɛ = (ɛ,..., ɛ t ) {0, } t, denote ɛ = t i= ɛ i. 2.2. Gowers norms. For any function a: Z C, we inductively define: and a U (Z ) = E n Z a(n) a U d+ (Z ) = (E ah h Z a 2d U d (Z ) ) /2 d+ where a h (n) = a(n + h). Gowers [2] showed that this defines a norm on functions on Z for d. These norms were later used by Green, Tao, Ziegler and others in studying the primes (see, for example, [3], [5] and [6]). Analogous semi-norms were defined in the ergodic setting by Host and Kra [8]. 2.3. ilsequences and ilmanifolds. Let G be a Lie group. Denote G = G, G i+ = [G, G i ] = {ghg h : g G, h G i }, i. We say that G is d-step nilpotent if G d+ = {} and the system (G/Γ, X, µ, T ) is called a d-step nilsystem, where Γ G is a discrete cocompact subgroup, µ is the measure induced by the Haar measure of G, X is the Borel σ-algebra and T is the transformation given by T (x) = gx for some fixed g G. If F is a continuous function on a d-step system (G/Γ, X, µ, T ), for any x G/Γ, we say that {F (T n x)} n is a basic d-step nilsequence. A sequence is called a d-step nilsequence if it is the uniform limit of basic d-step nilsequences. ilsequences were introduced by Bergelson, Host and Kra [3] and they play an important role in the study of multiple averages and in the estimate of Gowers norms (see, for example, [3], [3], [5], [6] [7], [8], [9], [2] and [25]). As we need to be quantitative regarding these nilmanifolds, we endow each manifold with an arbitrary smooth Riemannian metric. We then define the Lipschitz constant of a basic nilsequence {F (T n x)} n to be the Lipschitz constant of F. otice that the Lipschitz constant of a basic nilsequence {F (T n x)} n is independent of the transformation T. 2.4. van der Corput Lemma. The use of the van der Corput Lemma in studying multiple averages in ergodic theory was introduced by Bergelson []. In this paper, we use a variation of the estimation of van der Corput: Lemma 2.. Let {v(n)} be a sequence of elements in a Hilbert space with norm and inner product,. Then v(n) 2 v(n) 2 + h v(n + h), v(n). 2 The proof of a special case of this lemma can be found in [20] and the proof of the general case is essentially the same. h=,

MULTIPLE AVERAGES FOR CERTAI AVERAGES ALOG SHIFTED PRIMES 5 3. Estimation of Modified von Mangoldt Function Throughout this paper, we let T = R/Z denote the -dimensional torus with Haar measure λ, associated σ-algebra X and transformation R α (x) = x + α(mod ), and we call the system (T, X, λ, R α ) the -dimensional torus. For convenience, we sometimes identify T with the interval [0, ]. Let Λ: R denote the von Mangoldt function, taking the value log p on the prime p and its powers and 0 elsewhere, and let Λ (n) = P (n)λ(n). We use the von Mangoldt function to replace the indicator function P (n) since it has better analytic properties. We have the following lemma: Lemma 3.. If a function a: k C is bounded, then lim π() k p i P,p i a(p,..., p k ) k n,...,n k i= k Λ (n i )a(n,..., n k ) = 0. The proof of the case k = can be found in for example [8], and the general case can be derived easily from the special case. Throughout this paper we denote W = Π p P,p<w p for w > 2, i.e. W is always assumed to be an integer depending on w. For r, we always denote Λ w,r(n) = φ(w ) W Λ (W n + r), where φ(n) = d n,d n is the Euler function, and Λ w,r(n) is referred to as a modified von Mangoldt function. The key to the study of convergence and recurrence results along primes is the estimate of the Gowers norms of this modified von Mangoldt function. The purpose for this section is to establish a variation of Theorem 7.2 of [3], which allows us to estimate not only the Gowers norms of the modified von Mangoldt function, but also that of the product of the modified von Mangoldt function and some well-behaved sequences. The method we use basically follows the one used in [3]. We prove: Proposition 3.2. () For any bounded basic k-step nilsequence {F (g n x)} n with bounded Lipschitz norm, both sup (Λ w,r ) [,] (n)f (g W n x) and r<w,(r,w)= sup r<w,(r,w)= (Λ w,r ) [,] (n)f (g n x) U k (Z k ) U k (Z k ) converge to 0 as and then w. (2)Let (T, X, λ, R α ) be the -dimensional torus with α R\Q. Let A X be an interval. Then for any x T, both sup (Λ w,r ) [,] (n) A (Rα W n x) r<w,(r,w)= U k (Z k )

6 WEBO SU and sup r<w,(r,w)= (Λ w,r ) [,] (n) A (Rαx) n converge to 0 as and then w. U k (Z k ) Remark 3.3. Our definition of nilsequences is different from the one used in [3]. The basic nilsequence defined in this paper is called a nilsequence in [3]. Proof. For any δ > 0, we can find two non-negative smooth functions Fl δ(x) A (x) Fu(x) δ such that (F δ T u Fl δ)dλ < δ/4. We temporally write G W (n) to represent one of the following functions: F (g W n x), F (g n x), A (Rα W n x), A (Rαx). n Suppose the statement is not true for one of the functions G W (n) listed above. Then there exists δ > 0 and sequences of non-negative integers {w }, {r }, {W } and I an infinite subset, such that W = p P,p<w p, W = o(log ), r is coprime with W, and W as, and (Λ w,r ) [,] (n)g W (n) > δ U k (Z k ) for all I. Moreover, I can be chosen such that k Fc δ (R W n () α x) Fc δ dλ < δ /8 k and (2) k k Fc δ (Rαx) n Fc δ dλ < δ /8 T for c = l, and u, I(we can do so because the order of the choice is that first a sequence of w is picked and then we attach to each w with some sufficiently large). Here δ is some constant to be chosen latter depending only on δ and k. By Propositions 0. and 6.4 of [3], (Proposition 0. of [3] was based on the inverse conjecture for the Gowers norms which was later proved in [6]) there exists δ > 0 and a finite collection of (k )-step nilmanifolds U such that for any I, there exists a basic (k )-step nilsequence {F (hn x)} n Z from one of the manifolds in U with bound and Lipschitz constant O δ,k () such that E n k (Λ w,r ) [,] (n)g W (n)f (h n (3) x) > δ. We now choose the δ appearing in () and (2) to be the same δ appearing in (3) (this is possible because δ is independent of ). By passing to an infinite subset of I, we can assume all {F (hn x)} n comes from the same nilmanifold Y. We still denote this subset by I for convenience. Case that G W (n) = F (g W n x) or F (g n x). In this case, for each I, {S (n): = G W (n)f (hn x)} n is a basic nilsequence from X Y with a uniform Lipschitz bound. Then (3) contradicts Proposition 0.2 of [3]. T

MULTIPLE AVERAGES FOR CERTAI AVERAGES ALOG SHIFTED PRIMES 7 Case that G W (n) = A (Rαx) n or A (Rα W n x). We assume G W (n) = A (Rα W n x) since the proof of the other case is identical. We may assume without loss of generality that all functions F (x), I are nonnegative (otherwise we can split F (x) as the difference of two continuous nonnegative functions and use the argument for each one). Then (4) E n k (Λ w,r ) [,] (n) A (R W n α x)f (h n x) E n k (Λ w,r (n)f δ u (R W n α x) Fl δ E n k (Λ w,r ) [] (n)f δ u (R W n α x)f (h n x) + E n k (F δ u (R W n α x) Fl δ (R W n α x)). (R W n α x)) [,] (n)f (h n x) Similar to the discussion of the first case, by Proposition 0.2 of [3], the first term on the right hand side goes to 0 as. For the second term, by () we have E n k (F δ u (R W n (F δ T δ /2. α x) Fl δ u Fl δ )dλ + δ /4 (R W n α x)) Thus the left hand side of (4) is less than δ. Similarly, the left hand side of (4) is greater than δ. This contradicts (3). 4. Convergence and Recurrence along Shifted Primes on [nα] 4.. Comparison with Averages along Integers. The following proposition shows that the weighted averages along some properly chosen subsets of [] for [nα] are controlled by the Gowers norms of a related function: Proposition 4.. Let α R\Q. Let a i (n) = [inα] for all i k, n [], or a i (n) = i[nα] for all i k, n []. Let (X, X, µ, T ) be a measure preserving system and f 0, f,..., f k L (µ). Let b: C be a sequence of complex numbers satisfying b(n)/n c 0 for all c > 0. Let ξ : T R be a function with absolute value bounded by whose support is contained in an interval that does not contain any internal point of the form (5) i (k+)!, 0 i < (k + )!. Then b(n)ξ({nα})f 0 T a(n) f... T ak(n) L f k 2 (µ) U b(n)ξ({nα}) [,] + o (). k+ (Z (k+) ) Furthermore, the implicit constant is independent of {b(n)} n and the o () term depends only on the integer k and {b(n)} n.

8 WEBO SU Remark 4.2. The function i[nα] is generally easier to treat than [inα], and we can replace (k + )! with 2 in the first case. But we write them together since the proofs are similar. Proof. Without loss of generality, we assume that all functions f i are bounded by. Before we prove the general case, we give an example for the case k = which is easier and illustrates the main idea. By Lemma 2., the Cauchy-Schwartz inequality and the invariance of T, we have (6) ( b(n)ξ({nα})f 0 T [nα] 4 f h= h= h= h h h b(n)ξ({nα})b(n + h)ξ({(n + h)α})t [nα] f 0 f 0 f T [(n+h)α] f L (µ)) 2 + o () b(n)ξ({nα})b(n + h)ξ({(n + h)α})t [nα] f 0 f 0 f T [(n+h)α] f 2 b(n)ξ({nα})b(n + h)ξ({(n + h)α})f T [(n+h)α] [nα] f 2 L (µ) + o () + o (), where o () depends on {b(n)} n. Suppose ξ(x) is supported in an interval I (0, ), I /2. otice that [(n + h)α] [nα] = hα {(n + h)α} + {nα}. Thus if {nα}, {(n+h)α} I, then {hα} < /2 implies [(n+h)α] [nα] = [hα], and {hα} > /2 implies [(n+h)α] [nα] = [hα]+. Let A = {h []: {hα} < /2}. Then by splitting the summand of h along [] into A and A c, we get h b(n)ξ({nα})b(n + h)ξ({(n + h)α})f T [(n+h)α] [nα] 2 f h= h A h b(n)ξ({nα})b(n + h)ξ({(n + h)α}) + h b(n)ξ({nα})b(n + h)ξ({(n + h)α}) h A c 4 b(n)ξ({nα}) [,]. U 2 (Z 2 ) This finishes the proof of the case k =. We now give the proof of the general case. Write a 0 (n) = 0. Denote g(n) = b(n)ξ({nα}) and g(n; h,..., h t ) = ɛ {0,} t C ɛ g(n+ t j= ɛ jh j )(Recall that Cf(n) = f(n), ɛ = t j= ɛ j). 2 2

MULTIPLE AVERAGES FOR CERTAI AVERAGES ALOG SHIFTED PRIMES 9 Claim. For any 0 t k, there exists a function F ( ; h,..., h t ) L (µ) bounded by (this function depends on h,..., h t but not on n) such that g(n)f 0 T a(n) f... T ak(n) 2 k+ f k t h,...,h t F ( ; h,..., h t ) h h t g(n; h,..., h t ) k T ai(n+ i=t+ ɛ {0,} t t j= ɛ jh j ) a t(n) C ɛ 2 k+ t f i + o (). If the claim is true, let t = k, we get the left hand side of (5) is bounded by ( t h,...,h t h h k g(n; h,..., h k ) 2 ) 2 k+ + o (), which is exactly the right hand side of (5) since this is the expansion of the Gowers norm U k+ (Z (k+) ) of g(n) [,] (n). In order to prove the claim, it suffices to show that for any t k, any F ( ; h,..., h t ) L (µ) bounded by, there exists F ( ; h,..., h t ) L (µ) bounded by, such that (7) h h t g(n; h,..., h t ) F ( ; h,..., h t ) h t= F ( ; h,..., h t ) k h h t T ai(n+ i=t ɛ {0,} t k g(n; h,..., h t ) T ai(n+ i=t+ ɛ {0,} t t j= ɛ jh j ) a t (n) C ɛ f i 2 k+2 t t j= ɛ jh j ) a t(n) C ɛ 2 k+ t f i + o (), for any h,..., h t. Similar to (6), by Lemma 2., the Cauchy-Schwartz inequality and the invariance of T, the left hand side of (7) is bounded by o () plus k h t= h h t T ai(n+ i=t ɛ {0,} t g(n; h,..., h t ) t j= ɛ jh j ) a t(n) a t (n+ɛ th t)+a t (n) C ɛ 2 k+ t f i,

0 WEBO SU where o () depends only on t and g(n; h,..., h t ) {b(n)} n, and thus depends only on k and {b(n)} n since ξ is bounded by. So in order to prove the claim, it suffices to show that if g(n; h,..., h t ) 0, F = t j= ɛ jh j ) a t(n) a t (n+ɛ th t)+a t (n) C ɛ f i ɛ {0,} t T at(n+ is a function bounded by and is independent of the choice of n. It suffices to show that for any ɛ {0, } t, if g(n; h,..., h t ) 0, then value of t (8) a v (n + ɛ j h j ) a v (n) j= is independent of n, where v k. Suppose ξ(x) is supported in an interval i I T where is not an internal point of I for all 0 i < (k + )!. We discuss (k+)! case by case: Case that a v (n) = v[nα]. otice that t t ( r r ) v[(n + ɛ j h j )α] v[nα] = v [(n + ɛ j h j )α] [(n + ɛ j h j )α]. j= r= If g(n; h,..., h t ) 0, then for each r t, we must have {(n + r j= ɛ jh j )α}, {(n + r j= ɛ jh j )α} I. By the property of I, the length of I does not exceed 2 and I does not contain 0 as an internal point. Thus it is easy to verify that r r [(n + ɛ j h j )α] [(n + ɛ j h j )α] = ɛ r ([h r α] + D(h r )), j= j= where D(h r ) is the unique closest integer to {h r α}. This means (8) is independent of n if g(n; h,..., h t ) 0. Case that a v (n) = [vnα]. otice that t t ( r r ) [v(n + ɛ j h j )α] [vnα] = [v(n + ɛ j h j )α] [v(n + ɛ j h j )α]. j= r= If g(n; h,..., h t ) 0, then for each r t, we must have {(n + r j= ɛ jh j )α}, {(n + r j= ɛ jh j )α} I, so {v((n + r j= ɛ jh j ))α}, {v(n + r j= ɛ jh j )α} I v : = {vx: x I}. By the property of I, the length of I v does not exceed and I v+ v does not contain 0 as an internal point. Thus it is easy to verify that [v(n + j= j= j= j= r r ɛ j h j )α] [v(n + ɛ j h j )α] = ɛ r ([vh r α] + D v (h r )), j= j= where D v (h r ) is the unique closest integer to v{h r α}. This means (8) is independent of n if g(n; h,..., h t ) 0.

MULTIPLE AVERAGES FOR CERTAI AVERAGES ALOG SHIFTED PRIMES 4.2. Proof of Theorem.. We prove Theorem. in this subsection. Definition 4.3. Let (X, X, µ, T ) be a measure preserving system. A factor Z is called a characteristic factor, or characteristic, for the family of integer sequences {a (n),..., a k (n)}, if for any f,..., f k L (µ),at least one of which is orthogonal to Z, the average T a(n) f (x)... T ak(n) f k (x) converges to 0 in as. The following theorem proved plays an important role in the study of the characteristic factors: Theorem 4.4. (Host and Kra, [8]) For any k and any measure preserving system, there exists a factor Z k which is characteristic for {n, 2n,..., kn}, and a.e. ergodic component of Z k is an inverse limit of k-step nilsystems (See Section 2 for the definition). We refer the reader to [8] for details about characteristic factors. The following proposition can be deduced directly from Proposition 4. of [7]: Proposition 4.5. Let (X, X, µ, T ) be a measure preserving system. Let α R\Q, a, b Z. Then there exists l such that the factor Z l in Theorem 4.4 is characteristic for both {[(an + b)α],..., k[(an + b)α]} and {[(an + b)α],..., [k(an + b)α]}. Remark 4.6. Proposition 4. of [7] is stated for the case {[(an + b)α],..., k[(an + b)α]} but the proof of the case {[(an + b)α],..., [k(an + b)α]} is included in the proof. The following result is partially due to Frantzikinakis [6], Lemma 4.7: Proposition 4.7. Let α R\Q, a, b Z. For any nilmanifold X = G/Γ, with G connected and simply connected, any g G, x 0 X and any F (x),..., F k (x) C(X), the limit lim F (g [(an+b)α] x 0 )... F k (g k[(an+b)α] x 0 ) exists. The conclusion still holds if i[nα] is replaced with [inα] for i =,..., k. Proof. The case a = 0 is trivial, so in the rest of the proof we assume a 0. Without loss of generality, we assume x 0 = Γ. The case {[nα],..., k[nα]} is Lemma 4.7 of [6]. So we only need to prove for the case {[nα],..., [knα]}. Let ˆX = Ĝ/ˆΓ, where Ĝ = Rk G k, ˆΓ = Z k Γ k and ĝ = (,...,, g, g 2,..., g k ). Then Ĝ is also connected and simply connected. It is well known that if G is connected

2 WEBO SU and simply connected, the exponential map exp(x): g G is a bijection, where g is the Lie algebra of G. So for any t R, we may define g t = exp(ta), where A g is the unique element such that g = exp(a). We define a function ˆF : ˆX C by ˆF (t Z,..., t k Z, g Γ,..., g k Γ) = F (g {t } g Γ) F 2 (g {2t 2} g 2 Γ)... F k (g {kt k} g k Γ). (We caution the reader that ˆF may not be continuous.) otice that ˆF (ĝ (an+b)αˆγ) = F (g {(an+b)α} g (an+b)α Γ)... F k (g {k(an+b)α} g k(an+b)α Γ) = F (g [(an+b)α] Γ)... F k (g [k(an+b)α] Γ), so it suffices to show that the limit lim ˆF (ĝ (an+b)αˆγ) exists. For any δ > 0 (and sufficiently small) there exists a function ˆF δ C( ˆX) that agrees with ˆF on ˆX δ = I δ ˆX, where I δ = {(t Z,..., t k Z): min ti 0 j<i j i δ, i =,..., k}, and is uniformly bounded by 2 ˆF. Denote I 0 = {tz : min 0 j<i k t j i δ}. If t I0, then (t,..., t) I δ. Since ({(an + b)α}) n is uniformly distributed on T, the density of the set of n in {,..., } (as ) such that ˆF (ĝ (an+b)αˆγ) ˆFδ (ĝ (an+b)αˆγ) is at most I0 = C k δ for some C k > 0 depending on k. So lim sup ˆF (ĝ (an+b)αˆγ) ˆFδ (ĝ (an+b)αˆγ) (9) 4Ck ˆF δ. By Theorem B of [22], the limit lim ˆF δ (ĝ (an+b)αˆγ) exists. It is then easy to deduce from (9) that { ˆF (ĝ (an+b)αˆγ)} is a Cauchy sequence. So the limit exists. With the help of all the above material, we are now able to prove the following multiple convergence result: Proposition 4.8. Let α R\Q, k. For any measure preserving system (X, X, µ, T ) and any f,..., f k L (µ), the average T [(an+b)α] f (x)... T k[(an+b)α] f k (x) converges in as for all integers a, b. The conclusion also holds if i[(an + b)α] is replaced with [i(an + b)α] for i =,..., k.

MULTIPLE AVERAGES FOR CERTAI AVERAGES ALOG SHIFTED PRIMES 3 Remark 4.9. It is worth noting that the case {[nα],..., k[nα]} is proved in Theorem 2. of [7]. Proof. The method follows from [6] and [7]. We can assume f i, i k are continuous by an approximation argument. By Proposition 4.5, we can replace the original space with Z l for some l. Using an ergodic decomposition argument, it suffices to prove this proposition when the system is an inverse limit of nilsystems. By an approximation argument, we can simply consider the case when the system X = G/Γ is a nilsystem. We further simplify our discussion as follows (see [22]): since the average we deal with involves only finitely many actions on X, we may assume the discrete group G/G 0 is finitely generated, where G 0 is the connected component of G containing the unit element id G. In this case one can show that X is isomorphic to a subnilmanifold of a nilmanifold ˆX = Ĝ/ˆΓ, where Ĝ is connected and simply connected and for any F C(X), b G, x 0 X, there exists ˆF C( ˆX), ˆb Ĝ, ˆx 0 ˆX such that F (b n x 0 ) = ˆF (ˆb nˆx 0 ) for any n. So it suffices to consider the case when G is connected and simply connected, and the conclusion follows from Proposition 4.7. Proof of Theorem.. We only prove for the case a i (n) = i[nα] since the other case follows analogously. By Lemma 3., it suffices to prove the convergence in for the corresponding averages A() = Λ (n)t [nα] f... T k[nα] f k. Equivalently, it suffices to show that the sequence of functions {A()} is Cauchy in. Let ɛ > 0. Fix w, r and let B w,r () = T [(W n+r)α] f... T k[(w n+r)nα] f k. Let A i () = Λ (n)ξ i ({nα})t [nα] f... T k[nα] f k, A i,w,r () = B i,w,r () = Λ w,r(n)ξ i ({(W n + r)α})t [(W n+r)α] f... T k[(w n+r)α] f k, ξ i ({(W n + r)α})t [(W n+r)α] f... T k[(w n+r)nα] f k, where ξ i (x) = ( i (k+)!, i )(x). By Propositions 3.2 and 4., we have that for any (k+)! w 0 (and corresponding W 0 ) large enough and any r W 0, (r, W 0 ) =

4 WEBO SU, if is large enough, then A i,w0,r() B i,w0,r() Since A i (W 0 ) = φ(w 0 ) A i (W 0 ) φ(w 0 ) Thus A(W 0 ) φ(w 0 ) ɛ (k + )!. r W 0,(r,W 0 )= A i,w 0,r(), we deduce B i,w0,r() r W 0,(r,W 0 )= r W 0,(r,W 0 )= ɛ (k + )!. B w0,r() ɛ, when is large. By Proposition 4.8, for r =,..., W 0, the sequence {B w0,r()} converges in. Therefore, if and are sufficiently large, then for r =,..., W 0 we have B w0,r() B w0,r( ) ɛ. Thus if is large enough, A(W 0 ) A(W 0 ) B w0,r() B w0,r( ) + 2ɛ 3ɛ. otice that for r =,..., W 0, (0) lim A(W 0 + r) A(W 0 ) = 0. Thus if and are sufficiently large, A() A( () ) 4ɛ. Therefore the sequence {A()} is Cauchy and this finishes the proof. 4.3. Proof of Theorem.2. We prove Theorem.2 in this subsection. The following proposition is from Theorem 2. of [4]: Proposition 4.0 (Bergelson, Host, McCutcheon and Parreau, [4]). For any δ > 0, k, there exists c(δ), (δ) > 0, such that for any probability system (X, X, µ), any commuting measure preserving transformations T,..., T k, and any µ(a) δ, there exists 0 < n < (δ) such that µ(a T n A T n A) > c(δ). We need to use the following uniform multiple recurrence result along integers: k

MULTIPLE AVERAGES FOR CERTAI AVERAGES ALOG SHIFTED PRIMES 5 Proposition 4.. Let k. For any δ > 0, there exists c(δ) > 0 such that for any α R\Q, any measure preserving system (X, X, µ, T ) and any µ(a) δ, we have lim inf µ(a T [nα] A T k[nα] A) > c(δ). The conclusion also holds if i[nα] is replaced with [inα] for i =,..., k. Remark 4.2. otice the c(δ) in this theorem does not depend on α. Proof. The proof follows the method used in [2]. Denote q(n) = [nα], p m (n) = [mα]n. For any r, denote S r = {m : {mα} < /kr}. It is easy to see if m S r, then q(inm) = p m (in) = iq(nm), n r, i k. Let c(δ), (δ) > 0 be the same as in Proposition 4.0 and let T i = T i[mα]. We assume without loss of generality that (δ). Then there exists n (δ), such that µ(a T pm(n) A T pm(kn) A) > c(δ). Clearly, lim inf lim inf µ(a T q(n) A T q(kn) A) [ n ] m= µ(a T q(mn) A T q(kmn) A), for any n. otice that the set S (δ) has density /k(δ), thus lim inf lim inf (δ) lim inf (δ) c(δ) k(δ) 3 > 0. µ(a T q(n) A T q(kn) A) [ (δ) ] (δ) m= µ(a T q(mn) A T q(kmn) A) m S (δ) {,...,[ (δ) ]} (δ) µ(a T pm(n) A T pm(kn) A) This finishes the proof of the case {[nα],..., [knα]}. The proof of the case {[nα],..., k[nα]} follows by replacing q(in) with iq(n). Proof of Theorem.2. We only need to prove the situation for P since the other case is similar. By Proposition 4., there exists a positive constant c depending

6 WEBO SU only on µ(a) and k such that for any W > 0 (k+)! i= = lim sup lim sup where ξ i (x) = ( i ξ i ({nw α})µ(a T [nw α] A T [knw α] A) µ(a T [nw α] A T [knw α] A) > c, )(x). So one term on the left hand side is larger than (k+)!, i (k+)! c. Suppose this is the term corresponding to ξ (k+)! i, i {,..., (k + )!}. If we apply Proposition 4. with W α, f 0 = = f k = A, b(n) = Λ w,(n), ξ(x) = ξ i (x) and use the fact that when w (and corresponding W ) large enough, (Λ w, (n) )ξ i ({nw α}) U [,] can be sufficiently small as (by k (Z k ) Proposition 3.2), we can deduce that if w is large enough, then lim sup Λ w,(n)ξ i ({nw α})µ(a T [nw α] A T [knw α] A) > 0. This finishes the proof of the case {[nα],..., [knα]} by Lemma 3.. The proof of the case {[nα],..., k[nα]} can be obtained by replacing T [jnw α] with T j[nw α] in the discussion. 5. Other Applications 5.. Cube Averages along Shifted Primes. Every element ɛ {0, } d is identified as a sequence ɛ = (ɛ,..., ɛ d ), ɛ i {0, }. Write {0, } d = {0, } d \{(0,..., 0)}. The convergence result along cubes along integers was obtained in [9]. We indicate how the same method used for proving Theorem.2 can be applied for the averages along cubes along primes: Theorem 5.. For any k, any measure preserving system (X, X, µ, T ) and any functions f ɛ L (µ), ɛ {0, } k, the average T p ɛ + +p k ɛ k f π() k ɛ converges in as. p,...,p k [] P ɛ {0,} k Theorem 5.2. Let P = P + or P. For any sequence of integers {W } with lim W =, W = o(log ), any k, any measure preserving system (X, X, µ, T ) and any A X, µ(a) > 0, we have lim inf π() k n,...,n k P [] ( ) µ T W (nɛ+ +nkɛk) A > 0. ɛ {0,} k

MULTIPLE AVERAGES FOR CERTAI AVERAGES ALOG SHIFTED PRIMES 7 Remark 5.3. It is not known whether one can drop the factor W in this theorem (i.e. W = ). The reason we need W is that Proposition 3.2 is the estimate for the modified von Mangoldt function, and we do not have an estimate for the ordinary von Mangoldt function. We outline the key ingredients for the proofs. Proposition 5.4. Let k, (X, X, µ, T ) be a measure preserving system, f ɛ L (µ), ɛ {0, } k be functions bounded by. Let b,..., b k : C be k sequences of complex numbers satisfying b i (n)/n c 0 for all c > 0 and b i(n) ( i k). Then for any j k,, we have k n,...,n k i= k b i (n i ) ɛ {0,} k T ɛ n + +ɛ k n k L bj U f ɛ k [,] + o (). 2 (µ) 2 (Z 2 ) Furthermore, the implicit constant is independent of {b (n)} n,..., {b k (n)} n and the o () term depends only on {b (n)} n,..., {b k (n)} n. Remark 5.5. Proposition 7. of [3] shows that this proposition is true for X = Z. Proof. We may assume j = k since it is the same for other cases. Fix f 0 L (µ) with norm. Denote u(n,..., n k ) = b (n )... b k (n k ) n k = ɛ {0,} k T ɛn+ +ɛknk f ɛ. We only need to show that X k n,...,n k i= k b i (n i ) Let g ɛ = f ɛ0, g ɛ = f ɛ. Then X k k n,...,n k i= n,...,n k ɛ {0,} k T k b i (n i ) i= k i= ɛ in i f ɛ dµ k bk [,] U 2 (Z 2 ) + o (). ɛ {0,} k T k b i (n i ) k i= ɛ in i f ɛ dµ b k (n) ɛ {0,} k T k i= ɛ in i +n g ɛ.

8 WEBO SU By Lemma 2. and the invariance of T, k b k (n) i= ɛ in i +n 2 g ɛ = h= C h h= h h ɛ {0,} k T b k (n)b k (n + h) k T ɛ {0,} k b k (n)b k (n + h) + o () The proposition follows from the fact that converges. k n,...,n k i= ɛ in i +n (g ɛ T h g ɛ )dµ + o () b k [,] 2 k b i (n i ) i= U 2 (Z 2 ) + o (). As the detail of the proofs of Theorem 5. and 5.2 are similar to (in fact, easier than) that of Theorems. and.2, we just indicate the main steps. By using Lemma 3. and corresponding convergence and recurrence results along integers (Theorems.2 and.3 of [8]), it suffices to show that the difference between the averages along primes and that along integers is small. And this is achieved by the upper bound obtained in Proposition 5.4 and the estimate of Gowers norm in Proposition 3.2. We left the details to the reader. 5.2. Weighted Averages along Primes. We indicate how the same method used for proving Theorem.2 can be applied to generalize Theorem 2.24 of [9] to primes: Theorem 5.6. Let a(n) be a bounded k-step nilsequence. Then for any measure preserving system (X, X, µ, T ) and any f,..., f k L (µ), the average a(p)t p f (x)... T kp f k (x) π() p [] P converges in as. We outline the key ingredients for the proof. Proposition 5.7. Let (X, X, µ, T ) be a measure preserving system and f,..., f k L (µ). Let b(n): C be a sequence of complex numbers satisfying b(n)/n c 0, c > 0. Then b(n)t n f... T kn f k b(n) [,] U k (Z k ) + o ().

MULTIPLE AVERAGES FOR CERTAI AVERAGES ALOG SHIFTED PRIMES 9 Furthermore, the implicit constant is independent of {b(n)} n and the o () term depends only on the integer k and {b(n)} n. This proposition is proved in [9]. The proof is similar and actually easier than the proofs of Propositions 4. and 5.4. As the detail of the proof of Theorem 5.6 is similar to (in fact, easier than) that of Theorem.2, we just indicate the main steps. By using Lemma 3. and the corresponding convergence result along integers (Theorem 2.24 of [9]), it suffices to show that the difference between the averages along primes and that along integers is small. And this is achieved by the upper bound obtained in Proposition 5.7 and the estimate of Gowers norm in Proposition 3.2. We left the details to the reader. References [] V. Bergelson, Weakly mixing PET. Ergod. Th. and Dynam. Sys. 7 (987), 337-349. [2] V. Bergelson, I. J. Håland, Sets of Recurrence and Generalized polynomials. Convergence in ergodic theory and probability (Columbus, OH, 993), Ohio State Univ. Math. Res. Inst. Publ., de Gruyter, Berlin. 5 (996), 9-0. [3] V. Bergelson, B. Host, B. Kra, with an appendix by I. Rusza, Multiple recurrence and nilsequences. Inventiones Math. 60, 2, (2005) 26-303. [4] V. Bergelson, B. Host, R. McCutcheon, F. Parreau, Aspects of uniformity in recurrence. Colloq. Math. 84/85 (2000), part 2, 549-576. [5] V. Bergelson, A. Leibman, T. Ziegler, The shifted primes and the multidimensional Szemerédi and polynomial van der Waerden theorems. Comptes Rendus Mathematique Volume 349, Issues 3-4, February 20, 23-25. [6]. Frantzikinakis, Equidistribution of sparse sequences on nilmanifolds. J. Analyse Math. 09 (2009), -43. [7]. Frantzikinakis, Multiple recurrence and convergence for Hardy sequences of polynomial growth. J. Anal. Math. 2 (200), 79-35. [8]. Frantzikinakis, B. Host, B. Kra, Multiple recurrence and convergence for sequences related to the prime numbers. J. Reine Angew. Math. 6 (2007), 3-44. [9]. Frantzikinakis, B. Host, B. Kra, The polynomial multidimensional Szemerédi theorem along shifted primes. to appear in Isr. J. Math. Available at arxiv:009.484. [0] H. Furstenberg, Ergodic behavior of diagonal measures and a theorem of Szemerédi on arithmetic progressions. J. Analyse Math. 7 (977), 204-256. [] H. Furstenberg, B. Weiss, A mean ergodic theorem for f(t n x)g(t n2 x). Convergence in Ergodic Theory and Probability. Walter de Gruyter, Berlin, ew York (996), 93-227. [2] W. Gowers, A new proof of Szemerédi theorem. Geom. Funct. Anal. (200), 465-588. [3] B. Green, T. Tao, Linear equations in primes. Ann. of Math. (2) 7 (200), no. 3, 753-850. [4] B. Green, T. Tao, The mobius function is strongly orthogonal to nilsequences. Ann. of Math. (2) 75 (202), no. 2, 54-566. [5] B. Green, T. Tao, The primes contain arbitrarily long arithmetic progressions. Ann. of Math. 67 (2008), 48-547. [6] B. Green, T. Tao, T. Ziegler, An inverse theorem for the Gowers U s+ -norm. Ann. of Math. (2) 76 (202), no. 2, 23-372.

20 WEBO SU [7] B. Host, B. Kra, Convergence of polynomial ergodic averages. Isr. J. Math. 49 (2005), -9. [8] B. Host, B. Kra, onconventional ergodic averages and nilmanifolds. Ann. of Math. (2) 6 (2005), no., 397-488. [9] B. Host, B. Kra, Uniformity seminorms on l and applications. J. Anal. Math. 08 (2009), 29-276. [20] L. Kuipers, H. iederreiter, Uniform distribution of sequences. Pure and Applied Mathematics. Wiley-Interscience, ew York-London-Sydney, (974). [2] A. Leibman, Convergence of multiple ergodic averages along polynomials of several variables. Isr. J. Math. 46 (2005), 303-36. [22] A. Leibman, Pointwise convergence of ergodic averages for polynomial sequences of translations on a nilmanifold. Ergodic Theory Dynam. Systems 25 (2005), no., 20-23. [23] A. Sarközy, On difference sets of sequences of integers, III. Acta Math. Acadm. Sci. Hungar. 3 (978), 355-386. [24] T. Wooley, T. Ziegler, Multiple recurrence and convergence along the primes. to appear in Amer. J. of Math. Available at arxiv:00.408. [25] T. Ziegler, Universal characteristic factors and Furstenberg averages. J. Amer. Math. Soc. 20 (2007), 53-97. Department of Mathematics, orthwestern University, 2033 Sheridan Road Evanston, IL 60208-2730, USA E-mail address: swenbo@math.northwestern.edu