Beruflich Dokumente
Kultur Dokumente
TOM SANDERS
1. Arithmetic functions
An arithmetic function is a function f : N C; there are many interesting and natural
examples in analytic number theory. To begin with we consider what is perhaps the best
known
X
(n) :=
1P (x),
x6n:
the usual counting function of the primes. Various heuristic arguments suggest that one
should expect x to be prime with probability 1/ log x, and coupled with a body of numerical
evidence this prompted Gauss to conjecture that
Z n
1
n
dx
.
(n) Li(n) :=
log n
1 log x
It is the purpose of the first section of the course to prove this result.
As it stands can be a little difficult to evaluate because the indicator function of the
primes is not very smooth. To deal with this we introduce the von Mangoldt function
defined by
(
log p if n = pk
(n) =
0
otherwise.
It will turn out that is smoother than the indicator function of the primes and while
the von Mangoldt function is supported on a larger set (namely all powers of primes), in
applications this difference will be negligible. For now we note that our interest lies in the
sum of the von Mangoldt function, defined to be
X
(x),
(n) :=
x6n
and which is closely related to as the next proposition shows. The key idea in the proof of
the proposition is a technique called partial summation or, sometimes, Abel transformation.
This is the observation that
X
X
f (x)(g(x) g(x 1)) = f (n)g(n)
g(x)(f (x + 1) f (x)),
x6n
x6n1
TOM SANDERS
Proof. First note that power of primes larger than 1 make a negligible contribution to (n):
X
X
(n) =
(x) =
log p + O( n log n).
x6n
p6n
x6n
= (n) log n +
(x)(log(x + 1) log x)
x6n1
= (n) log n +
X (x)
+ O(log n),
x
x6n1
whence
(1.1)
X (x)
x
x6n
+ O( n log n).
o(1) n.
x6n
In fact this argument leads to explicit error terms not just asymptotic results but
this will not concern us since it turns out that the main contribution to the error term
|(n) n/ log n| comes from approximating Li(n) by n/ log n.
There is a natural convolution operation on arithmetic functions. Given two arithmetic
functions f and g we define their convolution f g point-wise by
X
f g(x) =
f (z)g(y).
zy=x
This convolution has an identity and it turns out that 1, the function that takes the
value 1 everywhere, has an inverse: we define the Mobius -function by
(
(1)k if x = p1 . . . pk for distinct primes p1 , . . . , pk
(x) :=
0
otherwise,
with the usual convention that 1 has a representation as the empty product.
d|p1 ...pl
S[l]
Convolution has the effect of smoothing or concentrating in Fourier space which is desirable because it means that the function is easier to estimate. As an example of this
we shall estimate the average value of the divisor function (n) (sometimes denoted d(n))
defined by
X
(n) :=
1 = 1 1(n),
d|n
that is the number of divisors of n. Before we begin we recall that the Euler constant is
Z
{x}
dx,
:=
xbxc
1
where bxc is the integer part of x and {x} := x bxc. There are many open questions
about the Euler constant, although they will not concern us. For our work it is significant
only as the constant in the following elementary proposition.
Proposition 1.3. We have the estimate
X1
= log n + + O(1/n)
x
x6n
Proof. Given the definition of this is essentially immediate:
X1
X 1 Z n+1 1
log n =
dx + O(1/n)
x
x
x
1
x6n
x6n
Z n+1
1
1
dx + O(1/n)
=
bxc x
1
Z n+1
{x}
=
dx + O(1/n)
xbxc
1
Z
= +
O(1/x2 )dx + O(1/n) = + O(1/n).
n+1
We shall now use Dirichlets hyperbola method to estimate the average value of the
divisor function.
TOM SANDERS
ab6n
a6 n
a,b6 n
This is called the hyperbola method because it is a way of counting lattice points below
the hyperbola xy = n. Now, as before we have that
X
X n
+ O(1) ( n + O(1))2 .
(x) = 2
a
x6n
a6 n
thus if we put
(n) :=
x6n
then we showed above that (n) = O( n). With additional ideas of a rather different
nature Voroni showed that (n) = O(n1/3 log n) and this has since been improved to O(n )
for some < 1/3. In the other directly Hardy and Landau showed that (n) = (n1/4 ),
but the true order of the error term is not known.
We now return to the von Mangoldt function. By the Fundamental Theorem of Arithmetic if n N then n = pe11 . . . pel l for some primes p1 , . . . , pl and naturals e1 , . . . , el . Now,
1 (n) =
X
d|n
(n) =
X
pk |n
log p =
l
X
i=1
ei log pi = log n.
The Mobius -function takes the values 1 and 1 (and 0) and so we have the trivial upper
bound |M (n)| 6 n. Since does not display any obvious additive patterns we might hope
that takes the values 1 and 1 in a random way. If it did it would follow from the central
limit theorem that we would have square-root cancellation:
M (n) = O (n1/2+ ) for all > 0.
The Riemann hypothesis is the conjecture that this is the case and it is far from being
know. On the other hand we shall be able to show that there is some cancellation so that
M (n) = o(n) and this already implies the prime number theorem.
Proposition 1.5. If M (n) = o(n) then (n) n.
Proof. We use the hyperbola method again: let B be a parameter to be optimised later,
and then note that
X
(n) n + 2 =
((x) 1(x) + 2(x))
x6n
(a)(log b (b) + 2)
ab6n
M (n/b)(log b (b) + 2)
b6B
X
a6n/B
(a)
B<b6n/a
which covers the first term. For the second we note that
X
X
X
|
(a)
(log b (b) + 2)| 6
|(n/a) (B)|
a6n/B
B<b6n/a
a6n/B
O(
n/a + B)
a6n/B
by the bound for given by Proposition 1.4. On the other hand by integral comparison
we have
Z n/B
X
p
1
da = O( n/B),
1/ a 6
a
0
a6n/B
TOM SANDERS
whence
X
a6n/B
There is a special measure on G called Haar measure which is the unique (up to scaling)
translation invariant measure on G which we shall denote G . If f L1 (G ) we may thus
define the Fourier transform of f by
Z
b
f () := f dG .
The advantage of this is that we have an inverse theorem.
Theorem 2.1 (The Fourier inversion theorem). Suppose that G is a locally compact abelian
group, f L1 (G ) and fb L1 (Gb ). Then
Z
f (x) = fb()(x)dGb () for a.e. x G.
We have been deliberately vague about scaling the Haar measure on G; in all applications
the scaling will be clear.
It may be instructive to consider an example. Suppose that G = Z/pZ for some prime
p. Then the characters on G are just the maps
x 7 exp(2irx/p),
b is isomorphic to Z/pZ the natural measure on each of the groups is different
and so G
b as
however. We shall think of G as endowed with normalised counting measure and G
endowed with counting measure so that
G (A) := |A|/|G| and Gb () := ||.
Given a function f : G C, we may think of it as a sum of canonical basis vectors (x )xG
where
(
|G| if y = x;
x (y) :=
0
otherwise.
We then have the decomposition
Z
f (x) =
the Fourier transform changes this basis to the set of characters. Indeed
Z
b
f () = f dG = hf, i
and the inversion formula just says that
Z
X
X
b
f=
hf, i =
f () = fb()dGb ().
b
G
b
G
TOM SANDERS
and the Fourier transform serves to diagonalize this operator. Indeed if we decompose g
in the Fourier basis:
X
g=
hg, i,
b
G
fb()hg, i.
b
G
Note, of course, that fb() = hf, i too but we use the above notation to make things more
suggestive.
Many problems in mathematics involve convolution: in probability theory the law of the
sum of two independent random variables is the convolution of their laws. For enumerative
problems 1A 1B (x) is the (scaled) number of ways of writing x as a sum a + b where a A
and b B. Indeed, before the end of the course we shall find ourselves in a position where
we want to examine
1A 1A 1A (x)
for A the set of primes less than or equal to N and so we shall be pleased to have access
to a basis which diagonalizes this expression.
Returning now to the more immediate concern of arithmetic functions we suppose that
G is the reals under addition and f is an arithmetic function. Then we write mf for the
atomic measure defined by
X
mf (A) :=
f (n)1A (log n).
nN
Shortly we shall damp our measures so that they are better behaved, but before that it is
easy to see that if f and g are arithmetic functions then
mf mg = mf g .
An important additional class of measures is the class of damped versions of the above.
Indeed, suppose that > 1, and write mf, for the exponentially damped measure defined
by
X
mf, (A) :=
f (n)1A (log n) exp( log n).
nN
where
(
exp(x)
I (x) =
0
if x 6 log n
otherwise.
Unfortunately the function I is not smooth enough and we are not able to control the
Fourier transform of m, well enough.
We shall estimate the Fourier transform of m, through m1, via the usual convolution
identity:
m
d
d
, (t) = 1/m
1, (t).
First we note a simple estimate we shall make further use of the basic method in Lemma
3.5 so it is worth becoming familiar now.
Lemma 3.1. We have the estimate
1
+ O(log(1 + |t|)).
1 + it
Proof. Naturally we proceed by integral comparison: let T > 1 be a parameter to be
optimised later.
Z
Z
it
(bxcit xit )dx
x
dx +
m
d
1, (t) =
1
1
Z
Z T
1
=
(bxcit xit )dx
O(x )dx +
+
1 + it
1
Z T
1
=
+ O(log T ) +
(bxcit xit )dx.
1 + it
T
To estimate the second integral we just note that
Z
Z 1
X
((1 + /n)+it 1)
it
it
(bxc
x
)dx =
d
+it
(n
+
)
T
0
n=T
m
d
1, (t) =
O(|t|/n+1 ) = O(|t|/T ).
n=T
10
TOM SANDERS
Proof. Write
PN :=
(1 pit )1 =
p6N
(1 + p1(+it) + p2(+it) + . . . ).
p6N
N
X
it
|6
n=1
n .
n=N +1
N
X
it
n=1
so
|PN m
d
1, (t)| 6 2
|6
n ,
n=N +1
n = O(( 1)1 N 1 )
n=N +1
by the triangle inequality and integral comparison. The lemma is proved on letting N
.
We now record the number theoretic content of this section.
Lemma 3.3. There is an absolute constant c > 0 such that if (1, 1 + c) then we have
the estimates
km, k = ( 1)1 + O(1)
and
3/4
|m
d
log1/2 (2 + |t|)).
, (t)| = O(( 1)
11
the result now follows from Lemma 3.1 for |t| > 1/100 say. On the other hand, by Lemma
3.1 if |t| < 1/100 then
1
1
1
+ O(1)
= O(1),
m
d
d
=
, (t) = m
1, (t)
1 + O(1)
provided 1 is sufficiently small. The result is proved.
X
1
m
d
(t)
=
.(n)exp(it log n)
,
n
n=1
X
1
= ( 1)1 + O(1).
n
n=1
xit dx
f (t) = ( 1 + it)
1
+( 1 + it)
1
Z
= 1 + ( 1 + it)
1
12
TOM SANDERS
Differentiating this we see that the first term is just O(1); to estimate the second we
introduce an additional auxiliary function
Z
(bxcit xit )dx.
g(t) :=
1
(3.1)
As before we split the integral in two according to whether or not x is larger or smaller
than some parameter T > 1. In the first instance
Z T
Z T
q
it
q it
(i logbxc) bxc
(i log x) x
dx = O(
(log x)q x dx)
1
= O(logq+1 T ).
In the second instance
Z
is equal to
Z
X
n=T
n
0
which is O(log n)q |t|/n+1 . Setting T = 2 + |t| and combining this we conclude that
g (q) (t) = O(log(2 + |t|))q+1 ,
and the lemma follows on inserting this into (3.1) with q = r and q = r 1.
...
. . ..
f a +2a ++ka =k
a1 ! . . . ak !
1!f
i!f
1
13
Proof of Lemma 3.4. Now we can leverage our knowledge of the auxiliary function f to
provide the estimate that we are looking for. First note that
dk m
d
,
(t) = ( 1 + it)(1/f )(k) (t) + ki(1/f )(k1) (t).
dtk
On the other hand by the previous corollary and lemma we get immediately that
dk m
d
,
(t) = O(k!2 ( 1)3(k+1)/4 O(log(2 + |t|))O(k) )
dtk
as claimed.
S1
It remains to show that the integrals along S2 , S3 and S4 are negligible. By the ML-Lemma
we see that
Z
|
f (s)ds| 6 sup | exp(xs)||s|2 .|S2 |
sS2
S2
t[T,T ]
|
S2
14
TOM SANDERS
S4
Thus
S1 = 2ix + O(T /S 2 ) + O,x (S/T 2 ).
Setting T = S and letting it tend to infinity gives us that
Z
exp(x( + it))
dt = 2ix
( + it)2
when x > 0.
The case x 6 0 is covered similarly except that this time C is taken to be the rectangle
with sides S1 := [ iT, + iT ], S2 := [S iT, S + iT ], S3 = [ iT, S iT ] and
S4 = [ + iT, S + iT ], and the integral in Cauchys theorem is 0 because f is holomorphic
in and on C. The result is proved.
Finally we can prove the main result of this section.
Theorem 3.9. We have the estimate
X
(x) logk x log+ (n/x) = Ok (n log3(k+1)/4 n)
F (n) :=
x6n
= ik
n+it
dk m
d
,
(t)
dt.
dtk
( + it)2
15
F (n + m) F (n) =
x=n+1
n+m
n+m
+ log
H(n)
x
n
n+m
n+m
) + log
H(n).
n
n
n2
log3(k+1)/4 n)).
m
x6n
= M (n) logk n +
x6n1
= M (n) logk n +
O(x).O(kx1 logk1 x)
x6n1
k
We should remark that with a little more care the error term can be made quite explicit
and, in particular, larger than any power of log, that is
M (n) = OA (n/ logA n) for all A > 0.
Combining this last result with Propositions 1.1 and 1.5 we get the Prime Number Theorem.
Theorem 3.11 (The Prime Number Theorem). (n) n/ log n.
16
TOM SANDERS
(1 p )1
k
Y
Gi ,
i=1
where Gi = Z/qi Z for some qi Z. As usual we write 0G for the identity of a group G,
but the reader should be warned that, for example, 0S 1 = 1.
Now, if : G S 1 is a homomorphism then,
i : Gi S 1 ; x 7 (0G1 , . . . , 0Gi1 , x, 0Gi+1 , . . . )
is also a homomorphism. Since Gi is cyclic there is a qi th root of unity i such that
i (x) = ix ,
which the reader may care to check is well-defined. Since is a homomorphism it follows
that
k
Y
(x1 , . . . , xk ) =
ixi .
i=1
17
for all x G.
b we extend it to Z in
Now, returning to our group G = Z/qZ , given a character G
the obvious way:
(
(x (mod q)) if (x, q) = 1
(x) :=
0
otherwise;
such a function is sometimes called a Dirichlet character. Crucially they inherit their
orthogonality properties from characters on Z/qZ . In particular if and 0 are (Dirichlet)
characters then
(
q
1 X
1 if = 0
(x)0 (x) = h, 0 iL2 (Z/qZ ) =
(q) x=1
0 otherwise.
By the inversion formula, A := {x Z : x a (mod q)}, we have the important relation
1A (x) =
X
1
(x)(a)
(q)
\
Z/qZ
\ | = |Z/qZ| = (q).
since |Z/qZ
In anticipation of the above we consider m
[
, (t) an L-function. For s = + it and
> 1 we shall write
X
(n)
.
L(s, ) := m
[
, (t) =
ns
n=1
For > 1 is follows that L(s, ) is a uniform limit of analytic functions and hence analytic
itself. Moreover we have an Euler-product formula as in Lemma 3.2:
Lemma 4.1. For > 1 we have the equivalence
Y
L(s, ) =
(1 (p)ps )1 .
p
18
TOM SANDERS
The proof is left as an exercise following from the fact that is induced by a homomorphism. Now, it is easy enough to check from the above that
XX
(pm )
log L(s, ) =
.
mpms
p m=1
Note that we fixed a branch of log here which we can do since L(s, ) is non-zero when
> 1. This follows from the fact that the product formula for L(s, ) converges absolutely
in this range and the prodands (the terms in the product) are never zero.
Now, by the inversion formula we then have
(4.1)
X
1 X
(a) log L(s, ) =
(q)
p
X
m:pm a
(mod q)
1
.
mpms
(1 ps ).
p|q
The proof of this is left as an exercise. Crucially, on combination with Lemma 3.1 we
see that
log L(s, 0 ) as s 1+ .
We should like to show that the remaining contributions in (4.1) are bounded; doing this
will lead to the Dirichlets theorem. We shall split into two cases according to whether or
not the character is complex valued. Before that, however, we have an analytic extension
of L(s, ).
Lemma 4.3. For all non-principal characters , the function L(s, ) can be extended
analytically in the range > 0 and satisfies
Z
L(s, ) = s
1
n6x
(n).
S(x)x(s+1) dx
19
X
(n)
L(s, ) =
ns
n=1
n=1
S(n)(ns (n + 1)s )
n=1
Z
= s
S(x)x(s+1) dx.
Thus L(s, ) satisfies the equality for > 1. However, by orthogonality of characters we
see that S(x) = O(q) since is non-principal. Thus the right hand side is analytic in the
range > 0 and so L(s, ) may be continued to this range and the result is proved.
It will be similarly convenient to have a meromorphic extension for the function.
Lemma 4.4. The function (s) can be meromorphically extended to the plane > 0 with
a simple pole at s = 1 so that
Z
s
(s) =
s
{x}xs1 dx.
s1
1
The proof is left as an exercise which can be done by the method in Lemma 3.1.
It follows from Lemma 4.3 that L(1, ) = O(1) for all non-principal . As it happens
it turns out to be harder to show that it is non-zero. If is complex valued it is a little
easier because it has a companion character .
Lemma 4.5. For all complex valued characters we have
L(1, ) 6= 0.
Proof. Non-negativity of the right hand side of (4.1) when s = > 1 and a = 1 gives
X
log L(, 0 ) > 0.
0
It follows that
Y
|L(, 0 )| > 1,
whenever > 1. On the other hand for all non-principal 0 we have L(1, 0 ) = O(1).
Furthermore, by Lemma 4.4 we have lims1+ L(s, 0 )(s 1) = 1. However, if L(1, ) = 0
then so does L(1, ) whence
lim L(s, )L(s, )(s 1)2 = O(1).
s1+
20
TOM SANDERS
We now turn our attention to the harder job of dealing with real characters. The rather
mysterious proof below is due to de la Vallee Poussin.
Proposition 4.6. For all real valued characters we have
L(1, ) 6= 0.
Proof. We assume that L(1, ) = 0 so that L(s, ) has a zero at s = 1 and then we see that
L(s, )L(s, 0 ) is analytic in the region > 0. Moreover, since L(2s, 0 ) 6= 0 if > 1/2
we have that
L(s, )L(s, 0 )
(s) :=
L(2s, 0 )
is analytic in that region. Additionally L(2s, 0 ) as s 1/2+ so
(4.2)
lim (s) = 0.
s1/2+
Y (1 (p)ps )1 (1 ps )1
(1 p2s )1
p-q
Y
p-q
1 + ps
1 (p)ps
Y
p:(p)=1
1 + ps
.
1 ps
Since we know the product is absolutely convergent we can expand it out and we see that
we get
X
an ns
(s) =
n=1
n=1
X
X
1 (
bm
m)(2)(s 2)m =
(2 s)m
m!
m!
m=0
m=0
21
(mod q)
The above sort of arguments coupled with partial summation techniques yield asymptotics for
X
1
.
p
pa
(mod q),p6N
These do not tend to be terribly useful though and we should much prefer to have an
analogue of the prime number theorem. We write
X
(x; q, a) :=
(n)
n6x,na
(mod q)
22
TOM SANDERS
Proof. We can clearly achieve the hypothesis (a, q) = 1 by cancellation so it will be sufficient
to prove the conclusion without this hypothesis.
We consider the fractional parts 0, {}, . . . , {Q}. There are clearly two that are within
1/Q of each other by the pigeon-hole principle, say {r} and {s} with r < s. We put
q := s r so that
q = a + {r} {s}
for some integer a. Thus
|q a| 6 1/Q
and the result follows.
Now, what happens with {n2 }? The situation is much less clear. Weyl introduced an
approach for dealing with this sort of problem which will inform our work with representation functions later on.
The basic question will boil down to estimating terms of the form
|
N
X
exp(2in2 )|
n=0
If is rational then the non-uniformity of quadratic residues in congruence classes will mean
that we cannot expect much cancellation in this term. Indeed, suppose that = 1/3. Then
it is easy to see that
|
N
X
n=0
where is a primitive cube root of unity. If is irrational or, rather, not close to a rational
with small denominator then we shall see that we do get cancellation in this quantity. To
estimate it we consider its square:
2
N
N
X
X
2
exp(2i(n2 m2 ))
exp(2in ) =
n=0
n,m=0
N
X
n,m=0
N
X
u
X
u=0
v=u
vu (mod 2)
exp(2iuv)
2N
X
2N
u
X
u=N +1
v=u2N
vu (mod 2)
exp(2iuv).
We shall estimate this through the inner sums. To this end we record the following decay
estimate for the Fourier transform.
23
t
X
exp(2in)| 6 min{t s + 1,
n=s
1
}.
2kk
Proof. Without loss of generality s = 0. There are t + 1 terms in the sum and each term
has modulus 1 so it follows that the sum is at most the t + 1. For the second estimate we
sum the geometric progression:
t
X
exp(2in) =
n=0
exp(2i(t + 1)) 1
exp(2i) 1
provided kk =
6 0; if it is then we certainly have the estimate. However
| exp(2i) 1| = | exp(ik) exp(ik)| = 2 sin kk > 4kk,
whence we have the result.
2N
X
u=N +1
exp(2iuv)|
(mod 2)
2N
u
X
exp(2iuv)|,
v=u2N
vu (mod 2)
2N
X
1
1
}+
min{2N u + 1,
},
2k2uk
2k2uk
u=N +1
min{N + 1,
1
}.
2kuk
The next lemma will let us show that kuk stays away from 0 enough to provide some
cancellation in the preceding.
Lemma 5.3. Suppose that 1 , . . . , k are -separated, i.e. ki j k > for all i 6= j. Then
k
X
min{Q,
i=1
1
} = O((Q + 1 ) log Q).
2ki k
Proof. We may certainly assume that i [1/2, 1/2], that at least half the sum comes
from i with i > 0 and that 0 6 1 < 2 < < l . Thus
S :=
k
X
i=1
X
1
1
}62
min{Q,
}.
min{Q,
2ki k
2
i
i=1
24
TOM SANDERS
l
X
i=1
for any R > 1 by Proposition 1.3. Now l 6 k 6 1 and we may certainly assume that
< 1/Q and thus put R = 1 /Q. We conclude that
S 6 O( 1 log Q),
and the result is proved.
N
X
n=0
u=0
Split the range of u up into 1 + O(N/q) intervals of length at most bq/2c, so that I1 :=
{0, 1, . . . , bq/2c1}, I2 = {bq/2c, . . . , 2bq/2c1} etc. with a possibly shorter final interval.
Now, suppose that u, u0 Ii are distinct. Then
k(u u0 )k > ka(u u0 )/qk |u u0 |/qQ > 1/2q
by the triangle inequality whence {u : u Ii } is a 1/2q-separated set. It follows from
Lemma 5.3 that
N
X
|
exp(2in2 )|2 = (1 + O(N/q)).O((N + q) log N ).
n=0
We now turn to the question of how we use this information. The basic idea is that if
kn2 k is never small for n 6 N then there is a large interval around the origin which it
misses. This, in turn, implies that it must have a large Fourier coefficient which we know
is not so if q is large; if q is small the result is easy.
To begin with we recall that if P is a large prime then the characters on Z/P Z are just
the maps
x 7 exp(2ixr/P ),
\
so we identify Z/P
Z with Z/pZ in the obvious way. Thus if f : Z/P Z C then
Z
fb(r) := f (x)exp(2ixr/P )dPG (x).
25
2
b
|1c
A (r)||1I (r)| .
r6=0
sup
06=|r|6(P/L)2
|r|<(P/L)2
|r|>(P/L)2
2
|1c
A (r)|.(L/P ) + (P/L) /2
N
X
n=1
26
TOM SANDERS
16n6N
or else
N 1 = O(N 1/2+/2+o(1) ).
Putting = 3 tells us that we can take = 1/5 o(1) giving the result.
1
log3 N
a+b+c=x
k>2,pk +b+c=x
27
we can show that RN (x) is large then we shall have that x has a representation as the sum
of three primes.
Hardy and Littlewood introduced the idea of using the Fourier transform to study RN (x)
noting that
Z 1
cN ()3 exp(2ix)d
RN (x) =
0
by the inversion formula. They then split the range of integration into two sets: the major
arcs, denoted
M := { T : | a/q| 6 1/qQ for some q 6 Q0 and (a, q) = 1},
and the minor arcs
m := { T : | a/q| 6 1/qQ for some q > Q0 and (a, q) = 1}.
cN ()
By Dirichlets pigeon-hole principle these sets cover T. On M we shall estimate
using the Siegel-Walfisz theorem; on m we shall need to develop a Weyl type estimate due
cN () is small.
to Vaughn to show that
The reason for the names major and minor is that the major arcs contribute the main
term to the integral form of RN (x) and the minor arcs an error term.
6.1. The minor arcs. As stated we shall estimate the minor arcs using a technique developed by Vaughn simplifying an earlier approach of Vinogradov quite considerably. In
the first instance we want to introduce long intervals because we can estimate their Fourier
transform using Lemma 5.2. To do this we need a trick of the form we used in Weyls
inequality to generate long intervals to sum over. In this instance we have the convolution
identities of 1 to fall back on. In particular
(x) = log(x),
so that
cN () =
(n) exp(2in) =
n6N
ab6N
Since log is smooth long exponential sums involving log have a lot of cancellation which we
shall exploit. We shall let X = N 2/5 be a parameter and naturally split into two ranges:
X
X
S1 :=
(a)
log b exp(2iab).
a6X
b6N/a
and
S2 :=
X
X<a<N
(a)
X
b6N/a
log b exp(2iab).
28
TOM SANDERS
In this sum S2 we shall decompose log as 1 (in the sense of Dirichlet convolution):
X
X
S2 =
log b
(a) exp(2ab)
b6X
X<a6N/b
X
cd6N
=
=
(a) exp(2acd)
X<a6N/cd
(d)
cd6N
(d)
(a) exp(2acd)
X<a6N/cd
(d)
d6N
X<u6N/d
(d)
d6N
exp(2iud)
(a)
a|u,X<a
exp(2iud)((u)
X<u6N/d
(a)).
a|u,a6X
Thus
S2 =
X
d6N
(d)
exp(2iud)
X<u6N/d
(a)
a|u,a6X
(a)
X<u6N a|u,a6X
(d) exp(2iud).
d6N/u
If d is not too large then we get a long range of u over which to sum exp(2iud) which
will, again, lead to good cancellation. Thus we write S2 = S3 + S4 , where
X X
X
S3 =
(a)
(d) exp(2iu)
X<u6N a|u,a6X
d6min{X,N/u}
and
S4 =
(a)
X<u6N a|u,a6X
(d) exp(2iud).
X<d6N/u
(a)
a6X
d6min{X,N/u}
v6N/a d6min{X,N/av}
Writing
S5 :=
X
a6X
(a)
(d) exp(2iavd)
v6N/a d6min{X,N/av}
29
To proceed with estimating these terms we shall use the following variant of Lemma 5.3.
Lemma 6.2. Suppose that R has | a/q| 6 1/qQ with q 6 Q and R N. Then
R
X
min{
x=1
1
N
,
} = O((q + R + N/q) log N log R).
x kxk
Proof. We consider the sum in two parts. In the first instance we address the case when x
is small:
bq/2c
bq/2c
X
X 1
N
1
min{ ,
S :=
}6
.
x kxk
kxk
x=1
x=1
As in the proof of Weyls inequality the numbers x are 1/2q-separated as x ranges
{0, . . . , bq/2c} so
bq/2c
X 2q
= O(q log q).
S6
x
x=1
Now we dyadically decompose the remaining range:
Li :=
2i+1
X1
min{
x=2i
N
1
,
}.
i+1
2
kxk
As before we split the xs into intervals of length bq/2c so that by Lemma 5.3 we get
Li = O(2i /q).O(N/2i + q) log N = O(N/q + 2i ) log N.
Summing over the range of i with q/2 6 2i 6 R we get the result.
We shall now use this lemma to estimate the three terms S1 , S4 and S5 .
Lemma 6.3. With conditions as in Lemma 6.2 we have
X
X
(a)
log b exp(2iab) = O((q + N/q) log3 N ).
S1 =
a6X
b6N/a
b6M
= 1d
[M ] (a) log M +
b6M 1
1c
[b] (a)/b.
b6M 1
30
TOM SANDERS
Thus
|S1 | 6
x6X
v6N/a d6min{X,N/av}
= O((q + N
4/5
+ N/q) log N ).
d6X
v6N/ad
Now the fact that X 2 is significantly smaller than N comes in to ensure that N/ad is large:
by Lemma 5.2 we get that
XX
(d) min{N/ad, 1/kadk}.
|S5 | 6
a6X d6X
But by grouping the terms where ad = u and noting non-negativity of the summand we
get
X X
|S5 | 6
(d) min{N/u, 1/kuk}.
u6X 2 ad=u
S5 = O(log N
min{N/u, 1/kuk}).
u6X 2
Finally we turn to S4 . It will be useful to have a trivial estimate for the second moment
of from 1.
Lemma 6.5. We have the estimate
X
x6N
31
Proof. It is easy to check that is sub-multiplicative so that (ab) 6 (a) (b). Then
X
X
(x)2 =
(ab)
x6N
ab6N
(a)
a6N
(b)
b6N/a
N
log N )
a
a6N
X
= O(N log N ).
(a)/a)
X
(a)O(
a6N
= O(N log N ).
X 1
= O(N log3 N ),
bc
bc6N
X<d6N/u
X
X<u6N
w(u)
(d) exp(2iud),
X<d6N/u
ready for splitting up the range of u. Dyadically decompose the range of values of u via
the numbers
R := {X, 2X, . . . , 2k X}
where k is maximal such that 2k1 X 6 N/X. Thus
X
|S4 | 6
TR
RR
where
X
TR :=
|w(u)|
(d) exp(2iud).
X<d6N/u
R<u62R
X
32
TOM SANDERS
The first sum here is O(R log3 N ) by the previous lemma; it is the second that we hope to
get some cancellation from. Reordering summation and the trivial logarithmic bound on
gives
X
X
5
2
.
|TR | = O(R log N )
exp(2u(d
d
))
1
2
X<d1 ,d2 6N/R R<u6min{2R,N/d1 ,N/d2 }
Thus, by Lemma 5.2 we get
X
Now the number of representation of a number r in the form d1 d2 is at most N/R whence
N X
min{R, 1/krk}
|TR |2 = O(R log5 N ).
R
r6N/R
Combining the three previous lemmas with Vaughns identity we get the following.
Lemma 6.7. Suppose that R has | a/q| 6 1/qQ for some 1 6 q 6 Q 6 N and
(a, q) = 1. Then
p
33
p6N
6.9. The major arcs. These are conceptually easier to work with although they will still
take time. Our main tool is the Siegel-Walfisz theorem which we leverage though the
Fourier transform. It will be convenient to write e() := exp(2i) and to begin with we
cN (a/q) for a/q a rational in lowest terms with small denominator. First, we
estimate
record a short calculation.
Lemma 6.10. For a and q integers such that (a, q) = 1 we have
X
e(ra/q) = (q).
16r6q,(r,q)=1
we see that
X
c(d) =
e(r/q) = (q).
16r6q
d|q
(q)
Proof. We do the obvious thing and group the terms into congruence classes:
X
cN (a/q) =
N (n)e(an/q)
n6N
q
N (n)e(ar/q).
(mod q)
(mod q)
N (n)e(ar/q) = O(log N ),
34
TOM SANDERS
thus
X
cN (a/q) =
(mod q)
16r6q,(r,q)=1
cN (). We write
As a corollary we get the major arcs estimate for
a
1 a
1
M(a, q) :=
, +
.
q qQ q qQ
Corollary 6.12. Suppose that M(a, q) for some q 6 Q0 . Then for any B > 0 we have
that
A+2A0 B
cN () (q) 1d
N ).
[N ] ( a/q) = OA,A0 ,B (N log
(q)
Proof. We examine the difference which we denote by D:
X
(q)
)e(n( a/q))
D =
(N (n)e(a/q)
(q)
n6N
=
cn (a/q)
((
n6N
(q)
(q)
[
n)) (
(n 1))))e(n( a/q))
n1 (a/q)
(q)
(q)
(q)
N )e(N ( a/q))
(q)
X
cn (a/q) (q) n))(e((n + 1)( a/q)) e(n( a/q))).
+
(
(q)
n6N 1
cN (a/q)
= (
Since
(e((n + 1)( a/q)) e(n( a/q))) = O(| a/q|)
we are done by the previous lemma.
It follows immediately from the above estimate that
3
A+2A0 B
3
3
cN ()3 (q) 1d
N)
[N ] ( a/q) = OA,A0 ,B (N log
3
(q)
35
At this point we have the crucial fact that the sets M(a, q) are disjoint since q 6 Q0 and
N is large so that 2Q0 < Q. Thus integrating the above shows us that
Z
cN ()3 e(N )d =
M
X X (q)3 Z
3
1d
[N ] ( a/q) e(N )d
3
(q)
M(a,q)
q6Q (a,q)=1
0
16a6q
X X
q6Q0
(a,q)=1
16a6q
The second integral is rather easy to estimate by Lemma 5.2 and Parsevals theorem:
Z
Z 1/qQ
3
0 3
0
0
1d
1d
[N ] ( a/q) e(N )d = e(N a/q)
[N ] ( ) e(N )d
M(a,q)
1/qQ
Z
0 3
0
0
= e(N a/q) 1d
[N ] ( ) e(N )d
T
Z
2
d
+ sup |1[N ] ()| |1d
[N ] () d
kk>1/qQ
Z
= e(N a/q)
2
+O(N log
T
A
0 3
0
0
1d
[N ] ( ) e(N )d
N ).
Of course
Z
0 3
1d
[N ] ( ) e(N )d = 1[N ] 1[N ] 1[N ] (N ) =
T
N 1
,
2
so
Z
cN ()3 e(N )d =
M
X X (q)3
N 1
e(aN/q)
(q)3
2
q6Q (a,q)=1
0
16a6q
X X
q6Q0
(a,q)=1
16a6q
Since Q = N/ logA+A0 N and Q0 = logA0 N the second double sum in the error term is of
size at most
X 2(q)
OA,A0 ,B (N 3 logA+2A0 B N ) = OA,A0 ,B (N 2 log2A+4A0 B N ).
qQ
q6Q
0
36
TOM SANDERS
Of course (q) = (q 3/4 ) (and, in fact, a much stronger estimate is true) so that by integral
comparison we have shown
Z
N 1 X X (q)3
3
c
N () e(N )d =
e(N a/q)
3
2
(q)
M
(a,q)=1
q6Q
0
16a6q
16a6q
It is possible to simplify this expression slightly more in the style of Lemma 6.10.
Lemma 6.14. For integers n and q we have
X
(q/(n, q))(q)
e(rn/q) =
.
(q/(n, q))
16r6q,(r,q)=1
(r 0 ,q/(q,n))=1
37
We conclude that
RN (N ) =
N 1 X (q)(q/(q, N ))
+ OC (N 2 logC N ).
2 (q/(q, N ))
2
(q)
q=1
p|N
)
>
(1
)
(1 +
3
2
(p 1)
(p 1)
(p 1)2
p-N
p|N
p|N
> exp(2
X
p|N
1
) > exp(4)
(p 1)2
Giles, Oxford OX1 3LB,