ANT Web
ANT Web
ANT Web
Ben Green
Contents
Preface 1
0.1. A brief introduction 1
0.2. These notes 2
0.3. Prerequisites 2
Every positive integer greater than one may be factored into primes, and this
factorisation is unique up to the ordering of the primes. You have known this fact
since school (though the first time you saw a proof may well have been last year,
in Part A). It is impossible to imagine doing number theory without it.
Does unique factorisation into primes generalise? To understand why one might
care about this question, let us look at some theorems about diophantine equations
(equations to be solved in integers) that have been proven by mathematicians in
the past.
p = (x + iy)(x − iy),
(x + y)(x + ζy)(x + ζ 2 y) = z 3
(where ζ = e2πi/3 ) and
√ √
(x + −7)(x − −7) = 2n .
To proceed further, one needs to understand the more general “number systems” in
which we have written these factorisations. This – especially the question of unique
factorisation into primes – is the main theme of the course.
1
2 PREFACE
These are notes I wrote to teach the course B3.4 Algebraic Number Theory at
Oxford in Hilary Term 2020. They cover the examinable material of that course as
well as some extra material on the link between class groups of imaginary quadratic
fields and binary quadratic forms. I was able to cover the material fairly comfortably
in sixteen 55-minute lectures, omitting only the proof of the Minkowski bound in
the general case.
In making these notes available to a general audience, I have omitted any men-
tion of which topics and proofs are considered examinable in the Oxford course.
In preparing the notes I used various books, particularly Stewart and Tall, as well
as previous notes by Victor Flynn, building on earlier notes of Neil Dummigan, Alan
Lauder and Roger Heath-Brown. In particular, most of the illustrative examples
are lifted directly from those notes.
Please send any corrections to
ben.green@maths.ox.ac.uk.
0.3. Prerequisites
Students taking B3.4 are expected to have attended the second year course Rings
and Modules, which covers the basic theory of rings and ideals, UFDs, PIDs and
so on. In the notes I recall a lot of this material, often with self-contained proofs.
A third year course on Galois theory is also listed as a prerequisite, but very little
Galois theory is actually used in this introductory course. The few sections requiring
Galois theory are starred.
I assume that students have taken a first course in number theory and cer-
tainly that they are familiar with modular arithmetic, as well as the statements of
quadratic reciprocity.
CHAPTER 1
Algebraic numbers
Given any complex number α, write Q(α) for the smallest field containing Q and
α; this will consist of all fractions p(α)/q(α), where p, q ∈ Q[X] are polynomials.
3
4 1. ALGEBRAIC NUMBERS
Recall that if K, L are two fields with K ⊇ L then the degree [K : L] is the degree
of K, considered as a vector space over L (it may be infinite).
Lemma 1.1.3. Let α ∈ C. Then α is algebraic if, and only if, [Q(α) : Q] < ∞.
Suppose that α is algebraic. Then Q(α) = Q[α]. Suppose that mα , the minimal
polynomial for α, has degree n. Then a basis for Q(α) as a vector space over Q is
1, α, . . . , αn−1 , that is to say Q(α) may be identified with the polynomials in α of
degree < n, and hence [Q(α) : Q] = deg mα = n.
Proof. Suppose first that [Q(α) : Q] is finite, say equal to n. In particular, the
numbers 1, α, . . . , αn must be linearly dependent over Q, which means precisely
that α satisfies some polynomial equation with coefficients in Q (of degree 6 n)
and hence is algebraic.
In the other direction, suppose that α ∈ Q, and that mα is the minimal poly-
nomial of α, with deg mα = n. Consider the evaluation map Q[X] → Q[α], which
sends f (X) to f (α). This is a surjective ring homomorphism whose kernel is the
set of polynomials in Q[X] satisfied by α. As we saw above, this is precisely (mα ),
the ideal generated by mα . Therefore
Q[α] ∼
= Q[X]/(mα ).
Now (mα ) is a maximal ideal in Q[X] (since all ideals in Q[X] are principal, and
if (mα ) ⊆ (f ) then f |mα and so (f ) = (1) or (mα )). Therefore the quotient
Q[X]/(mα ) is actually a field. We have shown that the polynomial ring Q[α] is in
fact a field, and so of course it must be Q(α).
Suppose that f (α) ∈ Q[α]. By the Euclidean algorithm, f (X) = mα (X)q(X) +
r(X) where deg r < n, and so f (α) = r(α). That is, Q[α] is spanned by 1, α, . . . , αn−1 .
In the other direction, these elements are independent over Q since otherwise there
would be a nonzero polynomial of degree < n satisfied by α.
Remark. To help in understanding all this, let us explain a little more explicitly
and algorithmically why inverses exist in Q[α], a fact which is surprising at first
sight. Let f (α) ∈ Q[α], f (α) 6= 0. Then f is not divisible by mα and so is coprime
it. By the Euclidean algorithm there are polynomials q, p such that f (X)q(X) +
mα (X)p(X) = 1. Thus f (α)q(α) = 1, so q(α) is the inverse of f (α).
Examples. The field Q(i) = {a + bi : a, b ∈ Q}, with the inverse being given by
1 a−bi
a+bi = a2 +b2 .
√ √ 1√
√
a−b 2
Similarly Q( 2) = {a + b 2 : a, b ∈ Q}, with a+b 2
= a −2b2 .
2
Proof. The minimal polynomial of α has degree 6 n, so the result follows straight
away from Lemma 1.1.3.
We will need this twice. In Lemma 14.3.1 we will need it when k = Q(α) in
which case, since this field is contained in C, the proof goes exactly as for k = Q.
Later, in Lemma 9.2.1, we will need the case k = Fp .
(1.2) {ei fj : 1 6 i 6 n, 1 6 j 6 m}
spans K3 over K1 . (In fact (1.1) is an equality, the so-called tower law for field
extensions. This is because (1.2) is actually a basis for K3 over K1 , which is another
easy exercise). Applying (1.1) with K1 = Q, K2 = Q(α), and K3 = Q(α, β) we get
Proof. *The key fact we will need is that there are only finitely many fields
intermediate between Q and K. This follows from the fundamental theorem of
Galois theory: consider some K̃ ⊇ K (for example, the normal closure) such that
K̃/Q has finite degree and is Galois. Then the subfields of K̃ are in one-to-one
correspondence with the subgroups of Gal(K̃/Q). This being a finite group, it only
has finitely many subgroups.
Turning to the proposition at hand, certainly every number field is finitely gen-
erated, that is to say K = Q(α1 , . . . , αn ) for some n (if not, keep adding new
elements; the degree increases each time).
1.4. MORE EXAMPLES 7
Example 2 (Cubic fields). We have already discussed the example Q(21/3 ). This
is an example of a pure cubic field. More generally, one may consider α with a
minimal polynomial mα (X) = X 3 + pX + q; there is more on this, including the
criterion for irreducibility, on the first example sheet. This is the most general type
of cubic field since one may always remove the X 2 term from a cubic X 3 + aX 2 +
bX + c by substituting Y = X − a3 , and the resulting field will be the same. We
will occasionally touch on cubic fields as a source of examples on the sheets, but
already they can be difficult to work with by hand.
Example 4 (Quartic fields). General quartic (i.e. degree 4) fields are too com-
plicated as a source of examples in this course. However we will occasionally look
√ √
at biquadratic fields such as K = Q( 2, 3). In this case, the primitive element
theorem is not obvious just by looking at the field; it is an exercise to show that
√ √
K = Q(θ) where θ = 2 + 3 (for example).
Conjugates are distinct. In Lemma 1.5.2 below we will show that the field Q
is perfect, which means that the roots (in Q) of any irreducible polynomial are
distinct. Thus the conjugates of any algebraic number are distinct. These facts will
be familiar to anyone having taken a course on Galois theory, but we include the
(short) proof here.
We isolate a general lemma from the proof. We state it for general fields since
we will need the case k = Fp later, for a different purpose.
Lemma 1.5.1. Let k be a field, and suppose that f (X), g(X) ∈ k[X]. Suppose
that f, g gave a common root in some field extension of k. Then f (X) and g(X)
have a common factor in k[X].
Proof. Suppose not. Then f (X), g(X) are coprime in k[X], and so by Euclid’s algo-
rithm there are polynomials a(X), b(X) ∈ k[X] such that f (X)a(X) + g(X)b(X) =
1. If α is a common root of f, g (in some extension field of k) then substituting
X = α immediately gives a contradiction.
Lemma 1.5.2. Let f (X) ∈ Q[X] be irreducible. Then the roots of f in Q are
distinct. Thus the conjugates of any algebraic number are distinct.
Proof. If f had a repeated root β in C then f (X) = (X − β)2 g(X) (for some
f ∈ C[X]) and hence the derivative f 0 (X) = 2(X − β)g(X) + (X − β)2 g 0 (X) would
also have β as a root. By Lemma 1.5.1, f and f 0 would have a common factor
over in Q[X]. Since f 0 is not zero, this is contrary to the assumption that f is
irreducible.
1.5. CONJUGATES AND EMBEDDINGS 9
Remarks. The only place we used the fact that the underlying field is Q was
when we asserted that f 0 is not zero. Indeed, if
(1.3) f (X) = an X n + · · · + a0
then
Note that mα , since it is irreducible and satisfied by each αj , is also the minimal
polynomial for each of the conjugates αj .
Q(θi ) ∼
= Q[X]/(mθi ) = Q[X]/(mθj ) ∼
= Q(θj ).
Examples. When K = Q(i), the two embeddings are the identity map and
complex conjugation.
√
When K = Q( 2), the two embeddings are the identity map and the map which
√ √ √ √
sends 2 to − 2, thus σ(a + b 2) = a − b 2.
√
More generally the same holds for any quadratic field K = Q( d) with d a
squarefree integer.
When K = Q(21/3 ), there are three embeddings: the identity σ1 , the map σ2
defined by σ2 (21/3 ) = ω21/3 , and the map σ3 (21/3 ) = ω 2 21/3 . Note in particular
that for these embeddings (unlike the two quadratic examples) we do not have
σ(K) = K. (The reason for this is that K/Q is not Galois.)
1.6. Norms
n
Y
(1.5) NK/Q (α) := σi (α).
i=1
√ √ √ √
If K = Q( d) then NK/Q (a + b d) = (a + b d)(a − b d) = a2 − db2 . An
important thing to note here is that this will be nonnegative if d < 0 but not if
√ √
d > 0. For instance when K = Q( 2) we have NK/Q (a + b 2) = a2 − 2b2 which
certainly takes negative values.
The following facts follow immediately from the fact that the embeddings σi are
field homomorphisms preserving Q:
Thus NK/Q (α) is invariant under the whole Galois group G and hence, by Galois
theory, is rational.
Lemma 1.7.1. Suppose that e01 , . . . , e0n is another basis for K over Q and that
the change of basis is given by
X
(1.6) e0j = Akj ek ,
k
Proof. By the preceding lemma, we need only find one basis for which this is so.
Suppose K = Q(θ), and take the basis 1, θ, · · · , θn−1 , that is to say ej = θj−1 . Then
Mij = σi (θj−1 ) = xj−1
i , where xi := σi (θ). Note that the xi , being the conjugates
of θ, are distinct by Lemma 1.5.2. The determinant det M is then what is known as
Q
a Vandermonde determinant, and its value can be shown to be i<j (xi − xj ) 6= 0.
See Appendix D for the short proof.
Lemma 1.7.3. Let α ∈ K. Then NK/Q (α) is the determinant of the multiplication-
by-α map from K to K, considered as a Q-linear map.
Proof. Let e1 , . . . , en be some basis for K over Q. Let e0j := αej , and suppose that
X
(1.7) e0j = Akj ek
k
with Akj ∈ Q. Thus A is the matrix of the multiplication-by-α map, with respect
to the basis e1 , . . . , en . Let M = M (e1 , . . . , en ) and M 0 = M (e01 , . . . , e0n ). Then, as
1Note, however, that this is not canonically defined, since there is no natural ordering on the
embeddings σ1 , . . . , σn . Different orderings permute the rows of the matrix.
1.7. NORMS AND DETERMINANTS 13
we saw above,
(1.8) M 0 = M A.
and so
(1.9) M 0 = DM
where D is the diagonal matrix with Dii = σi (α). It follows, since M is nonsingular
(by Lemma 1.7.2), that A = M −1 DM , and therefore
Y
(1.10) det A = det D = σi (α) = NK/Q (α).
i
Examples. Let us first check a quadratic example. When K = Q(i), a basis for
K over Q is {e1 , e2 } = {1, i}. Let α = 2 + i. Then
Thus
a b c
NK/Q (α) = 2c a b = a3 + 2b3 + 4c3 − 6abc.
2b 2c a
14 1. ALGEBRAIC NUMBERS
1.8. Discriminants
In this section we introduce the notion of discriminant. We will use the word in
two different ways in these notes. First, in this chapter, a discriminant is associated
with an n-tuple of elements. In the next chapter we will use this notion to define
the discriminant ∆K of a number field, which is a single quantity associated to K
and somehow measuring its “size”.
Let K be a number field with embeddings σ1 , . . . , σn .
Definition 1.8.2. Suppose that α ∈ K. Then the trace trK/Q (α) is defined to
P
be i σi (α), the sum being over all embeddings of K.
Proof. *As with the norm, a short proof may be given using Galois theory, and
in fact the proof is almost exactly the same as for the norm: suppose K = Q(θ),
and let K̃ be the splitting field of θ, so that K̃/Q is Galois. For σ ∈ Gal(K̃/Q) the
embeddings σσ1 , . . . , σσn are a rearrangement of σ1 , . . . , σn , and so
X X
σ(trK/Q (α)) = σσk (α) = σk0 (α) = trK/Q (α).
k k0
The link between the discriminant and the trace is as follows. First note that
Remark. The discriminant, whilst being rational and the square of something
(det M ), is not necessarily positive. For instance,
1 i 2
∆Q(i)/Q (1, i) = = −4.
1 −i
The following fact about how discriminants fare under base change is immediate
from the corresponding fact for M , namely Lemma 1.7.1.
Lemma 1.8.5. Suppose that e1 , . . . , en and e01 , . . . , e0n ∈ K are related by e0j =
P
k Akj ek , where the matrix A has rational entries. Then
Algebraic integers
Shortly (in Proposition 2.1.4 below) we are going to prove that the algebraic
integers form a ring. The following lemma is very useful in that regard.
Lemma 2.1.3. Let K be a number field. Then α ∈ K is an algebraic integer if
and only if there is a nonzero finitely-generated Z-module V ⊆ K such that αV ⊆ V .
Proof. First suppose that α is an algebraic integer. Then we have αd =
Pd−1 i d
i=0 ai α for some rational integers ai . Thus α is in the Z-module generated
by 1, α, . . . , αd−1 , which therefore has the required property.
17
18 2. ALGEBRAIC INTEGERS
Proof. Suppose that α, β ∈ O. Then by Lemma 2.1.3 we can find finitely generated
Z-modules V (generated by e1 , . . . , en ) and W (generated by f1 , . . . , fm ) such that
αV ⊆ V and βW ⊆ W . Let V W be the Z-module generated by the products vw.
This is finitely generated, by the ei fj . Moreover,
and similarly
(αβ)V W ⊆ (αV )(βW ) ⊆ V W.
By the other direction of Lemma 2.1.3, both α + β and αβ are algebraic integers.
This completes the proof.
αn + an−1 αn−1 + · · · + a0 = 0,
By choosing m suitably, we may clear all the denominators and ensure that all of
man−1 , m2 an−1 , . . . , mn a0 are all integers.
Proof. We have already shown (just with the assumption that the ei lie in
K) that discK/Q (e1 , . . . , en ) ∈ Q. Recall that the definition of discriminant was
(det M )2 , where the (i, j)-entry of M is σi (ej ). By Lemma 2.2.1, each of these
entries is an algebraic integer. Therefore (since O is a ring) (det M )2 ∈ O. Hence
discK/Q (e1 , . . . , en ) ∈ O ∩ Q = Z.
2.3. Units
Let K be a number field, and OK its ring of integers. Note that OK (being
contained in a field) is an integral domain. A unit is an element u for which there
is v ∈ OK with uv = 1. Equivalently, the inverse u−1 (in the field K) in fact lies in
OK . It is easy to see that the units form a group under multiplication.
We will sometimes write U (OK ) for the group of units in OK .
Example. The units in Q are ±1. The units in Q(i) are {±1, ±i}. However,
√
Q( 3) has infinitely many units, and they can be very large (in the Euclidean norm
20 2. ALGEBRAIC INTEGERS
√ √ √
on R). Indeed, 7 + 4 3 is a unit since (7 + 4 3)(7 − 4 3) = 1, and hence so is any
√
power (7 + 4 3)n .
Observe that if e1 , . . . , en are an integral basis then they are also a basis for K as
a vector space over Q. This is because in any nontrivial Q-relation q1 e1 +· · ·+qn en =
0 we may clear denominators to get a Z-relation, which cannot exist by the definition
of integral basis. Thus e1 , . . . , en are n Q-linearly independent elements of K, and
must therefore be a basis.
Proof. [Proof of Theorem 2.4.1.] First observe that there is some Q-basis for K con-
sisting of elements of OK . This follows by taking an arbitrary basis and multiplying
up each element to get an element of OK , using Lemma 2.1.5. If e1 , . . . , en is such a
basis then discK/Q (e1 , . . . , en ) is a non-zero integer, by Corollary 2.2.3 and Lemma
1.7.2. Suppose that e1 , . . . , en ∈ OK are chosen so that | discK/Q (e1 , . . . , en )| is
minimal (subject to being non-zero). We claim that e1 , . . . , en is then an integral
basis.
Suppose this is not the case. Then (subtracting integer multiples of the ei )
P
there is some element i ci ei ∈ OK with, for some i, 0 < |ci | < 1. Without loss of
generality, i = 1. Set e01 := i ci ei . Then e01 , e2 , . . . , en is a basis for K as a vector
P
space over Q, all of whose elements lie in OK . Its base change matrix A relative to
e1 , . . . , en is given by Aj1 = cj and Aji = δij when i > 2. Thus det(A) = c1 and so
by Lemma 1.8.5
Integral bases are not unique. Let e1 , . . . , en and e01 , . . . , e0n be two bases for K
L L 0
over Q. Then the sums Zei and Zei are indeed both direct sums. If the base
change matrix is given by e0i = j Aji ej then it is easy to see that
P L 0 L
Ze ⊆ Zei
L i L 0
if, and only if, A ∈ Matn (Z), the n × n integer matrices. Similarly Zei ⊆ Zei
if, and only if, A−1 ∈ Matn (Z) is an integer matrix. This implies the following.
Lemma 2.4.3. Suppose that A ∈ Matn (Z). Then A is unimodular if and only if
det A = ±1.
Proof. The only if direction is easy: we have 1 = (det A)(det A−1 ), and if A is
unimodular then both det A and det A−1 are integers.
The if direction requires some nontrivial linear algebra, specifically Cramer’s
formula for the inverse of a matrix, that is to say 1/ det A times the adjoint matrix.
This formula makes it clear that if A ∈ Matn (Z) and det A = ±1 then A−1 ∈
Matn (Z).
Corollary 2.4.4. Suppose that e1 , . . . , en and e01 , . . . , e0n are two integral bases
for OK . Then
discK/Q (e01 , . . . , e0n ) = discK/Q (e1 , . . . , en ).
We have layered many definitions on top of one another. For the moment one
should, roughly thinking, imagine that ∆K describes the “size” or “density” of the
ring of integers OK . This interpretation will become a little clearer in Section 10.6.
Let us work through some of the concepts just discussed for quadratic fields
√
Q( d), d 6= 1 a squarefree integer.
√
Proposition 2.5.1. Let K = Q( d), d 6= 1 squarefree. Then an integral basis
for K is given by
√
• 1 and d if d ≡ 2, 3(mod 4);
2.6. COMPUTING AN INTEGRAL BASIS 23
√
• 1 and 12 (1 + d) if d ≡ 1(mod 4).
• 4d if d ≡ 2, 3(mod 4);
• d if d ≡ 1(mod 4).
√
Proof. Suppose that a + b d ∈ OK , where a, b ∈ Q. Then (by Lemma 2.2.1)
√ √ √
a − b d ∈ OK . In particular (a + b d) + (a − b d) = 2a (i.e., the trace) lies in OK ,
√ √
which means that a = 2` for some rational integer `. Also, (a + b d) − (a − b d) =
√
2b d lies in OK and hence so does its square 4b2 d. Since d is squarefree, the only
m
denominator b could have is 2. Thus we also have b = for some m ∈ Z. Thus
√2
everything in OK is, up to adding elements of Z ⊕ Z d, an element of the set
√ √
S := {0, 12 , 2d , 12 (1 + d)}. The middle two elements of S are easily seen not to be
√
algebraic integers, so we need only decide whether or not α = 21 (1 + d) ∈ O. The
minimal polynomial mα (X) is X 2 −X + 1−d
4 , so this is so if and only if d ≡ 1(mod 4).
The discriminants may now be calculated by simply evaluating 2 × 2 determi-
nants – we leave this to the reader.
It follows from Proposition 2.5.1 that quadratic fields are monogenic, meaning
that OK = Z[α] for some α. (Sometimes this is called a “power integral basis”).
Whilst many fields share this property, it is not universal and indeed there are cubic
fields which are not monogenic.
Proof. Let e01 , . . . , e0n be some integral basis. Let the base change matrix from the e0i
to the ei be A, thus A ∈ Matn (Z). Then by Lemma 1.8.5 we have discK/Q (e1 , . . . , en ) =
(det A)2 discK/Q (e01 , . . . , e0n ), and so
Lemma 2.6.2. Suppose that e1 , . . . , en and e01 , . . . , e0n are linearly independent
tuples in OK , and that e0i = j Aji ej , where A ∈ Matn (Z). Set M := Ze1 ⊕ · · · ⊕
P
Corollary 2.6.3. Suppose that e01 , . . . , e0n ∈ OK are linearly independent over
Q. Write M 0 = Ze01 ⊕ · · · ⊕ Ze0n . Then
Remark. This is tautologous (given what we have already proven) when e01 , . . . , e0n
is an integral basis. The point, of course, is that it applies more generally.
Proof. Let e1 , . . . , en be an integral basis for OK , and let A be the base-change
matrix expressing the e0i in terms of the ei . Then, by Lemma 1.8.5 and the definition
of ∆K ,
[OK : M 0 ] = [M : M 0 ] = det A.
Finally, we come to the lemma which is actually useful for computing integral
bases in practice.
Lemma 2.6.4. Suppose that K is a number field and that e1 , . . . , en are elements
of OK , independent over Q, which do not form an integral basis. Then there exists
a prime p with p2 | discK/Q (e1 , . . . , en ) and integers m1 , . . . , mn ∈ {0, . . . , p − 1}, not
1
all zero, such that p (m1 e1 + · · · + mn en ) ∈ OK .
This allows us to give an algorithm for computing an integral basis which, although
potentially painful, is guaranteed to terminate in finite time. The algorithm goes
as follows:
• Start with elements e1 , . . . , en of OK spanning K as a vector space over
Q (for example, one might start with a power basis 1, θ, . . . , θn−1 , the
existence of which is guaranteed by Proposition 2.1.6).
1
• For each prime p with p2 | discK/Q (e1 , . . . , en ), test all p (m1 e1 + ··· +
mn en ), 0 6 mi < p, not all mi zero, to see if they lie in OK .
• If none do, e1 , . . . , en is an integral basis.
1
• Suppose that p (m1 e1 + · · · + mn en ) ∈ OK , with (say) m1 6= 0. Set
e01 := 1
p (m1 e1 + · · · + mn en ), then replace e1 , . . . , en with e01 , e2 , . . . , en
and return to the start.
Let us additionally remark that we can save a factor of roughly p in the time
taken for the second step by observing that if there is some p1 (m1 e1 + · · · + mn en ) ∈
OK with p - mi , then we can find such an element with m1 ≡ 1(mod p), by multi-
plying up by the inverse of mi (mod p). Then we may reduce so that all the mi lie
between 0 and p − 1, and in particular mi = 1.
CHAPTER 3
α = x1 · · · xm = y1 · · · yn
Remark. One often says that if x and y differ by a unit then they are asso-
ciate. Thus, in a UFD, factorisations into irreducibles exist and are unique up to
reorderings and associates.
We start with the good news, which is that when R = OK factorisation into
irreducibles does always exist.
Lemma 3.1.3. Let OK be the ring of integers of a number field. Then every
x ∈ OK may be written, in at least one way, as a product of irreducibles.
Proof. We proceed by induction on the absolute value of the norm |NK/Q (x)|
which, by Lemma 2.2.2. If x is itself irreducible, we are done. Otherwise, we
have x = yz with neither y nor z a unit. Taking norms, we have NK/Q (x) =
NK/Q (y)NK/Q (z). Since neither y nor z is a unit, NK/Q (y), NK/Q (z) 6= ±1. (Here
27
28 3. IRREDUCIBLES AND FACTORISATION
we used Lemma 2.3.1.) It follows that |NK/Q (y)|, |NK/Q (z)| < NK/Q (x), and so
by induction y, z admit decompositions into irreducibles. Hence so does x.
Remark. This lemma holds in any commutative noetherian ring, a concept you
may wish to read up on.
There is more good news: the rings of integers in many small number fields such
√ √
as Q(i), Q( 2) and Q( −2) are UFDs. You have probably already seen this in an
earlier course by showing that these fields are Euclidean domains. We will not be
saying very much about Euclidean domains in this course. However, the fact that
these examples are UFDs may also be proven using the techniques we develop in
this course. We do this explicitly for Q(i) in Section 11.1.
√
3.2. Failure in Q[ −5]
However, there is bad news - it is not hard to come up with an example where
OK does not admit unique factorisation into irreducibles.
√
Lemma 3.2.1. When K = Q( −5), OK is not a UFD.
√ √
Proof. First note that, by Lemma 2.5.1, OK = Z[ −5] = {a + b −5 : a, b ∈ Z}.
Now observe that
√ √
6 = 2 × 3 = (1 + −5) × (1 − −5).
√ √
We claim that 2, 3, 1 + −5, 1 − −5 are all irreducible, and that neither 2 nor 3
√
are associate to 1 ± −5.
To see this, we use norms. Note that
√ √ √
NK/Q (a + b −5) = (a + b −5)(a − b −5) = a2 + 5b2 .
(3.1) 1, 4, 5, 6, 9, . . . .
Note that
√
NK/Q (2) = 4, NK/Q (3) = 9, NK/Q (1 ± −5) = 6.
None of these numbers 4, 6, 9 factors as a product of two smaller numbers in the
√
sequence (3.1), and so 2, 3, 1 ± −5 are all irreducible. Indeed, if we had 2 =
xy with neither x nor y a unit then, taking norms, we would have NK/Q (2) =
NK/Q (x)NK/Q (y), with neither NK/Q (x), NK/Q (y) being 1 by Lemma 2.3.1.
√
Neither 2 nor 3 is associate to 1 ± −5, because associate elements have the
same norm.
3.3. THE USEFULNESS OF UNIQUE FACTORISATION 29
We will be spending most of the rest of the course discussing unique factorisation.
√
As justification for this, let us see how to use unique factorisation in Z[ −2] to
solve the equation y 2 + 2 = x3 mentioned in the introduction.
1 = b(3a2 − 2b2 ).
This is a vert easy equation to solve over the integers. We must have either b = −1,
in which case 3a2 − 2 = −1, which is impossible, or else b = 1, in which case
√ √
3a2 − 2 = 1 and so a = ±1. This leads to y + −2 = (±1 + −2)3 and so y = ±5.
In the next few chapters we come to the main theme of the course: whilst OK is
not necessarily a UFD, we may recover a theory of unique factorisation by working
in the enlarged world of ideals.
First a word on notation. It is not uncommon (for example in previous iterations
of this course in Oxford) to use capital letters such as I, J, P, Q for ideals in OK .
However, it is also somewhat standard to use fraktur letters a, b, p, q. This is what
is done in the book of Stewart and Tall, and we will follow this convention too. In
particular, p and q will always denote prime ideals (we will recall the definition in
the next section).
Let us first recall the basic definitions, adapted to the notation of this course.
(x) := {αx : α ∈ OK }.
(x1 , . . . , xr ) := {α1 x1 + · · · + αr xr : α1 , . . . , αr ∈ OK }.
You have seen in a first course on rings that a PID is a UFD, not just for rings
of integers OK but for general integral domains. Indeed, when showing that a
Euclidean domain is a UFD, one first shows that it is a PID and then one shows
that all PIDs are UFDs.
The converse is not true in general: for instance Z[X, Y ] is a UFD (because a
polynomial ring over a UFD is a UFD) but it is not a PID since, for example, the
ideal (X, Y ) is not principal.
We will show later on that the converse is true in number fields.
Theorem 4.1.3. Let OK be the ring of integers of a number field. Suppose that
OK is a UFD. Then OK is a PID.
Proof. See Chapter 6. As we have remarked, this is not true for arbitrary integral
domains and so we must rely on properties at least somewhat specific to number
fields.
The picture we have at the moment (not all proven!) is as follows. We have a
map OK → Ideals(OK ). This is surjective if and only if OK is a UFD. Our plan is
to show that unique factorisation can always be recovered by working in the larger
world Ideals(OK ).
Let us pause to check that we are indeed building a nonempty theory, by giving
an example of a nonprincipal ideal. But the remarks above, to find such an ideal
we need to look in some K where OK is not a UFD. We have already discussed
√
such an example, K = Q( −5).
√ √
Lemma 4.2.1. Let K = Q( −5). Then the ideal a = (2, 1 + −5) generated by
√
2 and 1 + −5 is not principal.
a ( (1).
√ √ √
Indeed if 1 ∈ a then we would have 1 = 2(a+b −5)+(1+ −5)(c+d −5) for some
integers a, b, c, d. Comparing coefficients gives 1 = 2a + c − 5d, so c + d ≡ 1(mod 2),
and 2b + c + d = 0, so c + d ≡ 0(mod 2). This is a contradiction.
It follows that if a = (α) were principal then 1 < NK/Q (α) < 4 (in fact, that
√
NK/Q (α) = 2). However, recalling that NK/Q (a + b −5) = a2 + 5b2 , we see that
there is no such element.
4.4. NORMS. INTEGRAL BASIS FOR AN IDEAL. 33
Let us record some simple properties of ideals, somewhat specific to the number
field case.
Lemma 4.3.2. Let a be a nonzero ideal. Then the quotient ring OK /a is finite.
Proof. First note that if b ⊆ a then there is a natural surjective map from OK /b to
OK /a. Therefore it suffices to prove the statement for any nonzero ideal b contained
in a. By Lemma 4.3.1, it suffices to prove that OK /(a) is finite, for any non-zero
rational integer a. Switching a to −a if necessary, we may assume a > 0. Let
e1 , . . . , en be an integral basis for OK . Then
As we have seen, OK is a free abelian group of rank n (that is, it has an integral
basis). It is a general fact (see Appendix A) that any finite index subgroup of a
free abelian group of rank n is also free abelian of rank n. Thus a is free abelian of
rank n, or in other words a has an integral basis, that is to say
n
M
a= Ze0i
i=1
Lemma 4.4.3. Suppose that a = (α) is a principal ideal, for some α ∈ OK \ {0}.
Then N (a) = |NK/Q (α)|.
det A = NK/Q (α). The result follows immediately from Lemma 4.4.2.
In other words, the absolute value of the norm is respects the embedding OK →
Ideals(OK ), and generalises the notion of (absolute value of) norm on OK to a
notion on Ideals(OK ).
a · (1) = a.
Definition 4.5.2. Let a, b be two ideals in OK . Then we say that b|a if there
is an ideal c such that a = bc.
Note that if b|a then a ⊆ b. That is, division implies containment. Remarkably,
the converse is also true, but much harder to prove (Theorem 5.0.2). However, we
strongly suggest the reader keep this fact in mind when reading what follows.
Prime ideals. The notion of prime ideal is the standard one from ring theory,
specialised to the setting of number fields.
Lemma 4.5.4. An ideal p is prime if and only if the following is true: whenever
ab ⊆ p, either a ⊆ p or b ⊆ p.
Proof. Suppose first that p is prime, that ab ⊆ p, and that a is not contained in
p. Let x ∈ a \ p, and let y ∈ b be arbitrary.
Then xy ∈ ab ⊆ p and hence xy ∈ p. But p is prime, so either x or y lies in p.
Since x ∈
/ p we must have y ∈ p. Therefore b ⊆ p.
Conversely, suppose that p is not prime, and find x, y ∈
/ p with xy ∈ p. Then
if we take a = (x) and b = (y) we see that ab = (xy) ⊂ p, but neither a nor b is
contained in p.
The main theorem of this chapter, and one of the main theorems of the course,
is the following.
Theorem 5.0.1. Let K be a number field with ring of integers OK . Then any
non-zero proper ideal a admits a unique factorisation a = p1 · · · pk into prime ideals.
Remark. This statement is actually cleaner than the statement of unique fac-
torisation over the integers, because there is no ambiguity up to multiplication by
units. Indeed if x and y are associates then the ideals (x) and (y) are the same.
During the proof of Theorem 5.0.1, we will establish two facts of independent
interest. First, we will prove that containment of ideals is equivalent to division:
Second, we will show that prime ideals behave like prime numbers in the follow-
ing sense.
Lemma 5.0.3. Let p be a prime ideal, and suppose that p|ab. Then p|a or p|b.
Once these results are proven, one can easily establish analogues of facts familiar
from elementary number theory. For instance, we can say that two ideals a and
b are coprime if there is no prime ideal p dividing both of them. Using unique
factorisation one may then show that if a and b are coprime ideals dividing a third
ideal c, then ab|c.
We turn now to the proof of Theorem 5.0.1, starting with some basic preliminary
facts.
Lemma 5.1.1. Let a be a proper ideal in OK . Then there is a prime ideal p with
a ⊆ p.
only continue for finitely many steps before we reach a maximal (and hence prime)
ideal.
Remark. In fact, any any ring with 1, every ideal is contained in a maximal (and
hence prime) ideal; this is a standard application of Zorn’s lemma (and hence relies
on the axiom of choice). The proof of Lemma 5.1.1 uses the fact that the index of
nonzero ideals in OK is finite to give a more down-to-earth proof in this case.
Lemma 5.1.2. Let a be a nonzero ideal in OK . Then there are prime ideals
p1 , . . . , pk such that p1 · · · pk ⊆ a.
Proof. Suppose the result is false. Then there is a counterexample a with minimal
norm. Clearly a is not itself prime, and therefore we may find x, y ∈ OK with
/ a. The ideals a0 := a + (x) and a00 := a + (y) strictly contain a. It
xy ∈ a but x, y ∈
is immediate from the definition of norm that N (a0 ), N (a00 ) < N (a), and hence by
minimality we have
p01 · · · p0k0 ⊆ a0 ,
p001 · · · p00k00 ⊆ a00 .
Finally, observe that a0 a00 ⊂ a, since a is an ideal and xy ∈ a.
Remark. What we are really using is the fact that OK is noetherian, that is to
say there is no infinite ascending chain of ideals. This property follows immediately
from the fact that nonzero ideals have finite index, which is (of course) the main
ingredient in the proof of Lemma 5.1.2.
The key ingredient in the proof of Theorem 5.0.1 is the following, which is a far
less obvious result than the ones we have established so far.
Proposition 5.2.1. Let a be an ideal in OK . Then there is an ideal b such that
ab is principal.
Remarks. The title of the section comes from the fact that b is indeed an inverse
to a in the ideal class group, which we shall introduce later.
Before proving Proposition 5.2.1, we assemble some lemmas. Here is the first of
them.
Lemma 5.2.2. Suppose that a is a nonzero proper ideal (thus it is not all of
OK ). Then there is some θ ∈ K \ OK such that θa ⊆ OK .
Proof. Let x be a nonzero element of a. Thus (x) ⊆ a. By Lemma 5.1.2 there are
prime ideals p1 , . . . , pr such that
p1 · · · pr ⊆ (x).
5.2. FINDING AN INVERSE 39
(5.1) p1 · · · pr ⊆ (x) ⊆ a ⊆ p,
so p1 · · · pr ⊆ p.
Since p is prime, by Lemma 4.5.4 there is some i, without loss of generality
i = 1, such that p1 ⊆ p. Since prime ideals are maximal (specifically, by Lemma
4.5.5) we in fact have p = p1 , and so by (5.1)
(5.2) a ⊆ p1 .
Here is the second preparatory lemma for the proof of Proposition 5.2.1.
Proof. This is a special case of Lemma 2.1.3, since a is a Z-module. (Recall the
proof: Let e1 , . . . , en be an integral basis for a. Certainly θai ∈ a for all i, and
P
so for some integer matrix A we have θei = j Aji ej , for all i. Thus the column
vector (e1 , . . . , en )T lies in the kernel of A − θI, which is therefore singular, and so
det(A − θI) = 0. This is a monic polynomial with integer coefficients, satisfied by
θ.)
With these two preparatory lemmas in hand, we may prove Proposition 5.2.1
itself. In fact we will show more: that for any nonzero x ∈ a there is an ideal b
such that ab = (x).
40 5. UNIQUE FACTORISATION INTO PRIME IDEALS
Define
b := {y ∈ OK : ya ⊆ (x)}.
That is, b is the biggest ideal for which ab ⊆ (x). To complete the proof we need
to show that ab is not properly contained in (x).
1
Define c := x ab. Then c is an ideal in OK , and we want to show that c is in
fact all of OK . Suppose, as a hypothesis for contradiction, that this is not the case.
By our first preparatory lemma, Lemma 5.2.2, there is some θ ∈ K \ OK such that
1 1
θc ⊆ OK . Since x ∈ a, b = x (x)b ⊂ x ab = c, that is to say b ⊆ c. Therefore
θb ⊆ OK .
Also, θba = θc(x) ⊆ OK (x) = (x). It therefore follows from the definition of b
that θb ⊆ b.
From Lemma 5.2.3, θ is an algebraic integer. This is a contradiction, since
θ ∈ K \ OK . This concludes the proof.
The proof of Proposition 5.2.1 was quite involved. However, now we have it in
hand, we can reach a number of pleasant consequences quite quickly.
Proposition 5.0.2. Suppose that a ⊆ b. Then there is some c such that a = bc.
In other words, b|a if and only if a ⊆ b.
Recall Lemma 4.5.4: this stated that if p is a prime ideal and ab ⊆ p then either
a ⊆ p or b ⊆ p. In the light of Proposition 5.0.2, this may be rephrased in the
following much more suggestive form.
Lemma 5.3.2. Let p be a prime ideal, and suppose that p|ab. Then p|a or p|b.
As we shall shortly see, Lemma 5.3.2 implies unique factorisation into prime
ideals quite easily.
5.5. FINDING THE PRIME IDEALS 41
We may now proceed to the proof of unique factorisation, which is quite straight-
forward now that we have prepared the ground. Let us recall the statement.
Theorem 5.0.1. Let K be a number field with ring of integers OK . Then any
non-zero proper ideal a admits a unique factorisation a = p1 · · · pk into prime ideals.
Proof. We first show existence of some factorisation into prime ideals. This we
do by induction on N (a). We know from Lemma 5.1.1 that there is some prime
ideal p with a ⊆ p or, (as we now know) p|a. Let b be such that a = pb. Then
a ⊆ b. Moreover, a is a proper subset of b, since if not we would have bp = b
which, by cancellation, would imply p = OK . It follows that N (b) < N (a), and so
by induction b is a product of primes. (Once again, what we are really using here
is the fact that OK is noetherian, that is to say has no infinite ascending chain of
ideals.)
To prove uniqueness, we use Lemma 5.3.2 repeatedly, in a manner entirely anal-
ogous to the proof of unique factorisation in Z. Suppose that
p1 · · · pk = q1 · · · qm .
Then, by Lemma 5.3.2, p1 divides some qi , say p1 |q1 . Thus q1 ⊆ p1 , which means,
by Lemma 4.5.5, that p1 = q1 .
Applying the cancellation property, Corollary 5.3.1, we see that
p2 · · · pr = q2 · · · qm .
Proposition 5.5.1. Every prime ideal p occurs as the prime factor of a unique
(p), where p is some rational prime.
In this brief chapter we prove Theorem 4.1.3: that is, if OK is a UFD, then it
is a PID. Recall that this fails for general rings (for example Q[X, Y ]) and so we
must use some specific properties of OK . The key fact we will use is Lemma 5.3.2:
if p is a prime ideal in OK , and if p|ab, then p|a or p|b.
Most of this material should be in a first course in ring theory but there is
certainly no harm in refreshing our memory.
Let R be an integral domain (such as OK ). Recall that x ∈ R is prime if x|yz
implies that x|y or x|z.
Proof. Suppose that x is prime and that x = ab. Then either x|a or x|b, without
loss of generality the former. Then a = xv for some v. Thus x = (xv)b and so
1 = vb, which means that b is a unit.
The converse is not true: irreducibles need not be prime. However, this is true
when R is a UFD. (In fact, this characterises UFDs, but we do not need this fact
here.)
Proof. Suppose x is irreducible and that x|yz. Then xv = yz for some v. Factor
v, y, z into irreducibles, obtaining xv1 · · · vn = y1 · · · yk z1 · · · zm . By uniqueness of
this factorisation, x must be one of the yi (say) up to a unit, which means that x|y.
Lemma 6.1.3. Let x ∈ OK be prime. Then the principal ideal (x) ∈ Ideals(OK )
is prime. Conversely, suppose the principal ideal (x) is prime; then x is prime.
43
44 6. IRREDUCIBLES AND FACTORISATION, REVISITED
We can now prove Theorem 4.1.3, that is to say if OK is a UFD then it is also
a PID.
Every ideal can be factored into prime ideals. Therefore it is enough to show
that if OK is a UFD then all prime ideals p in OK are principal.
Let p be a prime ideal. Let α ∈ p, so that p|(α). Let α = α1 · · · αk be the
(essentially unique) factorisation of α into irreducibles in OK . By Lemma 6.1.2,
the αi are all primes in OK . By Lemma 6.1.3, all of the (αi ) are prime ideals.
Therefore the factorisation of (α) into prime ideals is (α1 ) · · · (αk ). Since p|(α),
it follows from Lemma 5.3.2 that p is one of the (αi ), and therefore it is principal.
This concludes the proof.
CHAPTER 7
So far, we have made very limited use of the concept of the norm of an ideal.
We have used the fact that |OK /a| is finite to avoid Zorn’s lemma (in the proof
of Lemma 5.1.1) and (essentially) to prove that OK is noetherian (in the proof of
Lemma 5.1.2, and again in final part of the proof of Theorem 5.0.1 itself).
Now that we Theorem 5.0.1 in hand, we can revisit the notion of norm of an
ideal and establish some important further facts about it.
The main result of this section is the following very useful fact.
Proposition 7.1.1. For any two ideals a and b we have N (ab) = N (a)N (b).
We say that two ideals a and b are coprime if they do not have any prime (ideal)
factors in common.
Proof. It is always the case that ab ⊆ a ∩ b, thus a ∩ b|ab. In other other direction,
note that a ∩ b ⊆ a and so a|a ∩ b. Similarly b|a ∩ b. Thus, since a, b do not share
any prime factors, ab|a ∩ b. The result follows.
N (ab) = |OK /ab| = |OK /(a ∩ b)| = |(OK /a) ⊕ (OK /b)| = N (a)N (b).
Lemma 7.1.3. Let p be a prime ideal and t an integer. Then N (pt ) = N (p)t .
45
46 7. MORE ON NORMS OF IDEALS
Remark. Here, when writing the quotient pi /pi+1 , we are ignoring the ideal
structure and taking the quotient as abelian groups.
Proof. By the cancellation lemma for ideals, pi+1 is strictly contained in pi .
Therefore we may pick some α ∈ pi \ pi+1 . Note that
Define a homomorphism
π : OK → pi /pi+1
by
π(x) := xα + pi+1 .
By (7.1), π is surjective.
We claim that ker π = p. Write (α) = pi a, where a is coprime to p. Now
Corollary 7.1.5. Let a be an ideal for which N (a) is prime. Then a is prime.
We have already seen in Lemma 4.3.1 that every ideal a contains some rational
integer a, so that (a) ⊆ a. We now know that this means a|(a). That is, every ideal
divides the ideal generated by some rational integer. (The same result follows from
Proposition 5.5.1 and the fact that a factors into primes.)
Here is a more precise version of the same fact, which will be useful when
bounding class numbers later on.
7.3. AUTOMORPHISMS 47
Proof. Let m := N (a). By the definition of norm, |OK /a| = m. Therefore the
×m map is trivial on the additive group OK /a, and so in particular m ∈ a. This is
precisely what it means for a to divide (m).
A corollary of this, and unique factorisation into prime ideals, is there are only
finitely many ideals of a given norm.
7.3. Automorphisms
In this section we record a small lemma, Lemma 7.3.1, which is not really
important in the theoretical development but is occasionally useful in computations,
as we shall see in the next chapter.
Suppose that K is a number field and that σ = σi : K → C is an embedding
which fixes K. That is, σ : K → K is a field automorphism fixing Q.
By Lemma 2.2.1, σ maps OK to itself.
Proof. We leave (i) and (ii) as exercises. For (iii), note that there is a bijection
OK /a → OK /aσ given by
t + a 7→ σ(t) + aσ ,
thus
N (a) = |OK /a| = |OK /aσ | = N (aσ ).
This completes the proof.
CHAPTER 8
√
Q( −5) revisited
49
√
50 8. Q( −5) REVISITED
Finally, we remark (and you should check) that in fact p1 = p2 , but q1 and q2
are distinct. (Later, we will introduce some terminology for this: 2 is “ramified” in
√
Q( −5), but 3 is not. )
CHAPTER 9
In this chapter we will examine some strategies for factoring ideals into prime
ideal factors. We begin with the case of rational prime ideals (p), where there is a
useful tool – Dedekind’s lemma. At the end of the chapter we indicate a general
strategy for reducing to this case.
for distinct prime ideals pi and positive integer exponents ei , called the ramification
index of pi .
Taking norms, we see that each N (pi ) must equal some power pfi of p; the
number fi is called the degree of pi . Taking norms of both sides of (9.1) yields
r
X
(9.2) n= ei fi .
i=1
There are bits of language to describe various extreme situations. For instance,
There are also notions such as wild and tame ramification, which have to do with
the possibility that p divides ei ; these are not relevant in this course.
51
52 9. FACTORING INTO PRIME IDEALS IN PRACTICE
Let f (X) ∈ Z[X], and let f (X) ∈ Fp [X] be its reduction mod p. If f is reducible,
then so is f . However, the converse is not true: X 2 + 1 is irreducible in Z[X], but
factors as (X + 1)2 in F2 [X].
The main tool in the proof of Dedekind’s lemma is the following result about
this situation. This is perhaps a little subtle and the proof is even less examinable
than many of the others in the course.
Lemma 9.2.1. Suppose that α ∈ O has minimal polynomial m(X) ∈ Z[X]. Let
m(X) ∈ Fp [X] be the reduction of m mod p, and let g(X) be any monic irreducible
factor of m(X). Let α be a root of g (in the algebraic closure of Fp ). Then
(i) There is a natural ring homomorphism π : Z[α] → Fp [α] given by π(f (α)) =
f (α);
(ii) ker π = (p, g(α));
(iii) (p, g(α)) is a maximal ideal in Z[α] of index pdeg g .
(iv) If g 1 , g 2 are different irreducible factors of m, the corresponding ideals
(p, g1 (α)) and (p, g2 (α)) are distinct.
Proof. (i) It needs to be checked that π is well defined, in other words that if
f (α) = 0 then f (α) = 0. However, if f (α) = 0 then m(X)|f (X), thus f (X) =
m(X)q(X) for some q ∈ Z[X]. Reducing mod p, we see that m(X)|f (X), and
hence certainly g(X)|f (X). Since g(α) = 0, it follows that f (α) = 0.
(ii) It is clear that π(p) = π(g(α)) = 0, so certainly (p, g(α)) ⊆ ker π.
For the other direction, suppose that π(f (α)) = 0, or in other words that f (α) =
0. Now note that g is irreducible in Fp [X] and is satisfied by α, and hence it is
the minimal polynomial of α (over Fp ). It follows that g|f , that is to say f (X) =
g(X)q(X) for some q(X) ∈ Fp [X]. Lifting (arbitrarily) to Z[X], we have f (X) =
g(X)q(X) up to some multiple of p, and so indeed f (α) ∈ (p, g(α)).
(iii) The map π is clearly surjective, and so
Fp [α] ∼
= Z[α]/ ker π.
By Lemma 1.1.5, Fp [α] is a field; this implies that ker π is a maximal ideal. Moreover
the degree [Fp [α] : Fp ] is deg g, so in particular it has size pdeg g .
(iv) As a consequence of the first three parts, Z[α]/(p, g(α)) is a field extension of
Fp , and α maps under the quotient to a root of g. Thus if we did have (p, g1 (α)) =
(p, g2 (α)) then g 1 , g 2 would have a common root in some extension of Fp . By Lemma
9.3. DEDEKIND’S LEMMA 53
Proof. Much follows immediately from Lemma 9.2.1. Indeed, from (iii) of that
Lemma, pi is prime, and
N (p1 )e1 · · · N (pr )er = pe1 deg g1 +···+er deg gr = pdeg m = pdeg m = pn ,
which is the norm of the right-hand side. It follows that the inclusion (9.3) is in
fact an equality.
One fairly commonly finds the need to factor an arbitrary ideal a ⊆ OK into
prime ideals. This can be a little tedious, but here is a general strategy which will
always work. Things can often be sped up with ad hoc observations.
• Begin by finding a rational integer m ∈ a. To do this, first pick α ∈ a,
and then find a polynomial f ∈ Z[X], f (X) = cn αn + · · · + c0 satisfied by
α (a good choice is the minimal polynomial). Then c0 = −α(c1 + c2 α +
· · · + cn αn−1 ) lies in a.
• We have a|(m). Factor m into rational primes pi . We may then apply
Dedekind to each (pi ).
• We now have a list of all possible prime ideal factors of a. Note they may
occur with multiplicity. To find out which of them actually are prime
factors of a, we need to be able to test when b|a, or in other words when
a ⊆ b. This can often be done in an ad hoc way; if necessary, one can
explicitly see if each generator of a is in the OK span of the generators
of b by writing everything in terms of an integral basis and then solving
9.5. FACTORING A GENERAL IDEAL 55
Fractional ideals. The class group looks more natural if we introduce the notion
of a fractional ideal. This is a subset of K of the form
x−1 a := {x−1 a : a ∈ a} ⊆ K,
57
58 10. THE CLASS GROUP
Unlike the ideals, however, the non-zero fractional ideals form a group under
multiplication. This follows from Proposition 5.2.1 and the fact that every non-
zero principal fractional ideal is invertible, since (x)(x−1 ) = (1). This group is
often denoted by Div(OK ).
The ideal class group Cl(K) is then isomorphic to the quotient of Div(OK ) by
the subgroup of principal ideals.
In this section we will state, and set up the proof of, the most important theorem
about the ideal class group. This is the fact that it is a finite group. We establish
this together with additional information, the Minkowski bound, which can be used
to calculate the group in practice (we will present several examples in the next
chapter). The key statement is Theorem 10.2.3 below.
The proof is by no means trivial. It involves tools from the geometry of numbers
(see Section 10.4 for a brief introduction, and Appendix B for proofs) as well as
quite a number of other nontrivial ideas. Because the proof is quite hard, we will
present the imaginary quadratic case (which is conceptually easier) first, in Section
10.5, and then the general case in Section 10.6.
√
Lemma 10.2.2. Let Q( d), d 6= 1 a squarefree integer, be a quadratic field.
Then MK is given as follows:
√
(i) If d > 0 and d ≡ 2, 3(mod 4), MK = d;
√
(ii) If d > 0 and d ≡ 1(mod 4), MK = 12 d;
p
(iii) If d < 0 and d ≡ 2, 3(mod 4), MK = π4 |d|;
p
(iv) If d < 0 and d ≡ 1(mod 4), MK = π2 |d|.
Remark. (ii) is the key statement; the others follow almost immediately from it.
Indeed, recall Lemma 7.2.1, which states that a|(N (a)). Then (ii) implies that the
(ideal) divisors of the ideals (a), with a a rational integer 6 MK , represent every
class in Cl(K). (i) follows immediately. Factoring each such a into rational primes,
(iii) also follows straight away.
Definition 10.2.4. The size of Cl(K) is called the class number of K and it is
denoted hK .
In this section we give an initial reduction toward the proof of Theorem 10.2.3,
showing that it is a consequence of the following result, which states that every
ideal a contains an element of small norm (relative to the norm of a).
Proposition 10.3.1 (Elements of small norm). Let K be a number field and let
a be a nonzero ideal in OK . Then there is some x ∈ a with |NK/Q (x)| 6 MK N (a).
This proposition contains all the real difficulties in the proof of Theorem 10.2.3
and occupies the last few sections of this chapter. To conclude this section, we
deduce Theorem 10.2.3 from it.
10.3.1, c contains an element y with |NK/Q (y)| 6 MK N (c). Now (y) ⊆ c, that is to
say c divides (y), and so there is a with ca = (y). In the ideal class group, we have
[b] = [c]−1 = [a]. Taking norms, and using Lemma 4.4.3, we have
In the proof of Proposition 10.3.1, we will use the geometry of numbers, which
can be roughly defined as the study of when convex bodies intersect lattices.
A lattice Λ in Rn is the free abelian group generated by n linearly independent
vectors v1 , . . . , vn . The determinant det(Λ) is | det(v1 , . . . , vn )|; it depends only on
Λ, and not on the choice of integral basis v1 , . . . , vn . For more on lattices, including
a proof of this fact, see Appendix A.
The result from the geometry of numbers that we shall need is the following
result, known as Minkowski’s first theorem.
We turn now to the proof of Proposition 10.3.1. We will give the proof in the
√
imaginary quadratic case K = Q( d), d < 0, as it is rather easier to understand
than the general case, and also most of the examples we will consider will be of this
form.
As usual, we think of K as embedded in C. Now let Φ : C → R2 be the usual
map picking out real and imaginary parts, that is to say Φ(z) := (Re z, =z).
The key observation is that Φ(a) is a lattice in R2 , and that the set of elements
of small norm pushes forward under Φ to be contained within a convex body, and
therefore we may apply the geometry of numbers.
√
Let K = Q( d) be an imaginary quadratic field, and let a be an ideal in OK . We
know from Section 4.4 that OK has an integral basis e1 , e2 . Thus Φ(OK ) ⊆ R2 is
the Z-module generated by Φ(e1 ), Φ(e2 ), which is a lattice of determinant | det N |,
where !
Re e1 Re e2
N := .
=e1 =e2
10.6. ELEMENTS WITH SMALL NORM: GENERAL CASE 61
(We will see shortly that this determinant is nonzero, so this is a lattice.) On the
other hand, the two embeddings σ1 , σ2 of K into C are the identity, and complex
conjugation. Therefore from the definition of discriminant we have ∆K = (det M )2 ,
where !
e1 e2
M := .
e1 e2
One may easily check that | det N | = 21 | det M |, and so
1p
(10.2) | det(Φ(OK ))| = |∆K |.
2
Note that if desired one could also simply check this directly, dividing into two cases
√
according to whether d ≡ 1(mod 4) or not. For instance, in Q( −5) the integral
√ √
basis {1, −5} pushes forward under Φ to {v1 , v2 } with v1 = (1, 0), v2 = (0, 5),
√
and | det(v1 , v2 )| = 5. As we have already seen, ∆Q(√−5) = −20.)
Φ : K → Rr 1 × Cr 2 ∼
= Rn
given by
Φ(x) = (σ1 (x), . . . , σr1 (x), σr1 +1 (x), . . . , σr1 +r2 (x)).
To spell it out,
Φ(x) = (σ1 (x), . . . , σr1 (x), Re σr1 +1 (x), =σr1 +1 (x), . . . , Re σr1 +r2 (x), =σr1 +r2 (x)).
Remark. One should probably think of Φ(K) as “K ⊗Q R” but I will not elaborate
on this comment.
√
Example. Suppose that K = Q( 2). Then
√ √ √
Φ(a + b 2) = (a + b 2, a − b 2).
is a lattice in R2 . This, it turns out, is a general feature, and moreover we have the
following lemma, which directly generalises (10.2).
σ1 (e1 ) . . . σr1 (e1 ) Re σr1 +1 (e1 ) =σr1 +1 (e1 ) ... =σr1 +r2 (e1 )
.. ..
NT
:=
. .
σ1 (en ) . . . σr1 (en ) Re σr1 +1 (en ) =σr1 +1 (en ) . . . =σr1 +r2 (en )
10.6. ELEMENTS WITH SMALL NORM: GENERAL CASE 63
Proof. The deduction of this from Lemma 10.6.1 is the same as the deduction of
(10.3) from (10.2), so we do not repeat it.
This is generally not convex (although, as we saw in the last section, it is when
r1 = 0 and r2 = 1).
Solution. B contains a relatively large convex set B 0 , and we can use this
instead. Indeed, set
Now we have
1 r1 π r2
(10.6) vol(B 0 ) = 2 ( ) (nR1/n )n .
n! 2
(this is a multivariable integration calculation, which I have put on Sheet X).
Using Lemmas 10.6.2 and (10.6), a short computation now confirms that vol(B 0 ) >
2n det(Φ(a)) if and only if
n! 4 r2 p
R> n
( ) N (a) |∆K |,
n π
that is to say if and only if
R > MK N (a).
If R does satisfy this inequality, Minkowski’s Theorem (Theorem 10.4.1) tells us
that B 0 contains a point of Φ(a) which, by (10.5), implies that a contains an element
of norm at most R.
The proof of Proposition 10.3.1 in the general case is now finished.
CHAPTER 11
In this chapter we compute the class groups of some example imaginary qua-
dratic fields K. The general procedure is always
(i) Observe the basic features of K (ring of integers, integral basis, discrimi-
nant etc) and write down the Minkowski bound MK . By Theorem 10.2.3,
generators for Cl(K) may be found amongst the prime divisors of (p),
p 6 MK .
(ii) Factor all of the ideals (p), where p 6 MK is a rational prime, using
Dedekind’s theorem. This will give an explicit list of prime ideals gener-
ating Cl(K).
(iii) Figure out what relations there are, in the ideal class group, between the
prime ideals generated in (ii).
Items (i) and (ii) are purely formulaic, but there is a little bit of an art to (iii), at
least as we shall do things in this course. However, in the imaginary quadratic case
there is a key trick available: one can easily list the elements of OK (if any) of a
given norm, since the norm takes only positive values.
If a = (α) is principal then (Lemma 4.4.3) N (a) = |NK/Q (α)| = NK/Q (α). Thus
one can test whether or not an ideal a is principal by writing down all the elements
α ∈ OK with NK/Q (α) = N (a) and then testing whether a = (α) or not, which in
practice is pretty straightforward. In particular, if N (a) is not the norm of some
element, a cannot be principal. (However, the converse is not true.)
We will work through four examples according to the scheme detailed above.
In all cases, the basic features of K have already been worked out in Propositions
2.5.1 (integral bases) and 10.2.2 (Minkowski constant), which the reader should
recall now.
Let us begin by giving a new proof of the following fact, which is likely to be
familiar from a first course on rings.
4
Proof. By Lemma 10.2.2 (part (iii)), MK = π < 2. Since there are no primes less
than 2, Theorem 10.2.3 (ii) immediately implies that Cl(K) is trivial.
Corollary 11.1.2. Let p be an odd prime with p ≡ 1(mod 4). Then p is a sum
of two squares.
Proof. Let K = Q(i). Recall Proposition 9.4, which details the manner in which
rational primes split in OK = Z[i]. If p ≡ 1(mod 4) then (p) splits as p1 p2 in OK ,
where p1 , p2 have norm p. Since (as we now know) OK is a PID, p1 is principal,
say p1 = (a + ib) for some a, b ∈ Z. Taking norms, we see that
√
11.2. Q( −5)
We have already said a lot about this field, but let us revisit it in the light of
our new techniques.
√ 4
√
(i) Since d ≡ 3(mod 4), OK = Z[ −5]. By Lemma 10.2.2 (iii), MK = π 5<3
(to check this without resorting to a calculator, square up both sides to see that it
is enough to show that π 2 > 80/9, which is obvious since π > 3). It follows from
the Minkowski bound, Theorem 10.2.3, that generators of Cl(K) may be found
amongst the (ideal) prime factors of (2).
√
(ii) The minimal polynomial m(X) for −5 is X 2 + 5. Over F2 , this factors as
√
(X + 1)2 . By Dedekind’s lemma we therefore have (2) = p2 where p = (2, 1 + −5)
is a prime ideal of norm 2.
√
(iii) Since NK/Q (a + b −5) = a2 + 5b2 , there is no element of OK of norm 2.
Therefore p is not principal.
The only conclusion now is that Cl(K) is a cyclic group of order two, generated
by [p]. In particular, hK = 2.
√
11.3. Q( −29)
√ 4
√
(i) Since d ≡ 3(mod 4), OK = Z[ −29]. By Lemma 10.2.2 (iii), MK = π 29 <
7. Thus, by the Minkowski bound, generators of Cl(K) may be found amongst the
(ideal) prime factors of (2), (3) and (5).
√
(ii) The minimal polynomial m(X) for −29 is X 2 + 29.
√
11.3. Q( −29) 67
√
Over F2 this factors as (X+1)2 , so by Dedekind (2) = p2 where p = (2, 1+ −29)
has norm 2.
Over F3 this factors as (X + 1)(X − 1), so by Dedekind (3) = q3 q03 where
√ √
q3 = (3, 1 + −29), q03 = (3, −1 + −29) are distinct prime ideals of norm 3.
Over F5 this factors as (X + 1)(X − 1), so by Dedekind (5) = q5 q05 where
√ √
q5 = (5, 1 + −29), q05 = (5, −1 + −29) are distinct prime ideals of norm 5.
(iii) Since [q03 ] = [q3 ]−1 , [q05 ] = [q5 ]−1 , the class group is generated by p, q3 , q5 .
However, we need to do quite a lot more work to determine it completely. We make
the following preliminary observations.
The above are at least somewhat scientific, but we got stuck with q3 , and to finish
the job it really helps to “observe” the relation
√ √
(2)(3)(5) = (30) = (1 + −29)(1 − −29).
The prime factorisation of the left-hand side is of course p2 q3 q03 q5 q05 , and the two
√
(principal) ideals on the right hand side both have norm 30. Thus (1 + −29)
must be one of pq3 q5 , pq3 q05 , pq03 q5 , pq03 q05 . Whichever holds, we see that [q3 ] is in
√
the group generated by [p] and [q5 ]. (For instance, if (1 + −29) = pq03 q5 then
[p][q3 ]−1 [q5 ] is the identity).
We are now done: Cl(K) is generated by [p], which has order 2, and [q5 ], which
has order 3, and therefore Cl(K) is cyclic of order 6. (It is easy to conclude from all
this that in fact [q3 ] has order 6, which explains why it was troublesome to analyse!)
68 11. EXAMPLE CLASS GROUP CALCULATIONS
Here is another way in which we could have finished the argument, once we
found elements of order 2 and 3 in the class group. By Theorem 10.2.3 (ii), every
ideal class contains an ideal a with N (a) 6 MK < 7. However, the distinct ideals
of norm less than or equal to 6 are (1), p, q3 , q03 , (2), q5 , q05 , pq3 and pq03 . Thus the
class group has size at most 9, and the only such group with elements or order 2
and 3 is Z/6Z.
√
11.4. Q( −163) and the Rabinowitch Phenomenon
Remarks. At first sight1, the implication (i) ⇒ (ii) seems completely remarkable.
Proof. We will show (i) ⇒ (iii) ⇒ (ii).
To show (i) ⇒ (iii), we will try to evaluate the class number hK , where K =
√
Q( −A), in the same manner that we did for the examples in Chapter 11. We
√
have OK = Z[ 1+ 2−A ], since −A ≡ 1(mod 4). By Lemma 10.2.2 (iv) the Minkowski
√ √
constant MK is π2 A < π4 a. Thus generators of Cl(K) may be found amongst
√
the (ideal) prime factors of the principal ideals (p), where p 6 π4 a is a rational
prime.
√
1+ −A
Let p be such a prime. The minimal polynomial m(X) of 2 is m(X) =
2
X + X + a. If this has a root x(mod p) then the other root is −1 − x ≡ p − 1 −
x(mod p), since the sum of the roots if −1(mod p). Thus m(X), if it has a root mod
√
p, has a root in the range 0, 1, 2, . . . , 21 (p − 1). Note that 21 (p − 1) < π2 a. Since
we are assuming (i), it follows that x2 + x + a is prime for x = 0, 1, 2, . . . , 12 (p − 1),
and so the only way it can be 0(mod p) for one of these x is if it equals exactly p.
√
But this is impossible, since x2 + x + a > a whilst p < π4 a. It follows that m(X)
is irreducible (mod p) and so Dedekind tells us that (p) is inert. That is, all ideals
√
(p) with p 6 π4 a are principal and so indeed Cl(K) is trivial, and so (iii) holds.
Now we show that (iii) ⇒ (ii). For this, we more-or-less reverse the above
argument. Suppose that x2 + x + a is not prime for some 0 6 x 6 a − 2. On this
range, x2 +x+a 6 (a−2)2 +(a−2)+a = (a−1)2 +1 < a2 , so x2 +x+a has a prime
factor p with p < a. Thus m(X) has a root (mod p) and so by Dedekind’s lemma,
(p) splits in OK as a product of two ideals of norm p. Since Cl(K) is trivial, these
ideals must be principal. Thus there is some α ∈ OK with NK/Q (α) = N ((α)) = p.
√
Suppose that α = x + y 1+ 2−A , with x, y ∈ Z. Then p = NK/Q (α) = x2 + xy + ay 2 .
Obviously p is not a square, and so y 6= 0. Therefore
y y A
p = x2 + xy + ay 2 = (x + )2 + A( )2 > > a − 1.
2 2 4
But p < a, and so this is a contradiction.
It is now rather easy to check (using (i)) that hQ(√−A) for the following values
of A: A = 11, 19, 43, 67, 163. The last of these implies (by (ii)) the famous fact,
observed by Euler, that x2 + x + 41 is prime for x = 0, 1, . . . , 39.
A much deeper fact (the solution of the so-called “class number one problem”)
is that there are no larger values of A with this property.
CHAPTER 12
An elliptic curve
We look at an example of how to use the ideas of the course to solve a specific
diophantine equation, specifically to find all the integral points on a certain cubic
curve (elliptic curve). The example is somewhat similar to the equation y 2 + 2 = x3
considered by Fermat and Euler, which we solved in Theorem 3.3.1. However, in
this example unique factorisation fails.
of ideals.
We are going to prove that the two ideals on the left are coprime. Suppose
√ √
some prime ideal p divides both terms on the LHS. Then y + −37, y − −37 ∈ p,
√ √
and so, taking the difference, 2 −37 ∈ p. Therefore p|(2 −37). (Here, of course,
we are using the fact that containment and division of ideals are the same thing,
Theorem 5.0.2.)
Taking norms, we have
√
(12.2) N (p)|N (2 −37) = 22 · 37.
√
Also, since p|(y + −37), we have p|(x)3 and so
in OK , where u is a unit. The only units are ±1; by replacing a, b with −a, −b
if necessary, we may in fact assume that u = 1. Expanding out and comparing
√
coefficients of −37 (which, of course, is irrational) we obtain
The second of these implies that b = ±1 and hence that 3a2 − 37 = ±1, which is
obviously impossible. This concludes the proof.
CHAPTER 13
(13.1) x3 + y 3 + z 3 = 0.
and therefore it is not very surprising that we will be working in the field Q(ω).
√ √
Observe that in fact ω = 21 (−1 + −3), so K = Q(ω) is the quadratic field Q( −3)
and the ring of integers is Z[ω]. We will show the more general result that (13.1)
has no nontrivial solutions in Z[ω].
Basic facts about Z[ω]. We leave it to the reader to check using the methods
of Chapter 11 that the class number hK is one (in fact, this is easier than all of
the examples presented there; since OK is also a Euclidean domain, you may also
have done this in Rings and Modules). Thus Z[ω] is a unique factorisation domain.
In particular, primes and irreducibles are the same thing. We remark that there
are six units in Z[ω], namely {±1, ±ω, ±ω 2 }: this is easily seen by noting that
NK/Q (a + bω) = a2 − ab + b2 .
√
The prime λ = −3. In the argument, we will be working “mod λ”, where
√
λ = −3. Note that λ is prime, since NK/Q (λ) = 3 is prime. The main reason for
this is that cubes have very special behaviour modulo powers of λ, as the following
lemma (which generalises the fact that m3 ∈ {0, ±1}(mod 9) for m ∈ Z) shows.
Proof. We work modulo λ. Note that 9 = λ4 . Since N ((λ)) = NK/Q (λ) = 3, the
quotient Z[ω]/(λ) has size three. The three equivalence classes are represented by
0, 1, −1, which are mutually incongruent mod λ. Thus x ≡ ±1(mod λ). Suppose
73
74 13. THE CASE n = 3 OF FERMAT’S LAST THEOREM
However, a3 ≡ a(mod λ), since a is congruent to one of 0, ±1(mod λ). The result
follows.
(13.3) x3 + y 3 + λ3n z 3 = 0,
where now x, y, z are pairwise coprime and none is divisible by λ, and n > 1.
Consider the slightly more general equation
(13.4) x3 + y 3 = uλ3n z 3 ,
where u is one of the six units in Z[ω]. Let P (n) denote the statement that this
equation has no solution in coprime elements x, y, z ∈ Z[ω]. By the above discus-
sion, if we know P (n) for all n > 1 then Theorem 13.0.1 follows. We will now show
P (1), and that P (n − 1) ⇒ P (n). As the reader will see, the argument requires us
to work with (slightly) more general equation (13.4), rather than just (13.3).
Proof of P (1). Again, we work modulo λ. By Lemma 13.0.2, x3 + y 3 ∈
{0, ±2}(mod λ4 ), thus the power of λ dividing x3 + y 3 is either 0 or at least 4.
However, the power of λ dividing uλ3 z 3 is 3. This is a contradiction.
The inductive step. Suppose now that n > 2, and suppose we have established
P (n − 1). Suppose P (n) is false, thus (13.4) has a solution in coprime elements
x, y, z ∈ Z[ω]. Finally we use the factorisation of the LHS of (13.4), so the equation
becomes
Evidently, this means that λ divides one of the factors on the LHS. However, if it
divides one of them, then it divides all of them: this is because 1 − ω and 1 − ω 2
are associates of λ (in fact, λ = ω(1 − ω) = (−ω 2 )(1 − ω 2 )). Moreover, λ is the
only common factor of each pair of factors on the LHS of (13.5). For instance, if
δ divides x + y and x + ωy then it also divides (ω − 1)y = (x + ωy) − (x + y) and
13. THE CASE n = 3 OF FERMAT’S LAST THEOREM 75
(1 − ω)x = (x + ωy) − ω(x + y). Since x and y are coprime, we have δ|ω − 1 and
so δ|λ. Thus (13.5) becomes
x + y x + ωy x + ω 2 y
( )( )( ) = uλ3n−3 z 3 ,
λ λ λ
with the three factors on the left being coprime elements of Z[ω].
The power λ3n−3 still divides the LHS. Since the three factors on the LHS are
coprime, it divides one of them. Replacing y with ωy or ω 2 y if necessary, we may
assume that λ3n−3 | x+y
λ , and so our equation now becomes
x + y x + ωy x + ω 2 y
( )( )( ) = uz 3 ,
λ3n−2 λ λ
with the three terms on the left being coprime elements of Z[ω].
Using the fact that Z[ω] is a UFD, and considering prime factorisations, this
implies that we have
where the ui are units and the zi are coprime elements of Z[ω], none divisible by
λ (and u1 u2 u3 = u, z1 z2 z3 = z, but we will not need this). Since (x + y) + ω(x +
ωy) + ω 2 (x + ω 2 y) = 0, we have (after a little rearrangement)
Throughout this chapter, let K be imaginary quadratic field with ring of integers
OK and discriminant ∆. Our aim is to describe a beautiful connection between the
ideal class group of such fields and binary quadratic forms. One application of this
is an algorithm for computing class numbers hK .
• every fractional ideal a has an integral basis, that is to say is of the form
Ze1 ⊕ Ze2 for some e1 , e2 ∈ a.
By an ideal class we mean an element of Div(OK )/K ∗ (the fractional ideals
modulo the principal fractional ideals) which, as remarked in Chapter 10, is iso-
morphic to the class group Cl(K). In fact, many texts take this as the definition of
the class group.
Proof. (i) Suppose that a = Ze1 ⊕Ze2 is some fractional ideal in the class. Since K
is imaginary, R ∩ K = Q and so we cannot have e1 /e2 ∈ R, since this would entail
e1 /e2 ∈ Q and so e1 , e2 would not generate a free abelian group. By swapping e1 , e2
1
if necessary, we may assume that τ := e2 /e1 ∈ H. Then e1 a = Z ⊕ Zτ is in the
same (fractional) ideal class as a.
(ii) Suppose that τ 0 = γτ , where γ = a b
c d ∈ Γ. Then, since γ is unimodular it
follows from Proposition 2.4.2 that
Definition 14.1.2. Write H(K) for the set of all τ ∈ H for which Z ⊕ Zτ is a
fractional ideal in K. These are called the Heegner points for K.
In this language, Lemma 14.1.1 shows that H(K) is a union of Γ-orbits, and the
number of such orbits is precisely the class number hK . That is,
(14.1) |Γ \ H(K)| = hK .
(14.2) g(τ, 1) = (g11 τ + g12 , g21 τ + g22 ) = (g21 τ + g22 )(gτ, 1).
The action of SL2 (R) on R2 gives rise to a (right-) action of SL2 (R) on qua-
dratic forms of any given discriminant D via (gq)(x) = q(g −1 x). To see that
the discriminant is preserved, note that if q(x) = xT M x with M symmetric then
D(q) = −4 det M . We have (gq)(x) = q(g −1 x) = xT g −T M g −1 x, and so since
det g = 1
D(gq) = −4 det(g −T M g −1 ) = −4 det M = D(q).
Lemma 14.2.1. Let τ ∈ H. Then we have gqτ = qgτ . That is, the SL2 (R)-
actions on H and on quadratic forms are the same under the correspondence between
these two sets.
80 14. QUADRATIC FORMS AND THE CLASS GROUP
and that moreover this intertwines two natural actions of SL2 (R), the left action on
H given by Möbius transformations, and the right action on quadratic forms given
by (gq)(x) = q(g −1 x).
In this section we specialise this to the action of the modular group Γ.
Define
1 1 1
F := {τ ∈ H : − 6 Re τ < , |τ | > 1} ∪ {τ ∈ H : − 6 Re τ 6 0, |τ | = 1}.
2 2 2
Thus F is the shaded area in Figure 14.1 (but we have been precise about what
the boundary is).
(14.3) |cτ + d| 6 1.
1
√
Taking imaginary parts, we have |c=τ | 6 1 which, since |=τ | > 2 3, means that
c ∈ {−1, 0, 1}. Taking real parts, we have Re(cτ + d) 6 1 and so |d| 6 1 + 21 |c| and
so d ∈ {−1, 0, 1} as well.
Case c = 0. Then d = ±1. The two cases are similar, so we look at d = 1. Then
a = 1 and γτ = τ + b. Since τ, γτ ∈ F, taking real parts gives b = 0 and so γ is the
identity, contrary to the assumption that τ, γτ are distinct.
Case c = ±1. The cases are similar, so suppose that c = 1. If d = 1 then (14.3)
√
3
gives |τ + 1| 6 1. The only point of F with this property is τ = ω = − 12 + i 2 .
Also a − b = ad − bc = 1 and so
aτ + b
γτ = = −τ (aτ + b) = a + (a − b)τ = a + τ.
τ +1
This only lies in F if a = 0, and so γτ = τ , contrary to assumption. The case
d = −1 is similar. Finally, if d = 0 then (14.3) gives |τ | 6 1, and therefore since
τ ∈ F we have |τ | = 1. Also, b = −1 and γτ = a − τ1 = a − τ . This only lies in F if
a = 0, in which case γτ = −τ . Thus τ and γτ both lie in F, on |z| = 1, and their
real parts have opposite signs. This is impossible.
Remark. The proof shows that any point of H may be moved into F using
elements of hS, T i. Take τ ∈ F to be a point with trivial Γ-stabiliser (exercise:
these exist, and in fact any interior point of F has this property). Then, for any
γ ∈ Γ, we may find γ 0 ∈ hS, T i such that γ 0 γz = z which, since z has trivial
stabiliser, implies that γ ∈ hS, T i. Thus Γ is generated by S and T .
Definition 14.3.2. Let q(x) = ax21 + bx1 x2 + cx22 be a positive definite form
over R. Then we say that q is reduced if |b| 6 a 6 c and if either |b| = a or a = c
then b > 0.
As a consequence of Lemmas 14.3.1 and 14.3.3 and the fact that the actions of
Γ on H and on quadratic forms are equivalent, we have the following.
We say that two quadratic forms q, q 0 are equivalent if they are in the same
Γ-orbit. Thus q, q 0 are equivalent if and only if there is some γ ∈ Γ such that
q 0 (x) = q(γx).
We can summarise the findings of this section as follows: for each fixed D < 0
there is a one-to-one correspondence
F∼
= Γ \ H ←→ equivalence classes of quadratic forms of discriminant D
←→ reduced quadratic forms of discriminant D.
The material in the last two sections was purely geometric and contained no
number theory. Let us now reintroduce the imaginary quadratic field K, with
discriminant ∆.
A positive definite binary quadratic form over R is integral if its coefficients
a, b, c all lie in Z. It is easy to see that the action of Γ on quadratic forms preserves
the property of being integral.
In particular, by (14.1) and Corollary 14.3.4, the class number hK is precisely the
number of reduced integral quadratic forms of discriminant ∆.
√
14.5. Example: Q( −29)
Proof. We have
|∆| = 4ac − b2 > 4a2 − a2 = 3a2 ,
so the result follows immediately.
84 14. QUADRATIC FORMS AND THE CLASS GROUP
Unsolved problems
There are very many quite basic unsolved problems about number fields, easily
stated with the language we have developed in this course.
For instance
√
• It is not known if there are infinitely many real quadratic fields Q( d)
whose rings of integers are UFDs, although it is conjectured (and sup-
ported by numerical evidence) that as d ranges over primes, more than
75% of them are.
√
• It is known that there are only nine imaginary quadratic fields Q( d),
d < 0, whose rings of integers are UFDs, but this was only proven in the
√
1960s. The largest of them is Q( −163). (Note that we did show that
the ring of integers of this field is a UFD, but we certainly did not show it
√
is the biggest such field.) It is also known that the class number of Q( d)
tends to infinity as d → −∞, but the question of exactly how quickly is
related to notorious questions in analytic number theory, connected with
the generalised Riemann Hypothesis.
• Even less about unique factorisation is known for fields of degree > 3.
• As we saw in the notes, the classification of quadratic fields is quite
straightforward. Cubic fields already present significant computational
challenges. It turns out that even roughly counting how many fields there
are of a given degree is an unsolved problem in general. It is conjectured
that the number of number fields with degree n and discriminant at most
X grows like a linear function cn X. This is easily checked for n = 2. The
case n = 3 was established by Davenport and Heilbronn in the 1970s, and
the cases n = 4 and 5 only in the last fifteen years or so, by Bhargava.
All cases with n > 6 are open.
85
APPENDIX A
In this chapter we record some basic facts about free abelian groups and lattices.
Free abelian groups. A free abelian group G or rank n is a group of the form
Ln
G = i=1 Zei , for some e1 , . . . , en .
All such groups are isomorphic, and they are all isomorphic to the “standard
lattice” Zn ⊆ Rn .
The following is the key result about free abelian groups.
Ln
Proposition A.0.1. Let G = i=1 Zei be a free abelian group of rank n. If
H 6 G is a finite index subgroup, H is also a free abelian group of rank n, that is
Ln
to say H = i=1 Ze0i with e0i ∈ G. Suppose that e0i = j Aji ej . Then [G : H] =
P
| det A|.
Remark. We use the absolute values since otherwise det(Λ) is only defined up
to sign, depending on the ordering of the ei .
Lemma A.0.4. The determinant det(Λ) depends only on Λ, and not on the
particular choice of ei .
Proof. Suppose that e01 , . . . , e0n is another basis for the lattice, and suppose that
e0i = j Aji ej . Then
P L 0 L
Zei = Zei . We saw in the main text that this is the
case if and only if A is unimodular, that is to say A ∈ Matn (Z) and det A = ±1.
However we have
and so
| det(e01 . . . , e0n )| = | det(e1 , . . . , en )|.
This completes the proof.
Proof. Suppose that a basis for Λ is e1 , . . . , en , and that a basis for Λ0 is e01 , . . . , e0n .
Since Λ0 ⊆ Λ, we have e0i = j Aji ej for some A ∈ Matn (Z). By Proposition A.0.1,
P
and so
det(Λ0 ) = | det A| det(Λ).
Combining these facts concludes the proof.
Ln
Suppose that Λ = i=1 Zei is a lattice. Then the region
Geometry of numbers
In this section we give the proof of Minkowski’s first theorem, the key ingredient
in the proof of the Minkowski bound. Let us begin by recalling the statement.
89
90 B. GEOMETRY OF NUMBERS
Suppose that there do not exist distinct points x, y ∈ K whose difference lies in Λ.
Then the Kλ are all disjoint. Since they all lie in F, we therefore have
X
(B.2) vol(Kλ ) 6 vol(F) = det Λ.
λ
Gauss’s Lemma
There are more general versions of Gauss’s lemma than the one we are about to
state, but this is all we need in the course.
Lemma C.0.1 (Gauss’s lemma). Let f (X) ∈ Z[X] be monic. Suppose that f is
reducible in Q[X]. Then f factors into monic polynomials in Z[X].
Proof. Take the factorisation of f (X) in Q[X], and clear denominators. Then we
find some positive integer d such that
df (X) = g(X)h(X),
g(X) = a0 + a1 X + · · · + am X m ,
h(X) = b0 + b1 X + · · · + bn X n .
Since f is monic, d = am bn and therefore any common factor of the ai would have
to divide d. We may then divide through by such a common factor, and in this way
we may suppose that the ai are coprime, and similarly that the bj are coprime.
Suppose that d 6= 1. Then some prime p divides d. Let i be maximal so that
p - ai , and j be maximal so that p - bj . Then the coefficient of X i+j in g(X)h(X)
is ai bj + . . . , where everything in . . . is divisible by p. Thus the coefficient of
X i+j in g(X)h(X) is not divisible by p, which is evidently a contradiction since all
coefficients of df (X) are divisible by p.
Therefore d = 1 and the result is proven.
91
APPENDIX D
Vandermonde determinants
93