Stark-Heegner points attached
to Cartan non-split curves
Juan Ignacio Restrepo
Doctor of Philosophy
Department of Mathematics and Statistics
McGill University
Montreal,Quebec
July 2015
A thesis submitted to McGill University in partial fulfillment of the requirements of
the degree of Doctor in Philosophy
c Juan Ignacio Restrepo 2015
ACKNOWLEDGEMENTS
I want to thank first my advisor, Professor Henri Darmon, whose guidance was
vital during the development of this project. All these years, his valuable comments
during my talks and our meetings helped me grasp a deeper understanding of mathematics. His patience and calm helped me get courage to ask questions even when I
was feeling overwhelmed and pointed me in the right direction.
Next, I must thank Daniel Kohen, who spent a significant amount of time discussing with me his work with Professor Ariel Pacetti, which is a cornerstone of this
thesis. He also spent countless hours doing computations with me trying to pin down
the correct analogies that led to the main contributions of this project.
I must thank Marc Masdeu and Cameron Franc, who were senior graduate students at the time I started. Their guidance starting to navigate the mathematical
world was extremely important. On the same alley, Francesc Castellà explained to
me a plethora of things when we shared an office. Luiz Takei, who made me do
mathematics in Portuguese while talking with me in a way as down to earth as possible about topics we both found hard many times. And, Andrew Fiori and Nicolas
Simard, with whom I had multiple discussions about various topics in Number Theory and computing.
I would be remiss not to express my gratitude towards NSERC, who funded half
of my stay at McGill, and the Department of Mathematics and Statistics, McGill
University and my advisor for funding the other half.
ii
I would like to thank my friends Ben Smith, Daphna Harel and Katherine Daignault, who provided moral support in moments of distress. The list of people in this
category is rather long, so unfortunately I cannot mention every one.
Finally, I want to thank my family for their continued support throughout these
years.
iii
ABSTRACT
Let E be an elliptic curve of conductor pq 2 , where p and q are prime numbers,
and let K be a quadratic extension of Q. If K is imaginary and p and q are split in
K, there are Heegner points on the modular curve X0 (pq 2 ) defined over ring class
fields attached to orders in K, which can be mapped to points on E. If q is inert,
there are no Heegner points on the modular curve, but points can be obtained from
ε
the Cartan non-split curve Xns
(q, p). If K is real the panorama is quite different. If
p is inert and q is split, Stark-Heegner points have been defined on E, whose field of
definition is conjecturally the narrow ring class field attached to an order in K. This
work combines these two ideas, defining Stark-Heegner points when p and q are inert
in the real quadratic field K, using Cartan non-split curves, which are conjecturally
defined over narrow ring class fields attached to orders in K.
iv
ABRÉGÉ
Soit E une courbe elliptique de conducteur pq 2 , où p et q sont des nombres
premiers, et soit K une extension quadratique de Q. Si K est imaginaire et p et q sont
décomposés dans K, on dispose de points de Heegner sur E construits en appliquant
la paramétrisation par la courbe modulaire X0 (pq 2 ) aux points CM attachés à K.
Ces points sont définis sur des corps de classes d’anneau (ring class fields) attachés
à des ordres dans K. Lorsque q est inerte, le système de points algébriques analogue
ε
s’obtient en remplaçant X0 (pq 2 ) par la courbe Xns
(q, p) associée au sous-groupe de
Cartan non-déployé en q, par une construction rendue explicite par Kohen et Pacetti.
Lorsque K et réel, la situation et radicalement différente, du fait que l’on ne dispose
plus de la construction des points de Heegner. Néanmoins, si p est inerte et q se
décompose dans K, des soi-disant points de Stark-Heegner ont été définis sur E,
dont le corps de définition est conjecturalement l’anneau de corps de classes au sens
restreint attaché à un ordre dans K. Notre travail combine ces deux idées, pour
définir des points de Stark-Heegner en niveau pq 2 quand p et q sont inertes dans
le corps réel quadratic K, à partir de la cohomologie des groups p-arithmétiques
associés à des sous-groupes Cartan non-déployés en q. Il s’agit donc, en fin de compte,
d’adapter les constructions de Kohen-Pacetti au cadre des points de Stark-Heegner.
v
TABLE OF CONTENTS
ACKNOWLEDGEMENTS . . . . . . . . . . . . . . . . . . . . . . . . . . . .
ii
ABSTRACT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
iv
ABRÉGÉ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
v
1
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1
2
Background . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
4
2.1
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
4
4
5
6
7
7
9
10
11
13
13
19
25
34
37
42
Modular Forms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
45
3.1
3.2
3.3
3.4
50
58
67
71
2.2
2.3
3
Algebraic Curves . . . . . . . . . . . . . . .
2.1.1 Affine Curves . . . . . . . . . . . . .
2.1.2 Projective Curves . . . . . . . . . . .
2.1.3 M -rational points. . . . . . . . . . . .
2.1.4 Function fields . . . . . . . . . . . . .
2.1.5 Smoothness . . . . . . . . . . . . . .
2.1.6 Divisors . . . . . . . . . . . . . . . . .
2.1.7 Differentials . . . . . . . . . . . . . .
2.1.8 Genus . . . . . . . . . . . . . . . . . .
Curves of genus zero . . . . . . . . . . . . .
2.2.1 Conics and Ternary Quadratic Forms
2.2.2 Binary Quadratic Forms . . . . . . .
Elliptic Curves . . . . . . . . . . . . . . . . .
2.3.1 Group Structure . . . . . . . . . . . .
2.3.2 Isogenies . . . . . . . . . . . . . . . .
2.3.3 L-functions . . . . . . . . . . . . . . .
Elliptic Curves arising from Modular Forms
The Modular Curve X0 (N ). . . . . . . . . .
Hecke operators . . . . . . . . . . . . . . . .
L-functions associated to Modular Forms . .
vi
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
3.5
4
Modularity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
73
Heegner and Stark-Heegner points; the classical case . . . . . . . . . . .
76
4.1
4.2
4.3
.
.
.
.
.
76
81
87
88
91
.
.
.
.
.
.
.
.
.
.
93
98
102
111
111
112
115
118
121
122
4.4
5
Stark-Heegner points attached to Cartan Non-split curves . . . . . . . . 125
5.1
5.2
5.3
5.4
5.5
6
Complex Multiplication and Class Field Theory . . . . . . . . .
Heegner points . . . . . . . . . . . . . . . . . . . . . . . . . . .
Stark-Heegner points . . . . . . . . . . . . . . . . . . . . . . . .
4.3.1 p-adic measures and p-adic line integrals . . . . . . . . . .
4.3.2 Modular Forms on Γp,M . . . . . . . . . . . . . . . . . . .
4.3.3 Measures associated to an Elliptic Curve and the double
integrals . . . . . . . . . . . . . . . . . . . . . . . . . .
4.3.4 Tate’s uniformization . . . . . . . . . . . . . . . . . . . .
4.3.5 The Stark-Heegner point . . . . . . . . . . . . . . . . . .
4.3.6 Computational remarks . . . . . . . . . . . . . . . . . . .
Heegner points attached to Cartan Non-Split curves . . . . . . .
4.4.1 Cartan Non-split curves . . . . . . . . . . . . . . . . . . .
4.4.2 Modular Forms over Γεns (p) . . . . . . . . . . . . . . . . .
4.4.3 Modular Parametrization . . . . . . . . . . . . . . . . . .
4.4.4 Higher levels . . . . . . . . . . . . . . . . . . . . . . . . .
4.4.5 Heegner points . . . . . . . . . . . . . . . . . . . . . . . .
The group . . . . . . . . . . . . . . . . . . . . . . . . .
5.1.1 Cusps . . . . . . . . . . . . . . . . . . . . . . . .
Modular Forms . . . . . . . . . . . . . . . . . . . . . .
Measures, double Integrals and semi-indefinite Integrals
The Stark-Heegner point . . . . . . . . . . . . . . . . .
Setup for computations . . . . . . . . . . . . . . . . . .
5.5.1 The case q = 3. . . . . . . . . . . . . . . . . . .
5.5.2 More general values of q. . . . . . . . . . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
126
132
138
147
154
156
156
159
Further directions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 176
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 182
vii
Chapter 1
Introduction
Finding rational points on elliptic curves is not as easy as finding rational points
on conic sections. Actually, determining their existence, or lack thereof, is also a
much harder problem. For conic sections we have the Hasse-Minkowski Principle at
our disposal (see [Ser73]) which helps us determine readily when a conic section has
rational points. Moreover, once we have a rational point on the conic, finding all of
the points can easily be done using the chord and tangent method. The counterpart
for elliptic curves is much harder; the techniques are much more sophisticated and
are not even guaranteed to succeed (see [ST92]). However, often they do work and
we manage to characterize the set (a posteriori, group) of rational points.
One of the most used techniques in order to find rational points on elliptic
curves stems from the work of Heegner on the class-number problem for imaginary
quadratic fields. He defined some points, under the hypothesis that all the primes
dividing N are split in K (or ramify, with a further restriction), on the modular curves
X0 (N ), which can be seen as the quotient of the complex upper half-plane by the
congruence group Γ0 (N ). These points are called Heegner points. Birch recognized
that the modular parametrization mapped these points into points on the elliptic
curve defined over an algebraic extension of the base field. Upon taking the trace
of one of these points, we can obtain a point with rational coordinates (or, with
1
coordinates in the field with which we started) and under some further conditions,
they can be shown to be non-torsion (see [Gro84] and [GZ86]).
Darmon proceeded to extend Birch’s ideas to cases where the base field is not
an imaginary quadratic field (see [Dar04]). The so-obtained points do not lie on the
complex upper half-plane, so the usual modular parametrization will not yield any
points. Hence, Darmon used a p-adic version of the modular parametrization and
found that these points share a lot of similarities with the points Birch found before.
Again, we need suitable conditions for these points to exist. The setup is a quadratic
real field K and an elliptic curve of conductor N = pM , where p is inert in K and
all the primes dividing M are split in K. Since Darmon drew a parallel between this
generalization and Stark units relative to elliptic or circular units, he called these
Stark-Heegner points, although sometimes, in the literature, these points are also
referred to as Darmon points.
Recently, Kohen and Pacetti (see [KP14]) explored cases where the suitable conditions required for Birch’s and Darmon’s constructions do not hold. This approach
allowed for the finding of new points which could not be obtained with the previous methods, as the classical modular curves do not have Heegner points when a
prime dividing the conductor is inert. Their setup involves a quadratic imaginary
field K and an Elliptic Curve of conductor N = n2 m, where n is square-free, all
the primes dividing n are inert in K and all the primes dividing m are split in K.
Their construction is based on the consideration of different modular curves called
Cartan non-split curves, which can also be seen as the quotient of the complex upper
half-plane by a congruence subgroup. However, the construction of these curves can
2
also be presented (as is the case with the classical modular curves) as the solution
to a moduli problem, allowing for a possible analogous generalization to the one of
Darmon above.
This thesis emulates Darmon’s generalization of the classical case in conditions
similar to those where Kohen and Pacetti obtained new points. Specifically, for an
elliptic curve of conductor pq 2 and a quadratic real field K where the primes p and q
are inert, we describe a construction that yields points on E, which are conjecturally
defined over narrow ring class fields attached to orders in K.
3
Chapter 2
Background
In this chapter we provide a brief introduction to the objects that will be used
throughout the work. We will be looking for special points on curves. The most
basic invariant about a curve is its genus, which we define in Section 2.1.8 below.
This chapter mainly follows [Sil86] and [Har77].
2.1
Algebraic Curves
2.1.1
Affine Curves
Let K be a field and L an algebraic closure. Let S ⊆ K[x1 , . . . , xn ] be a set and
let Z(S) = {P ∈ Ln |∀f ∈ S, f (P ) = 0}. Let R = L[x1 , . . . , xn ]. The set Z(S) is
called an algebraic set. Clearly Z(S) = Z((S)), where (S) is the ideal generated by
S in R. Also, if S1 ⊆ S2 ⊆ R, Z(S1 ) ⊇ Z(S2 ). Since R is a noetherian ring, every
ideal has a finite generating set, so we can safely assume S is finite (or an ideal.) If
S1 and S2 are two ideals of R, Z(S1 ) ∪ Z(S2 ) = Z(S1 S2 ). If {Si }i∈I is a collection of
S
T
n
subsets of R, i∈I Z(Si ) = Z
i∈I Si . Also, L = Z(0) and ∅ = Z(1). This means
that we can endow Ln with a topology using {Z(S)|S ⊆ R}, the algebraic sets, as
the collection of closed sets. This topology is referred to as the Zariski topology.
Now, let T ⊆ Ln and let I(T ) = {f ∈ R|∀P ∈ T, f (P ) = 0}. This is seen to be
an ideal of R. If T1 ⊆ T2 ⊆ Ln , I(T1 ) ⊇ I(T2 ). To every ideal, we are associating
an algebraic set and to every algebraic set we are associating an ideal. We have the
√
relations Z(I(T )) = T and I(Z(S)) = S (Hilbert’s Nullstellensatz), where T is the
4
closure of T in Ln and
√
S is the radical of the ideal generated by S. This establishes
a correspondence between algebraic sets and radical ideals.
An algebraic set Z is said to be irreducible if whenever Z = Z1 ∪ Z2 with Z1 and
Z2 algebraic sets, we have Z = Z1 or Z = Z2 . An irreducible algebraic set is called
an algebraic variety. It should be noted that the defining ideal of an algebraic variety
can always be taken to be prime. Every point (a1 , . . . , an ) = Z(x1 − a1 , . . . , xn − an )
is an irreducible set. An affine algebraic curve is an irreducible algebraic set whose
only irreducible subsets are points.
2.1.2
Projective Curves
We define the projective space Pn (L) = (Ln+1 − {0})/ ∼, where (x0 , . . . , xn ) ∼
(y0 , . . . , yn ) if and only if there exists λ ∈ L× such that xi = λyi for i = 0, . . . , n.
It does not make sense to evaluate a polynomial in R = L[x0 , . . . , xn ] at a point in
Pn (L), as the polynomial does not evaluate to the same element at every representative of it. However, if we only restrict to homogeneous polynomials, we have that
f (λ(x0 , . . . , xn )) = λd f (x0 , . . . , xn ), where d is the degree of the polynomial. This
does not ensure the value is well defined, but it does ensure the vanishing of the
polynomial at a point in the projective space is well defined. Let S ⊆ L[x0 , . . . , xn ]
be a subset of homogeneous polynomials. A projective algebraic set is a set of the
form Z(S) = {P ∈ Pn (L)|∀f ∈ S, f (P ) = 0}. The ideal (S) generated by S is called
a homogeneous ideal. The discussion above for affine algebraic curves follows almost
verbatim, except for the fact that the homogeneous ideals we deal with must be
contained in the ideal R+ = (x0 , . . . , xn ) to have the correct correspondence. In this
case, we call it a projective algebraic curve. Because most curves we consider in this
5
thesis are algebraic, the use of the word curve will refer to algebraic curves unless
explicitly stated otherwise.
Every projective curve C can be covered by affine patches. Let Ci ⊆ C be the
subset of points where xi 6= 0. The dehomogenization map ϕi : Ci −→ Ln given by
(x0 , . . . , xi−1 , xi , xi+1 , . . . , xn ) 7→ (x0 /xi , . . . , xi−1 /xi , xi+1 /xi , . . . , xn /xi )
maps Ci isomorphically into an affine curve, which is one of the said affine patches.
Every affine curve C ′ is an affine patch of a projective curve via the homogenization
map φi : C ′ −→ Pn (L) given by
(y1 , . . . , yn ) 7→ (y1 , . . . , yi , 1, yi+1 , . . . , yn ).
Note further that these maps are inverses of each other, i.e., φi ◦ ϕi = idCi and
ϕi ◦ φi = idC ′ . Often, when dealing with projective curves, we work with the affine
patch C0 and understand tacitly that the actual curve we are referring to is φ0 (C0 )
together with the points that satisfy the same equations and x0 = 0.
2.1.3
M -rational points.
The polynomials we chose have coefficients in K, so we say that the curve C is
defined over K. For any algebraic extension M of K together with an embedding
M ֒→ L, we have natural embeddings M n ֒→ Ln and Pn (M ) ֒→ Pn (L). The Galois
group Gal(L/M ) acts on the left (in contrast to Silverman’s notation, where it acts
on the right) on Ln and Pn (M ) and this action restricts to C. We denote by C(M )
the set of M -rational points of C, i.e., the set of points in Ln (or Pn (L) if the curve
is projective) that are invariant under the action of Gal(L/M ). This is also the set
6
of points in Ln (or Pn (L)) in the image of the respective embedding which satisfy
the polynomial equations defining the curve C.
2.1.4
Function fields
Let C be a curve defined over K. We define the coordinate ring K[C] to be the
quotient of the ring K[x1 , . . . , xn ] (or K[x0 , . . . , xn ] if C is projective) by I(C). We
note that if C is affine, K[C] is isomorphic to the ring of regular functions on C, this
is, the functions that can be locally represented as the quotient of two polynomials
in K[x1 , . . . , xn ]. On the other hand, if C is projective K[C] is isomorphic to K.
The function field of C, denoted K(C), is the field defined by equivalence classes
of regular functions defined on open subsets of C, where we say that two classes are
equivalent if the functions agree on a nonempty open set contained in the overlap of
the domains. We remark that if C is affine K(C) is isomorphic to the field of fractions
of K[C]. (Since C is irreducible, I(C) is prime, making K[C] an integral domain.)
When C is projective, K(C) is isomorphic to the function field of an affine patch of
C, which is also isomorphic to the degree zero elements of the field of fractions of
the coordinate ring K[C]. Note that when the denominator does not vanish, these
quotients evaluate to some value in K, despite the homogeneous coordinates having
many representatives. This is because the degrees being equal will cancel the λ value
attesting for equivalence, and the value the function takes on at each point (where
it is defined) is independent of the choice of homogeneous coordinates.
2.1.5
Smoothness
Let C ⊆ Ln be an affine curve with defining ideal hf1 , . . . , fr i and let P ∈ C be
a point. The Jacobian matrix at P is defined as the matrix of partial derivatives of
7
the fj with respect to each of the variables evaluated at P . The Jacobian matrix has
dimensions n × r and its rank can be at most n − 1. We say that C is smooth (or
nonsingular) at P if this rank is actually equal to n − 1. Note that in the case of a
plane curve defined by only one polynomial, we have a 2 × 1 matrix and the rank is
at most 1. Smoothness is given by this matrix having a nonzero entry. The curve C
is said to be smooth it is smooth at every point P ∈ C.
For a projective curve C ⊆ Pn (L) things are not very different. We say the curve
is smooth at P if an affine patch (and therefore any) containing P is smooth. Also,
we can check the smoothness at P via a similar test to the one employed for affine
curves. Let the defining ideal be hf1 , . . . , fr i and choose a fixed representative of P .
This Jacobian matrix has dimensions (n + 1) × r but its rank can be at most n − 1
as well. The projective curve is smooth at P when this rank attains its maximum
possible value, n − 1 again.
In contrast with the affine case, elements of the function field of smooth projective curves have a very special property relating its zeroes and poles. Let P ∈ C
and denote by mP the ideal in L[C] consisting of functions vanishing at P . The
localization of L[C] away from mP is a local ring with maximal ideal mP . FurtherT
n
more, it is a discrete valuation ring and ∞
n=1 mP = 0. Let πP be an element of mP
which does not lie in m2P . (This element, called a uniformizer, exists because an
equivalent condition for smoothness of a curve is that the vector space mP /m2P be
a one-dimensional L-vector space.) For a nonzero element f ∈ L[C], we define the
order of vanishing of f at P , and denote it ordP (f ), to be the largest integer n such
that f ∈ mnP . Since the intersection of all the powers of mP is just 0, this n must
8
exist. Note that this integer can be 0. Now, let t ∈ L(C)× and choose f, g ∈ L[C]
of the same degree such that t = f /g. We extend the definition of ordP (·) to L(C)×
by defining ordP (t) = ord(f ) − ordP (g). It should be noted that this definition is
independent of the choice of uniformizer and the representation chosen for t. If
ordP (t) > 0 we say that t vanishes or has a zero at P , and if ordP (t) < 0 we say that
t has a pole at P . If ordP (t) ≥ 0 we say that t is defined at P . For any t ∈ L(C)× ,
P
ordP (t) = 0 for all but finitely many P . Moreover, the sum P ∈C ordP (t) = 0 and
if t has no poles (and therefore no zeroes) t ∈ L.
2.1.6
Divisors
Let C be a curve. Let Div(C) be the abelian free group generated by the points
of C. The elements of Div(C) can be seen as formal sums indexed by the points of
C, this is
Div(C) =
(
X
P ∈C
nP [P ] : nP ∈ Z and nP = 0 for all but finitely many P
)
.
There is a natural group homomorphism deg : Div(C) −→ Z defined by
!
X
X
deg
nP [P ] =
nP .
P ∈C
P ∈C
This sum is well defined because the nP are almost all 0 and we refer to this homomorphism as the degree map. Its kernel, ker(deg), is denoted Div0 (C).
Div(C) has a natural action of the Galois group Gal(L/M ) defined by
!
X
X
nP [P ] =
nP [σ(P )].
σ
P ∈C
P ∈C
9
A divisor is said to be defined over M if it is invariant under the action of Gal(L/M ).
In particular, divisors where nP = 0 for all P ∈ C(L) − C(M ) are defined over M ,
but these are not the only ones. It suffices (and it is enough) that for every point
P ∈ C, the values of the nQ be equal to nP for all Q ∈ C which are conjugates of P
under some automorphism in Gal(L/M ). The set of divisors of C defined over M is
denoted DivM (C), and Div0M (C) is its degree zero counterpart.
P
For an element f ∈ L(C) we let div(f ) =
P ∈C ordP (f )[P ] be the divisor
whose coefficient at P is the order of vanishing of f at P . Note that this divisor is
well defined, as the order of vanishing has finite support for a fixed f . The divisors
associated to an element of the function field of C will be referred to as principal
divisors. We say that two divisors P and Q are linearly equivalent if there exists a
principal divisor div(f ) such that P − Q = div(f ). The Picard group of C, Pic(C),
is the quotient Div(C) modulo linear equivalence. The remark at the end of the
last subsection asserts that div(f ) ∈ Div0 (C) for all f ∈ L(C). This implies that
this quotient descends to Pic0 (C), Div0 (C) modulo linear equivalence, which is also
known as the Jacobian of C. We define PicM (C) and Pic0M (C) as the subgroups of
Pic(C) and Pic0 (C) fixed by Gal(L/M ), respectively.
2.1.7
Differentials
Let Ω(C) be the L(C)-vector space generated by the formal elements dx, x ∈
L(C), quotiented by the subspace generated by the elements of the form da for all
a ∈ L, d(x + y) − dx − dy and d(xy) − xdy − ydx for all x, y ∈ L(C). Ω(C) happens to
be a 1-dimensional L(C)-vector space (Theorem II.4.2(a) in [Sil86]), so it is generated
by any nonzero element.
10
Let P ∈ C and let πP ∈ L(C) be a uniformizer at P . For a given ω ∈ Ω(C), we
have a unique element g ∈ L(C) such that ω = gdπP (since Ω(C) is 1-dimensional).
Denote this g by ω/dπP . We define the order of ω at P as ordP (ω) = ordP (ω/dπP ).
This order is independent of the uniformizer πP .
Like in the case of elements in the function field, ordP (ω) = 0 for all but finitely
P
many P ∈ C. This allows us to define div(ω) = P ∈C ordP (ω)[P ] and this will be
an element of Div(C). For ω ∈ Ω(C) we say that the differential is holomorphic (or
regular ) if ordP (ω) ≥ 0 for all P ∈ C. We say that it is nonvanishing if ordP (ω) ≤ 0
for all P ∈ C.
Notice that if ω1 and ω2 are two nonzero differentials, there must exist some
f ∈ L(C)× such that ω1 = f ω2 , because of the 1-dimensionality of the vector
space of differential forms in C.
Hence , whenever ω2 = gdπP , we have that
ω1 = f ω2 = f gdπP , so ordP (ω1 ) = ordP (f ) + ordP (ω2 ). This equality holds for all
P ∈ C, which implies that div(ω1 ) = div(f ) + div(ω2 ), making div(ω1 ) and div(ω2 )
linearly equivalent, which in turns implies that the image of div(ω1 ) and div(ω2 ) in
the Picard group is the same. This allows for a definition: the canonical divisor class
on C is the image in Pic(C) of any nonzero differential ω ∈ Ω(C).
2.1.8
Genus
Now we have enough material to define the invariant we mentioned at the beginning of the introduction.
If D1 , D2 ∈ Div(C), we say that D1 ≥ D2 whenever n1P ≥ n2P for all P ∈ C. For
a divisor D ∈ Div(C) we associate the following two objects: the vector space
L(D) = {f ∈ L(C)× : div(f ) ≥ −D} ∪ {0}
11
and its dimension ℓ(D) = dimL (L(D)) < ∞.
Note that if deg D < 0, this vector space is trivial (and hence, its dimension is
0) because whenever f ∈ L(D) and f 6= 0 we have that
0 = deg(div(f )) ≥ deg(−D) = − deg(D).
Also, if D1 , D2 are linearly equivalent, the corresponding vector spaces are isomorphic
via the isomorphism
L(D1 ) −→ L(D2 ),
f 7−→ f g,
where g is such that D1 = D2 + div(g). In particular, if KC ∈ Div(C) is a canonical
divisor on C, say L(C) ∼
= {ω ∈ Ω(C) : ω is holomorphic}. The genus of the curve C
is the number ℓ(KC ).
Theorem 2.1 (Riemann-Roch). Let C be a curve and KC a canonical divisor on C.
There is an integer g ′ ≥ 0 such that for every divisor D ∈ Div(C),
ℓ(D) − ℓ(KC − D) = deg(D) − g ′ + 1.
Applying Riemann-Roch to D = 0 we obtain g = ℓ(KC ) = g ′ , so g ′ is actually
the genus of C and we will drop the prime henceforth. Also, applying Riemann-Roch
to D = KC (together with what was just mentioned) we obtain deg(KC ) = 2g − 2.
Finally, if deg(D) > 2g − 2, then deg(KC − D) < 0, making ℓ(KC − D) = 0, yielding
ℓ(D) = deg(D) − g + 1. We record these in the following corollary.
Corollary 2.2. Let C be a curve, KC be a canonical divisor on C and g ′ the integer
given by the Riemann-Roch Theorem.
a) The genus of C, ℓ(KC ), is equal to g ′ .
12
b) deg(KC ) = 2g − 2.
c) If deg(D) > 2g − 2, then ℓ(D) = deg(D) − g + 1.
2.2
Curves of genus zero
If we classify curves according to their genus, the first natural example to look at
is curves of genus 0. This theory has been thoroughly studied and there are techniques
to easily decide whether or not there are K-rational points on these curves, and how
to find them. (See [Ser73].)
2.2.1
Conics and Ternary Quadratic Forms
We keep the notations from the previous section. For any algebraically closed
field, there is essentially just one curve of genus 0. Furthermore, any smooth ternary
quadratic form over K (a conic) is isomorphic over L to this curve.
Theorem 2.3. The only curve of genus 0 over L is P1 (L).
Proof. First, let us show that the genus of P1 (L) is actually 0. P1 (L) = Z(0),
making P1 (L) a projective variety (as 0 is a homogeneous prime ideal in L[x, y]).
Let ∞ = (1, 0), 0 = (0, 1) and P = (a, b). Let t = x/y be a uniformizer at 0,
π∞ = y/x = 1/t be a uniformizer at ∞ and πP = (xb − ay)/by = t − a/b. We can
see that these lie in L(P1 (L)), being the quotient of homogeneous polynomials of
the same degree in x and y. Let ω = dt, which is a nonzero differential. dπP =
d(t − a/b) = dt − d(a/b) = ω. Also,
1
0 = d(1) = d t ·
t
1
1
1
= td
+ dt = tdπ∞ + · ω,
t
t
t
whence ω = −t2 dπ∞ . This shows that ord0 (ω) = ordP (ω) = 0, as ω/dt = ω/dπP = 1
−2
and ord∞ (ω) = −2, as ω/dπ∞ = −t2 = −π∞
. This implies that a canonical divisor
13
for P1 (L) is −2[∞], whose degree is −2. We know that the degree of a canonical
divisor is 2g − 2, so we may conclude that g = 0.
For the converse, let C be a curve of genus 0 with canonical divisor KC and let
P and Q be two distinct points of C. Consider the divisor D = [P ] − [Q] and apply
Riemann-Roch to it. Note that deg(KC − D) = deg(KC ) − deg(D) = −2 < 0, so
ℓ(KC − D) = 0 and we obtain ℓ(D) = ℓ(D) − ℓ(KC − D) = deg(D) − 0 + 1 = 1. Let
P
f ∈ L(D). We have that R∈C nR [R] = div(f ) ≥ [Q] − [P ], so nQ ≥ 1, nP ≥ −1
and nR ≥ 0 for all other R ∈ C. Adding all these inequalities we find that
0 = nQ + nP +
X
R∈C
′
nR ≥ 1 + (−1) + 0 = 0,
where the primed sum ignores the points P and Q. In order to satisfy this, we must
have equality at every point, implying that div(f ) = D.
This means that D = 0 in Pic(C). If we let D′ be any divisor of degree 0, we can
write it as a finite sum of divisors of the form [P ] − [Q] (for different points P and
Q), which makes D′ = 0 in Pic(C) as well. This implies that every degree 0 divisor
is principal, making Pic(C) ∼
= Z, which in turn implies that C ∼
= P1 (L).
Theorem 2.4. Assume the characteristic of K is not 2. Every smooth ternary
quadratic form over K is isomorphic over L to P1 (Q).
Proof. Consider a homogeneous smooth ternary quadratic form Q in P2 (K),
a d/2 f /2 x
y .
Q(x, y, z) = ax2 +by 2 +cz 2 +dxy +eyz +f zx = x y z
d/2
b
e/2
f /2 e/2 c
z
14
If d = e = f = 0 we say that Q is diagonal. Denote by
a d/2 f /2
.
A=
d/2
b
e/2
f /2 e/2 c
Since the curve is smooth, we find that the system
2ax + dy + f z = 0
x
or,
2A
dx + 2by + ez = 0
y = 0,
f x + ey + 2cz = 0
z
does not have nontrivial solutions. This clearly amounts to the matrix A being
non-singular.
A linear change of variables can be represented by an invertible 3 × 3 matrix P ,
with coefficients in K, multiplying the vector on the left:
x X
X
x
−1
y = P Y
or,
equivalently,
P
y = Y .
Z
z
Z
z
This way, the form QP defined by
QP (X, Y, Z) =
is isomorphic to Q.
X Y
X
x
T
= x y z A y = Q(x, y, z)
Y
Z P AP
z
Z
15
If a 6= 0, we can complete squares to obtain a quadratic form with no cross
terms involving x. This can be obtained via the matrix
0
0
a
1 −d/2a −f /2a
, since P T AP = 0 b − d2 /4a e/2 − df /4a ,
P =
0
1
0
0 e/2 − df /4a c − f 2 /4a
0
0
1
so
QP (X, Y, Z) = aX 2 + (b − d2 /4a)Y 2 + (c − f 2 /4a)Z 2 + (e − df /2a)Y Z.
Note that QP is the sum of a unary quadratic form and a binary quadratic form,
which is closer to diagonal. If b 6= 0 or c 6= 0, we can carry out a similar process
to get rid of cross terms involving y or z, so, if at least one amongst a, b and c is
different from 0, we can eliminate cross terms of one of the variables and express the
quadratic form as the sum of a unary quadratic form and a binary quadratic form.
If a = b = c = 0, det A = def /4, so d 6= 0 for det A to be nonzero. We set
0
0
d
1 1 −(e + f )/2d
T
P = 1 −1 −(e + f )/2d , whence P AP = 0
−d
(f − e)/2
0 (f − e)/2 −(f + e)2 /4d
0 0
1
and we obtain a form QP isomorphic to Q which is the sum of a unary quadratic
form and a binary quadratic form.
16
So far, we have shown that Q is isomorphic to a quadratic form with matrix
0
h 0
0 k l/2 ,
0 l/2 m
so assume henceforth that Q(x, y, z) = hx2 + ky 2 + mz 2 + lyz. If k 6= 0, we can
diagonalize in the same way as before via the matrix
0
0
h 0
1 0
T
,
P = 0 1 −l/2k , since P AP = 0 k
0
2
0 0 m − l /4k
0 0
1
obtaining the diagonal quadratic form QP (X, Y, Z) = hX 2 + kY 2 + (m − l2 /4k)Z 2 .
If m 6= 0 we can still do this prioritizing the third variable. Hence, the only problem
may arise when k = m = 0, in which case det A = −hl2 /4, so l 6= 0. In this situation,
we set
1 0 0
,
P =
0
1
1
0 1 −1
h 0 0
,
whence P T AP =
0
l
0
0 0 −l
obtaining again a diagonal quadratic form. All the previous isomorphisms are over K.
Without loss of generality, we may assume that Q(x, y, z) = ax2 + by 2 + cz 2 , where
√
abc 6= 0. If we want to consider also isomorphisms over L, Q(x, y, z) = ( ax)2 +
√
√
( by)2 − ( −cz)2 , so all quadratic forms are isomorphic to x2 + y 2 − z 2 . Consider
the curve C in P2 (L) defined by x2 + y 2 − z 2 (which is smooth, as the rank of the
17
matrix
2x 2y −2z
is 1). Let
φ : P1 (L) −→C
(m, n) 7−→(m2 − n2 , 2mn, m2 + n2 ),
and
ψ : C −→P1 (L)
(x, y, z) 7−→(x + z, y) = (y, z − x).
Notice that (m2 − n2 )2 + (2mn)2 − (m2 + n2 )2 = 0, and if m2 − n2 = m2 + n2 = 0 then
m = n = 0 so φ is well defined. If x + z = y = 0 then we have the point (1, 0, −1), in
which case z −x 6= 0. Also, (x+z, y) = (y(x+z), y 2 ) = (y(x+z), z 2 −x2 ) = (y, z −x),
so ψ is also well defined. Finally it is easy to check that ψ◦φ = idP1 (L) and φ◦ψ = idC ,
as
(m, n) 7→ (m2 − n2 , 2mn, m2 + n2 ) 7→ (2m2 , 2mn) = (m, n)
(m, n) 7→ (m2 − n2 , 2mn, m2 + n2 ) 7→ (2mn, 2n2 ) = (m, n)
and, using y 2 = z 2 − x2 on C,
(x, y, z) 7→ (x + z, y) 7→((x + z)2 − (z 2 − x2 ), 2(x + z)y, (x + z)2 + (z 2 − x2 ))
(2x(x + z), 2y(x + z), 2z(x + z)) = (x, y, z)
(x, y, z) 7→ (y, z − x) 7→((z 2 − x2 ) − (z − x)2 , 2y(z − x), (z 2 − x2 ) + (z − x)2 )
(2x(z − x), 2y(z − x), 2z(z − x)) = (x, y, z),
18
so the maps are inverses of each other, making both curves isomorphic.
This implies that all smooth ternary quadratic forms have genus 0, and these
are the only curves over K with this property.
2.2.2
Binary Quadratic Forms
This section follows [Cox13]. A binary quadratic form is a homogeneous polynomial of degree two in two variables. We will be interested in binary quadratic forms
of the form
F (x, y) = ax2 + bxy + cy 2 ,
a, b, c ∈ Z,
gcd(a, b, c) = 1,
which are referred to as integral primitive binary quadratic forms. To it, we associate
the quantity D = b2 − 4ac, which we call the discriminant of F. Note that the
b
discriminant is the additive inverse of the determinant of the matrix ( 2a
b 2c ) and it
is always congruent to 1 or 0 modulo 4. It may happen that D = dk 2 where d is
another discriminant. If the largest such k is 1, we say that D is fundamental.
An integer n is said to be represented by F if there exist integers x, y such that
F (x, y) = n. If gcd(x, y) = 1, we say that n is properly represented by F . If F
represents n, we have that
(2ax + by)2 − Dy 2 = 4an.
If D < 0 we can see that 4an ≥ 0, so F only represents integers of the same sign.
Because of this, if D < 0 we say that F is definite. In addition, if a > 0 we say that F
is positive definite and if a < 0 we say that F is negative definite. Studying negative
definite quadratic forms is equivalent to studying positive definite quadratic forms
19
so in the case of definite binary quadratic forms we will restrict ourselves to positive
definite binary quadratic forms. If D > 0, F represents integers of both signs, so we
say that F is indefinite.
If the matrix α = ( vt wu ) ∈ GL2 (Z), then left multiplication by α provides a
bijection from Z2 to itself (viewed as column vectors), which descends to the subset
of pairs of integers which are relatively prime, for if gcd(x, y) = 1 we have integers
A and B such that Ax + By = 1, so
x
x
−1
A B = 1 ⇒ A B α α = 1,
y
y
whence, denoting by (A′ B ′ ) the matrix (A B)α−1 we find that A′ x′ + B ′ y ′ = 1,
where the pair x′ , y ′ is the image of the pair x, y by α.
Define an action of GL2 (Z) on the set of binary quadratic forms by
(F · α)(x, y) = F ((x, y)αT ) = F (tx + uy, vx + wy)
= a(tx + uy)2 + b(tx + uy)(vx + wy) + c(vx + wy)2
= F (t, v)x2 + (2atu + btw + buv + 2cvw)xy + F (u, w)y 2 .
It is a right action because we can write
F (x, y) =
a b/2 x
,
x y
y
b/2 c
20
so if α, β ∈ GL2 (Z) we have
x
a b/2
(F · (αβ))(x, y) = x y (αβ)T
αβ
y
b/2 c
x
a b/2
= x y β T αT
αβ
y
b/2 c
=((F · α) · β)(x, y).
By the preceding comments, the binary quadratic forms F and F · α (properly)
represent the same set of integers. We say that F and F · α are equivalent. If,
furthermore, α ∈ SL2 (Z), we say that F and F · α are properly equivalent. The
orbits of the action of SL2 (Z) are referred to as classes, as the action induces an
equivalence relation in the set of binary quadratic forms. Note that the discriminant
of two equivalent forms is the same (as the determinant will be multiplied by the
determinants of α and αT , both equal to 1), so each class is comprised of forms with
the same discriminant. For a fixed D we may partition the set of binary quadratic
forms of discriminant D into the classes mentioned above. This set is denoted Cl(D)
and it can be endowed with a group structure called the composition law, due to
Gauss. See [Cox13] for details.
A positive binary quadratic form ax2 + bxy + cy 2 is said to be reduced if
|b| ≤ a ≤ c,
and b ≥ 0 if either |b| = a or a = c.
Then, every positive definite binary quadratic form is properly equivalent to a unique
reduced form, yielding canonical representatives for the classes of Cl(D) when D < 0.
21
Furthermore, if a positive binary quadratic form is reduced, we have that
r
−D = 4ac − b2 ≥ 4a2 − b2 ≥ 3a2 ,
−D
, implying that there are only finitely many positive definite
3
binary quadratic forms with a fixed discriminant, as this bounds the value of b as
whence a ≤
well, and c is the quotient of b2 − D and 4a. (In particular, we just need 4a to divide
b2 − D at this point.)
For example, for D = −164, a ≤
p
164/3 < 8. Since b2 − 4ac = −164, b is an
even integer whose absolute value is bounded by a. If a = 1, b can only be 0, making
c = 41 and yielding the form x2 + 41y 2 . If a = 2, b can only be 0 or 2. In the former
case, 4a does not divide b2 − D and and in the latter we obtain c = 21, giving only
the form 2x2 + 2xy + 21y 2 . For a = 3, b can be −2, 0 and 2. If b = ±2, c = 14 and if
b = 0, c would not be an integer. We obtain the forms 3x2 ± 2xy + 14y 2 . For a = 4,
the possible values of b are −2, 0, 2 and 4, but none of these make b2 + 164 divisible
by 16, so there are no forms with a = 4. a = 5 has ±4, ±2 and 0 as possible values
for b. The only ones that work are b = ±4, where c = 9. This gives us the forms
5a2 ± 4xy + 9y 2 . If a = 6, the possible values of b are ±4, ±2, 0 and 6. For b = ±2
we obtain c = 7 and the other values do not give any integral values of c, yielding
the forms 6x2 ± 2xy + 7y 2 . Finally, if a = 7, b can be ±6, ±4, ±2 and 0. For b = ±2,
we would get c = 6, which is smaller than 7. For the other values, the divisibility
relation is not satisfied, so there are no reduced positive definite binary quadratic
22
forms in this case. Summarizing, we have that
Cl(−164) = {[x2 + 41y 2 ], [2x2 + 2xy + 21y 2 ], [3x2 + 2xy + 14y 2 ], [3x2 − 2xy + 14y 2 ]
[5x2 + 4xy + 9y 2 ], [5x2 − 4xy + 9y 2 ], [6x2 + 2xy + 7y 2 ], [6x2 − 2xy + 7y 2 ]}
The case of indefinite forms is a little bit more subtle. The remaining of this
subsection follows [Fla89]. A form F (x, y) = ax2 + bxy + cy 2 of discriminant D > 0
is said to be reduced if
0<b<
√
D
and
√
D − b < |2a| <
√
D + b.
Every indefinite binary quadratic form is properly equivalent to a reduced form,
but it is not unique. However, all the reduced forms equivalent to a given form
are of a very particular kind. Define the right neighbor RF of F to be the form
cx2 + Bxy + Cy 2 of the same discriminant as F such that B is the integer in the
√
√
interval ( D − |2c|, D) satisfying b + B being divisible by 2c. From 2c | b + B
we can see that b and B have the same parity, so 2 | b − B, further implying that
4c | b2 − B 2 = D + 4ac − B 2 . Whence 4c | B 2 − D and C is just the corresponding
quotient, an integer, which shows that the right neighbor is also an integral binary
quadratic form. Moreover, if b + B = 2cδ, RF = F · ( 10 −1
δ ) , so RF and F are
properly equivalent. We have the following theorem:
Theorem 2.5. Let F and G be two indefinite integral binary quadratic forms of
discriminant D. The sequences defined by
{Rn F }n≥0
and
23
{Rn G}n≥0
are periodic. Furthermore, all the forms appearing in the periodic part of the sequences are reduced and F and G are properly equivalent if and only if the sequences
overlap.
This gives us an algorithm to compute representatives for Cl(D), albeit not
canonical. The bounds imposed on the definition of being reduced imply that b and
a have a finite range of values, and the discriminant gives c also a finite range of
values, implying that Cl(D) is finite.
For example, for D = 145, 0 < b <
√
145 < 13, and since b and D have the
same parity, b can only take on the values 1, 3, 5, 7, 9 and 11. We summarize the
possible values a can take on in each case in the following table:
b
√
⌊ D − b⌋
√
⌊ D + b⌋
1
11
13
±6
3
9
15
±5, ±6, ±7
5
7
17
±4, ±5, ±6, ±7, ±8
7
5
19
±3, ±4, ±5, ±6, ±7, ±8, ±9
9
3
21
±2, ±3, ±4, ±5, ±6, ±7, ±8, ±9, ±10
11
1
23
±1, ±2, ±3, ±4, ±5, ±6, ±7, ±8, ±9, ±10, ±11
possible a’s
The only possible combinations such that 2a divides b2 − 145 are recorded in
the following table:
24
b
a’s
b2 − D
respective c’s
1
±6
−144
∓6
3
−
−136
−
5
±5, ±6
−120
∓6, ∓5
7
±3, ±4, ±6, ±8
−96
∓8, ∓6, ∓4, ∓3
9
±2, ±4, ±8
−64
∓8, ∓4, ∓2
−24
∓6, ∓3, ∓2, ∓1
11 ±1, ±2, ±3, ±6
This gives us 28 possible forms. After finding the right neighbor of all of them,
we can find the different orbits and we just need to take one element per orbit. We
end up with
Cl(145) = {[6x2 + xy − 6y 2 ], [5x2 + 5xy − 6y 2 ], [3x2 + 7xy − 8y 2 ], [8x2 + 7xy − 3y 2 ]}.
2.3
Elliptic Curves
After dealing with curves of genus 0, the next natural example is curves of genus
1. This theory already poses more problems than the previous case, as there are no
known algorithms that can guarantee the existence of K-rational points on the curve
or find all of the points provided that one already has one point. However, there are
algorithms that often work (see [ST92]). In this section we follow mainly [Sil86].
A genus 1 curve together with a point is referred to as an Elliptic Curve. Elliptic
curves, as in the case of genus 0 curves, always have planar models. Let us start our
brief discussion about these by establishing the nature of their equations. In order to
do this we will use again the Riemann-Roch theorem. Denote the special point (the
25
one that is part of the definition) by O and keep the notations from the previous
sections.
Consider the spaces L(n[O]) for n ∈ {1, 2, 3, 4, 5, 6}. Since deg(n[O]) = n ≥
0 = 2 · 1 − 2, Corollary 2.2(c) applies and ℓ(n[O]) = n − 1 + 1 = n. The function
1 ∈ L([O]), as div(1) = 0 ≥ −[O], and since ℓ([O]) = 1, we find that 1 is a
basis for L([O]). Since ℓ(2[O]) = 2 and ℓ(3[O]) = 3, let x ∈ L(2[O]) such that
x 6∈ L([O]) and y ∈ L(3[O]) such that x 6∈ L(2[O]). Note that x and y have poles
of order exactly 2 and 3 at O, respectively, and {1, x} and {1, x, y} are bases for
L(2[O]) and L(3[O]), respectively. The functions x2 and xy have poles of order
exactly 4 and 5, respectively, and the dimensions given by Corollary 2.2(c) give
that {1, x, y, x2 } and {1, x, y, x2 , xy} are bases for L(4[O]) and L(5[O]), respectively.
Finally, the elements x3 and y 2 both lie in L(6[O]), which has dimension 6, making
the set {1, x, y, x2 , xy, x3 , y 2 } linearly dependent, whence, there are elements li ∈ K,
i ∈ {1, 2, 3, 4, 5, 6, 7}, such that l6 l7 6= 0 and
l1 + l2 x + l3 y + l4 x2 + l5 xy + l6 x3 + l7 y 2 = 0.
After multiplying by l62 l73 we obtain
l1 l62 l73 +l2 l6 l72 (l6 l7 x)+l3 l6 l7 (l6 l72 y)+l4 l7 (l6 l7 x)2 +l5 (l6 l7 x)(l6 l72 y)+(l6 l7 x)3 +(l6 l72 y)2 = 0,
so, after rescaling we can obtain an equation of the form
y 2 + a1 xy + a3 y = x3 + a2 x2 + a4 x + a6 ,
26
(2.1)
where a1 , a2 , a3 , a4 , a6 ∈ K. Thus, we can construct an isomorphism from C into a
subset of P2 (K) via
C −→
P2 (K)
P 7−→ (x(P ), y(P ), 1),
1
x
(O), 1(O), (O) = (0, 1, 0). Equation (2.1) is
where the point O is mapped to
y
y
called the Weierstrass equation of the curve C. Often, we refer to the pair (E, O)
just by E; in this case we have a Weierstrass equation in mind in which O is the
point at infinity.
When dealing with fields of characteristic different from 2, we can multiply by
4 and complete the square on the left side to obtain
(2y + a1 x + a3 )2 = 4x3 + (4a2 + a21 )x2 + 2(2a4 + a1 a3 )x + (4a6 + a23 ),
so the change of variables
X
x
Y = P1 y ,
1
1
yields the equation
1 0 0
where P1 =
a
2
a
3 ,
1
0 0 1
Y 2 = 4X 3 + b2 X 2 + 2b4 X + b6 ,
where
b2 = 4a2 + a21 ,
b4 = 2a4 + a1 a3
27
and
b6 = 4a6 + a23 .
Furthermore, if the characteristic is different from 3, we can multiply by 24 36 to
obtain
(22 33 Y )2 = (22 32 X)3 + 3(22 32 X)2 (3b2 ) + 25 36 b4 X + 24 36 b6
= (22 32 X)3 + 3(22 32 X)2 (3b2 ) + 3(22 32 X)(3b2 )2 + (3b2 )3
+ 25 36 b4 X + 24 36 b6 − 3(22 32 X)(3b2 )2 + (3b2 )3
= (22 33 X + 3b2 )3 + (22 32 X + 3b2 )(23 34 b4 − 33 b22 )
+ 24 36 b6 − 33 b32 − 3b2 (23 34 b4 − 33 b22 )
= (36X + 3b2 )3 + 27(24b4 − b22 )(36X + 3b2 ) + 54(216b6 − 36b2 b4 + b32 ),
so the change of variables
X
X
Y = P2 Y ,
1
1
yields the equation
36 0 3b2
where P2 =
0 108 0 ,
0
0
1
Y 2 = X 3 − 27c4 X − 54c6 ,
where
c4 = b22 − 24b4
and
c6 = −b32 + 36b2 b4 − 216b6 .
For simplicity of notation, write X = x and Y = y and let us work with the
equation
y 2 = x3 − 27c4 x − 54c6 ,
which is referred to as the short Weierstrass equation of the curve C.
28
(2.2)
Note that all these transformations preserve the field where the coefficients lie
and the point at infinity (0, 1, 0). After homogeneizing we obtain that C can be
modeled by the zero set of the polynomial
y 2 z + 27c4 xz 2 + 54c6 z 3 − x3 ,
whose matrix of partial derivatives is
2
27c4 z − 3x
2
2
2yz y + 54c4 xz + 162c6 z
2
.
For this matrix to not have rank 1, we require 2yz = 0 in particular. If z = 0, we
have the point at infinity O, so x = 0 and y = 1, yielding the matrix 0 0 1 ,
with rank 1. (In particular, the point at infinity is smooth.) If y = 0, we have that
z = 1 and the simultaneous equations
x2 = 9c4 ,
c4 x = −3c6 .
Squaring the latter, multiplying by c24 the former and equating yields 9c34 = 9c26 .
Notice that the discriminant of the cubic polynomial (in x) defining the elliptic
curve is ∆′ = 22 39 (c34 − c26 ), so the discriminant of this polynomial must be nonzero.
If we start with a short Weirstrass equation already, namely, a curve E where
a1 = a2 = a3 = 0, there is no need to apply the transformations we applied in order
to obtain a short Weierstrass model. However, after applying all these operations,
we obtain the equation
y 2 = x3 + 64 a4 x + 66 a6 .
29
The discriminant of this cubic polynomial is 212 312 times the discriminant of the
cubic polynomial associated to the original equation. It makes sense that in order to
define the discriminant of the elliptic curve we take this into account, so we define
the discriminant of E as
∆ = 16 ·
c34 − c26
−4(−27c4 )3 − 27(−54c6 )2
=
,
212 312
1728
where the extra 16 is introduced in order to preserve integrality of the discriminant
when all the ai are in a specific ring. If the ai are all integers, it can be shown that
c34 −c26 is always divisible by 1728 and that this is the largest possible integer with this
property. (It just suffices to expand the quotient in terms of the ai to see that 1728
divides, and plugging in the tuples (0, −1, −1, 0, 0) and (1, 0, −1, 0, 0) gives the values
−11 and −28 respectively, which do not share any common factors.) In the case of
starting with the short Weierstrass equation y 2 = x3 + ax + b, ∆ = −16(4a2 + 27b3 ).
Note that even if we start with coefficients in a number ring, we can reduce modulo its primes, resulting in a curve in the quotient ring. The value of the discriminant
will be reduced modulo the same prime, and if it is nonzero, we will obtain an elliptic
curve as well and the curve is said to have good reduction modulo this prime. If the
discriminant reduces to 0, the curve is said to have bad reduction modulo this prime.
We define the j-invariant of the curve as
j=
c34
1728c34
=
.
c34 − c26
∆
The j-invariant does not depend on the model chosen for the curve and since ∆ 6= 0,
this value is always in K. It classifies elliptic curves up to L-isomorphism, as we will
30
explain next. Also, its importance will become even more prominent when we talk
about modular parametrizations. For simpliticy of the argument, let us consider
only the case where char(K) 6= 2, 3. Let E and E ′ be two elliptic curves with the
same j-invariant and let c4 , c6 and c′4 , c′6 be the corresponding values obtained after
finding a short Weierstrass model for E and E ′ , respectively. This is,
E : y 2 = x3 − 27c4 x − 54c6
and E ′ : y 2 = x3 − 27c4 x − 54c6 .
If j = 0, c4 = c′4 = 0. Note that if c4 c′4 = 0, j = 0 as well. Since ∆∆′ 6= 0, c6 c′6 6= 0.
Let u be a sixth root of the quotient c6 /c′6 . After multiplying the equation for E by
u6 we obtain
(u3 y)2 = (u2 x)3 − 54u6 c6 = (u2 x)3 − 54c′6 ,
so the map E −→ E ′ , (x, y) 7−→ (u2 x, u3 y) is an isomorphism. If j = 1728, c4 =
c34 − c26 , implying c6 = 0. Likewise for E ′ , so c′6 = 0 as well. Note that if c6 c′6 = 0,
j = 1728. Let u be a fourth root of the quotient c6 /c′4 . After multiplying the equation
for E by u6 we obtain
(u3 y)2 = (u2 x)3 − 27u6 c4 x = (u2 x)3 − 27c′4 (u2 x),
so the map E −→ E ′ , (x, y) 7−→ (u2 x, u3 y) is an isomorphism again. If j 6= 0, 1728,
after equating the equations for the j-invariants, cross-multiplying and canceling, we
obtain the equation
c34 c′2
6
=
2
c′3
4 c6 ,
or, since
c4 c′4 c6 c′6
31
6= 0,
c′4
c4
3
=
c′6
c6
2
.
3
′ 2
c′4
c6
Let u ∈ L be a fourth root of
Its twelfth power is u =
=
,
c4
c6
so u′6 = ±c′6 /c6 . Let u = u′ if this sign is positive and −u′ if it is negative. We will
′
c′4 /c4 .
′12
still have u4 = c′4 /c4 but now we also have that u6 = c′6 /c6 . The same isomorphism
as before will do the job.
We define the invariant differential associated to the curve as
ω=
dy
dx
= 2
.
2y + a1 x + a3
3x + 2a2 x + a4 − a1 y
The functions y 2 + a1 xy + a3 y and x3 + a2 x2 + a4 x + a6 are equal in K(C), so after
applying the formal symbol d we obtain the same differential. It follows that
d(y 2 + a1 xy + a3 y) = d(x3 + a2 x2 + a4 x + a6 )
2ydy + a1 xdy + a1 ydx + a3 dy = 3x2 dx + 2a2 xdx + a4 dx
=⇒ (2y + a1 x + a3 )dy = (3x2 + 2a2 x + a4 − a1 y)dx
=⇒
3x2
dx
dy
=
,
+ 2a2 x + a4 − a1 y
2y + a1 x + a3
showing that both differentials given in the definition of ω are the same. The curve
is given by the polynomial f = y 2 z + a1 xyz + a3 yz 2 − x3 − a2 x2 z − a4 xz 2 − a6 z 3 , so
∂f ∂f ∂f
the matrix
, where
∂x ∂y ∂z
∂f
= −(3x2 + 2a2 xz + a4 z 2 − a1 yz)
∂x
∂f
= 2yz + a1 xz + a3 z 2
∂y
∂f
= y 2 + a1 xy + 2a3 yz − a2 x2 − 2a4 xz − 3a6 z 2 ,
∂z
32
has rank 1. Notice that
x
∂f
∂f
∂f
+y
+z
= 3f,
∂x
∂y
∂z
so, if 3x2 + 2a2 xz + a4 z 2 − a1 yz = 2yz + a1 xz + a3 z 2 = 0, since f = 0, it follows that
∂f
= 0, so for points such that z 6= 0 it would follow that the matrix has rank 0,
z
∂z
yielding a contradiction. This means that the two polynomials in the definition of ω
cannot vanish simultaneously at finite points.
∂f
6= 0, namely, 2y0 +a1 x0 +a3 6= 0. We can
Take a point P = (x0 , y0 ) ∈ C where
∂y
rewrite the polynomial x3 +a2 x2 +a4 x+a6 as (x−x0 )3 +A2 (x−x0 )2 +A4 (x−x0 )+A6
for some A2 , A4 , A6 ∈ L. Also, y 2 + a1 xy + a3 y = y 2 + a1 (x − x0 )y + (a3 + a1 x0 )y,
so putting both together and evaluating at P we find that y02 + (a3 + a1 x0 )y0 = A6 .
This means that
y 2 + (a3 + a1 x0 )y − A6 = (y − y0 )(y + y0 + a1 x0 + a3 ) ∈ hx − x0 i in L[C].
The polynomial y + y0 + a1 x0 + a3 does not vanish at P , which means that y − y0 ∈
hx − x0 i in the localization of L[C] away from mP , making x − x0 a uniformizer at
P , implying that ordP (ω) = 0.
∂f
∂f
If
= 0 at P , we have that
6= 0.
∂y
∂x
y 2 + a1 xy + a3 y = (y 2 − 2yy0 + y02 ) + a1 x(y − y0 ) + a3 y0 + 2yy0 − y02 + a1 xy0
= (y − y0 )2 + a1 x(y − y0 ) + a3 y0 + 2y0 (y − y0 ) + y02 + a1 xy0
= x3 + a2 x2 + a4 x + a6 ,
33
so
x3 + a2 x2 + a4 x + a6 − a1 xy0 − (y02 + a3 y0 ) =
x3 + a2 x2 + a4 x + a6 − a1 xy0 − (x30 + a2 x20 + a4 x0 + a6 − a1 x0 y0 ) =
(x3 − x30 ) + a2 (x2 − x20 ) + a4 (x − x0 ) − a1 y0 (x − x0 ) =
(x − x0 )(x2 + xx0 + x20 + a2 (x + x0 ) + a4 − a1 y0 ).
Putting both equations together we find that
(y − y0 )2 + (a1 x + 2y0 )(y − y0 ) = (x − x0 )(x2 + xx0 + x20 + a2 (x + x0 ) + a4 − a1 y0 ),
which lies in the ideal hy − y0 i in L[C]. The polynomial x2 + xx0 + x20 + a2 (x + x0 ) +
a4 − a1 y0 does not vanish at P so x − x0 ∈ hy − y0 i in the localization of L[C] away
from mP , making y − y0 a uniformizer at P , implying again that ordP (ω) = 0.
Since the curve has genus 1, Corollary 2.2(b) states that the degree of a canonical
divisor is 0, implying that ordO (ω) = 0 as well, so the differential ω is holomorphic
and nonvanishing. (It is also easy to take πO = x/y as a uniformizer, compute ω as
gdπO for some g ∈ K(C) and compute ordO (g).)
2.3.1
Group Structure
When we were dealing with a curve of genus 0, C, we showed that for two
different points, say P and Q, there exists an element in L(C)× such that div(f ) =
[P ] − [Q]. In big contrast with this idea, when we have a curve of genus 1, [P ] cannot
even be linearly equivalent to [Q]. We record this in the following lemma.
34
Lemma 2.6. Let (E, O) be an elliptic curve and let P, Q ∈ E. Then
[P ] ∼ [Q] in Pic(E)
⇐⇒
P = Q.
Proof. One direction is trivial so we will focus on the other one. Let f ∈ L(E)×
such that div(f ) = [P ] − [Q]. Since [P ] − [Q] ≥ −[Q], this implies that f ∈ L([Q]).
By 2.2(c), ℓ([Q]) = deg([Q]) = 1, so f actually generates L([Q]). Since constant
functions are in L([Q]), this shows that f is constant, div(f ) = 0 and [P ] = [Q] in
Div(E), so P = Q.
Proposition 2.7. Let (E, O) be an elliptic curve. For every D ∈ Div0 (E) there
exists a unique point P ∈ E such that D ∼ [P ] − [O] in Pic0 (E). The map from
Div0 (E) −→ E that maps D to [P ] as described above induces a bijection between
the sets Pic0 (E) and E.
Proof. Consider the divisor D + [O], of degree 1. By 2.2(c), ℓ(D + [O]) = 1, so,
let 0 6= f ∈ L(D + [O]). This implies that div(f ) ≥ −D − [O], but deg(div(f )) = 0
and deg(D − [O]) = −1, so there must exist a point P ∈ E such that div(f ) =
−D − [O] + [P ], whence D ∼ [P ] − [O] as claimed. Uniqueness follows from Lemma
2.6, as [P ] − [O] ∼ [P ′ ] − [O] implies [P ] ∼ [P ′ ]. This maps descends to Pic0 (E)
because if D1 ∼ D2 , D1 7→ P1 and D2 7→ P2 , we have that
[P1 ] − [O] ∼ D1 ∼ D2 ∼ [P2 ] − [O],
so [P1 ] ∼ [P2 ] and this implies P1 = P2 , where it is also clear that the map is
injective in this quotient. Surjectivity follows from mapping [P ] − [O] 7→ P for every
P ∈ E.
35
Proposition 2.7 allows us to assign a group structure to E, as it is in bijection
with an abelian group. The identity of Pic0 (E) is 0 = [O] − [O], so O is the identity
of E. If P and Q are two points in E, the bijection assigns to the point P + Q the
point corresponding to the degree-zero divisor [P ] + [Q] − 2[O]. Recall that O is a
K-rational point, so, if we start with a divisor D defined over M , where K ⊆ M ⊆ L
is an intermediate field, the divisor D + [O] is also defined over M . Let P be the
point that corresponds to D and f ∈ L(E)× such that
div(f ) = −D − [O] + [P ].
Let σ ∈ Gal(L/M ) and apply σ to this equality to obtain
div(σ(f )) = σ(div(f )) = −σ(D) − σ([O]) + σ([P ]) = −D − [O] + [σ(P )].
Since σ(f ) ∈ L(E)× , this implies that D ∼ [σ(P )] − [O], and by Lemma 2.6 we
find that P = σ(P ). This implies that P ∈ E(M ), so starting with an element of
Div0M (E) produces a point defined over M . From this, it is clear that if P, Q ∈ E(M ),
P + Q ∈ E(M ) as well, as the divisor [P ] + [Q] − 2[O] ∈ Div0M (E). Likewise, the
divisor −[P ] + [O] is defined over M , so its corresponding point, call it −P, is defined
over M as well and P + (−P ) is the point corresponding to
[P ] + [−P ] − 2[O] = ([P ] − [O]) + ([−P ] − [O]) ∼ ([P ] − [O]) + (−[P ] + [O]) = 0,
whence P + (−P ) = O. We conclude that −P ∈ E(M ) and that −P is the inverse
of P, making E(M ) a subgroup of E (and thus, giving it a group structure as well.)
If K is a number field, the group E(K) always has at least one element (namely, the
36
point O). It may consist of a finite or an infinite number of elements. Nevertheless,
we have the following structure theorem:
Theorem 2.8 (Mordell). Let K be a number field and let (E, O) be an elliptic curve
defined over K. The abelian group E(K) is finitely generated. This is, there exist a
finite group T and and a nonnegative integer r such that
E(K) ∼
= T × Zr .
The integer r of Mordell’s theorem is called the algebraic rank of E over K, and
T is the torsion subgroup of E.
2.3.2
Isogenies
Let E1 and E2 be two elliptic curves defined over K. An element f ∈ K(E2 )
can be seen as a map f : E2 −→ P1 (L). A map φ : E1 −→ E2 induces a pull-back
map φ∗ : K(E2 ) −→ K(E1 ) defined by precomposing by φ, i.e., φ∗ f = f ◦ φ. The
following diagram illustrates the definition.
E1 E
φ
E
φ∗ f
E
//
E2
f
E""
P1 (L)
If the map is a morphism of algebraic varieties and a homomorphism of groups,
we say that φ is an isogeny from E1 to E2 and that E1 and E2 are isogenous. Clearly
φ∗ 1 = 1, φ∗ (f +g) = φ∗ f +φ∗ g and φ∗ (f ·g) = φ∗ f ·φ∗ g, so φ∗ is a field homomorphism,
and for φ 6= 0, K(E1 ) is a finite extension of φ∗ (K(E2 )) (Theorem II.2.4, [Sil86]).
We define the degree of φ to be 0 if φ = 0 or if φ 6= 0, the index [L(E1 ) : φ∗ L(E2 )]
(where L is again an algebraic closure of K) and we denote it by deg φ. If we have
37
φ : E1 −→ E2 and ψ : E2 −→ E3 , we get the compositions
φ
E1
// E
ψ
2
L(E1 ) oo
φ∗
L(E2 ) oo
// E
3
ψ∗
L(E3 ),
so L(E1 ) is an algebraic field extension of φ∗ ◦ ψ ∗ L(E3 ) and
[L(E1 ) : (φ∗ ◦ ψ ∗ )(L(E3 ))] = [L(E1 ) : φ∗ (L(E2 ))] · [φ∗ (L(E2 )) : (φ∗ ◦ ψ ∗ )(L(E3 ))]
= [L(E1 ) : φ∗ (L(E2 ))] · [L(E2 ) : ψ ∗ (L(E3 ))],
where the last equality follows from the fact that a field is isomorphic to its image
under any field homomorphism. We can see that deg(ψ ◦ φ) = deg φ · deg ψ.
An isogeny is either constant or surjective (Theorem II.2.3, [Sil86]), so for any
Q ∈ E2 there is at least one P ∈ E1 with φ(P ) = Q. Let πP and πQ be uniformizers
at P and Q, respectively. Theorems II.2.6 and III.4.10 in [Sil86] can be adapted
to our needs as follows: If φ is a separable map (K(E1 ) is a separable extension of
φ∗ (K(E2 )), which is always the case when char(K) = 0), then the ramification index
eφ (P ) = ordP (φ∗ πQ ) = 1 for all P ∈ φ−1 (Q). For every Q ∈ E2 we have
X
eφ (P ) = #φ−1 (Q) = deg φ.
P ∈φ−1 (Q)
This means that every point has the same number of preimages, and that this number
is also equal to the degree of the isogeny. In particular, deg φ = # ker φ.
If the source and the target of the isogeny are the same elliptic curve E, we
can make two isogenies interact with each other to obtain new isogenies. Denote
38
by End(E) the set of isogenies from E to itself, together with the zero map. For
φ, ψ ∈ End(E), we define (φ + ψ)(P ) = φ(P ) + ψ(P ). If ϕ ∈ End(E), we have that
(ϕ ◦ (φ + ψ))(P ) = ϕ((φ + ψ)(P )) = ϕ(φ(P ) + ψ(P ))
= ϕ(φ(P )) + ϕ(ψ(P )) = (ϕ ◦ φ)(P ) + (ϕ ◦ ψ)(P ),
so ϕ ◦ (φ + ψ) = ϕ ◦ φ + ϕ ◦ ψ. Likewise, we have (φ + ψ) ◦ ϕ = φ ◦ ϕ + ψ ◦ ϕ. Denote by
[0] : E −→ E and [1] : E −→ E the maps defined by [0](P ) = O and [1](P ) = P for
all P ∈ E. They are clearly isogenies and the sum and composition endow End(P )
with a ring structure.
For a positive integer m we have a map
[m] : E −→ E defined as P 7−→ P
· · + P} = mP
| + ·{z
m times
called the multiplication by m map. Its kernel is denoted by E[m] and we refer to it
as the m-torsion of E. We also have the map [−1] : E −→ E which simply assigns
the inverse of P to P . [−1] is a homomorphism, as E is abelian. If m is a negative
integer, we define [m] := [−m] ◦ [−1] = [−1] ◦ [−m]. With these definitions, we can
easily see that [m] + [n] = [m + n] and [m · n] = [m] ◦ [n] for all m, n ∈ Z. This defines
a ring homomorphism from Z −→ End(E). Proposition III.4.2 in [Sil86] shows that
this is an injective homomorphism and that End(E) has no zero divisors.
Let us get back to the case where we have an isogeny φ : E1 −→ E2 . As mentioned
above, this induces a map φ∗ : K(E2 ) −→ K(E1 ), which can be extended to a map
φ∗ : L(E2 ) −→ L(E1 ). This isogeny also defines a pull-back at the level of divisor
39
groups, which by abuse of notation we also denote
φ∗ : Div(E2 ) −→ Div(E1 ) where φ∗ [Q] =
X
eφ (P )[P ].
P ∈φ−1 (Q)
The definition of the ramification index implies that φ∗ πQ lies in the eφ (P )-th power
of the maximal ideal mP , but not in its (eφ (P ) + 1)-st power. Since the powers
e (P )
φ
∗
of mP are principal, there exists t ∈ L(E1 )×
mP such that φ πQ = πP
· t, as πP
m
is any generator of mP . Let f ∈ K(E2 ) and m = ordQ (f ), so f = πQ
· g, where
g ∈ K(E2 )×
mQ . Then, we have
e (P )·m
m
φ∗ f = φ∗ (πQ
· g) = (φ∗ πQ )m · φ∗ g = πPφ
· tm · φ∗ g.
Since g is defined and does not vanish at Q, φ∗ g is defined and does not vanish at P.
This implies that ordP (φ∗ f ) = eφ (P ) ordQ (f ). It follows that
div(φ∗ f ) =
X
P ∈E1
=
X
ordP (φ∗ f )[P ] =
X
X
eφ (P ) ordQ (f )[P ]
P ∈E1
eφ (P ) ordQ (f )[P ] =
Q∈E2 P ∈φ−1 (Q)
=
X
ordQ (f )φ∗ [Q] = φ∗
Q∈E2
X
ordQ (f )
Q∈E2
X
ordQ (f )[Q]
Q∈E2
!
X
eφ (P )[P ]
P ∈φ−1 (Q)
= φ∗ (div f ),
so φ∗ maps principal divisors to principal divisors. Moreover,
X
X
deg φ∗ [Q] = deg
eφ (P )[P ] =
eφ (P ) = deg φ = deg φ deg[Q],
P ∈φ−1 (Q)
P ∈φ−1 (Q)
so deg(φ∗ D) = deg φ deg D and φ∗ maps degree zero divisors to degree zero divisors.
Putting both things together, we find that the isogeny φ induces a homomorphism
40
φ∗ : Pic0 (E2 ) −→ Pic0 (E1 ). Using this homomorphism, we construct φ̂ via the following commutative diagram:
∼
E 2
//
Pic0 (E2 )
φ∗
φ̂
∼
E1 oo
Pic0 (E1 )
This map can be realized as follows. Let Q ∈ E2 . The top isomorphism maps
Q to the class of [Q] − [O], which is mapped via φ∗ , in the unramified case we are
dealing with, to
X
P ∈φ−1 (Q)
[P ] −
X
[T ] =
T ∈φ−1 (O)
X
P ∈φ−1 (Q)
([P ] − [O]) −
X
T ∈φ−1 (O)
([T ] − [O]),
whose class is mapped to
X
P ∈φ−1 (Q)
X
P−
T.
T ∈φ−1 (O)
Fix a point P ∈ φ−1 (Q). For any other point in φ−1 (Q), say Pi , we have that
φ(Pi − P ) = φ(Pi ) − φ(P ) = Q − Q = O, so there exists Ti ∈ φ−1 (O) such that
Pi − P = Ti . This shows that
X
P ∈φ−1 (Q)
P−
X
T ∈φ−1 (O)
deg φ
T =
X
i=1
(Pi − Ti ) = [deg φ]P.
In particular, we find that (φ̂ ◦ φ)(P ) = φ̂(Q) = [deg φ]P. It can be shown that φ̂ is
ˆ
an isogeny as well, that deg φ̂ = deg φ and that φ̂ = φ. The isogeny φ̂ is referred to
as the dual isogeny to φ.
From above, it follows that φ ◦ φ̂ = [deg φ] on E1 and φ̂ ◦ φ = [deg φ] on E2 .
\
Theorem III.6.2 in [Sil86] shows that if φ, ψ : E1 −→ E2 are two isogenies, φ
+ψ =
41
ˆ = [1], it follows that [m
c [1], so if we have that [m]
c = [m],
\
φ̂+ ψ̂. Since [1]
+ 1] = [m]+
c ◦ [m] = [m] ◦ [m] = [m2 ], so
\
we find that [m
+ 1] = [m + 1]. Now, [deg[m]] = [m]
deg[m] = m2 , as Z embeds in End(E1 ). This last fact allows us to show a structure
theorem for the torsion points, which is Theorem III.6.4 in [Sil86].
Theorem 2.9. Let (E, O) be an elliptic curve defined over a field K, p be a prime
number and r a positive integer. We have that
Z/pr Z × Z/pr Z if char(K) 6= p
r
r
∼
E[p ] = E[p ](K̄) =
Z/pr Z or 0
if char(K) = p
When char(K) = p, we can see that there are two options. We say that E is
ordinary in the former case and supersingular in the latter. (The term supersingular
does not imply that E is singular, despite the name.)
In an abelian group, when two torison elements have relatively prime orders the
order of their product is the product of their orders, i.e., if ord x = h and ord y = k
with gcd(h, k) = 1, we have that ord(xy) = hk. This follows from trivial computations. In order to understand the structure of E[m], we just need to understand the
structure of E[pr ] for each pr dividing m. In particular, if char(K) ∤ m or char(K) 6= 0,
we have that E[m] ∼
= Z/mZ × Z/mZ.
2.3.3
L-functions
The Riemann ζ-function is defined on the right half-plane ℜ(z) > 1 by the
Dirichlet series
ζ(s) =
∞
X
n=1
n−s =
Y
p∈P
42
1 − p−s
−1
,
where P is the set of rational primes. The last equality follows from the Fundamental
Theorem of Arithmetic and the absolute convergence of the infinite series, and it is
called the Euler product of ζ(s). Each factor is called the local factor at p. More
generally, for a number field K, the Dedeking ζ-function is defined on the same right
half-plane as the Riemann ζ-function by the Dirichlet series
ζ(K, s) =
∞
X
an n−s =
n=1
X
NK/Q (I)−s =
I⊆OK
Y
p⊆OK
1 − NK/Q (p)−s
−1
,
where an is the number of ideals of norm n in OK , the ring of algebraic integers of K,
and the last product runs over prime ideals in OK . The last equality follows from the
fact that every ideal can be uniquely decomposed as a product of prime ideals, plus
the absolute convergence of the series in the convergence half-plane. These functions
have a simple pole at s = 1, can be extended holomorphically to the rest of the
complex plane and have a functional equation which gives the function some sort of
symmetry with respect to the vertical line s = 1/2.
For a Dirichlet character χ we can also define a Dirichlet L-function
L(χ, s) =
∞
X
χ(n)n−s =
n=1
Y
p∈P
(1 − χ(p)p−s )−1 ,
and these also have functional equations and can be analytically continued to the
whole plane, with a possible pole at 1 and/or 0.
It turns out that it is possible to attach to the elliptic curve E defined over Q a
function with very similar properties. In order to define these, for a prime number p
let us define the curve Ep as the curve obtained by reducing a minimal Weierstrass
model of E modulo p. When p ∤ ∆, this results in an elliptic curve over Fp , and
43
since P2 (Fp ) is a finite set, we can count the number of elements in Ep (Fp ), Np . Let
ap = p + 1 − Np , referred to as the trace of Frobenius at p. Define the local factor
at p to be Lp (s) = 1 − ap p−s + p1−2s . When p | ∆, we define the local factor at p to
be 1 + p−s , 1 − p−s or 1, depending on the type of bad reduction at the prime p. We
define the Hasse-Weil function as
L(E, s) =
Y
−1
Lp (s)
=
∞
X
an n−s ,
n=1
p∈P
where the ap are the traces of the Frobenii for prime numers p, follow specific recurrence relations for prime powers, and are multiplicative. We have the Hasse bound,
√
which states that |ap | ≤ 2 p. Thanks to it, these expressions are known to converge
for ℜ(s) > 3/2, and they can be extended holomorphically to the whole complex
plane (see Section 3.4 and Section 3.5). They have similar functional equations to
the ζ-functions discussed above and share some symmetry with respect to the vertical line s = 1. The value of L(E, s) at s = 1 is of utmost importance. The order of
the zero at s = 1 is called the analytic rank of the elliptic curve E.
Conjecture 2.10 (Birch-Swinnerton-Dyer). Let E be an elliptic curve over Q. Its
algebraic rank and its analytic rank coincide.
44
Chapter 3
Modular Forms
In this chapter we will introduce modular forms from a classical point of view,
following mainly [DS05]. Denote by H the complex upper half-plane throughout.
For a positive integer N we define the principal congruence subgroup of level N to
be
a b
a b 1 0
Γ(N ) =
≡
∈ SL2 (Z) :
c d
0 1
c d
(mod N ) .
Let Γ(N ) ⊆ Γ ⊆ SL2 (Z) be another group. We say that Γ is a congruence subgroup
of level N . The two most important examples of level N congruence subgroups are
a
Γ0 (N ) =
c
a
Γ1 (N ) =
c
b
a
∈ SL2 (Z) :
c
d
b
a
∈ SL2 (Z) :
c
d
b ∗
≡
0
d
b 1
≡
0
d
∗
∗
∗
1
(mod N )
(mod N ) .
Let f : H −→ C be a function and k be an integer. There is a left action of
GL2 (Q)+ on H (actually on C ∪ {∞}, the Riemann Sphere) via fractional linear
45
transformations, i.e.,
α(τ ) =
aτ + b
a b
, where α =
.
cτ + d
c d
Define the weight k slash operator, |k , as
f |k [α](τ ) = det(α)k−1 (cτ + d)−k f
aτ + b
cτ + d
a b
, where α =
,
c d
which gives a right action of GL2 (Q)+ on the set of functions with domain H and
codomain C. We say that f is weakly modular of weight k with respect to Γ if f is
meromorphic and
f |k [α] = f for all α ∈ Γ.
Since Γ ⊇ Γ(N ), we know that the matrix ( 10 N1 ) ∈ Γ. Let h be the smallest positive
integer such that ( 10 h1 ) ∈ Γ. For a weakly modular form of any weight with respect
to Γ, we have that f (τ ) = f (τ + h), so f is periodic with period h. The map
τ 7→ e2πiτ /h = qh wraps H into the punctured unit disk of radius 1 and is also
periodic of period h. This allows us to see f as a map with the punctured disk as its
domain and the meromorphicity gives us a Laurent expansion, so
f (τ ) =
X
an qhn ,
n∈Z
which is referred to as the q-expansion of f at the cusp ∞. We say that f is
holomorphic at ∞ if an = 0 for all n < 0. Furthermore, we say that f vanishes at ∞
46
if a0 = 0. For a rational number r/s, the matrix α = ( ac db ) acts by
α(r/s) =
a · r/s + b
ar + bs
=
c · r/s + d
cr + ds
a
and α(∞) = ,
c
where every time a fraction has denominator 0 we consider it to be ∞. Restricted
to SL2 (Z) this action is transitive on the set Q ∪ {∞}, as for every reduced fraction
a/c, we can find b, c ∈ Z such that ad − bc = 1 and
d −b
a b
and
(a/c) = ∞,
(∞) = a/c
−c a
c d
so for any two rational numbers r1 /s1 and r2 /s2 we can find matrices such that
r1 /s1 7→ ∞ 7→ r2 /s2 . From this, we can see that the action of Γ partitions the
set Q ∪ {∞} into equivalence classes (the number of such bounded by the index
[SL2 (Z) : Γ]). For a rational number r/s, let α ∈ SL2 (Z) such that ∞ 7→ r/s. If
f is weakly modular with respect to Γ, the function f |k [α] is weakly modular with
respect to α−1 Γα, and if the latter is holomorphic (resp. vanishes) at ∞ we say
that f is holomorphic (resp. vanishes) at r/s. This is independent of the matrix and
representative chosen, as f is invariant under the action of Γ.
A Modular Form of weight k with respect to Γ is a holomorphic function that is
weakly modular of weight k with respect to Γ and that is holomorphic at the cusps.
A modular form is said to be cuspidal if it vanishes at every cusp. We also call
these just cusp forms. The set of modular forms of weight k with respect to Γ forms
a (finite dimensional) complex vector space, which we denote by Mk (Γ). The cusp
forms form a subspace, denoted by Sk (Γ). If we drop the weight index, we obtain a
47
graded algebra, as the product of two modular forms is again a modular form, the
weight being the sum of the two weights.
The first examples of modular forms come from the so-called Eisenstein series.
For k ≥ 4, we let
Gk (τ ) =
X
′
(mτ + n)−k ,
(m,n)∈Z2
where the primed summation means we exclude the term corresponding to (m, n) =
(0, 0). Holomorphicity follows from convergence theorems. Weak modularity comes
from the fact that every element of SL2 (Z) induces a bijection on Z2 − (0, 0) (and
convergence theorems as well), and the exponent we have determines the weight k.
There is only one cusp, and holomorphicity at ∞ follows from the fact that Gk (τ ) is
uniformly bounded near ∞. Some manipulations show that
∞
(2πi)k X
Gk (t) = 2ζ(k) + 2
σk−1 (n)q n ,
(k − 1)! n=1
where σk−1 (n) denotes the sum of the (k − 1)-st powers of the positive divisors of n.
If we let g2 (τ ) = 60G4 (τ ) and g3 (τ ) = 140G6 (τ ), we obtain that g2 (τ )3 − 27g3 (τ )2 is
a cusp form, as each term is a weight 12 modular form, and the leading terms are
603 · 23 · ζ(4)3 and 27 · 1402 · 22 · ζ(6)2 , both equal to (2π)12 /1728. This cusp form is
called the discriminant function, it is denoted ∆(τ ) and its q-expansion admits the
representation
12
∆(τ ) = (2π) q
∞
Y
n=1
(1 − q n )24 = (2π)12 (q − 24q 2 + 252q 3 − 1472q 4 + O(q 5 )),
nowhere vanishing on H.
48
Finally, the function
j : H −→ C,
j(τ ) = 1728
1
g2 (τ )3
= + 744 + 196884q + 21493760q 2 + O(q 3 )
∆(τ )
q
is weakly modular of weight zero, and it is holomorphic on H. However, there is a
simple pole at the cusp, with residue 1. j is referred to as the modular function,
often called the j-invariant as well.
Consider χ : Z −→ C a Dirichlet character modulo N. Since Γ1 (N ) ⊆ Γ0 (N )
we have that Mk (Γ0 (N )) ⊆ Mk (Γ1 (N )). Let f ∈ Mk (Γ1 (N )). Slashing f by an
element α = ( ac db ) of Γ0 (N ) will most likely not return f . Consider the subspace
Mk (N, χ) ⊂ Mk (Γ1 (N )) of modular forms such that f |k [α] = χ(d)f (notice that
since ad − bc = 1 and N | c, gcd(d, N ) = 1) for every α ∈ Γ0 (N ). We can verify that
Mk (Γ1 (N )) =
M
Mk (N, χ),
\
χ∈(Z/N
Z)×
which gives us a decomposition of Mk (Γ1 (N )) into eigenspaces. In particular, the
eigenspace of the trivial character is Mk (Γ0 (N )).
We also have interaction between different levels. Since Γ0 (N d) ⊆ Γ0 (N ) for all
positive integers d, we have that Mk (Γ0 (N d)) ⊇ Mk (Γ0 (N )), the same containment
holding for cusp forms. Furthermore, if f ∈ Mk (Γ0 (N )), f |k [( d0 01 )] ∈ Mk (Γ0 (N d)),
as
a
(f |k [( d0 01 )]) |k [( cNa d δb )] = f |k [( d0 01 ) ( cNa d δb )] = f |k [( cN
a
= (f |k [( cN
49
bd )]) | [( d 0 )]
k 0 1
δ
bd ) ( d 0 )]
0 1
δ
= f |k [( d0 01 )]
a
when ( cNa d δb ) ∈ Γ0 (N d) (which implies ( cN
bd )
δ
∈ Γ0 (N )). The computation extends
to cusp forms, and even to spaces of weakly modular forms in which the holomorphicity conditions are relaxed to just being meromorphic (called Automorphic forms
instead).
These remarks prompt the following definition:
Definition 3.1. Let N > 1 be an integer and let Sk,M,d = {f |k [( d0 01 )]|f ∈ Sk (Γ0 (M ))}.
Define
Sk (Γ0 (N ))old = hSk,M,d : M proper divisor of N, d a divisor of N/M i ⊆ Sk (Γ0 (N ))
as the subspace of Sk (Γ0 (N )) that comes from cusp forms of lower levels. We refer
to its elements as oldforms.
Analogously, we can define Mk (Γ0 (N ))old and even put a character modulo N
there (which also implies we can define Sk (Γ1 (N ))old and Mk (Γ1 (N ))old ).
3.1
Elliptic Curves arising from Modular Forms
Let Λ be a lattice in C, with generators ω1 and ω2 . We may assume that ω1 /ω2 ∈
H without loss of generality.
Consider the Weierstrass ℘Λ function attached to the lattice Λ, defined as
℘Λ : C − Λ −→ C
X
1
′
z 7−→ 2 +
z
ω∈Λ
1
1
− 2
2
(z − ω)
ω
,
where the primed summation denotes addition over all nonzero elements of the lattice. This even function is not defined at any point ω ∈ Λ because of the vanishing
denominators. However, it converges absolutely and uniformly on compact subsets
50
away from the lattice, yielding a meromorphic function with poles only at the elements of the lattice. Its derivative,
℘′Λ : C − Λ −→ C
z 7−→ −
X
2
(z − ω)3
ω∈Λ
is clearly Λ-periodic (thanks to the absolute convergence), which means that ℘Λ (z +
ωj ) − ℘Λ (z) is constant for j = 1, 2. Plugging in z = −ωj /2 and using the fact ℘Λ is
even, we can see this constant is 0, and hence ℘Λ is also Λ-periodic. All the terms
of the primed summation are holomorphic on a small neighborhood around 0 (as
the pole at 0 comes from the first term), so ℘Λ has a double pole at the origin, and
hence, at every point of the lattice.
The Λ-periodicity makes ℘Λ and ℘′Λ descend to functions from C/Λ to Ĉ, with
a double pole and a triple pole at the class of 0, respectively. This is reminiscent of
the functions x and y introduced in section 2.3, which had a pole of order two and
three, respectively, at O. We can actually find their Laurent expansion, but we need
a little bit more of notation for this.
For k ≥ 4 an integer, define the Eisenstein function of weight k as
Gk (Λ) =
X
′
ω −k ,
ω∈Λ
where the primed summation means the same as above. This function is absolutely
convergent so we can rearrange terms. When we specialize to the lattice Λτ = τ Z+Z,
we obtain the Gk (τ ) we defined in the previous section, so they generalize the modular
forms we had before. Since those had special transformation properties, we can
51
expect these to have special transformation properties as well, although here, they
will be more transparent. Consider a nonzero complex number λ and the lattice cΛ.
Then
Gk (λΛ) =
X
ω∈λΛ
′
ω −k =
X
′
(λω)−k = λ−k
ω∈Λ
X
′
ω −k = λ−k Gk (Λ),
ω∈Λ
Every matrix in SL2 (Z) acts on Λ in a simple transitive way, so each matrix in
SL2 (Z) is only reorganizing the terms of the sum. If we let α = ( ac db ) ∈ SL2 (Z),
τ ∈ H and λ = (cτ + d)−1 , keeping in mind that αΛ = Λ, we obtain
Gk (α(τ )) = Gk (Λα(τ ) ) = Gk (λ(αΛτ )) = λ−k Gk (Λτ ) = (cτ + d)k Gk (τ ),
which is the crucial weak modularity property.
Let us go back to ℘Λ and focus on 1/(z − ω)2 . We have
∞
∞
z n
X
X
−ω 2
1
−2
−2
=
=ω
=ω +
(n + 1)
(n + 1)ω −(n+2) z n ,
(z − ω)2
(1 − (z/ω))2
ω
n=0
n=1
where we used the identity 1/(1 − x)2 =
that
℘Λ (z) =
P∞
n=0 (n
+ 1)xn from calculus. This implies
∞
∞ X
X X
X
1
1
′
′
−(n+2) n
+
+
(n + 1)ω −(n+2) z n
(n
+
1)ω
z
=
z 2 ω∈Λ n=1
z 2 n=1 ω∈Λ
∞
∞
X
X
X
1
1
′ −(n+2) n
= 2+
(n + 1)
(n + 1)Gn+2 (Λ)z n .
ω
z = 2+
z
z
n=1
n=1
ω∈Λ
52
Notice that Gk (Λ) = 0 for odd values of k due to cancellation, so we actually have
the Laurent expansions
℘Λ (z) =
℘′Λ (z)
∞
X
1
1
(2n + 1)G2n+2 (Λ)z 2n = 2 + 3G4 (Λ)z 2 + 5G6 (Λ)z 4 + O(z 6 )
+
2
z
z
n=1
∞
−2 X
−2
2n(2n + 1)G2n+2 (Λ)z 2n−1 = 3 + 6G4 (Λ)z + 20G6 (Λ)z 3 + O(z 5 ).
= 3 +
z
z
n=1
Also, squaring ℘′Λ (z) and cubing ℘Λ (z) gives a pole of order 6 at z = 0, yielding
4
− 24G4 (Λ)z −2 − 80G6 (Λ) + O(z)
z6
4
4℘Λ (z)3 = 6 + 36G4 (Λ)z −2 + 60G6 (Λ) + O(z),
z
℘′Λ (z)2 =
so
℘′Λ (z)2 − 4℘Λ (z)3 = −60G4 (Λ)z −2 − 140G6 (Λ) + O(z),
and hence, we find that
℘′Λ (z)2 − 4℘Λ (z)3 + 60G4 (Λ)℘Λ (z) + 140G6 (Λ) = O(z).
If we restrict it to a fundamental parallelogram, the left hand side is bounded on its
closure (it being compact), so the Λ-periodicity implies the left hand side is bounded
on C. The right hand side indicates the function is holomorphic, hence, Liouville’s
Theorem implies that it is a constant. As z → 0, the right hand side tends to 0 as
well, implying said constant is 0. We conclude that
℘′Λ (z) = 4℘Λ (z)3 − 60G4 (Λ)℘Λ (z) − 140G6 (Λ),
53
or, if we adopt the terminology from the previous section,
℘′Λ (z) = 4℘Λ (z)3 − g2 (Λ)℘Λ (z) − g3 (Λ).
The map that goes from C −→ P2 (C) defined as
z 7−→
(℘Λ (z), ℘′Λ (z), 1)
=
℘Λ (z)
1
, 1, ′
′
℘Λ (z)
℘Λ (z)
can be factored through the quotient C/Λ, and its target can be restricted to EΛ (C),
where E has a Weierstrass model given by
y 2 = 4x3 − g2 (Λ)x − g3 (Λ).
This map turns out to be a group isomorphism, yielding the so-called Weierstrass
uniformization
∼
C/Λ −→ EΛ (C)
z (mod Λ) 7−→ (℘Λ (z), ℘′Λ (z), 1).
(3.1)
When the lattice is Λτ , we denote this elliptic curve by Eτ . The group operation
on C/Λ is quite easy to understand. The kernel of the multiplication by m map,
denoted by Eτ [m], is isomorphic to Z/mZ × Z/mZ and can be seen as h1/N, τ /N i.
The subgroup Eτ [m] will be very important in the next section.
Let ω3 = ω1 + ω2 and let zj = ωj /2. We can see that the zj constitute the
elements of order exactly 2 in C/Λ. Notice that ℘′Λ (zj ) = 0, as zj ≡ −zj (mod Λ)
and ℘′Λ is an odd function. This implies that ℘Λ (zj ) is a root of the polynomial
4x3 − g2 (Λ)x − g3 (Λ). Let f : C −→ Ĉ be a non-constant meromorphic function that
54
is Λ-periodic and let P be the fundamental parallelogram of Λ. (P can be seen as
the convex hull of the set {0, ω1 , ω2 , ω3 }, or the set {x1 ω1 + x2 ω2 : x1 , x2 ∈ [0, 1]}.)
Let ∂P be the boundary of P and t be a complex number such that t + ∂P does
not contain any zero or pole. Such t exists because the meromorphicity of f gives it
finitely many poles and zeroes on C/Λ. It is straightforward to compute the integrals
1
2πi
Z
f (z)dz = 0
and
t+∂P
1
2πi
Z
t+∂P
f ′ (z)
dz = 0,
f (z)
which allow us to conclude two things. Firstly, thanks to the Residue Theorem, there
are no functions f with a simple pole (as the sum of the residues is 0). Secondly,
thanks to the Argument Principle, we can see that f has the same number of poles
and zeroes, counting multiplicity, which in turn, allows us to conclude that every
value is taken on the same number of times. Hence, ℘Λ takes on every value twice,
as it only has one double pole. Since ℘′Λ (zj ) = 0, the value ℘Λ (zj ) is taken on only
once, with multiplicity two, implying that the three values ℘Λ (zj ) are distinct. This
implies that the cubic has nonzero discriminant, hence g2 (Λ)3 − 27g3 (Λ)2 6= 0. Also,
this justifies the name of the weight 12 cusp form ∆, as it is the discriminant of the
elliptic curve obtained this way. Furthermore, we can see that the j-invariant also
coincides with the modular function j from the previous section, justifying again the
terminology.
So, every lattice Λ ∈ C produces an elliptic curve over C. The converse is also
true and its based on the fact that the modular function j is surjective, which follows
from a similar argument to the one exposed above to show ℘Λ takes on every value
twice. The end of the proof is virtually the same as when we showed in the previous
55
chapter that two elliptic curves with the same j-invariant are isomorphic over the
algebraic closure.
Let φ : C/Λ −→ C/Λ′ be a (holomorphic) non-constant map of elliptic curves
(which we refer to as an isogeny). Using topological arguments, this map lifts to a
(holomorphic) map φ̃ : C −→ C, which maps Λ to Λ′ (as the class 0 + Λ needs to be
mapped to the class 0 + Λ′ ). Moreover, if z1 and z2 differ by λ ∈ Λ, φ̃(z1 ) and φ̃(z2 )
differ by λ′ ∈ Λ′ , as φ̃ lifts a map between the quotients. For a fixed λ ∈ Λ, the
difference φ̃(z +λ)− φ̃(z) ∈ Λ′ , which is a discrete set. Continuity implies it has to be
constant. Upon differentiation, we find that φ̃′ (z + λ) − φ̃′ (z) = 0, so φ̃′ is Λ-periodic,
making it bounded, and then constant by Liouville’s Theorem. Upon integration,
we find that φ̃(z) = mz + b, with m 6= 0 and b ∈ Λ′ (by what was mentioned at the
beginning of the paragraph and the fact φ is not constant). φ̃ − b also lifts φ, so we
can assume without loss of generality that φ̃(z) = mz. Since mω1 , mω2 ∈ Λ′ , we find
that mΛ ⊆ Λ′ . Conversely, if we have m ∈ C× such that mΛ ⊆ Λ′ , we clearly have a
map φ : C/Λ −→ C/Λ′ defined by φ(z) = mz.
If mΛ 6= Λ′ , there exists λ′ ∈ Λ′ − mΛ, so if we let z = λ′ /m we find that
φ(z) = λ′ = 0 in C/Λ′ , but z 6∈ Λ by definition. This implies that φ is not injective,
so, φ is not an isomorphism. If mΛ = Λ′ , we have that
1 ′
Λ
m
= Λ ⊆ Λ, so there is a
map ψ : C/Λ′ −→ C/Λ, and we can see that ψ ◦ φ and φ ◦ ψ are both the identity
map on the corresponding torus, implying that φ is an isomorphism. If we denote
by τ = ω1 /ω2 and we let m = 1/ω2 we find that C/Λ and C/Λτ are isomorphic.
Furthermore, if α = ( ac db ) ∈ SL2 (Z), the two lattices τ Z+Z and (aτ +b)Z+(cτ +d)Z
are the same, yielding the same torus.
56
If we let m = (cτ + d)−1 , we find that C/Λτ ∼
= C/Λα(τ ) , so the orbit of τ ∈ H
under the action of SL2 (Z) gives us a set {Eτ ′ : τ ′ ∈ OrbSL2 (Z) (τ )}, which is comprised
of isomorphic elliptic curves. If Eτ ∼
= Eτ ′ , there exists an m ∈ C× such that Λτ =
mΛτ ′ , so there exist integers a, b, c, d such that
mτ ′ = aτ + b
and
m = cτ + d.
Multiplying the first equation by d, the second one by b and subtracting, and multiplying the first equation by c, the second one by a and subtracting, we obtain
m(dτ ′ − b) = (ad − bc)τ
m(−cτ ′ + a) = (ad − bc),
and
respectively. If ad − bc = 0 we would have b = d = 0 and a = c = 0, as τ ′ and 1 form
a basis for Λ′ . Dividing both equations by m(ad − bc), we find that
b
d
τ
=
τ′ −
m
ad − bc
ad − bc
a
1
−c
=
τ′ +
,
m
ad − bc
ad − bc
and
which implies that ad − bc divides a, b, c and d. We can see that the determinant of
( ac db ) divides each of its entries, so we may divide each entry by it and still obtain
an integer as the new determinant. This means that 1/(ad − bc) is an integer, so
ad − bc = ±1. Dividing the first two equations we get
aτ + b
τ =
, so ℑ(τ ′ ) = ℑ
cτ + d
′
(aτ + b)(cτ + d)
|cτ + d|
=ℑ
adτ + bcτ
|cτ + d|
=
(ad − bc)ℑ(τ )
|cτ + d|
and ℑ(τ ′ ), ℑ(τ ) > 0 show that ad − bc > 0. We conclude that Eτ ∼
= Eτ ′ implies that
τ and τ ′ lie in the same orbit induced by the action of SL2 (Z). Let us collect this
into a proposition:
57
Proposition 3.2. Let E be an elliptic curve over C. There exists a number τ ∈ H
such that E ∼
= Eτ . Moreover, for τ, τ ′ ∈ H, Eτ ∼
= Eτ ′ if an only if there exists a
matrix
a b
∈ SL2 (Z) such that
c d
Proof. See above.
3.2
aτ + b
a b
= τ ′.
(τ ) =
cτ + d
c d
The Modular Curve X0 (N ).
Consider the quotient Y0 (N ) := Γ0 (N )\H = {Γ0 (N )τ : τ ∈ H} of the complex
upper half-plane by the action of the level N congruence subgroup of integral matrices
of determinant 1 which are upper-triangular modulo N . We can define a topology
on Y0 (N ) by declaring a subset of Y0 (N ) open if and only if its inverse image in H
is open. More concretely, if π : H −→ Y0 (N ) is the natural projection, Ũ is open if
and only if U = π −1 (Ũ ) is open.
There is a slight complication at points π(τ ) such that the stabilizer of τ in
Γ0 (N ) is not trivial (and, by trivial, we mean {±I}). For any other point, just take
a neighborhood small enough such that it is mapped bijectively to a neighborhood
of π(τ ). When τ is fixed by an element γ ∈ Γ0 (N ), no neighborhood will have a
property as above. These are called elliptic points. If τ is an elliptic point, the
7 ∞. The
matrix δτ = 11 −τ̄τ , with determinant τ − τ̄ 6= 0, maps τ 7→ 0 and τ̄ →
stabilizer of τ also stabilizes τ̄ , and conjugated by δτ stabilizes 0 and ∞. Quotienting
by ±I we obtain the a subgroup of distinct transformations which stabilize τ (and
0 and ∞ after conjugating). This means that all of its elements are of the form az,
and it being finite, we can see that these are all rotations about the origin. The
58
chart can be fixed by the local coordinate z hτ , where hτ is the number of these
rotations (also called the period of τ ). The charts loosely described above endow
Y0 (N ) with a Riemann surface structure. (Note that all we mentioned applies also
to any congruence group.)
It would be desirable that Y0 (N ) be compact, but it turns out that we need
to add some points for this to happen. Intuitively, we need to add ∞ to H. As
we do this for N = 1, we see that we also would need to add every element in the
orbit of ∞ under the action of SL2 (Z). This orbit is precisely all rational numbers,
as we described in the introduction of this chapter. We compactify H by adding
all these elements (called cusps, just as before), so H∗ := H ∪ Q ∪ {∞}. We define
X0 (N ) := Γ0 (N )\H∗ = {Γ0 (N )τ : τ ∈ H∗ }. The map π extends to H∗ , and, in order
to endow X0 (N ) with a topology, we need to endow H∗ with one as well. The only
points that are new are the cusps. Neighborhoods about ∞ will be, as expected,
{∞} ∪ {τ ∈ H : ℑ(τ ) > c}, for any c ∈ R+ . Neighborhoods about any other cusp will
be images under SL2 (Z) of the neighborhoods we just described. For the charts, if
s ∈ Q we can bring it up to ∞ with a marix in SL2 (Z) and look at the stabilizer over
1
there. The stabilizer of ∞ in SL2 (Z) is generated by the two matrices ±1
0 ±1 , so,
in this conjugate of Γ0 (N ) it will be a subgroup of the group generated by them. Let
h be the smallest positive element that can occur as a top right entry in one of these
h
lies in Γ but ( 10 h1 ) ∈ Γ
matrices. (For some groups, it could happen that −1
0 −1
does not.) The map e2πiτ /h maps this neighborhood of ∞ biholomorphically into a
disk centered at 0, which works as the sought chart.
59
The compactification just mentioned turns X0 (N ) into a compact Riemann surface. The appendix of [GH94] contains the following theorem:
Theorem 3.3. Suppose C is a compact Riemann surface. Then there exists an
immersion
f : C −→ P2 (C)
such that f (C) has at most ordinary double points.
In particular, this means that compact Riemann surfaces are algebraic curves.
In the previous section, we saw how the Riemann surfaces C/Λ, for a lattice Λ, could
be described as a curve. We will now follow the same approach as before, albeit it has
a few extra complications. We need to find two meromorphic functions with the same
poles and try to find an algebraic relation between the two of them. Unfortunately,
modular forms of weight 0 do not exist, as the lack of poles in a compact Riemann
surface implies the function is constant. However, we have the j-invariant, which is
a level 1 weakly holomorphic modular form (meromorphic at the cusps) of weight
0, i.e., a modular function. Since it is invariant under the action of SL2 (Z), it is
certainly invariant under the action of Γ0 (N ). This time the derivative j ′ will not be
invariant under the action of Γ0 (N ) so we need to find a different function.
Define jN (τ ) := j(N τ ). By the comments preceding Definition 3.1, we know jN
is a meromorphic function on X0 (N ), as it is weakly modular of weight 0 with a pole
at the cusps. We have the following theorem from [DS05].
Theorem 3.4. The field of functions of X0 (N ) is C(j, jN ). It is a degree 1 transcendental extension of C. The polynomial ϕN (x, y) such that ϕN (j, jN ) = 0 has integral
coefficients.
60
The polynomial equations for ϕN (x, y) have huge coefficients and are highly
singular. The following method can be applied to find such polynomials. Assume N
is prime, to reduce the necessary computations.
Notice that Γ0 (N ) only has two cusps, ∞ and 0. The cusp ∞ has width 1, as
( 10 11 ) ∈ Γ0 (N ). The matrix ( 10 −1
0 ) brings 0 to ∞ and conjugation by it transforms
Γ0 (N ) into Γ0 (N ) (lower-triangular modulo N ). The stabilizer of ∞ here is generated
N
by ±1
0 ±1 , with index N , hence, the cusp 0 has width N.
Since j is invariant under SL2 (Z), we have that
j
−1
τ
= j(τ ),
so
jN |0 [( 01 −1
0 )](τ )
−1
=j N·
τ
=j
−1
τ /N
= j(τ /N ).
We conclude that
1
1
1
N
)
+ 744 + O(q), j|0 [( 01 −1
+ 744 + O(q) = N + 744 + O(qN
0 )](τ ) =
q
q
qN
1
1
+ 744 + O(qN ),
jN (τ ) = N + 744 + O(q N ), jN |0 [( 01 −1
0 )](τ ) = j(τ /N ) =
q
qN
j(τ ) =
which show that j has a pole of order N at 0 and a simple pole at ∞ and that jN
has a simple pole at 0 and a pole of order N at ∞. (When Γ0 (N ) has more than two
cusps, we need more equations like the ones mentioned above.)
Now, in order to find a relation between j and jN , the goal is to find an algebraic
combination of them such the poles at 0 and ∞ disappear (the same way we did with
℘ and ℘′ ). For a function ϕ ∈ C(X0 (N )) with poles only at 0 and ∞, we denote
by ϕ = O(m, n) the fact that ϕ has a pole of order at most m at ∞ and a pole
b
of order at most n at 0. The function j a jN
has a pole of order a + N b at ∞ and a
b
pole of order N a + b at 0, so in the previous notation, j a jN
= O(a + N b, N a + b).
61
If ϕ = O(m1 , n1 ) and φ = O(m2 , n2 ), then ϕ + φ = O(max{m1 , m2 }, max{n1 , n2 }),
as the pole of larger order prevails. Potentially, if m1 = m2 or n1 = n2 , we will be
able to reduce the order of the pole (if the corresponding leading coefficients add up
to 0). In order to start reducing the order of the pole, we need to find three pairs
(a1 , b1 ), (a2 , b2 ), (a3 , b3 ) such that:
a1 + N b1 = a2 + N b2
N a 1 + b1 = N a 3 + b 3
a1 + N b1 > a3 + N b3
N a1 + b1 > N a2 + b2 ,
b1
coincides with the one of
which will ensure that the order of the pole at ∞ of j a1 jN
b1
b3
b2
coincides with
, and the pole at 0 of j a1 jN
and is greater than the one of j a3 jN
j a2 jN
b2
b3
. With a linear programming
and is greater than the one of j a2 jN
the one of j a3 jN
approach, we add the variables x and y to obtain the system
a1 + N b1 = a2 + N b2
N a 1 + b1 = N a 3 + b3
a1 + N b1 = a3 + N b3 + x
N a1 + b1 = N a2 + b2 + y,
whose solution is given by
a1 = a3 − x/(N 2 − 1)
b1 = b3 + N x/(N 2 − 1)
a2 = a3 − x/(N 2 − 1) − N y/(N 2 − 1)
b2 = b3 + N x/(N 2 − 1) + y/(N 2 − 1).
The integrality conditions imply that both x and y are divisible by N 2 − 1 and
positive. The smallest such solution would have them both be equal to N 2 − 1
62
precisely, so
(a1 , b1 ) = (N, N ),
(a2 , b2 ) = (0, N + 1),
(a3 , b3 ) = (N + 1, 0).
N +1
N
, and we have
This means we should consider the functions j N jN
, j N +1 and jN
N
j N jN
=O(N (N + 1), N (N + 1))
j N +1 =O(N + 1, N (N + 1))
N +1
=O(N (N + 1), N + 1).
jN
b
Notice that the leading term of the functions of the form j a jN
is 1, so we will
never need to divide in the process of finding this polynomial. We illustrate how to
proceed with N = 2, which will yield a relation of degree 4 with coefficients that
already display a very large size.
We start with the function ϕ = j 2 j22 = q −6 + 1488q −5 + O(q −4 ), which is O(6, 6).
The functions j23 and j 3 are O(6, 3) and O(3, 6) respectively, so we update ϕ by
subtracting j23 + j 3 . We are left with ϕ = 1488q −5 + 946569q −4 + O(q −3 ), which is
O(5, 5).
Now, we take the functions jj22 and j 2 j2 , which are O(5, 4) and O(4, 5), respectively, and we update ϕ by subtracting 1488(jj22 + j 2 j2 ). This time we are left with
ϕ = −162000q −4 + 40773375q −3 + O(q −2 ), which is O(4, 4).
We proceed by taking the functions j22 and j 2 , which are O(4, 2) and O(2, 4),
respectively, and we further update ϕ by adding 162000(j22 + j 2 ). We obtain ϕ =
40773375q −3 + 39083391000q −2 + O(q −1 ), which is O(3, 3).
63
At this point, we can consider the function jj2 , which is O(3, 3) as well, and update ϕ by subtracting 40773375jj2 , thus obtaining ϕ = 8748000000q −2 +8748000000q −1 +
O(1), which is O(2, 2).
Finally, we use the functions j2 and j, which are O(2, 1) and O(1, 2), respectively, and after updating ϕ subtracting 8748000000(j2 + j) we obtain ϕ =
−157464000000000 + O(q 43 ), which is O(0, 0). A priori, we expected ϕ to be O(1, 1)
but the order of the pole dropped by 2. Equations (11.14) and (11.15) in [Cox13]
assert that the degree of the polynomial in this case is N + 1 in each of the variables
and monic, so the equation has degree at most 2N . This means that this procedure
must produce the sought polynomial. As we cannot find j a j2b that is O(1, 1), O(1, 0)
or O(0, 1), we actually expect to reduce the order of the pole by a larger amount in
this case. This will happen multiple times when the value of N is larger. The fact
that the current ϕ is O(0, 0) indicates that it has no poles at 0 or ∞, and being an
algebraic combination of j and j2 , it has no poles anywhere on the compact Riemann
surface X0 (2), making it actually constant.
By collecting all the performed updates to ϕ, letting X = j and Y = jN , we
obtain the relation
X 2 Y 2 − X 3 − Y 3 − 1488XY (X + Y ) + 162000(X 2 + Y 2 )−
40773375XY − 8748000000(X + Y ) = −157464000000000.
64
It is customary to write it as
Φ2 (X, Y ) = X 3 + Y 3 − X 2 Y 2 + 1488XY (X + Y ) − 162000(X 2 + Y 2 )
+ 40773375XY + 8748000000(X + Y ) − 157464000000000.
The algorithm, for N prime, goes as follows. The object ϕ = O(m, n) has
attributes ϕ.m and ϕ.n, which denote the order of the pole at ∞ and 0, respectively,
and the attribute ϕ.lead, which denotes the leading coefficient of the q-expansion.
Algorithm 1 Find Modular Polynomial
1: function FindExponents(ϕ, N)
2:
a = ϕ.m mod N
3:
b = ϕ.n/N
\\integer division
4:
return (a, b)
5: procedure ModularPolynomial(N)
N
6:
ϕ = j N jN
\\initialize ϕ at a function O(N (N + 1), N (N + 1))
7:
coeffs = [(N, N, −1)] \\initialize coeffs at an array with triple (N, N, −1)
8:
while ϕ.m 6= 0 and ϕ.n 6= 0 do
9:
a, b = FindExponents(ϕ, N )
10:
append to coeffs triple (b, a, −ϕ.lead)
11:
if a == b then
b
12:
ϕ = ϕ − ϕ.lead · j a jN
13:
else
b
a
14:
ϕ = ϕ − ϕ.lead · (j a jN
+ j b jN
)
15:
append to coeffs triple (0, 0, ϕ.lead)
16:
return coeffs
As we mentioned before, this algorithm could potentially get stuck not being
able to find values of a and b that will continue reducing the order of the poles. For
example, when N = 3 and ϕ = O(6, 6) we obtain (a, b) = (0, 2), which yields j32 and
j 2 , which are O(6, 2) and O(2, 6). After subtracting, we could expect ϕ to be O(5, 5),
which would give us (a, b) = (2, 1), requiring us to subtract a scalar multiple of jj32
65
(and of j 2 j3 ), but this function is O(5, 7) (and O(7, 5)) and this increases the order
of the poles. However, ϕ turns out to be O(4, 4) at this point, with (a, b) = (1, 1) and
jj3 = O(4, 4) as well. The restriction on the degree of the polynomial guarantees
that the procedure will finish.
The equations obtained above describe the curves Y0 (N ). Upon homogenizing,
we obtain equations for the curves X0 (N ). These are highly singular, and the method
is not very practical itself. For a more detailed account of how to find these polynomials (with more sophisticated methods), see [Elk98] or [CL05]. Nonetheless, there
are ways of finding smooth models. We will not delve into how to accomplish this.
There is another way of thinking about these curves, which is the so-called modular
interpretation. We follow [DS05] for this.
Let E be an elliptic curve over C and let S be a cyclic subgroup of order N . We
know that E is isomorphic to some Eτ ′ by Proposition 3.2, and the N -torsion of Eτ ′ is
′
τ′
1
1
τ 1
= Z+ Z ⊇ Λτ ′ , which
given by a quotient of the super-lattice Λτ ′ =
,
N
N N
N
N
is isomorphic to Z/N Z × Z/N Z, as predicted by Theorem 2.9 and the comments
following it. In order to describe S, we need to find an element of order N in
′
τ 1 ∼
,
S ֒→
= Z/N Z × Z/N Z, which will be the generator.
N N
Denote the image of such generator by (c, d), and let g = gcd(c, d, N ). Adding
(c, d) to itself N/g times yields (c/g, d/g) = (0, 0) in Z/N Z × Z/N Z, so g = 1. Hence,
there exist a, b, k ∈ Z such that ad − bc − kN = 1. Take α = ( ac db ), which reduces
modulo N to a matrix in SL2 (Z/N Z), and hence lifts to a matrix in SL2 (Z), so we
can assume without loss of generality that α ∈ SL2 (Z). If τ = α(τ ′ ), we have that
66
Eτ ′ ∼
= Eτ and the image of (c, d) under this isomorphism is (0, 1), so the pair (E, S)
is isomorphic to the pair (Eτ , h1/N i) for some τ ∈ H.
Now, if τ, τ ′ ∈ H lie on the same coset of Γ0 (N ), τ = α(τ ′ ) keeping the notation
of the previous paragraph, with α ∈ Γ0 (N ). Eτ and Eτ ′ are isomorphic and the
isomorphism maps 1/N in Eτ to (cτ ′ + d)/N in Eτ ′ , where N | c, so it is the same as
d/N. Since ad ≡ 1 (mod N ), gcd(N, d) = 1 and 1/N ∈ hd/N i inside the N -torsion
of Eτ ′ , so (Eτ , h1/N i) ∼
= (Eτ ′ , h1/N i).
On the other hand, if τ, τ ′ ∈ H are such that (Eτ , h1/N i) ∼
= (Eτ ′ , h1/N i). The
isomorphism of elliptic curves implies the existence of α ∈ SL2 (Z) as in the previous
paragraphs. It maps 1/N ∈ Eτ to (cτ ′ + d)/N ∈ Eτ ′ . Since the isomorphism maps
h1/N i ⊆ Eτ to h1/N i ⊆ Eτ ′ , h1/N i = h(cτ ′ + d)/N i, implying N | c, so α ∈ Γ0 (N ).
These paragraphs show that Y0 (N ) can be seen as the moduli space corresponding to the data pairs of elliptic curves over C together with a cyclic subgroup of order
N . This construction is actually functorial, so in general, we can drop the requirement that the elliptic curve be defined over C (as long as N is invertible, which
for us will always be the case, given that we are working on number fields). For a
number field K, the K-points of Y0 (N ) can be identified with isomorphism classes of
pairs (E, S) where E is an elliptic curve defined over K and S is a cyclic subgroup
of E[N ]. For more details, see [?].
3.3
Hecke operators
For the vector space Mk (Γ1 (N )), we can define a commuting family of opera-
tors, called Hecke operators, which encode arithmetic properties. There are several
equivalent ways of defining them, but we will limit ourselves to the more down to
67
earth definition, i.e., the one acting on q-expansions. For the equivalent definitions,
see [DS05].
Let p be a prime, and let βj =
1 j
0 p
, for j ∈ {0, 1, . . . , p − 1}, β∞ =
p 0
0 1
and
m n
γ = (N
p ), where m, n are integers such that mp − nN = 1. Let f ∈ Mk (Γ1 (N )).
Define
Tp f =
p−1
X
f |k [βj ] + f |k [γβ∞ ]
p∤N
j=0
p−1
X
f |k [βj ]
p | N.
j=0
It can be seen that Tp f ∈ Mk (Γ1 (N )), and the operator descends to an operator on
P
Sk (Γ1 (N )). Moreover, if f = n≥0 an q n ∈ Mk (N, χ), then Tp f ∈ Mk (N, χ) and
Tp f =
X
anp q n + χ(p)pk−1
X
an q np
n≥0
n≥0
and, naturally, the operator still descends to cusp forms.
Define T1 as the identity map, and for r ≥ 2, define
Tpr = Tp Tpr−1 − pk−1 χ(p)Tpr−2 .
Simple computations show that Tp Tq = Tq Tp where p and q are primes, so, it makes
sense to define
Tn =
Y
Tpei i ,
where n =
Y
pei i
The commutativity of the operators indexed by primes implies the commutativity of
the operators for every index.
For an integer d, we also define the diamond operator hdi, which acts by slashing
a modular form by any matrix in Γ0 (N ) whose lower-right entry is congruent to d
68
modulo N. When gcd(N, d) > 1, there is no such matrix, so hdi := 0. Notice that for
all integers d1 and d2 we have that hd1 d2 i = hd1 ihd2 i.
For a congruence subgroup Γ ∈ SL2 (Z) we can define the Petersson inner product on the space Sk (Γ),
h·, ·iΓ : Sk (Γ) × Sk (Γ) −→ C
(f, g)
1
7 → hf, giΓ =
−
VΓ
Z
f (τ )g(τ )(ℑ(τ ))k−2 dxdy,
X(Γ)
where X(Γ) := Γ\H∗ (so the integral runs over any lift of X(Γ) on H∗ ), τ = x + iy
Z
ℑ(τ )−2 dxdy. (See [DS05] for details.)
and VΓ =
X(Γ)
Specializing to Γ = Γ1 (N ) we obtain that for p ∤ N the diamond operators and
the Hecke operators satisfy the relations
hpi∗ = hpi−1
and
Tp∗ = hpi−1 Tp ,
where T ∗ is the adjoint operator of T (i.e., the unique operator such that hT f, giΓ =
hf, T ∗ giΓ ). These relations make such diamond operators and Hecke operators normal on Sk (Γ1 (N )) (and on Sk (Γ0 (N ))). A family of commuting normal operators on
a finite-dimensional inner product space is simultaneously diagonalizable. If we have
two vectors, say f and g, that have at least one different eigenvalue, say T f = λf
and T g = µg, where T is one member of the family, we have that
λhf, gi = hλf, gi = hT f, gi = hf, T ∗ gi = hf, µgi = µhf, gi,
69
so f and g are orthogonal. For subspaces generated by eigenvectors that share all
eigenvalues at each T , we can orthogonalize them and thus we obtain an orthogonal
basis of simultaneous eigenvectors. We refer to these as eigenforms.
Definition 3.1 showed us how to obtain modular forms of higher level coming
from lower levels. It turns out that the spaces Sk (Γ0 (N ))old and Sk (Γ1 (N ))old are stable under the action of the Hecke operators and the diamond operators. The spaces
Sk (Γ0 (N ))new and Sk (Γ0 (N ))new are defined as the orthogonal complement with respect to the Petersson inner product of Sk (Γ0 (N ))old and Sk (Γ1 (N ))old in Sk (Γ0 (N ))
and Sk (Γ1 (N )), respectively. Furthermore, both spaces are also stable under the action of the Hecke operators and the diamond operators, so all of the spaces mentioned
above have an orthogonal basis of eigenforms for the Hecke operators (and diamond
operators for Γ1 (N )) away from the level, i.e., for the set {Tn , hni : gcd(n, N ) = 1}.
P
If f ∈ Sk (Γ1 (N )) has a q-expansion n≥1 an q n such that an = 0 for all n with
gcd(n, N ) = 1, it turns out that f ∈ Sk (Γ1 (N ))old . This is not trivial and it is
a result due to Atkin and Lehner. Its proof can be found in several sources, for
example [AL70] or [DS05]. If f ∈ Sk (Γ1 (N ))new , the panorama is quite amicable.
The following is Theorem 5.8.2 in [DS05].
Theorem 3.5. Let f ∈ Sk (Γ1 (N ))new be an eigenform for all the Hecke operators
and diamond operators away from the level. Then
a) f is an eigenform for all Hecke operators and diamond operators.
b) If g ∈ Sk (Γ1 (N ))new is another eigenform for all Hecke operators and all diamond
operators away from the level whose corresponding eigenvalues (away from the
level) match those of f , then f and g are scalar multiples of each other.
70
Such an eigenform can be normalized in such a way that the coefficient of q in its
q-expansion is equal to 1, and is referred to as a newform. Part b) of Theorem 3.5
is referred to as the Multiplicity One theorem, which is inherent of newforms. Also,
being normalized this way, it is easy to compute that Tn f = an f, so the coefficients
of the q-expansion are precisely the eigenvalues of the Hecke operators. To conclude
this section, we mention the existence of some important involutions. For each
prime q dividing N , let α be the positive integer such that pα | N but pα+1 ∤ N. Let
α
x, y, z, w be any integers satisfying q α xw − (N/q α )yx = 1 and let Wq = qN zx qαyw .
Slashing by the Wq induces an involution on Sk (Γ0 (N )) and it does not depend
on the values x, y, z, w. These are known as the Atkin-Lehner involutions. Their
product becomes equivalent to slashing by the matrix ( N0
−1
0 ),
which induces an
involution on Sk (Γ1 (N )) (which is stable on the eigenspaces). Each newform has an
associated number wf corresponding to whether or not the involution flips the sign.
This involution is referred to as the Fricke involution. See [AL70] or [Cre92].
3.4
L-functions associated to Modular Forms
Let τ = it, where t is a positive real number (so τ ∈ H). The Mellin transform
of q n = e2πiτ n can be computed as
Z
∞
2πiτ n s−1
e
t
dt =
Z
∞
e−2πtn ts−1 dt = (2π)−s Γ(s)n−s .
0
0
This means that if f (τ ) =
Z
∞
P
n≥1
an q n is a cusp form, its Mellin transform will be
f (it)ts−1 dt = (2π)−s Γ(s)
0
∞
X
n=1
71
an n−s .
For f ∈ Sk (Γ1 (N )) as above, define the functions
L(f, s) =
∞
X
an n−s
and
Λ(f, s) = (2π)−s N s/2 Γ(s)L(f, s),
n=1
which are referred to as the L-function and the completed L-function. The growth
condition at the cusps puts a bound of the growth of the coefficients, which makes
the integral converge for ℜ(s) > 1 + k/2 (and the series defining the L-function). By
carefully splitting the integral in the Mellin transform and exploiting the modularity
of f , the integral can be shown to have an analytic continuation to the whole complex
plane, and hence, so does Λ(f, s). Since (2π)s , N −s/2 and 1/Γ(s) are entire, we can
affirm the same about L(f, s). Furthermore, for a newform we find the functional
equation
Λ(f, k − s) = (−1)k/2 wf Λ(f, s).
The number w = (−1)k/2 wf is referred to as the sign of the functional equation
associated to f . This equation naturally translates into an equation with the Lfunction without completing it, but it’s much less clean.
When w = −1 and s = k/2 we can see that Λ(f, s) = −Λ(f, s), so we conclude
that Λ(f, s) = 0, which also implies that L(f, s) = 0, (since the extra factors don’t
vanish when ℜ(s) > 0).
For the L-function of a newform in Sk (N, χ) we have an Euler product given by
L(f, s) =
∞
X
n=1
an n−s =
Y
p∈P
(1 − ap p−s + χ(p)pk−1−2s )−1
Y
Y
= (1 − ap p−s + χ(p)pk−1−2s )−1 (1 − ap p−s )−1 ,
p∤N
p|N
72
which follows from the relations between the Tn , the fact that an is the eigenvalue of
Tn and the absolute convergence for ℜ(s) > 1 + k/2.
3.5
Modularity
Let f ∈ S2 (Γ0 (N )) be a newform. The curve X0 (N ) is a Riemann surface of
genus g, which is a g-holed torus with H1 (X0 (N ), Z) = Z2g . If we choose a cusp as
the base point of the Riemann surface, every loop γ will lift to a path on H∗ , which
allows us to define the map
h·, f i : H1 (X0 (N ), Z) −→ C
7−→ hγ, f i =
γ
Z
2πif (τ )dτ,
γ
where the convergence of the integral is ensured by the fact that f is a cusp form.
If f has rational coefficients, the image of this map is a rank 2 lattice in C, denoted
by Λf . Then, the elliptic curve isomorphic to C/Λf , Ef , is known to be defined over
Q, has conductor N and its Hasse-Weil L-function coincides with the L-function
attached to f . This is due to Eichler and Shimura; see [Kna92] for theoretical details
and [Cre92] for computational details.
Slightly more generally, choose a basis {γ1 , . . . , γ2g } of the Z-homology of X0 (N )
and basis {f1 , . . . , fg } of eigenforms (extending a basis of newforms). Let ωj =
2πifj (τ )dτ and let Ω(X0 (N )) = hω1 , . . . , ωg i be the space of holomorphic differentials
in X0 (N ). Let
Ωj =
Z
γj
ω1 , . . . ,
Z
γj
ωg
!
∈C
g
and
Λ=
2g
M
j=1
73
Ωj Z ⊆ C g .
This lattice is discrete and Λ ⊗ R = Cg . The Jacobian of X0 (N ), denoted J(X0 (N )),
is isomorphic to Cg /Λ. If τ ∈ H∗ , the map
H∗ −→
τ 7−→
Cg
Z
τ
ω1 , . . . ,
Z
τ
i∞
i∞
ωg
−→
7−→
Z
τ
i∞
J(X0 (N ))
Z τ
ωg mod Λ
ω1 , . . . ,
i∞
is independent of the path and descends to a map from the quotient by Γ0 (N ) of H,
so we obtain a map X0 (N ) −→ J(X0 (N )) from the modular curve into its Jacobian.
This is the so-called Abel-Jacobi map.
The elliptic curve Ef is a quotient of the Jacobian of X0 (N ) (after projecting
on the appropriate component, and rescaling by the Manin constant, denoted by c)
so we further obtain a map
ϕ : X0 (N ) −→ Ef .
This map can be made very explicit using the q-expansion of f . For τ ∈ H∗ , let
zτ = c
Z
τ
i∞
2πif (z)dz = c
Z
τ
2πi
i∞
∞
X
n
an q dz = c
Z qX
∞
0
n=1
an q
n−1
dq = c
n=1
∞
X
an
n=1
n
qn,
after performing the substitution q = e2πiz , dq = 2πiqdz. zτ depends on τ modulo
Γ0 (N ) as mentioned above. Using the Weierstrass ℘-function we obtain
ϕ : X0 (N ) −→
τ
Ef
7−→ (℘(zτ ), ℘′ (zτ ), 1).
This map is referred to as the modular parametrization of Ef . Despite its apparent
analytic description, this is a map of algebraic curves defined over Q.
74
Conversely, for an elliptic curve E defined over Q of conductor N , there exists
a weight 2 newform f of level N , whose L-function coincides with the Hasse-Weil
L-function for which Ef is isogenous to E. The map described above composed with
this isogeny equips us with a map
X0 (N )
_ Y S
k e
M&&
// Ef
// E.
(3.2)
This was first known as the Shimura-Taniyama conjecture. Later on, Weil’s name
was added. It was first partially proved by Wiles in the early 1990s for semi-stable
elliptic curves in [Wil95] and his worked was extended to all elliptic curves over Q by
Breuil, Conrad, Diamond and Taylor in [BCDT01]. It is referred to as the modularity
theorem.
75
Chapter 4
Heegner and Stark-Heegner points; the classical case
As we mentioned in the introduction, finding algebraic points on elliptic curves
is not as easy as finding them on conics. Currently, the best methods that produce
algebraic points rely on the existence of algebraic points on the modular curve and,
under suitable conditions, the modular parametrization often yields non trivial points
on the elliptic curve. These points on the modular curve are called Heegner points,
and, by extension, the resulting points on the elliptic curve inherit the same name.
Class Field Theory together with the theory of Complex Multiplication for elliptic curves gives an important algebraicity result, which is key for the Heegner
construction.
4.1
Complex Multiplication and Class Field Theory
This section gives a quick survey of the results we need. For a more detailed
explanation, see [Cox13] and [Sil94] as well as the references provided there.
In section 2.3.2 we defined the endomorphism ring End(E) for an elliptic curve
E together with an embedding of Z into it. In most cases, this embedding is in fact
an isomorphism. When this embedding is not surjective we say that E has complex
multiplication, and often we just say that E has CM. The origin of this term is that
when an elliptic curve has CM, its endomorphism ring can be embedded into the
ring of integers of an imaginary quadratic extension of Q.
76
Theorem 4.1. Let E be an elliptic curve over a field of characteristic 0. Then
either End(E) = Z or there exist an imaginary quadratic extension K and a positive
integer c such that End(E) = Z + cOK , where OK denotes the ring of integers of K.
The latter kind of subrings of K are called orders. The integer c is called the
conductor of the order. The set O = Z + cOK forms a subring of K and we refer to
it as the order of conductor c in K.
For a given fractional O-ideal a, we can consider the subset of K of elements α
with the property that αa ⊆ a. Since a is a fractional O-ideal, O is clearly contained
in this subset, and it is closed under multiplication and addition, inheriting a ring
structure. This makes it into an order in K as well. We say that a is a proper
O-ideal if this order is still O and not a larger order. Denote by I(O) the set of
proper O-ideals, which can be endowed with an abelian group structure. Every
principal O-ideal is proper, so the set of principal O-ideals, denoted by P (O), forms
a subgroup of I(O). Furthermore, if we only take ideals which can be generated
by an element α ∈ K such that under every real embedding σ : K −→ R we
have σ(α) > 0, we obtain a subgroup of P (O), denoted P + (O). For K imaginary
quadratic, this requirement is automatic, so P + (O) = P (O). If K is real quadratic
and its fundamental unit is of norm −1, every principal ideal can be generated by a
totally positive element and in this case P + (O) = P (O) as well. If the fundamental
unit has norm 1, then [P (O) : P + (O)] = 2.
The class group and the narrow class group of O are defined as
Cl(O) := I(O)/P (O)
and
77
Cl+ (O) := I(O)/P + (O),
respectively. If O = OK , the class group measures the failure of the ring OK being a
principal ideal domain. The number of elements of Cl(OK ) is called the class number
of K and the number of elements of Cl+ (OK ) is called the narrow class number of
K. As mentioned before, in some instances the narrow class group coincides with
the class group, but sometimes it will result in a finer invariant.
We have the following theorem:
Theorem 4.2. Let D be a non-square discriminant and let
h√ i
Z D
D ≡ 0 (mod 4)
2
O=
h √ i
Z 1+ D
D ≡ 1 (mod 4).
2
The groups Cl+ (O) and Cl(D) are isomorphic.
To put these class groups in a more suitable language for class field theory, we
need a few definitions. For an order O ⊆ K of conductor c, let IK (c) be the subgroup
of fractional ideals in K generated by integral ideals which are prime to c, i.e., ideals
a such that a + cOK = OK . (Equivalently, the norm of the ideal is relatively prime
to c.) Let PK,Z (c) be the subgroup of IK (c) generated by principal ideals αOK where
α ∈ OK is such that α − a ∈ cOK for some integer a relatively prime to c. Finally, let
PK,1 (c) be the subgroup of PK,Z (c) generated by principal ideals as above, in which
a = 1.
Fix a number field K. Let c be a finite formal product of places in K (possibly
empty) such that the real infinite places appear at most once and the complex infinite
places do not appear. Such c is referred to as a modulus. The ray class field of K
modulo c is the unique field extension Kc/K with the following properties:
78
• It is abelian
• It is unramified outside of c
• Its conductor divides c
• It is maximal with respect to the three properties above.
A modulus c can be written as c0 c∞ , where c0 and c∞ are comprised of the finite and
infinite places, respectively. As before, we define IK (c) as the subgroup of fractional
ideals in K generated by ideals which are prime to c0 . In contrast with PK,1 (c), we
define PK,1 (c) as the subgroup of IK (c) generated by principal ideals αOK where
α ∈ OK is such that α − 1 ∈ c0 and σ(α) > 0 for every σ dividing c∞ .
Let H be a subgroup of IK (c) that contains PK,1 (c). Let L be the fixed field of
Kc by H/PK,1 (c). By Galois theory, we have the following diagram
Kc
IK /PK,1 (c)
H/PK,1 (c)
L = Kc
IK (c)/H
K
which yields a field extension whose Galois group is IK (c)/H.
Let K be a quadratic field and O = Z + cOK be the order of conductor c. Let c
be the modulus corresponding to c (i.e., c looks the same as the factorization of (c)
into primes ideals in K) and c+ the modulus corresponding to c and the infinite real
places. Let Hc and Hc+ be the fixed fields of Kc and Kc+ by PK,Z (c) and PK,Z (c+ ),
respectively. They are referred to as the ring class field and narrow ring class field
79
of K of conductor c. The Galois groups of these extensions are isomorphic to Cl(O)
and Cl+ (O), respectively.
When O = OK , c = 1 is an empty product and c+ is just the infinite places.
In this case, K1 is referred to as the Hilbert class field of K and K1+ as the Hilbert
narrow class field of K.
Now, the most important result of this section:
Theorem 4.3. Let E be an elliptic curve with complex multiplication by the ring
O ⊆ K of conductor c, where K is an imaginary quadratic field. Then j(E) is
an algebraic integer. Furthermore, j(E) ∈ Hc, where Hc is the ring class field of
conductor c, and [Q(j(E)) : Q] = [K(j(E)) : K] = [Hc : K].
When E is defined over C, we know that it is isomorphic to Eτ for some τ ∈ H,
and all endomorphisms of Eτ are given by a complex number α such that αΛτ ⊆ Λτ ,
by doing z 7→ αz in C/Λτ . For this to happen, we need the equations
α · τ = aτ + b
α · 1 = cτ + d
to hold for some a, b, c, d ∈ Z. If the endomorphism is not the trivial endomorphism, α 6= 0 and we can divide these two equations to find that τ is fixed by the
Möbius transformation induced by the matrix ( ac db ), which yields an equation in τ.
Upon solving it, we find that τ is the solution of a quadratic equation with integral
coefficients (and, a fortiori, so is α).
Conversely, if τ ∈ H ∩ K is a quadratic number, we can see that Eτ has complex
multiplication by the order comprised of numbers of the form cτ + d as above. The
80
minimal equation of τ is of the form Aτ 2 + Bτ + C = 0 where A, B, C are relatively
prime. Every quadratic equation with integer coefficients where τ vanishes is a scalar
multiple of this one so we have the polynomial relation
cX 2 + (d − a)X − b = AkX 2 + BkX + Ck,
whence c = Ak, d − a = Bk and b = −Ck. This shows that Eτ has complex
multiplication by the ring Z[Aτ ] ⊆ K. We can readily see that we can determine
exactly in what field j(τ ) lies for quadratic numbers.
Corollary 4.4. Let τ ∈ H ∩ K be a quadratic number and let A be the leading
coefficient of the minimal polynomial of τ . Let c be the conductor of the order O =
Z[Aτ ] ⊆ K. Then Eτ has complex multiplication by O and the special value j(τ ) is
an algebraic integer of degree [Hc : K] which lies in the ring class field Hc.
4.2
Heegner points
If τ and N τ are quadratic numbers in H such that the elliptic curves Eτ and EN τ
have complex multiplication by the same order O ⊆ K of conductor c, the previous
section implies that their j-invariants will lie in the same algebraic extension of Kc/Q,
where Kc is the ray class field of conductor c. Then, Equation (3.2) ensures that the
point (j(τ ), j(N τ )) of X0 (N ) is mapped to a point defined over the same field, thus
obtaining a point on an elliptic curve of conductor N defined over this ray class field.
In order to find a τ as above, let us analyze the minimal equations of τ and
N τ. Let D be a negative integer congruent to 1 or 0 modulo 4 and let a, b, c be
relatively prime integers such that b2 − 4ac = D. Let N τ ∈ H be the solution of the
quadratic equation ax2 +bx +c = 0 in the upper half-plane. Let g = gcd(aN 2 , bN, c),
81
so gcd(aN 2 /g, bN/g, c/g) = 1 and
(aN 2 /g)τ 2 + (bN/g)τ + (c/g) = (a(N τ )2 + b(N τ ) + c)/g = 0,
making (aN 2 /g)x2 + (bN/g)x + (c/g) = 0 the minimal equation of τ . By the first
part of Corollary 4.4, we find that
End(Eτ ) = Z[aN 2 τ /g]
and
End(EN τ ) = Z[aN τ ].
For these two orders to be equal, we must have g = N, and hence, the minimal
polynomial of τ becomes aN x2 + bx + C, where C = c/N and N | c. Its discriminant
is b2 − 4aN C = b2 − 4ac = D so the two binary quadratic forms
ax2 + bxy + cy 2
and
aN x2 + bxy + Cy 2
are in the same class group. Hence, we need to find a binary quadratic form F (x, y)
with discriminant D whose leading term is divisible by N and take τ ∈ H such that
F (τ, 1) = 0.
Let us flip things around and start with a discriminant D together with a binary
quadratic form Ax2 + Bxy + Cy 2 of discriminant D and let us try to find en element
in the same equivalence class in Cl(D) whose x2 coefficient is divisible by N . This
is, we need a matrix ( ac db ) ∈ SL2 (Z) such that
a b
2
2 2
2
2
Ax + Bxy + Cy = (Aa + Bac + Cc )x +
c d
(2Aab + B(ad + bc) + 2Ccd)xy + (Ab2 + Bbd + Cd2 )y 2
82
yields an x2 coefficient divisible by N . This boils down to finding a non-trivial
solution to the original quadratic form congruent to 0 modulo N .
By the Chinese remainder theorem, we need to find non-trivial solutions modulo
each of the prime powers dividing N . Let p | N be odd. If p ∤ A, we have
p | Aa2 + Bac + Cc2 ⇐⇒ p | 4A2 a2 + 4ABac + 4ACc2 = (2Aa + Bc)2 − c2 D.
Then p ∤ c because otherwise we’d conclude that p | a, so we can assume WLOG that
c = 1 (modulo p). This implies that we have a solution modulo p if and only if D is
√
a square modulo p, so p splits (or ramifies) in K = Q( D). If p | A then D ≡ B 2
(mod p) so p still splits (or ramifies) in K.
Let us look at a couple of examples:
- Let D = −11 and E = 15a1 (following Cremona labels). We have that the
√
conductor of E is N = 15 and the primes 3 and 5 split in K = Q( −11). The
cardinality of Cl(−11) is 1 so we only have the identity quadratic form, namely
x2 + xy + 3y 2 . The pair (a, c) = (3, 1) vanishes at this quadratic form modulo
15. We need to find a pair (b, d) such that ad − bc = 1, which can easily be
achieved by (2, 1). Then
3 2 2
2
2
2
x + xy + 3y = 15x + 23xy + 9y ,
1 1
√
−23 + −11
which vanishes at (τ, 1) with τ =
. The minimal polynomials of τ
30
and 15τ are 15x2 +23x+9 and x2 +23x+135, respectively, so both Eτ and E15τ
83
have CM by Z[15τ ] = OK , the maximal order in K. The modular parametriza√
√
3 − −11 −3 − 3 −11
tion yields the point
. It can be readily verified
,
2
2
that it satisfies the equation y 2 + xy + y = x3 + x2 − 10x − 10 corresponding
to E. Since the class number is 1, we expected this point to belong to E(K),
as it happened.
- Let D = −15 and E = 17a1. The conductor of E is N = 17, which splits in
√
K = Q( −15). The cardinality of Cl(−15) is 2 so we have two quadratic forms
this time, namely, x2 + xy + 4y 2 and 2x2 + xy + 2y 2 . The pairs (a1 , c1 ) = (11, 0)
and (a2 , c2 ) = (14, 1) vanish at these quadratics modulo 17, respectively. We
can complete them with the pairs (b1 , d1 ) = (10, 1) and (b2 , d2 ) = (13, 1), thus
obtaining
11 10 2
2
2
2
x + xy + 4y = 136x + 249xy + 114y
1 1
14 13
2
2
2
2
2x + xy + 2y = 408x + 759xy + 353y ,
1 1
√
−249 + −15
=
272
which vanish at (τ1 , 1) and (τ2 , 1), respectively, with τ1
√
−759 + −15
. Since these denominators are rather large, many
and τ2 =
816
more coefficients of the L-series are needed in order to compute the modular
parametrization, as the convergence is much slower. τ1 yields, approximately,
the point
(−0.381966 + 1.73205i, 0.427051 − 3.7948939i)
84
and τ2
(−2.6180339 − 1.73205i, −2.927051 + 5.7313855i)
which add up to
(1.0000 + 3.8729833i, −4.500000 + 6.809475i)
The point associated to τ1 can be recognized as
!
√
√
√
√
3 √
5 5 11 −3 3 5
−15
− + −3 +
,− −
+
+
2
2
4
4
4
4
and the one associated to τ2 as
!
√
√
√
√
3 √
5 5 11 −3 3 5
−15
,− +
−
+
− − −3 −
2
2
4
4
4
4
√
√
√
√
which can be verified to lie on E(Q( −3, 5)) as expected (Q( −3, 5)
√
being the Hilbert class field of Q( −15)) and they are conjugates (under
√
√
√
√
√
√
σ ∈ Gal Q( −3, 5)/K given by σ( −3) = − −3 and σ( 5) = − 5).
Their sum is
1+
√
√
−9 + 3 −15
−15,
2
√
which lies on E(Q( −15)), and coincides with the approximated sum obtained
before.
√
- Let D = −164 and E = 11a1. The conductor N = 11 splits in K = Q( −41)
and the cardinality of Cl(D) is 8. For each of the eight reduced binary quadratic
forms of discriminant −41 found in section 2.2.2 we can find a representative
in their class whose x2 coefficient is divisible by 11, compute its fixed point and
push it to the elliptic curve via the modular parametrization. Upon adding
85
these eight points we obtain the point
√
√
−32 + 6 −41 192 − 36 −41
,
25
125
- If in the last example we consider the curve E = 37a1, which has rank 1, more
things can be said. The conductor N = 37 splits over K again so we can find
representatives for each of the classes whose x2 term is divisible by 37, yielding
eight points on E(L), where L is the splitting field of the polynomial
x8 + 10x7 + 36x6 + 8x5 − 47x4 − 54x3 − 10x2 − 12x + 4,
where all the x coordinates of the Heegner points obtained vanish. The y
coordinates all vanish at
x8 + 24x7 + 243x6 + 524x5 + 95x4 − 324x3 + 149x2 − 24x − 32,
which splits in L. The Hilbert class field of K is defined by the polynomial
x8 − 3x7 + x6 + 4x5 − 4x4 + 4x3 + x2 − 3x + 1,
which also splits in L, so the Hilbert class field is a quadratic extension of L.
(Actually, the compositum of K and L.) After adding all of the points, which
are conjugates in L, we obtain the point (0, 0), which, needless to say, belongs
to E(Q).
In the light of the last example, we have the following theorem, due to GrossZagier. See [GZ86].
86
Theorem 4.5. Let E/Q be an elliptic curve of conductor N and K be an imaginary
quadratic field such that every prime dividing N splits in K. Let h be the class
number of K. Assume that the L-series of E has a zero of order 1 at s = 1. Then
the sum of the h Heegner points is a point of infinite order in E(Q).
4.3
Stark-Heegner points
The construction of Heegner points relies heavily on the modular parametriza-
tion and the algebraicity of the j values at imaginary quadratic numbers. A natural
question to ask is if there is an analogous process that yields algebraic points on
elliptic curves by starting with real quadratic numbers. At this point, the classical
modular parametrization seems to be of no use, since j cannot be evaluated at real
numbers, so the analogy will have to rely on slightly different techniques.
Let K be a quadratic extension of Q. Demanding that K be imaginary is equivalent to saying that the place ∞ does not split in K. If we are dealing with a real
quadratic extension of Q (which will be our case), we need to find a different place
to play the role of ∞. We choose a prime p dividing the conductor N of the elliptic
curve E such that p is inert in K. This prime, naturally, does not always exist, but
if we restrict ourselves to the case where N and the discriminant of K are relatively
prime and the sign of the functional equation of L(E/K, s) is −1, we can guarantee
its existence. (See Theorem 3.17 in [Dar04].) From now on, write N = pM , with
gcd(p, M ) = 1 and p inert in K.
Define the p-adic upper half-plane Hp to be P1 (Cp ) − P1 (Qp ). In opposition with
its archimedean counterpart, the p-adic upper half-plane does not split in a natural
way into two disjoint components and there is no canonical choice, so we just keep
87
all of them. The topology comes from affinoids and annuli, which are inverse images
of special subsets of P1 (F̄p ) under the reduction map red : P1 (Cp ) −→ P1 (F̄p ). If Γ is
a discrete subgroup of SL2 (Qp ) the quotient Γ\Hp is equipped with the structure of
a rigid analytic curve over Qp . For a quick survey, see section 5.1 on [Dar04].
Previously, we had the group Γ0 (N ) acting discretely on H. Let Γp,M be the
subgroup of SL2 (Z[1/p]) of matrices which are upper triangular modulo M. The
action of Γp,M is not discrete on either Hp or H, but it turns out to be discrete on
the product Hp × H. We will be dealing with Γp,M instead of the usual Γ0 (N ).
4.3.1
p-adic measures and p-adic line integrals
This section largely follows sections 5.2 and 5.3 of [Dar04]. The boundary of Hp
is P1 (Qp ), which is endowed with its natural p-adic topology generated by the open
balls
B(a, r) = {t such that |t − a| < p−r }
B(∞, r) = {t such that |t| > pr },
for every a ∈ Qp . Any compact open subset of P1 (Qp ) is a finite disjoint union of open
balls and any finite disjoint union of open balls is compact and open. By assigning
values in Cp in a coherent way to open balls we obtain a function µ which assigns a
value in Cp to every compact open subset of P1 (Qp ) and is finitely additive. If we
further require that µ(P1 (Qp )) be 0, we refer to µ as a p-adic distribution on P1 (Qp ).
Additionally, if the p-adic distribution is bounded (i.e., the set of values of |µ(U )|p
is bounded in the usual sense) we say that µ is a measure on P1 (Qp ).
88
For a p-adic measure µ, we can define a Cp -linear operator that assigns values
in Cp to continuous Cp -valued functions on P1 (Qp ) as follows:
Z
λ(t)dµ(t) = lim
P1 (Qp )
X
λ(tα )µ(Uα ),
(4.1)
α
where {Uα }α is a disjoint cover of P1 (Qp ) by compact open subsets, the limit is
taken over increasingly finer such covers and tα ∈ Uα is any sample point. The
finite-additivity and boundedness of µ are the necessary ingredients to show this is
a well defined operator. If λ is the characteristic function of a compact open set U ,
we write
µ(U ) = µ(λ) =
Z
λ(t)dµ(t) =
P1 (Qp )
Z
dµ(t).
U
For a discrete subgroup Γ of SL2 (Qp ) we say that µ is Γ-invariant if µ(γU ) = µ(U )
for all γ ∈ Γ and U open compact subset of P1 (Qp ).
For a Γ-invariant measure µ we define
fµ : Hp −→Cp
Z
z 7−→
P1 (Qp )
1
z−t
dµ(t).
Section 5.2 of [Dar04] shows that fµ is rigid analytic and it is Γ-invariant of
weight 2 as per Definition 5.5 in [Dar04]. It is a theorem of Schneider and Teitelbaum
(Theorem 5.9 in [Dar04]) that the map µ 7→ fµ is an isomorphism from the set of Γinvariant p-adic measures on P1 (Qp ) to the set of weight 2 Γ-invariant rigid analytic
functions on Hp .
89
The function
log(1 − z) =
∞
X
zn
n=1
n
has as domain the open disc in Cp of radius 1 centered at 1. For small real numbers x
and y, we know that log((1 − x)(1 − y)) = log(1 − x) + log(1 − y) from the properties
of the natural logarithm, so we have an equality of formal power series which lets
us conclude the same is true for z in said open disc. Choosing π ∈ C×
p such that
|π|p < 1 and setting log(π) = 0, we can extend log to C×
p by demanding that log
be a homomorphism from C×
p to Cp . This is referred to as choosing a branch of the
logarithm.
For a rational differential f (z)dz on P1 (Cp ), we can assign a formal antiderivative
F (z) = R(z) +
t
X
j=1
λj log(z − zj ),
where R(z) is a rational function, the λj ’s are some constants in Cp , and the sum is
taken over all the poles of f (z)dz. This antiderivative is unique up to an additive
constant. In the case of f (z) = 1/(z − t) one simply has F (z) = log(z − t).
For a Γ-invariant rigid analytic function f on Hp we have an associated measure
µf (given by Schneider and Teitelbaum’s isomorphism; see sketch of proof of Theorem
5.9 in [Dar04] for details). Define the line integral on Hp from τ1 to τ2 to be
Z
τ2
τ1
f (z)dz =
Z
log
P1 (Qp )
90
t − τ2
t − τ1
dµf (t),
which can be motivated by the formal computation
Z
log
P1 (Qp )
t − τ2
t − τ1
dµf (t) =
Z
P1 (Qp )
Z
log
Z
τ2 − t
τ1 − t
τ
dµf (t)
dz 2
=
dµf (t)
z − t τ1
P1 (Qp )
Z τ2 Z
1
dµf (t)dz
=
τ1
P1 (Qp ) z − t
Z τ2
=
f (z)dz.
τ1
Note that this line integral has the expected property
Z
τ2
f (z)dz +
τ1
Z
τ3
f (z)dz =
τ2
Z
τ3
τ1
f (z)dz for all τ1 , τ2 , τ3 ∈ Hp ,
which follows from linearity of the integral operator and the fact that the logarithm
is a homomorphism.
4.3.2
Modular Forms on Γp,M
In order to extend the notion of modular forms, we need to mention the BruhatTits tree of GL2 (Qp ), which we denote by T . The set of vertices, denoted T0 , is
comprised of Zp lattices Λ ⊆ Q2p such that Λ ⊗ Qp = Q2p , where we identify two
such lattices Λ1 and Λ2 if there exists α ∈ Qp such that αΛ1 = Λ2 . Two vertices, v1
and v2 , are connected if there exist representatives Λ1 and Λ2 of v1 and v2 such that
pΛ1 ⊂ Λ2 ⊂ Λ1 , where the both inclusions are proper. Note that this implies that
pΛ2 ⊂ pΛ1 ⊂ Λ2 , so the graph is indeed undirected. The set of unordered edges is
denoted T1 . An ordered edge e is an ordered pair of adjacent vertices (v1 , v2 ). The
source of e, denoted s(e) is v1 and the target, denoted t(e) is v2 . Let E(T ) denote the
set of ordered edges of T . For each oriented edge e, let ē denote the edge obtained by
91
interchanging the source and the target of e. The tree T is regular of degree p+1. The
class of Z2p is denoted v0 and gives us a distinguished vertex of T which lets us see it
as a rooted tree. The p + 1 adjacent vertices of v0 are index p sublattices of Z2p , which
can be put in bijection with P1 (Fp ) via k 7→ {(a, b) ∈ Z2p such that ak + b ∈ pZp }
(for k = ∞ this amounts to pZp × Zp ). The edges connecting v0 to each of its
neighbors are labeled e0 , e1 , . . . , ep−1 , e∞ . The action of GL2 (Qp ) on T induces a
graph automorphism.
Let Γ = Γp,M as above. A cusp form of weight 2 for Γ is a function
f : E(T ) × H −→ C
satisfying the following three properties:
a b
(1) f (γe, γτ ) = (cτ + d)2 f (e, τ ) for all γ =
∈ Γ.
c d
(2) For each vertex v ∈ T0 we have
X
f (e, τ ) = 0,
s(e)=v
and for each ordered edge e ∈ E(T ) we have f (ē, τ ) = −f (e, τ ).
(3) For each fixed oriented edge e ∈ E(T ) the function f (e, τ ), denoted fe (τ ), is a
weight 2 cusp form in the usual sense for the stabilizer of e in Γ. This group is
denoted Γe .
The space of weight 2 cusp forms for Γ is denoted by S2 (T , Γ). The action of Γ on T
has only two orbits in T0 and one orbit in T1 . The stabilizer of e0 is Γ0 (N ) and the
stabilizer of v0 is Γ0 (M ). The properties spelled out above imply that f is determined
92
by f0 := fe0 , which is a weight 2 cusp form for Γ0 (N ). We have the following very
important lemma, which is Lemma 9.2 in [Dar04].
Lemma 4.6. The map
S2 (T , Γ) −→S2 (Γ0 (N ))
f
7−→
f0
is injective. Furthermore, the image is the p-new part of S2 (Γ0 (N )).
4.3.3
Measures associated to an Elliptic Curve and the double integrals
In particular, Lemma 4.6 implies that to an elliptic curve E of conductor N
there is an associated weight 2 cusp form f ∈ S2 (T , Γ) such that f0 is the usual
normalized weight 2 eigenform associated to E.
Every edge e ∈ T1 has an associated annulus in Hp , whose closure in P1 (Cp ) can
be intersected with P1 (Qp ) to give a compact open subset of P1 (Qp ) denoted by Ue
(see Theorem 5.9 in [Dar04]). For x, y ∈ P1 (Q) we define the complex distribution
µ̃f {x → y} by
µ̃f {x → y}(Ue ) = c · 2πi
Z
y
fe (z)dz,
x
where c is the Manin constant associated to E. These values can all be expressed
in terms of the periods of f0 , which lie on a lattice ΛE ⊆ C. This lattice can be
generated by a real period Ω+ and a purely imaginary period Ω− (or contained with
index 2 in such a lattice, so Ω+ and Ω− are half-periods). We can write
c · 2πi
Z
x
y
−
fe (z)dz = κ+
f {x → y}(e) · Ω+ + κf {x → y}(e) · Ω− ,
93
where κ±
f {x → y} takes on integral values when fed an oriented edge e ∈ E(T ). Upon
a choice of sign w∞ = ±1, we obtained an integral measure
∞
µf {x → y}(Ue ) = κw
f {x → y}(e).
Since this measure takes on integral values, it can be seen as a p-adic measure
on P1 (Qp ). Using the line integral defined in the previous section and this p-adic
distribution we define the double integral attached to f from τ1 to τ2 , in Hp , and
from x to y, in P1 (Q), as
Z
τ2
τ1
Z
y
ωf =
Z
log
P1 (Qp )
x
t − τ2
t − τ1
dµf {x → y}(t),
which is a well defined number in Cp . In virtue of the integrality of the values of
the measure, we define its multiplicative counterpart by formally exponentiating,
ultimately resulting in a finer invariant (which can be recovered by taking logarithms,
and the logarithm is not an injective function).
Definition 4.7. For τ1 and τ2 in Hp and x and y in P1 (Q), the multiplicative double
integral attached to f is defined as
Z τ2 Z
×
τ1
x
y
Z
ωf = ×
P1 (Qp )
t − τ2
t − τ1
dµf {x → y}(t) = lim
Y t − τ2 µf {x→y}(Uα )
α
t − τ1
,
where {Uα }α is a disjoint cover of P1 (Qp ) by compact open subsets, the limit is taken
over increasingly finer such covers and tα ∈ Uα is any sample point.
From the additivity of the p-adic line integral, we can expect a multiplicativity property in the outer integral. From the additivity of the system of measures, which comes from complex line integrals, we can expect a multiplicativity
94
property in the inner integral.
From the Γ-invariance property in S2 (T , Γ), as
f (γe, γz) = (cz + d)2 f (e, z) can be rephrased as f (γe, γz)d(γz) = f (e, z)dz, we can
expect a Γ-invariance property. Likewise, the additive double integral has analogous
properties. More explicitly, we have the following lemma:
Lemma 4.8. For all τ1 , τ2 , τ3 ∈ Hp , x, y, z ∈ P1 (Q) and γ ∈ Γ, the double integrals
satisfy
(1)
Z τ3 Z
×
τ1
y
x
Z τ2 Z
ωf = ×
τ1
y
x
Z τ3 Z
ωf ·×
τ2
y
ωf
and
x
Z
Z
τ3
τ1
y
ωf =
x
Z
Z
τ2
τ1
y
x
Z
ωf +
τ3
τ2
Z
y
ωf
x
(2)
Z τ2 Z
×
τ1
(3)
x
z
Z τ2 Z
ωf = ×
τ1
x
Z γτ2 Z
×
γτ1
y
γy
γx
Z τ2 Z
ωf ·×
τ1
z
ωf
y
Z τ2 Z
ωf = ×
τ1
and
Z
τ2
τ1
y
ωf
and
x
Z
Z
z
ωf =
x
γτ2
γτ1
Z
Z
τ2
τ1
Z
γy
γx
ωf =
x
Z
τ2
τ1
y
Z
ωf +
τ2
τ1
Z
Z
z
ωf
y
y
ωf
x
Let Qp2 be the quadratic unramified extension of Qp and O its ring of integers.
The reduction map of the beginning of the section can be restricted to P1 (Qp2 ) and
its image falls in P1 (Fp2 ). Let Hp0 = {τ ∈ P1 (Qp2 ) such that red(τ ) 6∈ P1 (Fp )}. When
τ1 , τ2 ∈ Hp0 , they must have valuation 0. If t ∈ Qp also has valuation 0, (t − τ1 ) and
(t − τ2 ) do as well (otherwise, red(τ ) ∈ P1 (Fp )), so the quotient (t − τ2 )/(t − τ1 ) does
too. If t has positive valuation, clearly numerator and denominator have valuation
0, so the quotient has valuation 0 again. If t has negative valuation, each term has
the same valuation and hence the quotient has valuation 0. This implies that all the
terms in the Riemann product associated to the multiplicative double integral when
95
τ1 and τ1 are in Hp0 have valuation 0, hence, lie in O× . Hence, we can conclude that
Z τ2 Z
×
y
ωf ∈ O× .
x
τ1
Moreover, to compute it to an accuracy of p−α it suffices to compute
Z τ2 Z
×
Z
τ2
τ1
ωf
(mod p)
ωf
(mod pα )
x
τ1
and
y
Z
y
x
This is Lemma 1.5 in [DP06]. [DP06] also shows how to compute these integrals
in the case of M = 1. The procedure described there consists of splitting the integral
at ∞, using the additivity, and computing each integral separately. This is,
Z
τ2
τ1
Z
x
y
ωf =
Z
τ2
τ1
Z
y
∞
ωf −
Z
τ2
τ1
Z
x
ωf ,
∞
reducing this to the computation of integrals of the form
Z
τ2
τ1
Z
x
ωf ,
∞
where x ∈ Q. The sequence of convergents of x obtained from its continued fraction
expansion yields rational numbers pj /qj such that the matrix
j−1
(−1) pj pj−1
Mj =
(−1)j−1 qj qj−1
96
is in SL2 (Z), where p−1 = 1 and q−1 = 0. Hence,
Z
τ2
τ1
Z
x
∞
ωf =
k Z
X
j=0
τ2
τ1
Z
pj /qj
ωf =
pj−1 /qj−1
k Z
X
τ2
τ1
j=0
Z
Mj (∞)
ωf =
Mj (0)
k Z
X
j=0
Mj−1 τ2
Mj−1 τ1
Z
∞
ωf ,
0
reducing the problem to the computation of integrals of the form
Z
τ2
τ1
Z
∞
ωf
0
for any τ1 , τ2 ∈ Hp0 , as Hp0 is stable under the action of SL2 (Z). It might be necessary
to compute more integrals if the level is not p, as not every Mj will lie in Γ. In
this case, right coset representatives of Γ in SL2 (Z[1/p]) can be chosen and the
computation is reduced to computing finitely many integrals instead of just one.
Define
J∞ (τ1 , τ2 ) =
Z
log
P1 (Qp )−Zp
and
Ja (τ1 , τ2 ) =
Z
a+pZp
so
Z
τ2
τ1
Z
0
∞
log
t − τ2
t − τ1
t − τ2
t − τ1
ωf = J∞ (τ1 , τ2 ) +
dµf {0 → ∞}(t)
dµf {0 → ∞}(t),
p−1
X
Ja (τ1 , τ2 ).
a=0
Using the power series of the logarithm, each of these integrals can be easily
expressed as the difference of a power series in τ1 and a power series in τ2 (centered
at a, or at 0 in the case of ∞) whose coefficients are the moments of the measure
µf {0 → ∞} (divided by n, where n is the exponent of τ1 or τ2 ). These can be
effectively computed via the so-called overconvergent modular symbols. See the end
97
of Section 1.3 in [DP06] for details on how to obtain these power series and Chapter
2 for details about the overconvergent modular symbols.
4.3.4
Tate’s uniformization
This subsection mainly follows [Sil94], chapters 1 and 5.
Weierstass uniformization (Equation (3.1)) assigns to a complex number z a
point in the elliptic curve E(C) via the quotient C/Λ, where Λ is a rank two Zmodule. In the p-adic realm, rank two Z-modules are dense, so we do not have the
same lattices. However, if we exponentiate first, we can salvage the situation. In
the complex case, the map C −→ C× given by exponentiation (after multiplying
by 2πi) has Z as kernel. Let q = e2πiτ and consider the lattice Λτ ⊆ C. Denote
by q Z the multiplicative group generated by q. Now, the composition of the exponential map and projecting into q Z has as kernel all complex numbers z such that
e2πiz = q n = e2πinτ for some n ∈ Z. This is, z − nτ ∈ Z, implying z ∈ Λτ , so the
composition factors through C/Λτ . Since the composition is clearly surjective, the
resulting map is actually a group isomorphism between C/Λτ and C× /q Z . We can
exploit the isomorphism given by the Weierstrass uniformization to obtain an isomorphism between C× /q Z and Eτ . To find out what the map from C× /q Z to Eτ (C)
looks like, we need to understand what ℘ and ℘′ look like in terms of u = e2πiz and
q instead of z and τ. It turns out that
X
X
1
qnu
qn
1
℘
(z)
=
+
−
2
τ
(2πi)2
(1 − q n u)2 12
(1 − q n )2
n≥1
n∈Z
X q n u(1 + q n u)
1
′
℘
(z)
=
,
n u)3
(2πi)3 τ
(1
−
q
n∈Z
98
which is Theorem 6.2, Chapter 1 on [Sil94]. This yields an explicit isomorphism from
C× /q Z to Eτ (C) given by
u 7→
(2πi)2
X
n∈Z
X
qnu
1
qn
+
−
2
(1 − q n u)2 12
(1 − q n )2
n≥1
!
, (2πi)3
X q n u(1 + q n u)
n∈Z
(1 − q n u)3
!
,1 ,
which is transcendental. Recall that Eτ is given by
y 2 = 4x3 − g2 (τ )x − g3 (τ ),
(4.2)
where
g2 (τ ) = (2πi)4
g3 (τ ) = (2πi)6
where sk (q) =
X
X
1
+ 20
σ3 (n)q n
12
n≥1
!
7X
−1
+
σ5 (n)q n
216 3 n≥1
=
!
=
(2πi)4
1 + 240s3 (q)
12
(2πi)6
− 1 + 504s5 (q) ,
216
σk (n)q n . Let X = x/(2πi)2 and Y = y/(2πi)3 . Dividing Equation
n≥1
(4.2) by (2πi)6 we obtain
Y 2 = 4X 3 −
1 + 240s3 (q)
−1 + 504s5 (q)
X−
.
12
216
Let x′ and y ′ be such that Y = 2y ′ +x′ and X = x′ +
1
. Substituting and simplifying,
12
we find that
y ′2 + y ′ x′ = x′3 − 5s3 (q)x′ −
99
5s3 (q) + 7s5 (q)
.
12
Solving for x′ and y ′ we obtain
x′ = X −
y′ =
X
X
qn
qnu
1
−
2
=
12 n∈Z (1 − q n u)2
(1 − q n )2
n≥1
X (q n u)2
X
Y −X
qn
1
+
.
+
=
2
24 n∈Z (1 − q n u)3 n≥1 (1 − q n )2
Denote x′ and y ′ by Xq (u) and Yq (u), respectively. Let
a4 (q) = −5s3 (q)
a6 (q) = −
and
5s3 (q) + 7s5 (q)
,
12
and let
Eq : y 2 + xy = x3 + a4 (q)x + a6 (q).
Notice that 5m3 + 7m5 ≡ 0 (mod 12) for all integers m, as from m3 ≡ m (mod 3) we
obtain 5m3 + 7m5 ≡ −m + m (mod 3), for even m clearly 5m3 + 7m5 ≡ 0 (mod 12)
and for odd m, from m2 ≡ 1 (mod 4) we obtain 5m3 +7m5 ≡ 5m+7m ≡ 0 (mod 4).
This implies that the coefficients of a6 (q) are all integers, whence the coefficients of
Eq are in Z[[q]]. Eq is referred to as the Tate curve; its discriminant and j-invariant
are given by
∆(q) = q
Y
n≥1
(1 − q n )24
and j(Eq ) = j(t) =
1
+ 744 + 196884q + O(q 2 ),
q
as expected. To summarize, by following the diagram
exp
mod q Z
// C× /q Z
j55 u
j
jj u
mod Λτ
j j uu
j
j
zzu
jj
// Eq (C)
// Eτ (C)
C/Λτ
C
//
C×
100
we have obtained the so-called Tate parametrization, given by
∼
C× /q Z −→ Eq (C)
u 7−→ (Xq (u), Yq (u), 1).
This construction generalizes to other complete fields. This theorem is due to
Tate.
Theorem 4.9 (Tate). Let K be a p-adic field and let q ∈ K × with |q| < 1.
(a) The series defining a4 (q) and a6 (q) converge in K.
(b) The Tate curve Eq is an elliptic curve defined over K with discriminant and
j-invariant as above.
(c) Eq (K̄) is parametrized via
∼
K̄ × /q n −→ Eq (K̄)
u 7−→ (Xq (u), Yq (u), 1).
(d) If L is an algebraic extension of K then the parametrization above restricts to
L× in the natural way, i.e.,
∼
L× /q n −→ Eq (L)
u 7−→ (Xq (u), Yq (u), 1).
Notice that |j(Eq )| > 1, as the leading term is 1/q and q < 1 is necessary to
ensure convergence of the series defining Eq . We have this other theorem of Tate
Theorem 4.10 (Tate). Let K be a p-adic field and let E/K be an elliptic curve with
|j(E)| > 1. Then there is a unique q ∈ K × with |q| < 1 such that E is isomorphic
101
to Eq over K̄. Moreover, this isomorphism is over K if and only if E has split
multiplicative reduction.
4.3.5
The Stark-Heegner point
This section follows Sections 9.4-9.6 of [Dar04].
For E and K as above this section, let q be Tate’s p-adic period attached to E.
Consider the integral
Z γ1 τ Z
κτ,x (γ1 , γ2 ) = ×
τ
γ 1 γ2 x
ωf
(mod q Z ),
γ1 x
Z
×
Z
seen as a 2-cochain in C 2 (Γ, C×
p /q ), where the action of Γ on Cp /q is trivial.
Applying the d operator (mapping 2-cochains to 3-cochains) we obtain
dκτ,x (γ1 , γ2 , γ3 ) = γ1 κτ (γ2 , γ3 ) ÷ κτ (γ1 γ2 , γ3 ) · κτ (γ1 , γ2 γ3 ) ÷ κτ (γ1 , γ2 )
Z γ1Zτ γ1 γ2 x
Z γ2Zτ γ2 γ3 x
Z γ1Zτ γ1 γ2 γ3 x
Z γ1 γ2Zτ γ1 γ2 γ3 x
=×
ωf
ωf ÷ ×
ωf · ×
ωf ÷ ×
τ
γ1 x
τ
γ1 x
τ
γ1 γ2 x
τ
γ2 x
Z γ1 τZ γ1 γ2 γ3 x
Z γ1 τZ γ1 x
Z τ Z γ 1 γ 2 γ3 x
Z γ2 τZ γ2 γ3 x
ωf
ωf · ×
ωf · ×
ωf · ×
=×
γ1 x
τ
γ1 γ 2 x
τ
γ 1 γ 2 τ γ 1 γ2 x
γ2 x
τ
Z γ1 τZ γ1 γ2 γ3 x
Z τ Z γ 1 γ 2 γ3 x
Z γ2 τZ γ2 γ3 x
ωf
ωf · ×
ωf · ×
=×
τ
γ1 γ2 x
γ 1 γ2 τ γ1 γ 2 x
τ
γ2 x
Z γ2 τZ γ2 γ3 x
Z γ 1 τ Z γ 1 γ2 γ 3 x
=×
ωf · ×
ωf
τ
γ 1 γ2 τ γ 1 γ 2 x
γ2 x
Z γ2 τZ γ2 γ3 x
Z τ Z γ2 γ3 x
=×
ωf = 1,
ωf · ×
τ
γ2 τ
γ2 x
γ2 x
showing that in fact, κτ is a 2-cocycle. A priori, it would seem like this cocyle
depends on the base point x, and of τ. Define the 1-cochain
Z γτ Z
ρx,y (γ) = ×
τ
γy
γx
ωf
Z
(mod q Z ) ∈ C 1 (Γ, C×
p /q ).
102
Applying the d operator (mapping 1-cochains to 2-cochains) we obtain
dρx,y (γ1 , γ2 ) = γ1 ρx,y (γ2 ) ÷ ρx,y (γ1 γ2 ) · ρx,y (γ1 )
Z γ1 τ Z γ 1 y
Z γ1 γ 2 τ Z γ 1 γ 2 y
Z γ2 τ Z γ2 y
ωf · ×
ωf ÷ ×
=×
ωf
τ
γ2 x
τ
γ1 x
τ
γ1 γ 2 x
Z γ1 τ Z γ 1 y
Z τ Z γ1 γ2 y
Z γ 1 γ 2 τ Z γ1 γ 2 y
ωf
ωf · ×
ωf · ×
=×
τ
γ1 γ 2 τ γ 1 γ2 x
γ1 γ2 x
γ1 τ
γ1 x
Z τ Z γ1 γ2 y
Z γ 1 τ Z γ1 y
=×
ωf ,
ωf · ×
γ1 τ
τ
γ 1 γ2 x
γ1 x
so
Z γ 1 τ Z γ 1 γ2 y
Z γ 1 τ Z γ1 γ 2 x
ωf
ωf ÷ ×
κτ,x (γ1 , γ2 ) ÷ κτ,y (γ1 , γ2 ) = ×
τ
γ1 y
τ
γ1 x
Z γ1 τ Z γ1 y
Z γ 1 τ Z γ 1 γ2 y
=×
ωf ÷ ×
ωf
τ
τ
γ1 x
γ1 γ 2 x
= dρx,y (γ1 , γ2 ),
implying that the class of κτ,x in H 2 (Γ, C× /q Z ) does not depend on the choice of
base point, as they differ by a coboundary. Furthermore, if we define the 1-cochain
Z τ2 Z
ρτ1 ,τ2 (γ) = ×
τ1
x
γx
(mod q Z ) ∈ C 1 (Γ, C× /q Z ),
we have
dρτ1 ,τ2 (γ1 , γ2 ) = γ1 ρτ1 ,τ2 (γ2 ) ÷ ρτ1 ,τ2 (γ1 γ2 ) · ρτ1 ,τ2 (γ1 )
Z τ2 Z x
Z τ2 Z x
Z τ2 Z x
=×
ωf ÷ ×
ωf · ×
ωf
τ1
γ2 x
τ1
γ1 γ2 x
τ1
γ1 x
Z τ2 Z x
Z τ2 Z γ1 γ2 x
Z τ2 Z x
=×
ωf · ×
ωf · ×
ωf
τ1
γ1 x
τ1
x
τ1
γ2 x
Z τ2 Z x
Z τ2 Z γ1 γ2 x
ωf ,
ωf · ×
=×
τ1
τ1
γ1 x
103
γ2 x
so
Z γ1 τ2 Z γ1 γ2 x
Z γ1 τ1 Z γ1 γ2 x
ωf
ωf ÷ ×
κτ1 (γ1 , γ2 ) ÷ κτ2 (γ1 , γ2 ) = ×
γ1 x
τ2
γ1 x
τ1
Z γ1 τ2 Z γ1 γ2 x
Z τ2 Z γ1 γ2 x
ωf
ωf ÷ ×
=×
γ1 τ1
γ1 x
τ1
γ1 x
Z τ2 Z γ2 x
Z τ2 Z γ1 γ2 x
ωf ,
ωf ÷ ×
=×
τ1
γ1 x
τ1
x
whence, the class of κτ in H 2 (Γ, C× /q Z ) does not depend on τ either.
Z
Conjecture 4.11. The class of κτ is trivial in H 2 (Γ, C×
p /q ).
In [Dar01] it is shown that Conjecture 4.11 is a refinement of the Exceptional
Zero Conjecture (in [MTT86]), which was proven by Greenberg and Stevens in
[GS93]. Nevertheless, the vanishing of this cocycle is still conjectural, and corresponds to Conjecture 9.10 in [Dar04]. Henceforth, assume that Conjecture 4.11
holds.
−1
−1
Define κ#
τ (γ1 , γ2 ) = κτ (γ2 , γ1 ), a related 2-cocycle. It is clear that the class of
2
×
Z
κ#
τ in H (Γ, Cp /q ) is trivial if and only if that of κτ is, so Conjecture 4.11 implies
that the class of κ#
τ is trivial.
Denote by M the group of functions
m : P1 (Q) × P1 (Q) −→
(x, y)
7−→
Z
C×
p /q
m{x → y}
such that m{x → y} · m{y → z} = m{x → z} and m{y → x} = m{x → y}−1 .
These are the so-called modular symbols. Fix a cusp x and denote by M0 the group
104
Z
arising from restriction of modular symbols to Γx × Γx. Let F = F(C×
p /q ) be the
Z
group of C×
p /q -valued functions on Γx.
Both M0 and F have a natural left-action of Γ, given by
(γm){y → z} = m{γ −1 y → γ −1 z}
and
(γg)(y) = g(γ −1 y).
Define maps
Z
C×
p /q
ι
//
F
∆
//
M0
by
ι(zp ) = (y 7→ zp )
∆(g)(y, z) = g(y) ÷ g(z).
and
We can readily check that both, ∆ and ι, commute with the action of Γ and that
∆(g) is indeed a modular symbol. ι is clearly an injection. For any m ∈ M0 , define
gm (y) = m{y → x}. We have that
∆(gm )(y, z) = m{y → x} ÷ m{z → x} = m{y → x} · m{x → z} = m{y → z},
which shows that ∆ is surjective. Clearly
∀y, z, ∆(g)(y, z) = 1 ⇐⇒ ∀x, y, g(y) = g(z) ⇐⇒ ∃zp : g = ι(zp ),
so ker(∆) = im(ι) and we have a short exact sequence
0
// C× /q Z
p
//
F
105
//
M0
//
0.
This sequence induces a long exact sequence of cohomology groups, and, in particular,
we have the connecting homomorphism
Z
δ : H 1 (Γ, M0 ) −→ H 2 (Γ, C×
p /q ).
Lemma 4.8 shows that the map cτ from Γ to M0 defined by
Z γτ Z
cτ (γ){y → z} = ×
τ
z
ωf
y
is indeed a modular symbol. The d operator applied to cτ yields the 2-cochain
dcτ (γ1 , γ2 ) = γ1 cτ (γ2 ) ÷ cτ (γ1 γ2 ) · cτ (γ1 )
Z γ 1 γ2 τ Z z
Z γ2 τ Z γ1−1 z
Z γ1 τ Z z
ωf
ωf ÷ ×
ωf · ×
=×
y
τ
γ1−1 y
τ
y
τ
Z γ1 γ2 τ Z z
Z γ 1 γ2 τ Z z
Z γ1 τ Z z
ωf
ωf ÷ ×
ωf · ×
=×
y
τ
y
γ1 τ
y
τ
Z γ1 γ2 τ Z z
Z γ1 γ2 τ Z z
ωf = 1
ωf ÷ ×
=×
τ
τ
y
y
implying that cτ is a 1-cocycle, and as such, it sits naturally in H 1 (Γ, M0 ). The
slightly cumbersome computation of δ applied to the class of cτ turns out to be κ#
τ
(see the comments preceding Conjecture 9.14 in [Dar04]). Since the class of κ#
τ is
Z
trivial in H 2 (Γ, C×
p /q ), it is not entirely unreasonable to expect the class of cτ in
H 1 (Γ, M0 ) to be trivial as well. We have the following strengthening of Conjecture
4.11.
Conjecture 4.12. The class of cτ is trivial in H 1 (Γ, M0 ).
106
Conjecture 4.12 implies that cτ is a coboundary, so, there exists a 0-cochain (i.e.,
an element of M0 ) η̃τ such that cτ = dη̃τ , i.e.,
Z γτ Z
×
τ
y
z
ωf = cτ (γ){y → z} = η̃τ {γ −1 y → γ −1 z} ÷ η̃τ {y → z}.
(4.3)
The 0-cochain η̃τ can be multiplied by any 0-cocycle without affecting the property
mentioned in the equation above. The set Z 0 (Γ, M0 ) of 0-cocycles is precisely the
set of modular symbols in M0 which are invariant under the action of Γ, which we
denote MΓ0 . Consider the group homomorphism
h : MΓ0 −→
Z
Hom(Γ, C×
p /q )
m 7−→ (γ 7→ m{x → γx}).
Verifying that h(m) is indeed a homomorphism amounts to show that
h(m)(γ1 γ2 ) = m{x → γ1 γ2 x} = m{x → γ1 x} · m{γ1 x → γ1 γ2 x}
= m{x → γ1 x} · m{x → γ2 x} = h(m)(γ1 ) · h(m)(γ2 ).
The restriction of a modular symbol m to Γx×Γx is completely determined by h(m),
as for all y, z ∈ Γx there exist γ1 , γ2 ∈ Γ such that y = γ1 x and z = γ2 x, and
m{y → z} = m{γ1 x → γ2 x} = m{x → γ2 x} ÷ m{x → γ1 x},
whence h is an injection.
107
Notice that h(m) does not depend on the cusp x. If we choose any other cusp
y ∈ P1 (Q), we have that
m{y → γy} = m{y → x} · m{x → γx} · m{γx → γy}
= m{y → x} · m{x → γx} · m{x → y}
= m{y → x} · m{x → γx} ÷ m{y → x} = m{x → γx}.
Since the target of h(m) is abelian, h(m) vanishes at every commutator. Furthermore, if γ is any element of Γ such that γy = y for some cusp y ∈ P1 (Q), we have
that h(m)(γ) = m{y → γy} = m{y → y} = 1, so h(m) also vanishes at every such
γ as well.
Let Γ′ ⊆ Γ be the normal closure of the subset comprised of all the commutators
and the elements that fix cusps, as above. Then, Γ′ ⊆ ker(h(m)), so we have the
following commutative diagram
Γ
πΓ
Γ/Γ′
h(m)
// C× /q Z
p
w;;
w
w
w h′ (m)
w
where h(m) = h′ (m) ◦ πΓ .
Lemma 4.13. The map
h′ : MΓ0 −→
Z
Hom(Γ/Γ′ , C×
p /q )
m 7−→ (γΓ′ 7→ m{x → γx})
108
is a monomorphism.
Proof. See the comments leading to this statement.
Lemma 4.14. The group Γ′ is of finite index in Γ.
Proof. See Lemma 3.5 in [Dar01], Theorem 2 in [Men67] or Theorem 3 in
[Ser70].
Let eΓ be the exponent of Γ/Γ′ , which is a finite number by Lemma 4.14. Then,
Lemma 4.13 implies that every modular symbol in MΓ0 is annihilated when raised to
the eΓ -th power, whence the modular symbol η̃τeΓ does not depend on the choice of η̃τ
satisfying Equation (4.3). Putting all of the above together, we obtain the following
map, whose existence is conjectural due to the assumption of Conjecture 4.12.
Conjecture 4.15. There exists a unique function
Hp (Cp ) × Γx × Γx −→
7−→
(τ, r, s)
Cp /q Z
Z τZ s
eΓ ωf ,
×
r
such that, for all τ1 , τ2 ∈ Hp (Cp ), r, s, t ∈ Γx and γ ∈ Γ, we have
(1)
Z τZ t
Z τZ t
Z τZ s
eΓ ωf
eΓ ωf = ×
eΓ ωf × ×
×
(2)
Z τ2 Z s eΓ
Z τ1 Z s
Z τ2 Z s
ωf
eΓ ωf = ×
eΓ ωf ÷ ×
×
τ1
r
r
(3)
r
s
r
Z γτ Z
×
γs
γr
Z τZ s
eΓ ωf
eΓ ωf = ×
r
This map is referred to as the semi-indefinite integral.
109
r
√
Let K = Q( D) (where D > 0 and D is 0 or 1 modulo 4) and let τ ∈ Hp ∩ K.
Let M0 (M )[1/p] be the ring of 2 × 2 matrices with entries in Z[1/p] which are upper
triangular modulo M . Note that Γ = {γ ∈ M0 (M )[1/p] : det(γ) = 1}. Let Oτ ⊆
M0 (M )[1/p] be the order of matrices which leave τ invariant under the usual action.
For this order to be non-trivial, we need every prime dividing M to split or ramify
in K. Since N and the discriminant of K are relatively prime, we need every prime
factor of M to split in K. Every matrix in Oτ has as eigenvector the column vector
(τ, 1) and it can be identified with its eigenvalue, giving an isomorphism to a Z[1/p]order in K, playing the role of CM. The units of norm one of Oτ form the stabilizer
of τ in Γ. The set of units of a Z[1/p]-order in K has rank one, so we can find a
×
generator, γτ of Oτ,1
/h±1i.
For each τ we can define Pτ ∈ E(Qp2 ) as follows: The semi-indefinite integral
Z τZ
×
γτ r
eΓ ωf
r
depends only on τ , as
Z τZ
×
r
γτ r
Z τZ
eΓ ωf ÷ ×
s
γτ s
Z τZ
Z τZ s
eΓ ωf ÷ ×
eΓ ωf = ×
r
γτ s
eΓ ωf
γτ r
Z γτ−1 τ Z s
Z τZ s
eΓ ωf = 1,
eΓ ωf ÷ ×
=×
r
r
Z
since γτ−1 τ = τ. The image of the semi-indefinite integral is Q×
p2 /q , so it can be
mapped to a point Pτ ∈ E(Qp2 ) via Tate’s uniformization.
Conjecture 4.16 (Darmon). Let τ and Oτ as above. Let H + denote the narrow
ring class field of K attached to Oτ . Then Pτ ∈ E(H + ).
110
This conjecture has been tested numerically in many cases, but current tools are
still unable to provide theoretical reasons besides the analogies followed by Darmon to
formulate it. Many examples and experimental verifications can be found in [DP06]
and [GM15].
4.3.6
Computational remarks
As with the double integral, this computation can be reduced to computing
integrals of the form
Z τZ
×
∞
nωf
0
when M = 1 (and finitely many ones, coming from the right cosets of Γ in SL2 (Z[1/p])).
Again, for M = 1, the computation of the semi-indefinite integral can be expressed
in terms of double integrals by
Z τZ
×
0
∞
Z τ −1 Z ∞
Z −1/τ Z −1
Z τZ ∞
Z τZ 1
nωf
nωf × ×
nωf = ×
nωf × ×
nωf = ×
0
∞
1
0
Z τ −1 Z ∞
Z 1−1/τ Z 0
Z τ −1 Z ∞
nωf .
nωf = ×
nωf × ×
=×
∞
0
1−1/τ
0
In the case of M > 1, Guitart and Masdeu in [GM15] developed a different
technique for the computation of some of these integrals. It will be explained with
some detail in the next chapter.
4.4
Heegner points attached to Cartan Non-Split curves
This section is entirely based on [KP14]. It deals with an extension of the Heeg-
ner hypothesis to cases where there are no Heegner points on X0 (N ). The original
setting, as described in section 4.2, is to take E an elliptic curve of conductor N and
K a quadratic imaginary field such that all primes dividing N split in K. When N
111
is square-free, this implies that the sign of the functional equation of L(E/K, s) is
−1, but when N is not square-free, this implication is not quite correct.
Consider an elliptic curve E of conductor p2 (p odd), where p is inert in K. The
sign of L(E/K, s) is still −1 but the curve X0 (p2 ) will not have Heegner points, so we
cannot hope to push them through the classical modular parametrization to obtain
points in E. However, there is a modular curve, the so-called Cartan Non-split curve,
where we might find Heegner points.
4.4.1
Cartan Non-split curves
Let ε be a quadratic nonresidue modulo p. The Cartan non-split open modular
ε
curve of level p associated to ε, denoted Yns
(p), has the following modular interpre-
tation: it classifies elliptic curves together with an Fp -linear endomorphism of the p
torsion whose square is multiplication by ε. Two such pairs (E, φ) and (E ′ , φ′ ) are
said to be equivalent if there exists an isogeny ψ : E −→ E ′ such that the diagram
E[p]
φ
//
E[p]
ψ
ψ
E ′ [p]
φ′
//
E ′ [p]
+
commutes. The normalizer of the Cartan Non-split of level p, denoted Yns
(p) clas-
sifies the same pairs, but the diagram can either commute or anticommute (i.e.,
ψφ = ±φ′ ψ as opposed to ψφ = φ′ ψ). The compactifications obtained by adding
ε
+
the cusps are denoted by Xns
(p) and Xns
(p), respectively. An alternate moduli
interpretation can be found in [RW14].
ε
The map (E, φ) 7→ (E, −φ) is an involution on Xns
(p) whose fixed points can
+
be identified with Xns
(p).
112
For computations, it is useful to have the complex model of the curve. Let
ε
Mns
(p) be the ring of 2 × 2 matrices with integer entries such that
x y a b
(mod p).
≡
bε a
z w
+
Its normalizer, denoted by Mns
(p), also contains the 2 × 2 matrices such that
b
x y a
(mod p).
≡
−bε −a
z w
ε
+
The groups Γεns (p) and Γ+
ns (p) are the elements in Mns (p) and Mns (p) of deter-
minant 1, respectively. Both groups contain Γ(p), so they are level p congruence
subgroups. The normalizer of Γεns (p) in SL2 (Z) is precisely Γ+
ns (p), justifying the
terminology.
The complex points of the Cartan Non-split and its normalizer can be identified
with quotients of the complex upper half-plane,
ε
(p)(C) = Γεns (p)\H
Yns
and
+
Yns
(p)(C) = Γ+
ns (p)\H,
as follows:
For τ ∈ H associate the pair (Eτ , φτ ), where Eτ is the usual elliptic curve
associated to τ ∈ H and φτ is the Fp -endomorphism of Eτ [p] whose matrix in the
basis {1/p, τ /p} is ( 0ε 10 ) .
Notice that (Eτ , φτ ) ∼ (Eτ ′ , φτ ′ ) if and only if there exists a matrix ( ac db ) ∈
SL2 (Z) such that the induced isomorphism ψ : Eτ −→ Eτ ′ when restricted to the
p-torsion satisfies φτ ′ ψ = ψφτ (resp. ±ψφτ if we are considering the normalizer).
113
Since
(aτ ′ + b)ε
τε
1
=ψ
=
ψ φτ
p
p
p
′
1
cτ + dε
c + dτ ′ ε
φτ ′ ψ
= φτ ′
=
,
p
p
p
ǫ
+
the two points will be equivalent in Yns
(p)(C) (resp. Yns
(p)(C)) if and only if
d≡
a (mod p) and c ≡
bε
(resp. or d ≡ −a (mod p) and c ≡ −bε
(mod p)
(mod p)),
which shows ( ac db ) ∈ Γεns (p) (resp. Γ+
ns (p)), as expected.
The existence of a pair of the form (Eτ , φτ ) for any (E/C, φ) is a bit more subtle.
The details can be found in [KP14].
For a = (r, s) ∈ Q2 and τ ∈ H, let
fa (τ ) =
g2 (τ )g3 (τ )
℘τ (rτ + s),
∆(τ )
which satisfies the transformation property fa (γz) = faγ (z) for every γ ∈ SL2 (Z).
The Λτ -periodicity of ℘τ ensures that if (r′ , s′ ) ≡ (r, s) (mod 1) then f(r,s) = f(r′ ,s′ ) .
If pa ∈ Z2 and γ ∈ Γ(p), we can see that a ≡ aγ (mod 1), so fa becomes invariant
under the action of Γ(p). Proposition 7.5.1 in [DS05] states that the function field of
X(p) over C is comprised of the j-invariant together with all the fa such that pa ∈ Z2
(and we can choose a finite subset by the comments above). Choose representatives
ε
βi for ±Γ(p)\Γεns (p). The function field of Xns
(p) is then
!
X
ε
C(Xns
(p)) = C j,
faβi ,
i
114
where a runs over the finite set above.
An important difference with the modular curve for Γ0 (N ) is that the cusps
are rational there, while here they are defined over the cyclotomic extension Q(ξp ),
where ξpp = 1. There are p − 1 cusps and they are all Galois conjugates of each other.
For the normalizer, there are just (p − 1)/2, they are defined over Q(ξ + ξ −1 ), and
they are also Galois conjugates of each other. (See [Ser97], Appendix 5, or [BB12]
Section 6.1.)
Modular
over Γεns (p)
Forms
p 0
Let αp =
. Conjugating Γ(p) by αp gives an isomorphism
0 1
4.4.2
∼
Γ(p) −→ αp−1 Γ(p)αp = Γ0 (p2 ) ∩ Γ1 (p),
which yields an isomorphism between the weight two cusp form spaces given by
∼
S2 (Γ(p)) −→ S2 (Γ0 (p2 ) ∩ Γ1 (p))
f 7−→ f |2 [αp ].
Since Γεns (p) ⊇ Γ(p), we have S2 (Γεns (p)) ⊆ S2 (Γ(p)). Using the decomposition
M
S2 (Γ0 (p2 ) ∩ Γ1 (p)) =
S2 (Γ0 (p2 ), χ), where the sum is taken over all the (even)
χ
characters of conductor dividing p, for any weight two cusp form f ∈ S2 (Γεns (p)),
f |2 [αp ] can be written (uniquely) as the sum of weight two cusp forms of level p2
with nebentypus χ, for χ as above.
For ℓ a prime number different from p, we can define Hecke operators Tℓε using
+
the double coset Γεns (p)αℓ Γεns (p) (resp. Γ+
ns (p)αℓ Γns (p)), where αℓ is any matrix in
115
ε
Mns
(p) with determinant ℓ. Notice that this means that when ℓ ≡ 1 (mod p), we can
choose αℓ = ( 10 0ℓ ), and it can be shown that this makes it coincide with the classical
Hecke operator on S2 (Γ(p)). Likewise, if ℓ ≡ −1 (mod p) and we are considering the
+
1 0
operator induced by the double coset Γ+
ns (p)αℓ Γns (p), we can choose αℓ = ( 0 ℓ ) and
the same conclusion can be drawn.
The following is Theorem 1.12 in [KP14], which compiles results from Theorem
1 in [Che98], Theorem 1.1 in [Edi96] and Theorem 2 [dSE00].
Theorem 4.17 (Chen-Edixhoven). The new part of Jac(X0+ (p2 )) is isogenous to
+
ε
Jac(Xns
(p)). Also the new part of Jac(X0 (p2 )) is isogenous to Jac(Xns
(p)). Further-
more, these isogenies are Hecke equivariant.
Theorem 4.17 associates to every normalized newform g ∈ S2 (Γ0 (p2 ))new a form
f ∈ S2 (Γεns (p)) such that f and g have the same eigenvalues for all ℓ 6= p, i.e., if
Tℓ g = λℓ g, then Tℓε f = λℓ f for all ℓ 6= p.
Using the Fourier expansion of g we can compute the Fourier expansion of f .
For χ a character of conductor p, denote by g ⊗ χ the twist of g by χ, which lives
in S2 (Γ0 (p2 ), χ2 ). The following is Theorem 1.14 in [KP14], which relates the two
Fourier expansions.
Theorem 4.18. Let f and g as above. Let πp be the local automorphic representation
of g at p. Then
- If πp is supercuspidal g⊗χ is a newform in S2 (Γ0 (p2 ), χ2 ) when χ has conductor
p and there exist αχ ∈ C for every χ of conductor dividing p such that
f |2 [αp ] =
X
116
χ
aχ · g ⊗ χ
- If πp is Steinberg, there exists a newform h ∈ S2 (Γ0 (p)) such that g is the twist
of h by the quadratic character of conductor p and there exist a, aχ ∈ C such
that
f |2 [αp ] = a · h +
X
χ
aχ · g ⊗ χ
- If πp is a ramified Principal Series, there exist a non-quadratic character θp of
conductor p, newforms h ∈ S2 (Γ0 (p), θ̄2 ) and h̄ ∈ S2 (Γ0 (p), θ2 ) with h ⊗ θp =
h̄ ⊗ θ¯p = g, and a1 , a2 , aχ ∈ C such that
f |2 [αp ] = a1 · h + a2 · h̄ +
X
χ
aχ · g ⊗ χ
Rational Modular Forms
We know that a newform in Γ0 (N ) is normalized in the sense that its first
coefficient is equal to 1. Multiplicity-one for the space of newforms states that given
g ∈ S2 (Γ0 (p2 ))new with a prescribed packet of eigenvalues outside of p, all other forms
with these eigenvalues will be scalar multiples of g. Theorem 4.17 implies that the
same holds for forms in S2 (Γεns (p)), but, the natural question that arises is how to
normalize f . We start by pre-normalizing f following Theorem 1.22 in [KP14].
Theorem 4.19. Let f ∈ S2 (Γεns (p)) be an eigenform which has the same eigenvalues
as a rational newform g ∈ S2 (Γ(p2 )). Normalize f |2 [αp ] in such a way that the
first Fourier coefficient is rational. Then f (and f |2 [αp ]) has a q-expansion whose
coefficients lie in Q(ξp ).
ε
At the end of section 4.4.1 we mentioned that there are p − 1 cusps in Xns
(p)
defined over Q(ξp ) which are conjugate of each other under the Galois action of the
117
cyclotomic extension. For σℓ ∈ Gal(Q(ξp )/Q) there exists a matrix Aℓ ∈ SL2 (Z) such
that σℓ (∞) = A · ∞. The q-expansion of f at Aℓ · ∞ is given by the q-expansion of
f |2 [A−1
ℓ ]. We say that f is rational if for every σℓ ∈ Gal(Q(ξp )/Q) the q-expansion of
X
f at σℓ (∞) equals σℓ−1 (f ) =
σℓ−1 (an (f ))qhn (which is equivalent to saying f |2 [Aℓ ] =
n≥1
σℓ (f ) for every σℓ ∈ Gal(Q(ξp )/Q)).
Thanks to Theorem 4.19, we can see that any scalar c ∈ Q(ξp )× will yield qexpansions for cf with coefficients in Q(ξp ), but we further want f to be rational.
If there exists a c such that cf is rational, every rational multiple of c will turn cf
into a rational modular form. Furthermore, if c1 f and c2 f are rational, then c1 /c2
must be a rational number (as σℓ (c1 /c2 ) = c1 /c2 for all σℓ ∈ Gal(Q(ξp )/Q)). It is
the existence of such c that is more subtle. This is the content of Theorem 1.27 in
[KP14].
Theorem 4.20. Let f ∈ S2 (Γεns (p)) be an eigenform with rational eigenvalues. There
exists a unique c ∈ Q(ξp )× (modulo Q× ) such that cf is rational.
Recall that modular forms have bounded denominators. This means that we
can choose the constant in Theorem 4.20 in such a way that all the coefficients of
the q-expansion are algebraic integers and that for every integer n ≥ 2 at least one
coefficient of cf /n is not an algebraic integer. Unfortunately, this still requires a
choice of “sign,” as there will be two such constants (up to sign). Both choices are
equally good.
4.4.3
Modular Parametrization
At this point, we can emulate the construction in Section 3.5 with f ∈ S2 (Γεns (p))
ε
a normalized eigenform. Let g be the genus of Xns
(p). Choose a basis {γ1 , . . . , γ2g }
118
ε
of the Z-homology of Xns
(p). The map that assigns to every loop its corresponding
period
Z
dq
f (q) =
q
γ
Z
γ
2πi
f (τ )dτ,
p
where q = e2πiτ /p , forms a rank 2 lattice Λf ⊆ C when f has rational eigenvalues
(which is the case for us, as we started with a newform corresponding to an elliptic
curve of conductor p2 ). The curve Ef = C/Λf is isogenous to the elliptic curve we
started with. Lemma 1.33 in [KP14] shows that Ef does not depend on the choice
of ε.
As before, the dimension of S2 (Γεns (p)) is equal to the genus. Take {f1 , . . . , fg }
a basis of S2 (Γεns (p)) comprised of eigenforms, with f1 = f . Let
ωj = fj (q)
dq
2πi
fj (τ )
=
fj (τ )dτ = ′
dj
q
p
pj (τ )/2πi
(which is a rational holomorphic differential form when fj is rational) and let
ε
ε
Ω(Xns
(p)) = hω1 , . . . , ωg i be the space of holomorphic differentials in Xns
(p). For
j = 1, . . . , 2g, let
Ωj =
Z
ω1 , . . . ,
γj
Z
γj
ωg
!
∈C
g
and
Λ=
2g
M
j=1
Ωj Z ⊆ C g ,
which again, forms a discrete full rank lattice in Cg .
ε
(p) are
The Abel-Jacobi map is not rational anymore because the cusps of Xns
not rational, but the projection map from the Jacobian to Ef is, so this modular
parametrization is rational over Q(ξp ). To remedy this, let us analyze a bit further
the Abel-Jacobi map.
119
The map
H∗ −→
τ 7−→
Cg
Z
τ
ω1 , . . . ,
Z
τ
ωg
i∞
i∞
−→
Z
7−→
τ
i∞
ε
J(Xns
(p))
Z τ
ωg
ω1 , . . . ,
mod Λ
i∞
descends to the quotient by Γεns (p), but we are choosing, in a rather arbitrary fashion,
the cusp ∞. For any σℓ ∈ Gal(Q(ξp )/Q) we can define an Abel-Jacobi map based at
the cusp σℓ (∞)
H∗ −→
τ 7−→
Cg
Z
τ
ω1 , . . . ,
Z
τ
ωg
σℓ (∞)
σℓ (∞)
−→
7−→
Z
τ
σℓ (∞)
ε
J(Xns
(p))
Z τ
ω1 , . . . ,
ωg
σℓ (∞)
mod Λ,
so, each σℓ yields, after projecting into Ef , it being a quotient of the Jacobian, a
modular parametrization
ε
Xns
(p)
A-Jσℓ
// J(X ε (p)) .
ns
MM
MM
π
M M
Φσℓ
&&
Ef
The map A-J =
P
σℓ
A-Jσℓ will be invariant under the action of Gal(Q(ξp )/Q), which
makes it a map defined over Q. Hence, we obtain a map
ε
Xns
(p)M
A-J //
ε
J(Xns
(p)) ,
MM
π
MM
Φ
M&&
Ef
where Φ =
P
σℓ
Φσℓ , and this modular parametrization is rational.
120
(4.4)
This map can also be made explicit, as in Section 3.5. The Manin constant is a
rational number (independent from ε), so we define
zτ = c
2πi X
p σ
ℓ
Z
A−1
ℓ τ
σℓ (f )(z)dz
i∞
!
,
where the sum is taken over all σℓ ∈ Gal(Q(ξp )/Q). Thus, we obtain an analogue for
Equation (3.2)
ε
ϕ : Xns
(p) −→
Ef
τ 7−→ (℘(zτ ), ℘′ (zτ ), 1),
where ℘ is the Weierstrass ℘-function of the lattice Λf . As before, this map can be
composed with the isogeny relating Ef and E, yielding a map
ε
Xns
(p)
4.4.4
_ Y S
k e
M&&
// E.
// Ef
(4.5)
Higher levels
We will be very brief here, as we will only use the definition of the Cartan
non-split group in what follows. It is included for the sake of completeness, and the
details can be found in Section 2 of [KP14].
Let N = n2 m, with n square free and gcd(n, 2m) = 1. Write n = p1 · · · pr . For
every pj , let εj be a quadratic nonresidue modulo pj and denote by ~ε = (ε1 , . . . , εr ).
The Cartan non-split congruence subgroup associated to these data is
Γ~εns (n, m)
=
r
\
j=1
Γεnsj (pj ) ∩ Γ0 (m),
121
which is the determinant 1 elements of the ring
~
ε
Mns
(n, m)
=
r
\
j=1
εj
(pj ) ∩ M0 (m).
Mns
The Chen-Edixhoven Theorem takes the following form, which is Theorem 2.1 in
[KP14].
~
ε
Theorem 4.21. The n2 -new part of J(X0 (n2 m)) is isogenous to J(Xns
(n, m)). The
isogeny is Hecke equivariant.
For an elliptic curve E of level N , there is a weight 2 newform g of level N , for
which there exists an eigenform f ∈ S2 (Γ~εns (n, m)) with the same eigenvalues as g
away from n.
The Abel-Jacobi map this time will be defined over Q(ξp1 , . . . , ξpr ), but the same
averaging as before will bring it down to a rational map.
4.4.5
Heegner points
As before, we will try to embed orders of quadratic fields into rings of matrices
and consider fixed points. Let E be an elliptic curve of conductor N = n2 m as in
the previous section. Let K be a number field and O ⊆ K an order. Assume that
(1) The discriminant of O is relatively prime to nm.
(2) Every prime dividing m is split in K.
(3) Every prime dividing n is inert in K.
2
Let D0 be the fundamental discriminant of K and
√ let D = D0 r . O can be seen as
D+ D
the Z-module generated by 1 and ω = ωD =
. In order to find an embedding
2
~
ε
of O ֒−→ Mns
(n, m) it suffices to find a matrix where we can map ω, which is, a
matrix with the same trace as the trace of ω and determinant as the norm of ω. More
122
explicitly, we need a matrix ( ac db ) such that a + d = D and ad − bc =
D(D − 1)
.
4
Substituting the former in the latter we obtain
a(D − a) − bc =
D(D − 1)
.
4
We need to guarantee local conditions and glue them using the Chinese Remainder Theorem. The main problem will be solving the quadratic equation.
Let p be a prime dividing m. The local condition at p states that
a(D − a) ≡
D(D − 1)
4
(mod p),
which can be rewritten as
(2a − D)2 ≡ D
(mod p).
√
Condition (2) ensures that p splits in K = Q( D), so this equation has solution
modulo p. Condition (3) implies that we cannot find Heegner points directly via the
ideas in Section 4.2, because this equation would not have a local solution.
Now, let p be a prime dividing n. The local condition at p states that
D
2
D
D−
2
− b2 ε ≡
D(D − 1)
4
(mod p),
which can be rewritten as
D
≡ b2 ε
4
(mod p).
Condition (1) implies that b2 ε is not 0 modulo p. Condition (3) implies that this
√
quantity cannot be a square, as p is inert in K = Q( D). Since ε is a quadratic
nonresidue modulo p, this equation has a solution modulo p.
123
~
ε
Let M ∈ Mns
(n, m) be the image of ω under the sought embedding and let
τ ∈ C such that M · τ = τ under the Möbius action. Then
M ·τ =
aτ + b
= τ ⇐⇒ cτ 2 + (d − a)τ − b = 0,
cτ + d
so
p
p
(d − a)2 + 4bc
a − d ± (d + a)2 − 4(ad − bc)
τ=
=
p 2c
√ 2c
2
a − d ± D − D(D − 1)
a−d± D
=
,
=
2c
2c
a−d±
and τ can be chosen to lie in H.
Appendix 5 in [Ser97] asserts that the corresponding point in the modular curve
~
ε
Xns
(n, m)(C) under its identification with Γ~εns (n, m)\H∗ , described in Section 4.4.1,
~
ε
actually lies in Xns
(n, m)(H), where H is the ring class field attached to O.
Using the modular parametrization described in Section 4.4.3 (Equation (4.5))
points can be obtained on E(H). For examples, see Section 5 in [KP14].
124
Chapter 5
Stark-Heegner points attached to Cartan Non-split curves
This chapter will bring together the ideas of Sections 4.2, 4.3 and 4.4 setting up
the computations required to find points on an elliptic curve E, conjecturally defined
over narrow ring class fields of real quadratic extensions, in similar circumstances to
those Kohen and Pacetti had in their construction of Heegner points. We will follow
closely the path Darmon designed in his construction, which can be summarized as
follows:
(1) Isolate a special prime p dividing the conductor N of the elliptic curve E.
(2) Construct a group to play the role of Γ0 (N ) in the classical case, which we
denoted by Γp,M .
(3) Define a space of modular forms for Γp,M .
(4) Establish an isomorphism between a subspace of the space of classical weight 2
cusp forms for Γ0 (N ) and the space of weight 2 cusp forms for Γp,M .
(5) Define a system of integral measures.
(6) Define the double integrals using the measures above.
(7) Conjecture the existence of the semi-indefinite integral and relate it to the double
integrals.
(8) Use the semi-indefinite integral to construct points on E.
Let K be a real quadratic field and E an elliptic curve defined over Q such
that the sign of L(E/K, s) is −1. We will limit ourselves to elliptic curves E with
125
bad reduction only at two odd primes, one of which is multiplicative and the other
one additive. That is, N = pq 2 , so p will play the same role as it did in Darmon’s
construction. (In particular, p is inert in K.) The case where q is split in the real
quadratic field K was dealt with in [GM15] so we will deal with the case where q is
inert. Fix K throughout the remaining of this chapter.
5.1
The group
The group Γp,M was defined as the subgroup of SL2 (Z[1/p]) of matrices which
are upper-triangular modulo M. Notice that Γp,M can be seen as the determinant
1 elements of the tensor M0 (N ) ⊗ Z[1/p] = M0 (M )[1/p]. The case of imaginary
quadratic fields where p was split and q was inert used the group Γεns (q) ∩ Γ0 (p),
where ε was a quadratic nonresidue modulo q. The natural extension to the scenario
we want to consider is to take the determinant 1 elements of the tensor product
ε
ε
(Mns
(q) ∩ M0 (p)) ⊗ Z[1/p] = Mns
(q)[1/p],
ε
where Mns
(q)[1/p] is the ring of 2 × 2 matrices with entries in Z[1/p] such that
x y a b
(mod q).
≡
bε a
z w
When talking about the normalizer, we add the alternate condition
b
x y a
(mod q).
≡
−bε −a
z w
+
(q)[1/p]. The corresponding groups will be denoted Γεns (q)[1/p]
This ring is called Mns
and Γ+
ns (q)[1/p].
126
Both Γεns (q)[1/p] and Γ+
ns (q)[1/p] are finite index subgroups of SL2 (Z[1/p]). Since
the action of SL2 (Z[1/p]) on Hp and on H by Möbius transformations has dense
orbits, the same is true for Γεns (q)[1/p] and Γ+
ns (q)[1/p]. Likewise, since the action of
SL2 (Z[1/p]) on Hp × H is discrete, the same is true for Γεns (q)[1/p] and Γ+
ns (q)[1/p].
There is a relationship between the different Cartan Non-split groups upon varying the quadratic nonresidue ε. We state the result in the next proposition.
Proposition 5.1. Let ε and ε′ be two quadratic nonresidues modulo q. There exists
an inner automorphism of SL2 (Z[1/p]) which induces an isomorphism
′
Γεns (q)[1/p] ∼
= Γεns (q)[1/p].
Furthermore, it induces an isomorphism between the corresponding normalizers.
In order to prove it, we need a couple of lemmas.
Lemma 5.2. For any quadratic nonresidue ε, the index [Γεns (q)[1/p] : Γ(q)[1/p]] is
equal to q + 1.
Proof. This can be accomplished using the isomorphism
Γεns (q)[1/p]/Γ(q)[1/p] −→ (F×
q 2 )1
√
a b
7−→ a + b ε,
bε a
√
2
2
where (F×
q 2 )1 is the set of norm 1 elements of Fq = Fq ( ε), which has q − 1
elements.
127
Lemma 5.3. Let
a b
M =
∈ SL2 (Z)
c d
such that
ac ≡ bdε
(mod q).
The inner automorphism of SL2 (Z[1/p]) given by γ 7→ M γM −1 restricted to Γεns (q)[1/p]
′
yields an isomorphism into Γεns (q)[1/p], where
ε′ ≡ ε(a2 − b2 ε)−2 (mod q).
x y
Proof. For γ ≡
(mod q), a quick computation reveals that
yε x
(a2 − b2 ε)y
(ad − bc)x − (ac − bdε)y
M γM −1 ≡
−(c2 − d2 ε)y
(ad − bc)x + (ac − bdε)y
(mod q).
Since ac ≡ bdε (mod q) and ad − bc = 1, the top-left entry and the bottom-right
entry are both congruent to x modulo q.
Note that
(c2 − d2 ε)(a2 − b2 ε) = c2 a2 + d2 b2 ε2 − ε(c2 b2 + d2 a2 )
= (a2 c2 − 2acbdε + b2 d2 ε2 ) − ε(a2 d2 − 2adbc + b2 c2 )
= (ac − bdε)2 − ε(ad − bc)2
≡ −ε
(mod q),
128
so
−
c2 − d2 ε
−(c2 − d2 ε)(a2 − b2 ε)
≡ ε(a2 − b2 ε)−2 ≡ ε′
=
a2 − b2 ε
(a2 − b2 ε)2
(mod q).
Using this last congruence, we find that
(a2 − b2 ε)y · ε′ ≡ (a2 − b2 ε)y · −
c2 − d2 ε
≡ −(c2 − d2 ε)y
a2 − b2 ε
(mod q),
′
so M γM −1 lies indeed in Γεns (q)[1/p].
′
Let Γ′ ⊆ Γεns (q)[1/p] denote the image of Γεns (q)[1/p] under the inner automorphism induced by M. Using Lemma 5.2 for ε and ε′ , we have that
[SL2 (Z[1/p]) : Γ′ ] = [SL2 (Z[1/p]) : Γεns (q)[1/p]]
= [SL2 (Z[1/p]) : Γ(q)[1/p]] ÷ [Γεns (q)[1/p] : Γ(q)[1/p]]
= [SL2 (Z[1/p]) : Γ(q)[1/p]] ÷ (q + 1)
′
= [SL2 (Z[1/p]) : Γ(q)[1/p]] ÷ [Γεns (q)[1/p] : Γ(q)[1/p]]
′
= [SL2 (Z[1/p]) : Γεns (q)[1/p]],
but we also have
′
[SL2 (Z[1/p]) : Γ′ ] = [SL2 (Z[1/p]) : Γεns (q)[1/p]] · [Γεns (q)[1/p] : Γ′ ],
whence
′
[Γεns (q)[1/p] : Γ′ ] = 1.
′
This shows that Γ′ = Γεns (q)[1/p], as we wanted.
Proof of Proposition 5.1. This statement is meaningful only when q > 3, so
assume this is the case. Since both ε and ε′ are quadratic nonresidues, their quotient
129
is a quadratic residue. Let t be an integer such that t2 ≡ εε′−1 (mod q). Let us
consider two cases:
- Case 1: t is a quadratic residue modulo q.
Let a be an integer such that a2 ≡ t (mod q). Let M be a lift to SL2 (Z[1/p])
of the matrix
a 0
,
−1
0 t a
which has determinant 1 modulo q.
- Case 2: t is a quadratic nonresidue modulo q.
Consider the sum of Legendre symbols
X x
x∈Fq
q
= 0.
Squaring it and reorganizing we obtain
0=
X x 2
x∈Fq
q
= (q − 1) +
+
X X x x − a
x∈Fq
a∈F×
q
q
q
X X x x − a a−1 2
x∈Fq
a∈F×
q
q
X X xa
q
−1
q
xa−1 − 1
= (q − 1) +
q
q
x∈Fq
a∈F×
q
X x x − 1
= (q − 1) + (q − 1)
,
q
q
x∈F
q
whence
X x x − a
x∈Fq
q
q
130
= −1
for any a ∈ F×
q . In particular, it holds for a = t. This implies that there exists
t′ ∈ Fq − {t} such that t′ is a quadratic nonresidue and t′ − t is a nonzero
quadratic residue. Otherwise, for all quadratic nonresidues different from t,
both Legendre symbols would be −1. These account for (q − 3)/2 products
equal to 1. For x = t and x = 0 we obtain a zero product. This would mean
that all the remaining (q − 1)/2 products must be equal to −1, but all the
remaining values for x are quadratic residues, so x − t would have to be a
quadratic nonresidue. This gives us a total of (q − 1)/2 + (q − 3)/2 = q − 2
values of x for which x −t is a quadratic nonresidue, producing a contradiction.
Since ε is a quadratic nonresidue and t′ − t is a quadratic residue, (t′ − t)ε−1 is
a quadratic nonresidue. This is, there is no integer k such that
k 2 ≡ (t′ − t)ε−1
(mod q),
or, equivalently,
This implies that the set
(
t + 12 ε, t + 22 ε, . . . , t +
q−1
2
t + k 2 ε ≡ t′
(mod q).
2 )
ε
must contain a quadratic residue, as the quadratic nonresidue t′ is not in the
set. Denote it by x2 ≡ t + y 2 ε (mod q). Let M be a lift to SL2 (Z[1/p]) of the
matrix
which has determinant 1.
y
x
,
−1
−1
t yε t x
131
In both cases, the matrix
a b
M =
c d
has the property that
ac − bdε ≡
a · 0 − 0 · at−1 ε
x · t−1 yε − y · t−1 xε
≡0
(mod q),
so, by Lemma 5.3, M induces an isomorphism between the Cartan Non-split groups
corresponding to
ε and ε(a2 − b2 ε)−2 ≡ εt−2 ≡ εε′ ε−1 ≡ ε′
(mod q).
The last part of the proposition follows from the following fact: if H is a subgroup
of G with normalizer N , then for every g ∈ G, gN g −1 is the normalizer of gHg −1 , as
n ∈ NG (gHg −1 ) ⇐⇒ ngHg −1 n−1 = gHg −1 ⇐⇒ g −1 ngHg −1 n−1 g = H
⇐⇒ g −1 ng ∈ N ⇐⇒ n ∈ gN g −1 .
Let G = SL2 (Z[1/p]) and H = Γεns (q)[1/p] above. The result follows.
5.1.1
Cusps
If two cusps in P1 (Q) are equivalent under the action of Γεns (q), they must also
be equivalent under the action of Γεns (q)[1/p], as Γεns (q)[1/p] ⊇ Γεns (q). However, the
surplus of elements in the larger group allows for some inequivalent cusps in Γεns (q)
to become equivalent in Γεns (q)[1/p]. The same occurs with the normalizers. How
this happens is summarized in the following proposition.
132
Proposition 5.4. Let x, y and x′ , y ′ be two pairs of relatively prime integers. The
cusps x/y and x′ /y ′ are equivalent under the action of Γεns (q)[1/p] or Γ+
ns (q)[1/p] if
and only if the classes of x2 − y 2 ε−1 and x′2 − y ′2 ε−1 coincide in
(Z/qZ)× /hp2 i
or
(Z/qZ)× /hp2 , −p2 i,
respectively.
In order to prove Proposition 5.4, we will make use of the following lemma.
Lemma 5.5. Let a, b be two relatively prime integers. The cusp a/b is equivalent to
2
2 −1
∞ under the action of Γεns (q)[1/p] or Γ+
ns (q)[1/p] if and only if the class of a − b ε
is trivial in
(Z/qZ)× /hp2 i
(Z/qZ)× /hp2 , −p2 i,
or
respectively.
Proof. Suppose that ∞ is equivalent to a/b under the action of Γεns (q)[1/p]. This
implies that there exists a matrix
such that
x z
ε
M =
∈ Γns (q)[1/p],
y w
a
M (∞) = .
b
From xw − yz = 1 we can see that the ℓ-adic valuations of x and y (ℓ 6= p) cannot
both be positive. Since
a
x·1+z·0
x
= M (∞) =
=
b
y·1+w·0
y
133
and a and b are relatively prime integers, we find that a = ±xps and b = ±yps for
some integer s. Using x ≡ w (mod q) and y ≡ zε (mod q), we conclude that
a2 − b2 ε−1 = (x2 − y 2 ε−1 )p2s ≡ (xw − yz)p2s ≡ p2s
(mod q),
which implies that the class of a2 − b2 ε−1 is trivial in
(Z/qZ)× /hp2 i.
Conversely, suppose that a2 − b2 ε−1 ≡ p2s (mod q) for some integer s. Let b̄ be
an integer such that b̄ε ≡ b (mod q), so b2 ε−1 ≡ bb̄ (mod q). From a2 − b2 ε−1 ≡ p2s
(mod q), we find that p2s − (a2 − bb̄) is divisible by q.
Let k1 and k2 be integers such that
ak2 − bk1 =
p2s − (a2 − bb̄)
,
q
whose existence is guaranteed by the fact that a and b are relatively prime and the
RHS is an integer. The matrix
has determinant
s
s
a/p (b̄ + qk1 )/p
M =
b/ps (a + qk2 )/ps
(a2 − bb̄) + (p2s − (a2 − bb̄))
a(a + qk2 ) − b(b̄ + qk1 )
=
= 1.
p2s
p2s
134
Furthermore, (b̄ + qk1 )/ps · ε ≡ b/ps (mod q) and a/ps ≡ (a + qk2 )/ps (mod q),
implying that M lies in Γεns (q)[1/p]. Finally,
M (∞) =
a/ps
a
= ,
s
b/p
b
so the cusp a/b is equivalent to ∞ under the action of Γεns (q)[1/p].
The computations for the normalizer are completely analogous.
Proof of Proposition 5.4. Let x/y ∈ P1 (Q) and let ρ denote the class of x2 −y 2 ε−1
in (Z/qZ)× . Let
′
′
a b
M =
∈ SL2 (Z)
′
′
c d
be the matrix in Proposition 5.1 that induces an isomorphism between Γεns (q)[1/p]
′
and Γεns [1/p], where ε′ ≡ ερ2 (mod q), this is,
b
a
2
2
2
′−1
−2
M ≡
(mod q), a − b ε ≡ t(mod q), t ≡ εε ≡ ρ (mod q).
t−1 bε t−1 a
We can choose t to be precisely ρ−1 . The action of M on the cusp x/y is given by
M (x/y) =
a′ x + b′ y
.
c′ x + d′ y
135
Consider
(a′ x + b′ y)2 − (c′ x + d′ y)2 ε′−1 ≡ (a′ x + b′ y)2 − (c′ x + d′ y)2 ε−1 ρ−2
≡ (a2 − b2 ε)x2 + (b2 − a2 ε−1 )y 2
≡ (a2 − b2 ε)(x2 − y 2 ε−1 )
≡1
(mod q)
(mod q)
(mod q)
(mod q).
By Lemma 5.5, the cusp M (x/y) is equivalent to the cusp ∞ under the action of
′
Γεns (q)[1/p] = M Γεns (q)[1/p]M −1 (and, a fortiori, by the action of its normalizer).
Let x′ /y ′ ∈ P1 (Q) be another cusp and let ρ′ denote the class of x′2 − y ′2 /ε in
(Z/qZ)× . As before, we have
M (x′ /y ′ ) =
a′ x′ + b′ y ′
c′ x′ + d′ y ′
and, in order to test equivalence to ∞ we need to consider
(a′ x′ + b′ y ′ )2 − (c′ x′ + d′ y ′ )2 ε′−1 ≡ (a2 − b2 ε)(x′2 − y ′2 ε−1 )
≡ tρ′
(mod q)
(mod q).
By Lemma 5.5, the cusp M (x/′ y ′ ) is equivalent to ∞ (and hence, to M (x/y))
′
under the action of Γεns (q)[1/p] if and only if tρ′ is in hp2 i. By Proposition 5.1, this
translates into the existence of γ ∈ Γεns (q)[1/p] such that
(M γM −1 )(M (x/y)) = M (x′ /y ′ ) ⇐⇒ γ(x/y) = x′ /y ′
136
if and only if tρ′ ≡ ρ−1 ρ′ lies in hp2 i, which is the same as asking for ρ and ρ′ to lie
in the same class in
(Z/qZ)× /hp2 i.
As with Lemma 5.5, the computations for the normalizer are analogous.
Now, we are in position to determine the number of inequivalent cusps under
the action of the groups we are analyzing.
Corollary 5.6. Let r be the size of the multiplicative subgroup of (Z/qZ)× generated
by p2 . Let β be equal to 1 if −1 ∈ hp2 i and 2 otherwise. Then
q−1
# Γεns (q)[1/p]\P1 (Q) =
r
and
q−1
1
# Γ+
.
ns (q)[1/p]\P (Q) =
βr
Proof. By Proposition 5.4, the cusps x/y and x′ /y ′ are equivalent under the
action of Γεns (q)[1/p] if and only if the classes of x2 − y 2 ε−1 and x′2 − y ′2 ε−1 in
(Z/qZ)× /hp2 i coincide. In other words, the map
Γεns (q)[1/p]\P1 (Q) −→ (Z/qZ)× /hp2 i
x/y
7−→
x2 − y 2 ε−1
is a bijection.
Likewise, the cusps are equivalent under the action of Γ+
ns (q)[1/p] if an only if
the classes coincide in (Z/qZ)× /hp2 , −p2 i, yielding a bijection
1
×
2
2
Γ+
ns (q)[1/p]\P (Q) −→ (Z/qZ) /hp , −p i
x/y
7−→
137
x2 − y 2 ε−1 .
The first equality follows from
#((Z/qZ)× /hp2 i) =
q−1
#((Z/qZ)× )
=
,
2
#hp i
r
and the second one from
#((Z/qZ)× /hp2 , −p2 i) =
#((Z/qZ)× )
#((Z/qZ)× ) #hp2 , −p2 i
q−1
=
÷
=
,
2
2
2
2
#hp , −p i
#hp i
#hp i
βr
as the index of hp2 i in hp2 , −p2 i is precisely β.
5.2
Modular Forms
ε
+
Let Γ be Γεns (q)[1/p] or Γ+
ns (q)[1/p]. Denote by Γ̂ the group Γns (q, p) or Γns (q, p),
respectively. Recall that T = T0 ∪ T1 is the Bruhat-Tits tree of GL2 (Qp ), where
T0 is the set of vertices and T1 is the set of unordered edges and E(T ) is the set of
ordered edges. Let S2 (T , Γ) be the space of cusp forms of weight 2 for Γ, this is, all
the functions
f : E(T ) × H −→ C
satisfying the three properties
a b
(1) f (γe, γτ ) = (cτ + d)2 f (e, τ ) for all γ =
∈ Γ.
c d
(2) For each vertex v ∈ T0 we have
X
f (e, τ ) = 0,
s(e)=v
and for each ordered edge e ∈ E(T ) we have f (ē, τ ) = −f (e, τ ).
(3) For each fixed oriented edge e ∈ E(T ) the function fe (τ ) is a weight 2 cusp form
for Γe , the stabilizer of e in Γ.
138
Proposition 5.7. The stabilizers of v0 and e0 in Γεns (q)[1/p] (resp. Γ+
ns (q)[1/p]) are
+
Γεns (q) and Γεns (q, p) (resp. Γ+
ns (q) and Γns (q, p)), respectively.
Proof. By Chapter 9, Exercise 1 in [Dar04], we have
StabSL2 (Z[1/p]) (v0 ) = SL2 (Z)
and
StabSL2 (Z[1/p]) (e0 ) = Γ0 (p).
and
StabΓ (e0 ) = Γ0 (p) ∩ Γ,
From here, it’s clear that
StabΓ (v0 ) = SL2 (Z) ∩ Γ
whence the proposition follows.
As in the case of classical Stark-Heegner points, for f ∈ S2 (T , Γ) we denote by f0
the classical modular form fe0 attached to the edge e0 . By the previous proposition,
this is a modular form in S2 (Γ̂).
We have the following lemma, which is analogous to Lemma 4.6.
Lemma 5.8. The map
S2 (T , Γ) −→ S2 (Γ̂)
f
7−→
f0
is injective. Furthermore, the image is S2 (Γ̂)p−new .
Proof. This follows almost exactly as Lemma 1.3 in [Dar01] and the comments
before it. We will write the proof in the case of Γ = Γεns (q)[1/p] for simplicity of
notation.
ε
Let Γ̃ = Mns
(q)[1/p]×
+ , which is, 2 × 2 matrices with entries in Z[1/p] satisfying
the congruence condition modulo q from Section 5.1 with determinant pβ , β ∈ Z.
139
Let S2 (E, Γ̃) be the space of functions
f : E(T ) × H −→ C
satisfying the two properties
a b
(1) f (γe, γτ ) = (cτ + d)2 f (e, τ )/ det(γ) for all γ =
∈ Γ̃.
c d
(2) For each fixed oriented edge e ∈ E(T ) the function fe (τ ) is a weight 2 cusp form
for Γ̃e , the stabilizer of e in Γ̃.
Note that (1) for γ with determinant 1 (i.e., γ ∈ Γ) gives (1) for any γ of even
determinant, and (1) for γ with determinant p gives (1) for any γ of odd determinant.
Also, let S2 (T0 , Γ̃) be the space of functions
f : T0 × H −→ C
satisfying the two properties
a b
(1) f (γv, γτ ) = (cτ + d)2 f (v, τ )/ det(γ) for all γ =
∈ Γ̃.
c d
(2) For each fixed vertex v ∈ T0 the function fv (τ ) is a weight 2 cusp form for Γ̃v ,
the stabilizer of v in Γ̃.
We say that a vertex is even if there is an even number of edges in the path
connecting it to the root, and we say it is odd otherwise. The two orbits of the action
of Γ on T0 are the even vertices and the odd vertices, precisely. However, the action
of Γ̃ on T yields only one orbit for the vertices and one orbit for the oriented edges.
If γ ∈ Γ̃ has determinant an even power of p, it preserves the Γ-orbit, whereas if it
140
has determinant an odd power of p it changes the Γ-orbit. Let α ∈ Γ̃ such that α
has determinant an odd power of p.
We have a map
ι : S2 (T , Γ) −→ S2 (E, Γ̃)
defined by
ι(f )(e, τ )dτ =
f (e, τ )dτ
f (αe, ατ )dατ
if s(e) is even
if s(e) is odd.
Note that this definition is independent of α, as any other choice α′ would also
have as determinant an odd power of p, so α′ = γ ′ α, where γ ′ has determinant an even
power of p. The invariance of f for elements in Γ yields the required invariance for
γ ′ , which shows that f (αe, ατ )dατ = f (α′ e, α′ τ )dα′ τ. In particular, for the definition
of ι(f ), we may choose α with the further property of fixing the unordered edge e0
(but not the ordered one).
The invariance for every element of Γ̃ follows from the following computations:
- If s(e) is even and det(γ) is an even power of p we have
ι(f )(γe, γτ )dγτ = f (γe, γτ )dγτ = f (e, τ )dτ = ι(f )(e, τ )dτ.
- If s(e) is even and det(γ) is an odd power of p we have
ι(f )(γe, γτ )dγτ = f (αγe, αγτ )dαγτ = f (e, τ )dτ = ι(f )(e, τ )dτ,
as s(γe) is odd and αγ has determinant an even power of p.
141
- If s(e) is odd and det(γ) is an even power of p we have
ι(f )(γe, γτ )dγτ = f (αγe, αγτ )dαγτ = ι(f )(e, τ )dτ,
as s(γe) and s(e) are odd and αγ has determinant an odd power of p.
- If s(e) is odd and det(γ) is an odd power of p we have
ι(f )(γe, γτ )dγτ = f (γe, γτ )dγτ = ι(f )(e, τ )dτ,
as s(e) is odd, s(γe) is even, and γ has determinant an odd power of p.
Moreover, this is an injection from S2 (T , Γ) ֒→ S2 (E, Γ̃), since ι(f ) = 0 implies
that f (e, τ ) = 0 for all edges such that s(e) is even. From (2) in the definition
of S2 (T , Γ), for odd s(e) we have f (e, τ ) = −f (ē, τ ) = 0, as s(ē) = t(e) is even.
Whence, f (e, τ ) = 0 for all e ∈ E(T ), making f = 0.
We also have two maps
πs , πt : S2 (E, Γ̃) −→ S2 (T0 , Γ̃)
given by
πs (f )(v, τ ) =
X
f (e, τ )
and
s(e)=v
πt (f )(v, τ ) =
X
f (e, τ ),
t(e)=v
since any γ ∈ Γ̃ will map the set of edges with source (resp. target) v to the set of
edges with source (resp. target) γv. The properties of S2 (E, Γ̃) translate into the
properties for S2 (T0 , Γ̃) this way.
142
Thus, we have the sequence
0
// S2 (T
, Γ)
ι
//
S2 (E, Γ̃)
πs ⊕πt //
S2 (T0 , Γ̃) ⊕ S2 (T0 , Γ̃)
Recall that ι is injective, so in order to show exactness, we just need to show
that ker(πs ⊕ πt ) = im(ι).
Let f ∈ S2 (T , Γ). Then,
πs (ι(f ))(v, τ )dτ =
X
ι(f )(e, τ )dτ =
s(e)=v
X
f (e, τ )dτ
if v is even
s(e)=v
X
f (αe, ατ )dατ
if v is odd
s(e)=v
X
f (e, τ ) dτ
s(e)=v
=
X
f (e, ατ ) dατ
if v is even
if v is odd
s(e)=αv
= 0,
Where the last equality follows from (2) in the definition of S2 (T , Γ). By the same
token, and using the identity f (e, τ ) = −f (ē, τ ), we find that
πt (ι(f ))(v, τ ) = 0,
implying that im(ι) ⊆ ker(πs ⊕ πt ).
Now, let g ∈ ker(πs ⊕ πt ). Let
f : E(T ) × H −→ C
143
defined by
g(e, τ )dτ =
f (e, τ )dτ
if s(e) is even
f (α−1 e, α−1 τ )dα−1 τ
if s(e) is odd.
Since g ∈ ker(πs ⊕ πt ), we have that f satisfies condition (2) of the definition of
S2 (T , Γ). The invariance of g for Γ̃ yields the required invariance of f for Γ. Lastly,
it can readily be verified that ι(f ) = g. Hence, ker(πs ⊕ πt ) ⊆ im(ι), as required.
Note that S2 (E, Γ̃) ∼
= S2 (Γεns (q, p)). This, via the assignment f 7→ f0 , which
follows from Proposition 5.7. For each f0 in the latter space, we can build f using
the transitivity of the action of Γ̃ on E(T ). Furthermore, this implies that one edge
determines the behavior at all edges. In exactly the same way, we can see that
S2 (T0 , Γ̃) ∼
= S2 (Γεns (q)), by taking the modular form attached to v0 , the root of T .
The double coset operator given by Γεns (q, p) ( 10 01 ) Γεns (q) is the natural trace
map that symmetrizes a modular form in S2 (Γεns (q, p)), yielding a modular form
in S2 (Γεns (q)), analogous to the level lowering operator in the classical case. The
double coset operator given by Γεns (q, p)αΓεns (q) corresponds to the trace map that
symmetrizes a modular form in S2 (Γεns (q, p)) after applying the involution that arises
slashing by α, analogous to the level lowering operator after applying the AtkinLehner involution. We denote the former by ϕs and the latter by ϕt . Thus, we
obtain a map
ϕs ⊕ ϕt : S2 (Γεns (q, p)) −→ S2 (Γεns (q)) ⊕ S2 (Γεns (q)).
144
The kernel of this map, as in the classical case, is the space of forms that are new at
p, which is denoted by S2 (Γεns (q, p))p−new . By abuse of notation, let
ι : S2 (Γεns (q, p))p−new ֒−→ S2 (Γεns (q, p))
denote this inclusion.
This gives us an exact sequence
p−new ι
ns (q, p))
// S2 (Γε
0
//
S2 (Γεns (q, p))
ϕs ⊕ϕt
//
S2 (Γεns (q)) ⊕ S2 (Γεns (q)) ,
whence we derive the diagram
0
//
S2 (T , Γ)
S2 (Γεns (q, p))
ι
ι
//
//
S2 (E, Γ̃)
S2 (Γεns (q, p))
πs ⊕πt
ϕs ⊕ϕt
//
//
S2 (T0 , Γ̃) ⊕ S2 (T0 , Γ̃)
S2 (Γεns (q)) ⊕ S2 (Γεns (q)),
where the vertical maps are given by the assignment f 7→ f0 . Commutativity of the
left square follows trivially, as f0 (τ ) = f (e0 , τ ) and s(e0 ) = v0 is even. Commutativity
of the second square is a bit more subtle. The maps ϕs and ϕt can be expressed in
a more concrete way by choosing representatives. Let γ0 , γ1 , . . . , γp be elements of
Γεns (q) (the stabilizer of v0 ) which map all the different oriented edges e such that
s(e) = v0 to e0 . These form a set of coset representatives
Γεns (q) =
p
[
Γεns (q, p)γj ,
j=0
so
ϕs (f ) =
p
X
j=0
f |2 [γj ]
and
ϕt (f ) =
p
X
j=0
145
f |2 [αγj ].
Let f ∈ S2 (E, Γ̃) and let j(γ, τ ) be the automorphy factor of γ at τ, i.e.,
for g : H −→ C,
g|2 [γ] = det(γ)j(γ, τ )−2 g(γτ ).
Hence,
f (e0 , τ )|2 [γj ] = det(γj )j(γj , τ )−2 f (e0 , γj τ ) = det(γj )j(γj , τ )−2 f (γj γj−1 e0 , γj τ )
= f (γj−1 e0 , τ )
and
f (e0 , τ )|2 [αγj ] = det(αγj )j(αγj , τ )−2 f (e0 , αγj τ )
= det(αγj )j(αγj , τ )−2 f (αγj γj−1 α−1 e0 , αγj τ ) = f (γj−1 α−1 e0 , τ ).
These two identities show that
X
πs (f )0 (τ ) =
f (e, τ ) =
s(e)=v0
p
=
X
p
X
f (γj−1 e0 , τ ) =
j=0
p
X
f (e0 , τ )|2 [γj ]
j=0
f0 (τ )|2 [γj ] = ϕs (f0 )(τ )
j=0
and
πt (f )0 (τ ) =
X
t(e)=v0
p
=
X
f (e, τ ) =
p
X
f (γj−1 α−1 e0 , τ )
=
p
X
j=0
j=0
f0 (τ )|2 [αγj ] = ϕt (f0 )(τ ),
j=0
which establish the sought commutativity.
146
f (e0 , τ )|2 [αγj ]
Since the diagram is commutative, for every f ∈ S2 (T , Γ) we have that
(ϕs ⊕ ϕt )(ι(f )0 ) = ((πs ⊕ πt )(ι(f )))0 = 00 = 0,
so f0 = ι(f )0 ∈ ker(ϕs ⊕ ϕt ) = S2 (Γεns (q, p))p−new . This means that we actually have
the commutative diagram
//
0
0
//
S2 (T , Γ)
S2 (Γεns (q, p))p−new
ι
ι
//
//
S2 (E, Γ̃)
S2 (Γεns (q, p))
πs ⊕πt
ϕs ⊕ϕt
//
//
S2 (T0 , Γ̃) ⊕ S2 (T0 , Γ̃)
S2 (Γεns (q)) ⊕ S2 (Γεns (q)).
The required isomorphism follows from the five-lemma.
5.3
Measures, double Integrals and semi-indefinite Integrals
Starting with an elliptic curve E of conductor pq 2 we have a normalized weight
2 newform g associated to E. Theorem 4.21 associates to g an eigenform f0 ∈ S2 (Γ),
where Γ = Γεns (q, p) or Γ+
ns (q, p) depending on the sign of the Atkin-Lehner involution
at q, with the same eigenvalues as g away from q. Theorem 4.20 yields a multiplicative
constant that shows we can assume f0 to be a normalized rational eigenform, i.e., for
every σℓ ∈ Gal(Q(ξq )/Q) we have f0 |2 [Aℓ ] = σℓ (f0 ) (where Aℓ is a matrix such that
σℓ (∞) = Aℓ · ∞), its q-expansion has algebraic integers as coefficients and for every
integer m ≥ 2, f0 /m does not fulfil this last property. Since g is new at p (in fact,
everywhere), f0 will also be new at p. Finally, Lemma 5.8 associates to f0 a modular
form f ∈ S2 (T , Γ).
As in Section 4.3.3, in order to define a system of measures it suffices to define
the values at the open sets Ue associated to each e ∈ T1 . Let x, y ∈ P1 (Q) and c be
147
the Manin constant associated to E via f0 , as in Section 4.4.3. Let
µ̃f {x → y}(Ue ) = c · 2πi
Note that for γ ∈ Γ we have
µ̃f {γx → γy}(Uγe ) = c · 2πi
= c · 2πi
Z
Z
y
fe (z)dz.
x
γy
γx
Z y
x
f (γe, z)dz
f (e, τ )dτ = µ̃f {x → y}(Ue ),
(5.1)
using the identity f (γe, γτ )dγτ = f (e, τ )dτ.
Since the Abel-Jacobi map is not a rational map in this case, we need to average it following the recipe from Equation (4.4). The value µ̃f {x → y}(Ue ) can be
expressed as
µ̃f {x → y}(Ue ) = c · 2πi
Z
x
y
fe (z)dz = c · 2πi
Z
y
σℓ (∞)
fe (z)dz − c · 2πi
Z
x
fe (z)dz,
σℓ (∞)
so by averaging over all cusps we end up obtaining the measure
µ̃′f {x → y}(Ue ) = (q − 1)µ̃f {x → y}(Ue ).
All these values can, again, be expressed in terms of the periods of f0 , which lie
on a lattice ΛE ⊆ C. Denote by Ω̃+ and Ω̃− the smallest positive real period and the
smallest purely imaginary (with positive imaginary part) period of ΛE , respectively.
We know that ΛE is generated either by Ω̃+ and Ω̃− or contains the lattice generated
148
by these two periods with index 2 (see Chapter 2.8 in [Cre92]). Let
Ω̃±
if ΛE = hΩ̃+ , Ω̃− i
Ω± =
Ω̃± /2
otherwise.
Thus,
−
µ̃f {x → y}(Ue ) = κ+
f {x → y}(e) · Ω+ + κf {x → y}(e) · Ω− ,
where κ±
f {x → y} : E(T ) −→ Z. In order to account for the averaging of the cusps,
we multiply this quantity by q − 1. Choose a sign at infinity w∞ = ±1 and define
the system of integral distributions
∞
µf {x → y}(Ue ) = (q − 1)κw
f {x → y}(e).
The integrality of the distributions makes them p-adically bounded, so they are
actually p-adic measures on P1 (Qp ). The line integral from Section 4.3.1 provides us
with the same double integral we had in Section 4.3.3. This is, attached to f , from
τ1 to τ2 in Hp , and from x to y in P1 (Q), we have
Z
τ2
τ1
Z
y
ωf =
Z
log
P1 (Qp )
x
t − τ2
t − τ1
dµf {x → y}(t).
We also have the multiplicative counterpart, defined as
Z τ2 Z
×
τ1
x
y
Z
ωf = ×
P1 (Qp )
t − τ2
t − τ1
dµf {x → y}(t) = lim
Y tα − τ2 µf {x→y}(Uα )
α
tα − τ1
,
where the disjoint compact open sets Uα cover P1 (Qp ) and the limit is taken over
increasingly finer covers, with tα ∈ Uα is any sample point.
We also have an analogue of Lemma 4.8 in this scenario, exactly as expected.
149
Lemma 5.9. For all τ1 , τ2 , τ3 ∈ Hp , x, y, z ∈ P1 (Q) and γ ∈ Γ, the double integrals
satisfy
(1)
Z τ3 Z
×
τ1
y
Z τ2 Z
ωf = ×
τ1
x
y
Z τ3 Z
ωf ·×
τ2
x
y
ωf
and
Z
Z
τ3
τ1
x
y
ωf =
Z
Z
τ2
τ1
x
y
Z
ωf +
τ3
τ2
x
Z
y
ωf
x
(2)
Z τ2 Z
×
τ1
(3)
x
z
Z τ2 Z
ωf = ×
τ1
x
Z γτ2 Z
×
γτ1
y
γy
γx
Z τ2 Z
ωf ·×
τ1
z
ωf
τ2
τ1
y
Z τ2 Z
ωf = ×
τ1
and
Z
y
ωf
and
Z
Z
ωf =
γτ2
Z
Z
τ2
τ1
x
γτ1
x
z
Z
γy
ωf =
γx
x
Z
τ2
τ1
y
Z
ωf +
τ2
τ1
Z
Z
z
ωf
y
y
ωf
x
Proof. Properties (1) and (2) follow formally from the definitions. Property (3)
follows from Equation (5.1), as it implies
µf {γx → γy}(Uγe ) = µf {x → y}(Ue ),
whence (3) is clear.
Let Γ′ ⊂ Γ be the normal closure of the subset of Γ comprised of the all the
commutators and the elements of Γ whose fixed points belong to P1 (Q). Denote by
eΓ the exponent of the group Γ/Γ′ .
Proposition 5.10. The exponent eΓ is a divisor of q + 1.
Proof. Let
1 0
M =
,
q 1
150
which lies in Γ(q)[1/p], Γ and SL2 (Z[1/p]). Let H denote the normal closure of M in
SL2 (Z[1/p]). Theorem 1 in [Men67] establishes that H is precisely Γ(q)[1/p]. On the
other hand, we have that H is generated by the set {AM A−1 : A ∈ SL2 (Z[1/p])}.
Let r = A(0) ∈ P1 (Q). Note that
AM A−1 (r) = AM (0) = A(0) = r,
as M stabilizes 0. This implies that AM A−1 ∈ Γ′ , as it fixes the cusp r and AM A−1
belongs to Γ (it belonging to Γ(q)[1/p] ⊆ Γ). This, in turn, implies that H ⊆ Γ′ , as
the generating set of H is a subset of Γ′ .
Now, the matrix −I, where I is the identity, fixes all of the cusps and is in Γ, so
we actually have that ±Γ(q)[1/p] ⊆ Γ′ . From the multiplicativity of indices of nested
subgroups we obtain the relation
[Γ : Γ(q)[1/p]] = [Γ : Γ′ ] · [Γ′ : Γ(q)[1/p]].
From Lemma 5.2 we have
Γεns (q)[1/p]/Γ(q)[1/p] ∼
= Z/(q + 1)Z,
completing the proof.
Let q be Tate’s p-adic period attached to E. In Section 4.3.5 we constructed
cochains κτ , ρx,y , ρτ1 ,τ2 , κ#
τ and cτ . These were constructed in terms of the double
integrals attached to the modular forms with which we were dealing back then. The
same formal computations show that κτ is a cocycle and that, thanks to ρx,y and
Z
ρτ1 ,τ2 , it does not depend on the base point or on τ when seen in H 2 (Γ, C×
p /q ).
151
Choose a cusp x ∈ P1 (Q). Before, Γ = Γ0 (M )[1/p]. If Γ = Γεns (q)[1/p] or
Γ = Γ+
ns (q)[1/p], we obtain the same exact sequence
0
//
Z
C×
p /q
//
F
//
M0
//
0,
Z
where F is the group of C×
p /q -valued functions on Γx and M0 is the group arising
Z
from restriction on C×
p /q -valued modular symbols on Γx × Γx. This induces a long
exact sequence of cohomology groups which yields the connecting homomorphism
Z
δ : H 1 (Γ, M0 ) −→ H 2 (Γ, C×
p /q ).
Again, the same computations show that δ applied to cτ is κ#
τ . As a natural
analogue of Conjecture 4.12, we formulate the following conjecture.
Conjecture 5.11. The class of cτ is trivial in H 1 (Γ, M0 ).
As in Section 4.3.5, Conjecture 5.11 implies that cτ is a coboundary, so we obtain
a cochain η̃τ , unique up to multiplication by a 0-cocycle, such that
Z γτ Z
×
τ
y
z
ωf = cτ (γ){y → z} = η̃τ {γ −1 y → γ −1 z} ÷ η̃τ {y → z}.
Lemma 5.12. The map
h′ : MΓ0 −→
Z
Hom(Γ/Γ′ , C×
p /q )
m 7−→ (γΓ′ 7→ m{x → γx})
is a monomorphism.
Proof. See comments leading to Lemma 4.13.
152
(5.2)
Lemma 5.10 implies that eΓ is finite (and gives a bound for it). Since eΓ is the
exponent of Γ/Γ′ , Lemma 5.12 implies that any 0-cocyle is annihilated when raised
to the eΓ -th power. Then, the modular symbol η̃τeΓ , which does not depend on the
choice of η̃τ , yields a unique modular symbol, which we denote, momentarily, ητ ,
such that
Z γτ Z
×
τ
z
ωf
y
eΓ
= ητ {γ −1 y → γ −1 z} ÷ ητ {y → z}.
It is customary to use the notation
Z τZ
×
z
eΓ ωf = ητ {y → z}.
y
As in Section 4.3.5, we summarize all this in a conjecture. (Not a theorem, as it
relies on Conjecture 5.11.)
Conjecture 5.13. There exists a unique function
Hp (Cp ) × Γx × Γx −→
7−→
(τ, r, s)
Z
C×
p /q
Z τZ s
nωf ,
×
r
such that, for all τ1 , τ2 ∈ Hp (Cp ), r, s, t ∈ Γx and γ ∈ Γ, we have
(1)
Z τZ t
Z τZ t
Z τZ s
eΓ ωf
eΓ ωf = ×
eΓ ωf × ×
×
(2)
Z τ2 Z s eΓ
Z τ1 Z s
Z τ2 Z s
ωf
eΓ ωf = ×
eΓ ωf ÷ ×
×
τ1
r
r
(3)
r
s
r
Z γτ Z
×
γs
γr
Z τZ s
eΓ ωf
eΓ ωf = ×
r
153
r
5.4
The Stark-Heegner point
Let OD ⊆ K be the order in K of discriminant D, with conductor relatively
√
prime to N. Let ωD = (D + D)/2, so OD = Z[ωD ].
Recall that in Section 4.4.5 we showed that if the discriminant of OD was relatively prime to pq and q was inert in K we had an embedding
ε
ε
OD ֒−→ Mns
(q) ⊆ Mns
(q)[1/p],
ε
so regard OD as a subring of Mns
(q)[1/p].
a b
Let τ be a fixed point of ωD under the Möbius action, this is, if ωD =
,
c d
τ=
aτ + b
cτ + d
or
cτ 2 + (d − a)τ − b = 0.
τ
This is equivalent to saying that the column vector ~τ = is an eigenvector of
1
ωD . Thus, if r + sωD ∈ OD , we find that
τ
τ
(r + sωD ) = (r + sω) ,
1
1
where ω is the eigenvalue of ωD associated to ~τ , and r and s are, by abuse of notation,
scalar matrices on the LHS and integers on the RHS. Hence, τ is a fixed point of
every element in OD .
154
Note that the discriminant of the quadratic equation defining τ is
(d − a)2 + 4bc = (d + a)2 − 4(ad − bc) = (ωD + ω̄D )2 − 4ωD ω̄D
√
√ !2
√ !
√ !
D+ D D− D
D+ D
D− D
=
−4
+
2
2
2
2
= D2 − (D2 − D) = D,
√
√
so τ ∈ K. Furthermore, since K = Q( D) and p is inert in K, D and, a posteriori
τ , lie in Qp2 − Qp , where Qp2 is the unramified quadratic extension of Qp . Combining
these two, we find that τ ∈ Hp ∩ K.
×
)1 , which is a rank one abelian group, contained
Let γτ be a generator of (OD
ε
ε
in Mns
(q)[1/p]×
1 = Γns (q)[1/p]. If g is invariant under the Atkin-Lehner involution at
+
q, the embedding is in Mns
(q)[1/p] and we regard γτ as an element of Γ+
ns (q)[1/p].
Denote by Γ the group Γεns (q)[1/p] or Γ+
ns (q)[1/p] accordingly.
Definition 5.14. Let τ ∈ Hp ∩ K as above and let x ∈ P1 (Q) be a cusp. The
Stark-Heegner point associated to τ at the cusp x is the point Pτ,x ∈ E(Cp ) given
by the Tate’s uniformization corresponding to
Z τZ
×
r
γτ r
Z
eΓ ωf ∈ C×
p /q
for any r ∈ Γx.
155
Note that this point is independent of the choice of cusp in Γx, as choosing two
cusps r, s ∈ Γx yields
Z τZ
×
γτ r
r
Z τZ
eΓ ωf ÷ ×
γτ s
s
Z τZ
Z τZ s
eΓ ωf ÷ ×
eΓ ωf = ×
r
γτ s
eΓ ωf
γτ r
Z γτ−1 τ Z s
Z τZ s
eΓ ωf = 1,
eΓ ωf ÷ ×
=×
r
r
where the last equality comes from the fact that γτ stabilizes τ and the properties
of Conjecture 5.13.
We further formulate the following conjecture regarding the algebraicity of the
resulting points.
Conjecture 5.15. Let τ ∈ Hp ∩ K as above and let x ∈ P1 (Q) be a cusp. Let H +
be the narrow ring class field attached to OD . Then the point Pτ,x lies in E(H + ).
5.5
Setup for computations
Now that we defined the points, we would like to know how to actually compute
them. [DP06] shows how to effectively compute Stark-Heegner points in the case
of prime level. The techniques explained in Chapter 2 show how to compute the
p-adic multiplicative double integrals to high accuracy in a general setup, but the
techniques mentioned in Chapter 1 only allow for computation of the semi-indefinite
integrals in the case of prime level. In [GM15], there is a method that works for
composite levels, which again boils down to the effective computation of the double
integrals to high accuracy.
5.5.1
The case q = 3.
Throughout this subsection, we will assume that the newform g is invariant
under the usual Atkin-Lehner involution at q, so we will be working with the group
156
Γ = Γ+
ns (3)[1/p]. By Corollary 5.6, all cusps are equivalent under the action of Γ
(even without inverting p). This means that every cusp is in Γ∞ and we can focus
on only one indefinite integral, namely
Z τZ
×
γτ ∞
eΓ ωf .
(5.3)
∞
Utilizing the continued fraction expansion of the rational number γτ ∞, we obtain a
(finite) sequence {pj /qj }kj=−1 of rational numbers. The matrices
j−1
(−1) pj pj−1
Mj =
(−1)j−1 qj qj−1
lie in SL2 (Z). The group Γ+
ns (3) has index 3 in SL2 (Z), so it is the disjoint union of
its left-cosets
1 0
1 0 [ +
1 0 [ +
Γ
(3)
Γ
(3)
SL2 (Z) = Γ+
(3)
,
ns
ns
ns
2 1
1 1
0 1
and we can write Mj = γj rj , where γj ∈ Γ+
ns (3) and
1 0
1 0 1 0
rj ∈
.
,
,
0 1
2 1
1 1
By properties (1) and (3) in Conjecture 5.13, we have
Z τZ
×
γτ ∞
∞
k Z τ Z
Y
eγ ωf =
×
pj−1 /qj−1
j=0
=
k Z τ
Y
j=0
pj /qj
×
Z
γj rj (∞)
γj rj (0)
k Z τ Z
Y
eΓ ωf =
×
j=0
Mj (∞)
eΓ ωf
Mj (0)
k Z γj −1 τ Z
Y
eΓ ωf =
×
157
j=0
rj (∞)
rj (0)
eΓ ωf ,
(5.4)
so we reduce the computation of (5.3) to that of computing
Z τ′ Z
×
∞
eΓ ωf ,
Z τ′ Z
×
1
eΓ ωf ,
1/2
eΓ ωf ,
(5.5)
0
0
0
Z τ′ Z
×
for some τ ′ ∈ Hp . Note that in order to apply property (1), it is crucial that the
cusps pj−1 /qj−1 and pj /qj be equivalent, which is guaranteed by the assumption q = 3
together with g having eigenvalue 1 at 3.
Consider the matrices
0 −1
W0,∞ =
,
1 0
W0,1
−1 1
=
,
−2 1
W0,1/2
which lie in Γ+
ns (3) and have the property that
Wr,s (r) = s
and
−2 1
=
,
−5 2
Wr,s (s) = r,
so, using the properties spelled out in Conjecture 5.13, we have the equality
!2 Z ′ Z
−1
Z τ ′ Z Wr,s
Z τ′ Z s
Z τ′ Z r
Z τ′ Z s
(s)
τ
s
eΓ ωf
eΓ ωf ÷ ×
eΓ ωf = ×
eΓ ωf
eΓ ωf ÷ ×
×
=×
r
−1
Wr,s
(r)
r
s
r
Z τ′
Z Wr,s τ ′ Z s
Z τ′ Z s
eΓ ωf = ×
eΓ ωf ÷ ×
=×
r
r
Wr,s τ ′
Z
r
s
ωf
!eΓ
Finally, in order to compute double integrals of the form
Z τ2 Z s
,
×
τ1
r
we just need to follow what was explained at the end of Section 4.3.3.
158
.
(5.6)
5.5.2
More general values of q.
There are two obstructions in generalizing the procedure described in the previous subsection to larger values of q. On one hand, as mentioned right after Equation
(5.5), we require the cusps pj−1 /qj−1 and pj /qj to be equivalent for every j. This
obstruction can be salvaged by imposing a nice relationship between q and p. For
example, Proposition 5.6 says that when p2 generates all of the squares modulo q,
all the cusps are equivalent under the action of Γ+
ns (q)[1/p].
On the other hand, we exhibited coset representatives rj for which we found
matrices Wrj (0),rj (∞) ∈ Γ+
ns (q)[1/p] with the property that
Wrj (0),rj (∞) (rj (0)) = rj (∞)
and
Wrj (0),rj (∞) (rj (∞)) = rj (0).
A natural question to ask at this point is, when do these elements exist?
Proposition 5.16. Let
r∞ r0
A=
∈ SL2 (Z).
s∞ s0
Denote by r and s the cusps A(∞) and A(0), respectively. Then, there exists a matrix
1
Wr,s ∈ Γ+
ns (q)[1/p] whose action on P (Q) transposes r and s if and only if one of
the following three conditions is met:
(i)
s20 − εr02
− 2
∈ hp2 i
2
s∞ − εr∞
(ii)
q | r∞ r 0 s∞ s0 ,
q | (r∞ − s0 )(s∞ − r0 ) and
159
s20 − εr02
∈ hp2 i
2
s2∞ − εr∞
(iii)
q ∤ r∞ r 0 s∞ s0 ,
−
r 0 s0
∈ hp2 i and
r ∞ s∞
ε≡±
s∞ s0
(mod q).
r∞ r0
Proof. Note that if W = Wr,s exists, then we have
W A (0) = A(∞) and
W A (∞) = A(0)
which is equivalent to
A−1 W A (0) = ∞ and
A−1 W A (∞) = 0,
so the matrix A−1 W A swaps 0 and ∞. The only matrices which swap 0 and ∞ have
the form
0 ∗
,
∗ 0
and if they lie in SL2 (Z[1/p]), then we have the equality
α
±p
0
A−1 W A =
∓p−α
0
for some α ∈ Z. Solving for W , we obtain
2 α
±r02 p−α ± r∞
p
±pα −1 ∓r0 s0 p−α ∓ r∞ s∞ pα
0
W = A
A
=
.
2 −α
2 α
−α
α
−α
∓s0 p ∓ s∞ p
±r0 s0 p ± r∞ s∞ p
∓p
0
Note that the trace of W is 0. If the diagonal entries are not divisible by q, then
+
ε
W ∈ Γ+
ns (q)[1/p] if and only if W ∈ Γns (q)[1/p] − Γns (q)[1/p], as q is odd. Thus, we
160
have that W ∈ Γ+
ns (q)[1/p] if and only if
2 α
±s20 p−α ± s2∞ pα ≡ (±r02 p−α ± r∞
p )ε
2 2α
p )ε
s20 + s2∞ p2α ≡ (r02 + r∞
−
s20 − εr02
≡ p2α
2
s2∞ − εr∞
(mod q)
(mod q)
(mod q),
whence (i).
The diagonal entries are divisible by q if and only if
r0 s0 ≡ −r∞ s∞ p2α
(mod q).
(5.7)
If this is the case, then W ∈ Γ+
ns (q)[1/p] if and only if
2 α
±(±s20 p−α ± s2∞ pα ) ≡ (±r02 p−α ± r∞
p )ε
2 2α
±(s20 + s2∞ p2α ) ≡ (r02 + r∞
p )ε
(mod q)
(mod q).
(5.8)
Now, we have two cases, depending on whether or not q divides r∞ r0 s∞ s0 . If
q | r∞ r0 s∞ s0 , then Equation (5.7) and r∞ s0 − r0 s∞ = 1 imply q | (r∞ − s0 )(s∞ − r0 ).
If r∞ ≡ s0 ≡ 0 (mod q), Equation (5.8) transforms into
±s2∞ p2α ≡ r02 ε
±
εr02
≡ p2α
s2∞
(mod q)
(mod q).
The RHS is a square, so ±ε has to be a square as well which is only possible if we
have the negative sign, and the condition becomes equivalent to
s20 − εr02
≡ p2α
2
2
s∞ − εr∞
161
(mod q).
If r0 ≡ s∞ ≡ 0 (mod q), Equation (5.8) transforms into
±s20 ≡ r∞ p2α ε
±
s20
≡ p2α
2
εr∞
(mod q)
(mod q).
Like before, we must choose the negative sign and again the condition translates into
s20 − εr02
≡ p2α
2
s2∞ − εr∞
(mod q),
whence (ii).
Finally, if q ∤ r∞ r0 s∞ s0 , from Equation (5.7) we obtain
−
r0 s0
≡ p2α
r∞ s∞
(mod q).
Then, Equation (5.8) transforms into
±(s20 − r0 s0 s∞ /r∞ ) ≡ (r02 − r∞ r0 s0 /s∞ )ε
(mod q).
Clearing denominators and factoring we obtain
±(r∞ s0 − r0 s∞ )s0 s∞ ≡ −(r∞ s0 − r0 s∞ )r∞ r0 ε
whence
ε≡±
s∞ s0
r∞ r0
which is precisely (iii).
162
(mod q),
(mod q),
Corollary 5.17. Suppose that hp2 i = ((Z/qZ)× )2 . Let
r∞ r0
A=
∈ SL2 (Z).
s∞ s0
Denote by r and s the cusps A(∞) and A(0), respectively. Then, there exists a matrix
1
Wr,s ∈ Γ+
ns (q)[1/p] whose action on P (Q) transposes r and s if and only if one of
the following three conditions is met:
(i)
ε − (s∞ s0 − εr∞ r0 )2
q
=1
(ii)
q | r∞ r0 s∞ s0 ,
q | (r∞ − s0 )(s∞ − r0 ) and
(iii)
q ∤ r ∞ r 0 s∞ s0 ,
−r∞ r0 s∞ s0
q
= 1 and
(s∞ s0 − εr∞ r0 )2 − ε
q
ε≡±
s∞ s0
(mod q).
r∞ r0
Proof. Modulo squares, the elements
s20 − εr02
2
s2∞ − εr∞
and
2
(s2∞ − εr∞
)(s20 − εr02 )
163
=1
are the same. Notice that
2 2
2 2
2
s0 + ε2 r∞
r0
(s2∞ − εr∞
)(s20 − εr02 ) = s2∞ s20 − εr02 s2∞ − εr∞
2 2
2 2
= s2∞ s20 + ε2 r∞
r0 − ε(r∞
s0 + r02 s2∞ )
2 2
2 2
= (s2∞ s20 + 2s∞ s0 εr∞ r0 + ε2 r∞
r0 ) − ε(r∞
s0 − 2r∞ s0 r0 s∞ + r02 s2∞ )
= (s∞ s0 + εr∞ r0 )2 − ε(r∞ s0 − r0 s∞ )2
= (s∞ s0 + εr∞ r0 )2 − ε.
Likewise, modulo squares the elements
−
r 0 s0
r ∞ s∞
and
− r ∞ r 0 s∞ s0
are the same. The corollary follows directly from Proposition 5.16.
According to Corollary 5.17, these matrices do not exist very often, as the conditions are very restrictive. Heuristically, half of the elements are squares, and (i)
covers the majority of the cases, indicating that we expect roughly a proportion of
just over 1/2 of representatives to yield elements in Γ+
ns (q)[1/p] transposing the two
cusps attached to them (the representatives).
In order to avoid this conundrum, we adapt an algorithm presented in [GM15].
Furthermore, this approach will allow us to dispose of the requirement that g be an
eigenform with eigenvalue 1 for the Atkin-Lerner involution at q. Let us introduce
some notation.
Let F be a number field and S a finite set of places of F , containing the archimedian ones. Let OS denote the S-integers of F (the elements of F with non-negative
valuation for every place not in S).
164
For an ideal N in OS let
1 ∗
Γ1 (N ) = γ ∈ SL2 (OS ) : γ ≡
0 1
(mod N ) .
Lemma 5.18. Let γ = ( ac db ) ∈ Γ1 (N ). Suppose that c = u+ta for some unit u ∈ OS×
and some t ∈ OS . There exists x ∈ OS such that
−1
1
0 1 x
1
0 1 −u
γ=
.
0 1
u(1 − a) 1
0
1
c + t(1 − a) 1
Proof. See Lemma 2.1 in [GM15], or with slightly different language, Lemma
2.2(b) in [BMS67].
Notice that since a − 1, c ∈ N , the product in Lemma 5.18 consists of matrices
in Γ1 (N ) that stabilize either 0 or ∞ when γ acts on P1 (Q). These matrices are
precisely the elementary matrices of determinant 1. The following theorem allows us
to remove the restriction imposed over a and c.
Theorem 5.19. Let γ = ( ac db ) ∈ Γ1 (N ). Assuming GRH, the following algorithm
terminates and computes an expression of γ as a product of elementary matrices of
determinant 1 in Γ1 (N ).
1. Iterate over the elements λ in the ring of integers of F to find λ such that
a′ = a + λc generates a prime ideal and the reduction map
OS× −→ (OS /a′ OS )×
is surjective.
2. Set γ ′ = ( 10 λ1 ) γ and let γ ′ =
a′ b′
c′ d′
.
165
3. Iterate over the elements u ∈ OS× until finding
c′ ≡ u
(mod a′ ).
4. Use Lemma 5.18 to find an expression of γ ′ as a product of elementary matrices.
Proof. See Theorem 2.3 in [GM15].
Corollary 5.20. Assume GRH. Every matrix in Γ1 (q 2 )[1/p] can be expressed as a
product of at most five elementary matrices in SL2 (Z[1/p]).
Proof. It follows directly from Theorem 5.19 applied to F = Q, S = {p, ∞} and
N = (q 2 ).
We are interested in adapting Corollary 5.20 to find factorizations of a similar
nature to matrices in Γεns (q)[1/p]. We need a couple of lemmas for that.
Lemma 5.21. Let
a b
2
†
Γ (q)[1/p] =
∈
Γ(q)[1/p]
:
a
≡
1
(mod
q
)
c d
q 0
and let Aq =
. The inner automorphism of GL2 (Z[1/p]) induced by Aq
0 1
induces an isomorphism between Γ1 (q 2 )[1/p] and Γ† (q)[1/p].
Proof. If
a b
A=
c d
then
a bq
Aq AA−1
.
q =
c/q d
166
Hence
A ∈ Γ1 (q 2 )[1/p] ⇐⇒ c, a − 1, d − 1 ∈ q 2 Z[1/p] and b ∈ Z[1/p]
⇐⇒ bq, c/q ∈ qZ[1/p] and a − 1, d − 1 ∈ q 2 Z[1/p]
†
⇐⇒ Aq AA−1
q ∈ Γ (q)[1/p],
showing the desired isomorphism.
†
For a matrix A ∈ Γ1 (q 2 ), denote by à = Aq AA−1
q its counterpart in Γ (q)[1/p].
Proposition 5.22. Let à ∈ Γ† (q)[1/p]. Then there exists a factorization
à = Ũ1 L̃1 Ũ2 L̃2 Ũ3 ,
where
1 x i
Ũi =
0 1
and
with xi , yi ∈ qZ[1/p].
1 0
L̃i =
,
yi 1
Proof. By Corollary 5.20, there exist elementary matrices of determinant 1 Ui
and Li in Γ1 (q 2 )[1/p], with Ui upper triangular and Li lower triangular, such that
A = U1 L1 U2 L2 U3 .
By Lemma 5.21, after conjugating by Aq , we obtain
à = Aq U1 L1 U2 L2 U3 A−1
q = Ũ1 L̃1 Ũ2 L̃2 Ũ3 .
Since Ui is upper triangular, so is Ũi and it lies in Γ(q)[1/p]. Since Li is lower triangular and lies in Γ1 (q 2 )[1/p], L̃i is lower triangular and lies in Γ(q)[1/p].
167
By definition, Γ(q 2 )[1/p] ⊆ Γ† (q)[1/p] ⊆ Γ(q)[1/p]. The reduction map
SL2 (Z[1/p]) −→ SL2 (Z/nZ)
(where gcd(n, p) = 1) is surjective (as the map with source SL2 (Z) is) and has as
kernel Γ(n)[1/p], so the index
[SL2 (Z[1/p]) : Γ(n)[1/p]] = #(SL2 (Z/nZ)) = n
3
Y
ℓ|n
1
1− 2
ℓ
,
(5.9)
where the product is taken over the prime divisors ℓ of n. Substituting n = q and
n = q 2 in Equation (5.9), we obtain
[Γ(q)[1/p] : Γ(q 2 )[1/p]] =
q 6 (1 − 1/q 2 )
= q3.
q 3 (1 − 1/q 2 )
Γ† (q)[1/p], lying strictly between these two groups, must have index q or q 2 .
Lemma 5.23. Γ† (q)[1/p] E Γ(q)[1/p]. Moreover,
Γ(q)[1/p]/Γ† (q)[1/p] ∼
= Z/qZ.
Proof. Consider the map
From
πa : Γ(q)[1/p] −→ (Z/q 2 Z)×
a b
7−→ a.
c d
a b A B aA + bC aB + bD
,
=
cA + dC cB + dD
C D
c d
168
when b ≡ C ≡ 0 (mod q), aA + bC ≡ aA (mod q 2 ). It follows that if γ1 , γ2 ∈
Γ(q)[1/p], then
πa (γ1 )πa (γ2 ) = πa (γ1 γ2 ),
showing πa is a homormorphism. By definition, ker(πa ) = Γ† (q)[1/p], showing it is a
normal subgroup. The image of πa is the subset of residues modulo q 2 which are 1
modulo q, which has q elements. From the first isomorphism theorem and the fact
that the only group of order q is cyclic, the isomorphism sought holds.
Lemma 5.24. For any matrix γ ∈ Γ(q)[1/p], γ q ∈ Γ† (q)[1/p].
Proof. This is a trivial consequence of Lemma 5.23.
Lemma 5.25. There exists M ∈ SL2 (Z) such that M (0) and M (∞) are equivalent
under the action of Γ.
Proof. If ∞ and 0 are already equivalent cusps (i.e., if −ε ∈ hp2 i) we can just
let M be the identity.
By Proposition 5.4, if
it suffices to show that
a b
M =
∈ SL2 (Z),
c d
a2 − c2 ε−1 ≡ b2 − d2 ε−1
(mod q)
in order to show that M (∞) = a/c and M (0) = b/d are equivalent under the action
of Γ. Let us show how to construct such M according to the following two cases:
- Case 1: q ≡ ±1 (mod 8).
169
In this case, 2 is a quadratic residue, so let ρ be a residue modulo q such that
ρ2 ≡ 2−1 (mod q). Let M ∈ SL2 (Z) be such that
a b ρ −ρ
M =
(mod q),
≡
ρ ρ
c d
which exists because the determinant is ρ2 + ρ2 ≡ 2ρ2 ≡ 1 (mod q). In this
case,
a2 − c2 ε−1 ≡ ρ2 (1 − ε−1 ) ≡ b2 − d2 ε−1
(mod q),
showing that M has the desired property.
- Case 2: q ≡ ±3 (mod 8).
Now, 2 is a quadratic nonresidue, so 2ε is a quadratic residue. Let ρ be a
residue modulo q such that ρ2 ≡ (2ε)−1 (mod q). Let M ∈ SL2 (Z) be such
that
ρ
a b ρ
M =
,
≡
−ρε ρε
c d
which exists because the determinant is ρ2 ε + ρ2 ε ≡ 2ερ2 ≡ 1 (mod q). In this
case,
a2 − c2 ε−1 ≡ ρ2 (1 − ε) ≡ b2 − d2 ε−1
(mod q),
showing again that M has the desired property.
Now we have everything we need at hand. Lemma 5.25 provides us with a
matrix M in SL2 (Z) such that M (0) and M (∞) are equivalent under the action of
Γεns (q)[1/p]. Let υ = M (∞) and let Pτ,υ be the Stark-Heegner point associated to τ
170
at υ. In order to compute this point, we need to compute the semi-indefinite integral
Z τZ
Jτ = ×
γτ υ
eΓ ωf .
υ
Proposition 5.26. There exists a divisor d of q(q + 1) such that Jτd can be expressed
as a product of double-integrals.
Proof. Note that for any positive integer k we have
Jτk
Z τ Z
= ×
γτ υ
eΓ ωf
υ
k
=
k−1
YZ τ
×
j=0
Z
γτj+1 υ
γτj υ
Z τZ
eΓ ωf = ×
γτk υ
eΓ ωf ,
υ
where the second equality comes from the independence of base-point in the orbit
of υ and the third equality from the properties of the semi-indefinite integral. Since
−I ∈ Γεns (q)[1/p], we also have
Z τZ
×
υ
−γτk υ
Z −Iτ Z
eΓ ωf = ×
−Iγτk υ
−Iυ
Z τZ
eΓ ωf = ×
γτk υ
eΓ ωf ,
υ
as −I acts trivially on Hp and on P1 (Q).
By Lemma 5.2, since γτ ∈ Γεns (q)[1/p], making k = q + 1 above, makes γτk be
in Γ(q)[1/p], which is normal in SL2 (Z[1/p]) meaning that M −1 γτq+1 M is also in
Γ(q)[1/p]. By Lemma 5.24, if we further raise M −1 γτq+1 M to the q-th power, we find
q(q+1)
that M −1 γτ
M ∈ Γ† (q)[1/p]. Let d be the smallest positive integer such that
either M −1 γτd M or −M −1 γτd M is in Γ† (q)[1/p]. It is clear that d | q(q + 1). Let
γ = ±M −1 γτd M ∈ Γ† (q)[1/p].
Proposition 5.22 yields matrices Ui and Li in Γ(q)[1/p] such that
γ = Ũ1 L̃1 Ũ2 L̃2 Ũ3 ,
171
where
1 x i
Ui =
0 1
and
1 0
Li =
.
yi 1
For a matrix A, denote by Ā = M AM −1 . Notice that if A ∈ Γ(q)[1/p], then Ā ∈
Γ(q)[1/p], as it is normal. Thus, we obtain
±γτd = γ̄ = Ū1 L̄1 Ū2 L̄2 Ū3 ,
where Ūi , L̄i ∈ Γ(q)[1/p], Ūi M (∞) = M Ũi M −1 M (∞) = M Ũi (∞) = M (∞) and
L̄i M (0) = M L̃i M −1 M (0) = M L̃i (0) = M (0). This is, Ūi and L̄i are matrices in
Γεns (q)[1/p] which fix υ = M (∞) and υ ′ = M (0), respectively.
To alleviate reading, denote by
γ2 = L̃1 Ũ2 L̃2 Ũ3 ,
τ1 = Ū1−1 τ,
γ3 = Ũ2 L̃2 Ũ3 ,
τ2 = L̄−1
1 τ1 ,
γ4 = L̃2 Ũ3 ,
τ3 = Ū2−1 τ2 ,
172
τ4 = L̄−1
2 τ3 .
Using the properties of the semi-indefinite integral, following Equations (3.4) and
(3.5) in [GM15], we have
Z τZ
×
γ̄υ
υ
Z Ū1 τ1 Z
eΓ ωf = ×
Z τ1 Z
=×
Ū1 γ2 υ
Ū1 υ
υ′
Z τ1 Z
eΓ ωf = ×
Z τ1 Z
eΓ ωf × ×
γ2 υ
eΓ ωf
υ
γ2 υ
eΓ ωf
υ′
υ
Z τ1 Z υ
Z L̄1 τ2 Z L̄1 γ3 υ
eΓ ωf ÷ ×
eΓ ωf
=×
L̄1 υ ′
υ′
Z τ2 Z γ3 υ
Z τ1 Z υ
=×
eΓ ωf ÷ ×
eΓ ωf
υ′
υ′
Z τ1 Z υ
Z τ2 Z υ
Z τ2 Z γ3 υ
eΓ ωf
eΓ ωf ÷ ×
=×
eΓ ωf × ×
υ′
υ′
υ
Z τ2 Z υ eΓ Z τ2 Z γ3 υ
eΓ ωf
××
ωf
= ×
υ
υ′
τ1
and exacly the same computation yields
Z τ2 Z
×
υ
γ3 υ
Z Ū2 τ3 Z
eΓ ωf = ×
Z τ3 Z
=×
Ū3 υ
υ′
υ
Z L̄2 τ4 Z
=×
Z τ4 Z
=×
Ū3 γ4 υ
Z τ3 Z
eΓ ωf = ×
Z τ3 Z
eΓ ωf × ×
γ4 υ
eΓ ωf
υ
γ4 υ
eΓ ωf
υ′
L̄2 Ū3 υ
L̄2 υ ′
Ū3 υ
υ′
τ4 Z υ
Z τ3 Z
eΓ ωf ÷ ×
Z τ3 Z
eΓ ωf ÷ ×
υ
eΓ ωf
υ′
υ
eΓ ωf
υ′
Ū3 υ
Z τ3 Z υ
Z
Z τ4 Z
eΓ ωf ÷ ×
eΓ ωf
=×
eΓ ωf × ×
υ′
υ′
υ
Z τ4 Z υ eΓ
Z τ4 Z υ eΓ Z τ4 Z Ū3 υ
.
ωf
eΓ ωf = ×
××
ωf
= ×
τ3
Ū3 υ
υ′
173
τ3
υ′
Putting both equations together, we obtain
Jτd
Z τZ
=×
υ
γ̄υ
Z τ2 Z
eΓ ωf = ×
τ1
υ
υ′
Z τ4 Z
ωf × ×
τ3
υ
υ′
ωf
eΓ
.
(5.10)
Note the importance of choosing υ the way we chose it in Proposition 5.26, as
the semi-indefinite integrals were split in such a way that we needed to evaluate them
at the pair of cusps υ and υ ′ , which are equivalent under the action of Γεns (q)[1/p] precisely by our choice of υ. Also, the factorization is heavily exploited, as the elements
lie in the Cartan Non-split and they fix the cusps υ and υ ′ .
If there is a nice relationship between p and q, we can reduce even further the
exponent d in Proposition 5.26 as follows.
Corollary 5.27. Assume that pq−1 6≡ 1 (mod q 2 ) and that −ε ∈ hp2 i. There exists
a divisor d of q + 1 such that Jτd can be expressed as a product of double-integrals.
Proof. Let Let h′ be the order of p modulo q 2 . By Euler’s Theorem, h′ divides
q(q − 1). By our assumption on p, h′ does not divide q − 1, implying h′ = qh for
some h.
′
Note that ph ≡ 1 (mod q), as ph ≡ (ph )q ≡ ph ≡ 1 (mod q). This implies that
for k = 0, 1, . . . , q − 1, the elements phk are all distinct modulo q 2 (the order is hq)
and they are all congruent to 1 modulo q.
Let d be the smallest positive integer such that either γτd or −γτd lie in Γ(q)[1/p],
and denote this matrix by γ = ( ac db ). By Lemma 5.2, γτq+1 ∈ Γ(q)[1/p], so d is a
174
divisor of q + 1. Let α be an integer such that p−hα ≡ a (mod q 2 ). Then
hα
hα
0
bphα
p
ap
γ
=
0 p−hα
cp−hα dp−hα
lies in Γ† (q)[1/p]. From here, we can continue as in Proposition 5.22, since the matrix
we used to tweak γ fixes the equivalent cusps 0 and ∞.
175
Chapter 6
Further directions
The nature of this work is conjectural so a natural step from here is to perform
computations corroborating the veracity of Conjecture 5.15, which would provide a
supply of algebraic points on elliptic curves with additive reduction (of conductor
pq 2 ) over ray class fields attached to real quadratic fields. We append a few partially
computed examples in such a way that the only remaining task would be to compute the multiplicative double integrals. The code used to find the factorizations in
Theorem 5.19 was written by Marc Masdeu and Xevi Guitart.
1. Let D = 17 and E = 99a1, with conductor 99 = 11 · 32 , so p = 11 and q = 3,
√
which are inert in K = Q( 17). The eigenvalue of the newform corresponding
to E at 3 is 1, and the only possible choice for ε is −1. The generator for O17
is ω17 =
√
17+ 17
,
2
so in order to find an embedding we need a matrix with deter-
minant 68 and trace 17, which lies in the Cartan. We can take the embedding
given by
ω17
10 −1
7−→
.
−2 7
176
The fundamental unit is given by 4 +
√
×
33 + 8 17, generates (O17
)1 .
√
√
33 + 8 17 = −103 + 16ω17
17, which has norm −1. Its square,
57 −16
−→ γτ =
,
−32 9
whose fixed point is a root of √
the polynomial −32τ 2 − 48τ + 16 = 0, or,
−3 + 17
2τ 2 + 3τ − 1 = 0. Let τ =
be one such root.
4
We need to compute the integral
Z τZ
×
γτ ∞
∞
Z τZ
eΓ ωf = ×
−57/32
eΓ ωf .
∞
The continued fraction expansion yields convergents −2/1, −7/4, −9/5, −16/9
and −57/32, so we need to compute the product of the integrals
Z τZ
×
−2
∞
Z τZ
eΓ ωf , ×
−7/4
−2
Z τZ
×
−16/9
−9/5
Z τZ
eΓ ωf , ×
−9/5
eΓ ωf ,
−7/4
Z τZ
eΓ ωf , ×
−57/32
eΓ ωf .
−16/9
These convergents give us the matrices
9 −7
−7 −2
2 1
M0 =
,
, M2 =
, M1 =
−5 4
4
1
−1 0
57 −16
−16 −9
M3 =
,
, M4 =
−32 9
9
5
177
which yield representatives
1 0
r0 = r2 =
2 1
and
and
23 −7
−7 −2
0 1
γ0 =
,
, γ2 =
, γ1 =
−13 4
4
1
−1 0
57 −16
−16 −9
γ3 =
.
, γ4 =
−32 9
9
5
We let
τ0 =
1 0
r1 = r3 = r4 =
0 1
γ0−1 τ
√
√
√
−3 − 17
3 − 17
9 − 17
−1
−1
=
, τ1 = γ1 τ =
, τ2 = γ2 τ =
,
2
4
16
√
√
3 − 17
−3 + 17
−1
−1
τ3 = γ3 τ =
, τ4 = γ4 τ =
,
2
4
and
τ0′
√
√
√
3 + 17 ′
9 + 17
9 + 17 ′
, τ1 = W0,∞ τ1 =
, τ2 = W0,1/2 τ2 =
,
= W0,1/2 τ0 =
32
2
16
√
√
3 + 17 ′
−3 − 17
′
τ3 = W0,∞ τ3 =
, τ4 = W0,∞ τ4 =
.
4
2
According to (5.4), we now need to compute the product of
Z τ0 Z
×
0
1/2
Z τ1 Z
eΓ ωf , ×
0
Z τ3 Z
×
0
∞
∞
Z τ2 Z
eΓ ωf , ×
0
Z τ4 Z
eΓ ωf , ×
0
178
1/2
∞
eΓ ωf .
eΓ ωf ,
Putting this together with (5.6), we obtain
Z τZ
×
−57/32
eΓ ωf
∞
!2
Z τ0 Z
= ×
τ0′
∞
Z τ1 Z
ωf × ×
τ1′
0
Z τ3 Z
×
τ3′
1/2
0
Z τ4 Z
ωf × ×
τ4′
0
∞
0
∞
ωf
Z τ2 Z
ωf × ×
!eΓ
τ2′
1/2
ωf
0
.
2. Let us use exactly the same setting as before, but let us apply the algorithm
suggested for general values of q. The cusps 0 and ∞ are equivalent, so we take
M to be the identity matrix. From the computations above, we have
57 −16
γτ =
,
−32 9
and we can see that
−3761 1056
†
γ = −γτ2 =
∈ Γ(3)[1/11] and Γ (3)[1/11].
2112 −593
Proposition 5.22 yields the factorization
γ = Ū1 L̄1 Ū2 L̄2 Ū3 ,
where
0
1
L̄1 =
,
4902/11 1
1 −24/11
Ū1 =
,
0
1
L̄2 =
1
−282 · 113
0
,
1
179
3
1 −3/11
Ū2 =
,
0
1
5
1 −45219/11
Ū3
.
0
1
Let
τ1 =
τ3 =
Ū1−1 τ
Ū2−1 τ2
√
17
63
+
,
=
44
4
τ2 =
√
−25690863 + 14641 17
=
,
11411473384
√
664029
17
−1
τ4 = L̄2 τ3 = −
+
.
664204
4
L̄−1
1 τ1
√
39881499 + 14641 17
=
,
11411473384
Equation (5.10) yields
Z τZ
×
−57/32
∞
eΓ ωf
!2
Z τ2 Z
= ×
τ1
0
∞
Z τ4 Z
ωf × ×
τ3
∞
ωf
0
eΓ
.
3. Let D = 5 and E = 147c1, with conductor 147 = 3 · 72 , so p = 3 and q = 7,
√
√
which are inert in K = Q( 5). Let ε = −1. The generator for O5 is ω5 = 5+2 5 ,
so in order to find an embedding we need a matrix with determinant 5 and
trace 5, which lies in the Cartan. We can take the embedding given by
−15 −61
ω5 7−→
.
5
20
The fundamental unit is given by
√
1+ 5
,
2
which has norm −1. Its square,
√
3+ 5
,
2
generates (O5× )1 .
1+ 5
−16 −61
= −1 + ω5 −→ γτ =
,
2
5
19
√
whose fixed point is a root of the polynomial 5τ 2 +35τ +61 = 0. Let τ =
√
−35+ 17
10
be one such root. The cusps 0 and ∞ are equivalent, so we take M to be the
180
identity matrix. Let
344 1281
†
γ = −γτ4 =
∈ Γ(7)[1/3] and Γ (7)[1/3].
−105 −391
Proposition 5.22 yields the factorization
γ = Ū1 L̄1 Ū2 L̄2 Ū3 ,
where
0
1
L̄1 =
,
−7/3 1
1 −7/3
Ū1 =
,
0
1
Let
L̄2 =
1
−12 · 32
0
,
1
√
7
5
=− +
,
6
10
2
1 7/3
Ū2 =
,
0
1
4
1 −301/3
Ū3
.
0
1
√
1533 + 81 5
τ1 =
τ2 =
=
,
2182
√
√
1477
5
5
81
35
τ3 = Ū2−1 τ2 = −
+
, τ4 = L̄−1
+
.
2 τ3 =
19638
2182
162
10
Ū1−1 τ
L̄−1
1 τ1
Equation (5.10) yields
Z τZ
×
−16/5
∞
eΓ ωf
!4
Z τ2 Z
= ×
τ1
181
0
∞
Z τ4 Z
ωf × ×
τ3
0
∞
ωf
eΓ
.
References
[AL70]
A. O. L. Atkin and J. Lehner. Hecke operators on Γ0 (m). Math. Ann.,
185:134–160, 1970.
[BB12]
Aurèlien Bajolet and Yuri Bilu. Computing integral points on x+
ns (p).
arXiv:1212.0665v1, 2012.
[BCDT01] Christophe Breuil, Brian Conrad, Fred Diamond, and Richard Taylor. On
the modularity of elliptic curves over Q: wild 3-adic exercies. J. Amer.
Math. Soc., 14:843–939, 2001.
[BMS67]
H. Bass, J. Milnor, and J.-P. Serre. Solution of the congruence subgroups
problem for SLn (n ≥ 3) and Sp2n (n ≥ 2). Inst. Hautes Études Sci.
Publ. Math., 33:59–137, 1967.
[Che98]
Imin Chen. The Jacobians of non-split Cartan modular curves. Proc.
London Math. Soc. (3), 77(1):1–38, 1998.
[CL05]
Denis Charles and Kristin Lauter. Computing modular polynomials. LMS
J. Comput. Math., 8:195–204, 2005.
[Cox13]
David A. Cox. Primes of the Form x2 +ny 2 : Fermat, Class Field Theory,
and Complex Multiplication. Pure and Applied Mathematics: A Wiley
Series of Texts, Monographs, and Tracts. Wiley, 2 edition, 2013.
[Cre92]
John E. Cremona. Algorithms for Modular Elliptic Cruves. Cambridge
University Press, 2 edition, 1997, 1992.
[Dar01]
Henri Darmon. Integration on Hp ×H and arithmetic applications. Annals
of Mathematics, pages 589–639, 2001.
[Dar04]
Henri Darmon. Rational points on modular elliptic curves, volume 101
of CBMS Regional Conference Series in Mathematics. Published for the
Conference Board of the Mathematical Sciences, Washington, DC; by the
American Mathematical Society, Providence, RI, 2004.
182
183
[DP06]
Henri Darmon and Robert Pollack. Efficient calculation of Stark-Heegner
points via overconvergent modular symbols. Israel Journal of Mathematics, 153(1):319–354, 2006.
[DS05]
Fred Diamond and Jerry Shurman. A First Course in Modular Forms,
volume 228 of Graduate Texts in Mathematics. Springer-Verlag, New
York, 2005.
[dSE00]
Bart de Smit and Bas Edixhoven. Sur un résultat d’Imin Chen. Math.
Res. Lett., 7(2-3):147–153, 2000.
[Edi96]
Bas Edixhoven. On a result of Imin Chen. arXiv:alg-geom/9604008, 1996.
[Elk98]
Noam Elkies. Elliptic and modular curves over finite fields and related
computational issues. AMS/IP Stud. Adv. Math., 7:195–204, 1998.
[Fla89]
Daniel E. Flath. Introduction to Number Theory. Wiley Classics Library.
Wiley-Interscience, 1989.
[GH94]
Phillip Griffiths and Joseph Harris. Principles of Algebraic Geometry.
Wiley-Interscience, 1994.
[GM15]
Xavier Guitart and Marc Masdeu. Elementary matrix decomposition and
the computation of Darmon points with higher conductor. Mathematics
of Computation, 84(292):875–893, 2015.
[Gro84]
Benedict H. Gross. Heegner points on X0 (N ). In Modular forms
(Durham, 1983), Ellis Horwood Ser. Math. Appl.: Statist. Oper. Res.,
pages 87–105. Horwood, Chichester, 1984.
[GS93]
Ralph Greenberg and Glenn Stevens. p-adic l-functions and p-adic periods
of modular forms. Invent. Math., 111(2):407–447, 1993.
[GZ86]
Benedict H. Gross and Don B. Zagier. Heegner points and derivatives of
L-series. Invent. Math., 84(2):225–320, 1986.
[Har77]
Robin Hartshorne. Algebraic Geometry, volume 52 of Graduate Texts in
Mathematics. Springer-Verlag, New York, 1977.
[Kna92]
Anthony W. Knapp. Elliptic Curves. Princeton University Press, 1992.
[KP14]
Daniel Kohen and Ariel Pacetti. Heegner points on Cartan non-split
curves. arXiv:1403.7801v2, 2014.
184
[Men67]
J. Mennicke. On Ihara’s modular group. Invent. Math., 4:202–228, 1967.
[MTT86]
B. Mazur, J. Tate, and J. Teitelbaum. On p-adic analogues of the conjectures of Birch and Swinnerton-Dyer. Invent. Math., 84(1):1–48, 1986.
[RW14]
Marusia Rebolledo and Christian Wuthrich. A moduli interpretation for
the non-split cartan modular curve. arXiv:1402.3498, 2014.
[Ser70]
Jean-Pierre Serre. Le problème des groupes de congruence pour SL2 .
Ann. of Math., 92(2):489–527, 1970.
[Ser73]
Jean-Pierre Serre. A Course in Arithmetic, volume 7 of Graduate Texts
in Mathematics. Springer-Verlag, New York, 1973.
[Ser97]
Jean-Pierre Serre. Lectures on the Mordell-Weil theorem. Aspects of
Mathematics. Friedr. Vieweg & Sohn, Braunschweig, third edition, 1997.
Translated from the French and edited by Martin Brown from notes by
Michel Waldschmidt, With a foreword by Brown and Serre.
[Sil86]
Joseph H. Silverman. The Arithmetic of Elliptic Curves, volume 106 of
Graduate Texts in Mathematics. Springer-Verlag, New York, 2 edition,
2009,1986.
[Sil94]
Joseph H. Silverman. Advanced Topics in the Arithmetic of Elliptic
Curves, volume 151 of Graduate Texts in Mathematics. Springer-Verlag,
New York, 1994.
[ST92]
Joseph H. Silverman and John Tate. Rational points on elliptic curves.
Undergraduate Texts in Mathematics. Springer-Verlag, New York, 1992.
[Wil95]
Andrew Wiles. Modular Elliptic Curves and Fermat’s Last Theorem.
Ann. of Math., 141:443–551, 1995.