First publ. in: Journal of experimental
zoology
/ Part B, Molecular
and developmental
evolution,
(2007), pp. (2007)
250–258
JOURNAL OF
EXPERIMENTAL
ZOOLOGY
(MOL DEV
EVOL)308B
308B:250–258
PCR Survey of Hox Genes in the Goldfish
Carassius auratus auratus
JING LUO1,2, PETER F. STADLER2–5, SHUNPING HE2, AND AXEL MEYER1
1
Lehrstuhl für Zoologie und Evolutionsbiologie, Department of Biology,
University of Konstanz, 78457, Konstanz, Germany
2
Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, Hubei 430072,
P.R. China
3
Lehrstuhl für Bioinformatik, Institut für Informatik, Universität Leipzig,
D-04103 Leipzig, Germany
4
Institut für Theoretische Chemie und Molekulare Strukturbiologie, Universität
Wien, A-1090 Wien, Austria
5
Santa Fe Institute, Santa Fe, New Mexico 87501
ABSTRACT
A tetraploidization event took place in the cyprinid lineage leading to goldfishes
about 15 million years ago. A PCR survey for Hox genes in the goldfish Carassius auratus auratus
(Actinopterygii: Cyprinidae) was performed to assess the consequences of this genome duplication.
Not surprisingly, the genomic organization of the Hox gene clusters of goldfish is similar to that of
the closely related zebrafish (Danio rerio). However, the goldfish exhibits a much larger number of
recent pseudogenes, which are characterized by indels. These findings are consistent with the
hypothesis that dosage effects cause selection pressure to rapidly silence crucial developmental
regulators after a tetraploidization event. J. Exp. Zool. (Mol. Dev. Evol.) 308B:250– 258, 2007.
r 2007 Wiley-Liss, Inc.
How to cite this article: Luo J, Stadler PF, He S, Meyer A. 2007. PCR survey of Hox genes in
the goldfish carassius auratus auratus. J. Exp. Zool. (Mol. Dev. Evol.) 308B:250–258.
The duplications of genes and entire genomes
are thought to be important mechanisms underlying the genesis of morphological variation and
functional innovation (Ohno and Atkin, ’66; Ohno
et al., ’67; Taylor et al., 2001; Wagner, 2001;
Wagner et al., 2005; Crow et al., 2006). Extensive
comparative genomics studies have corroborated
the proposition that gnathostomes have experienced two rounds of genome duplication (2R)
(Sidow, ’96; Amores et al., ’98; Taylor et al., 2003;
Vandepoele et al., 2004; Dehal and Boore, 2005).
Moreover, another round (3R) genome duplication, the so-called ‘‘fish-specific genome duplication’’ (FSGD) took place about 320 mya ago in the
ancestral lineage of the teleosts (see e.g., Amores
et al., ’98; Wittbrodt et al., ’98; Gregory and
Hebert, ’99; Taylor et al., 2003; Hoegg et al., 2004;
Vandepoele et al., 2004; Hoegg and Meyer, 2005;
Meyer and Van de Peer, 2005; Yan et al., 2005;
Crow et al., 2006).
Fishes exhibit remarkable variation in morphological, behavioral, and physiological adaptations.
Teleosts comprise more than 97% of the approxir 2007 WILEY-LISS, INC.
mately 25,000 species of actinopterygians (rayfinned fishes) (Nelson, ’94) and are the most
species-rich and diverse group of vertebrates.
Several authors suggested that the FSGD is at
least partially responsible for the species diversity
of teleosts (e.g., Amores et al., ’98; Wittbrodt et al.,
’98; Meyer and Van de Peer, 2005; Yan et al., 2005;
but see Donoghue and Purnell, 2005).
Hox genes encode for homeodomain-containing
transcription factors which are orthologous to the
genes in the Drosophila homeotic gene clusters
(McGinnis and Krumlauf, ’92; Schubert et al.,
’93). They specify developmental cell fates along
the anterior–posterior axis in bilaterian animals.
Grant sponsors: Max-Planck-Gesellschaft, Deutsche Forschungsgemeinschaft and Natural Science Foundation of China.
Correspondence to: A. Meyer, Lehrstuhl für Zoologie und
Evolutionsbiologie, Department of Biology, University of Konstanz,
Universitätsstr. 10, 78457 Konstanz, Germany.
E-mail: axel.meyer@uni-konstanz.de
Received 19 June 2006; Revised 8 October 2006; Accepted 11
October 2006
Published online 29 January 2007 in Wiley InterScience (www.
interscience.wiley.com). DOI: 10.1002/jez.b.21144.
Konstanzer Online-Publikations-System (KOPS)
URL: http://www.ub.uni-konstanz.de/kops/volltexte/2007/3414/
URN: http://nbn-resolving.de/urn:nbn:de:bsz:352-opus-34141
GOLDFISH HOX GENE EVOLUTION
The increased complexity of body plans that has
accompanied the evolution of higher vertebrates is
a phenomenon of particular interest and importance (Martinez and Amemiya, 2002) and a
particularly intriguing problem is the understanding of the role of Hox cluster duplications in the
evolution of vertebrates. In this regard, questions
such as the characterization of evolutionary
patterns after the FSGD including Hox gene
retention/deletion, and evolution at the nucleotide
level is of particular interest (Holland et al., ’94;
Malaga-Trillo and Meyer, 2001; Wagner et al.,
2003; Prohaska and Stadler, 2004; Hoegg and
Meyer, 2005; Crow et al., 2006; Kurosawa et al.,
2006). Available DNA sequences from teleosts
have shown a variety of gene retention and
deletion patterns. These might possibly be relevant to the adaptive evolution during the initial
teleost radiation, yet the role of the cluster
duplications in vertebrate evolution is still poorly
understood, especially the evolution immediately
after the duplication (Wagner et al., 2003, 2005).
Genome duplications have been an exceedingly
rare phenomenon in animals, yet they occurred
somewhat more frequently in some vertebrate
lineages, such as fishes, than in others (Ferris and
Whitt, ’77; Allendorf, ’78; Wolfe, 2001). The shortterm impact of genome duplications is still not
well understood. How is dosage balance achieved
when the genome suddenly contains additional
copies of a gene? Such questions have been
explored in plants (Soltis and Soltis, 2000; Wendel,
2000; Chen et al., 2004; Comai, 2005). What are
the mechanisms leading to retention or loss of a
duplicated gene? A partial answer to the latter
questions is provided by the DDC model (Force
et al., ’99; Lynch et al., 2001). It is not clear,
however, whether the situation in plants carries
over to vertebrates (Furlong and Holland, 2004;
LeComber and Smith, 2004). For instance, it
remains unclear how important developmental
regulators, such as Hox genes, have evolved
initially after gene or genome duplications. More
precisely, it remains unknown how quickly and
how much silencing, neofunctionalization, or subfunctionalization takes place in the aftermath of
genome duplications. Comparative studies have
shown that gene loss from Hox gene clusters is an
ongoing process, i.e., the resolution of the postduplication redundancy is not immediate (Prohaska
and Stadler, 2004; Hoegg and Meyer, 2005).
Polyploidy occurs in several unrelated groups
of fish such as salmonids, sturgeons, and cyprinids
and might have played an important role in
251
regulatory evolution (LeComber and Smith,
2004). The importance of genome duplications in
shaping the evolution of genomes can best be
examined by comparative genomic analyses of
sequences from several closely related organisms
that vary in the number of genome duplications
their lineages have experienced. In this regard,
closely related diploid and tetraploid organisms
provide the cleanest test situation for the investigation of ‘‘post-ploidy’’ events. Those pairwise
species comparisons can be used to test the effects
of those drastic genomic events, in terms of dosage
balance or fast stabilization of duplicated genomes
via retention/exclusion of redundant genomic
DNA regions.
More than 40 species in the family Cyprinidae
(mainly in the three subfamilies Cyprininae,
Schizothoracinae, and Barbinae, but rare in the
other subfamilies), are known to have undergone
repeated genome duplication events (Chen et al.,
’84; Yu et al., ’89; Chen, ’98; Li et al., personal
communication, Luo et al., unpublished data).
Various types of polyploidy have been observed:
tetraploidy (N 5 4; goldfish and common carp;
2n 5 100), hexaploidy (N 5 6; Schizothorax
prenanti Tchang; 2n 5 148), octaploidy (N 5 8;
Carassius auratus gebelio and C.a. langsdorfi;
2n 5 2007) and, although rarely, triploidy
(N 5 3; Phoxinus eosneogaeus), as well (Dawley
and Goddard, ’88; Yu et al., ’89; Murakami et al.,
2001; He et al., personal communication).
Goldfish belong to the subfamily Cyprininae
(Howes, 1991). The ancestor of all species in this
subfamily is thought to have been tetraploid (Yu
et al., ’89). The most recent genome duplication
event(s) in this subfamily of fish are believed to
have occurred within the last 20 million years
(Risinger and Larhammar, ’93; Yang and Gui,
2004). In addition, recent and recurrent hexaploids were discovered in different lineages as well
(Luo et al., unpublished data), which makes the
goldfish and its relatives a particularly interesting
group in which to investigate the short-term
evolutionary effects of genome duplications since
their duplicated genomes are still within the half
life of duplicates as suggested by Lynch and
Conery (2000). This group thus appears to be an
excellent model with which to test hypotheses
relevant to the vertebrate genome duplication and
diversification.
In particular, the fate of extra copies of crucial
developmental regulators such as the Hox genes
can be expected to shed some light on the
mechanisms that act in duplicated genomes after
J. Exp. Zool. (Mol. Dev. Evol.) DOI 10.1002/jez.b
252
J. LUO ET AL.
a genome duplication. As a first step towards
addressing these issues, a PCR survey of Hox
genes in the goldfish C.a. auratus, a putative
young allo-tetraploid species, was performed with
the goal of determining whether a genomic
screening of Hox clusters in a goldfish BAC
genomic library (Luo et al., 2006) is likely to be
fruitful. The PCR data already provided some
interesting insights into the post-duplication
evolution of goldfish Hox gene clusters.
MATERIALS AND METHODS
One male orange specimen of C.a. auratus was
obtained commercially in Konstanz, Germany and
used for genomic DNA extraction. Hox genes were
amplified from its genomic DNA. The detailed
extraction procedure is described in Luo et al.
(2006).
Degenerate primers for the amplification of Hox
genes included the posterior Hox forward primers
for paralogous groups 9–13 [50 CGA AAG AAG
(C/A) G(N/C)GT(N/C) CC(N/C)T A(T/C)AC, anterior Hox forward primer for paralogous groups 1–9
[GAA TTC CAC TTC AAC(C/A)(G/A)(C/G) TAC
CT], and the universal reverse primer [CAT CCT
GCG GTT TTG GAA CCA NAT], as described by
Amores et al. (2004).
PCR reactions were conducted in a volume of
10 ml PCR cocktail that included 1 buffer with
0.15 mmol MgCl2 (Sigma), 0.25 mM dNTPs
(Fermantas), 0.5 U Taq DNA polymerase (Sigma),
and 12–15 ng genomic DNA. Following a 2 min
denaturing period at 941C, the following PCR
conditions were used for the experiments both
with HOX1–9 primer1universal reverse primer
and HOX9–13 primer1universal reverse primer:
35 cycles at 941C for 30 sec, 501C for 30 sec, and
721C for 60 sec, followed by a final extension at
721C for 5 min.
Resulting PCR products were purified with
purification columns (Qiagen). Then the purification products were ligated and cloned with the TA
cloning kit (Invitrogen). Each clone was amplified
again with the universal M13 forward and reverse
primers and sequenced with an ABI 3100 automatic sequencer, by using an ABI PRISM BigDye
Terminator Cycle Sequencing Ready Reaction Kit
(with AmpliTaq DNA polymerase FS, Applied
Biosystems).
Hox sequences have been deposited in Genbank
with accession numbers DQ630465–DQ630513 for
all unique sequences. Some clones shared the
J. Exp. Zool. (Mol. Dev. Evol.) DOI 10.1002/jez.b
same sequence, and the clone number information
was included in the Genbank entry.
The sequences of 73 PCR clones and their
reverse complements were aligned using ClustalW
(Thompson et al., ’94) to identify identical clones.
Blastn searches in Genbank were used to reconfirm that the sampled sequences were not contaminations. The DNA sequences were then
translated in all six possible reading frames and
compared to known homeodomain-containing proteins to obtain the putative Hox protein sequence
fragments. Several PCR products had one or two
deletions, a single clone (H01 56 15) had two
inserted nucleotides relative to the standard
homeobox sequence. Based on peptide sequence
identity with known Hox genes, DNA sequences
could be assigned unambiguously to paralog
groups (PG) 1, 3, 9, 11, 13, as well as to a more
poorly resolved ‘‘middle group’’ gene (paralog
groups 4–7). The automated phylogenetic key for
classifying homeoboxes described by Sarkar et al.
(2002) unfortunately is not applicable since it
requires homeobox fragments that are longer than
27 amino acids.
11 PCR clones did not correspond to Hox genes,
and were tentatively identified as two copies of the
HB9-type homeobox gene hlxb9la. The remaining
62 Hox homeobox sequences were analyzed
further. In addition, we re-analyzed the 11
previously published Hox homeobox fragments
by Levine and Schechter (’93) (see Table 1).
Due to the relatively close phylogenetic relationship between C.a. auratus and Danio rerio, most
but not all PCR products could be unambiguously
identified as orthologs of one of the 49 zebrafish
Hox genes. To this end, we measured the
Hamming distance D of the goldfish Hox sequences (as well as the sequences from Levine
and Schechter (’93)) to each of the zebrafish
sequences. For comparison, we determined for
each zebrafish Hox fragment the Hamming distance D to the closest other Hox fragment. If a
C.a. auratus sequence x was significantly closer to
a unique D. rerio x0 sequence than this D. rerio
sequence x0 was to any other D. rerio sequence y0 ,
we could conclude that x was the true ortholog of
x0 . More formally, we assumed that goldfish
sequence x was homologous to zebrafish sequence
x0 if D 5 d(x, x0 )oominy,0 x0 d(x0 , y0 ) 5 D. For the
computation of the Hamming distance we treat
gaps as fifth character. Since gaps only occur in
the pseudogenes, this simple choice does not
affect the assignment of genes to paralog
groups. Through this method we could identify
GOLDFISH HOX GENE EVOLUTION
253
TABLE 1. Summary of C.a. auratus PCR fragments
PG
1
2
3
4
5
6
7
9
10
13
Previous work1
pg
D
D
A1a
B1b
A2b
A2a
B3a?
A3a?
C4a
A5a
A5a?
B5a
B5b
C5a
B6b
C6a
B7a
A9b
A10b
9
7–8
7
22 L09698
18
18 L09691
L09690
15 L09697
19
20
18 L09688
10–13
14–15
7
4–8
2
4–5
4
3
10
10–12
4
6–9
11
B10a 7
C10a 0
D10a 7
A13a 8
11
A13b 3
13
13
13
15
19
21
19
17
22
16
15
15
16
23
16
L09686
L09685/9
L09687
L09694
L09693
Copies
—
2
—
—
6
2
2
1
1
—
—
2
—
2
2
1
6
4
4
—
1
1
1
6
B13a
5
3
22
22
6
11
C13a
5–6
4
29
16
11
10
7
25
30
10
1
1
D13a 16
?
Original clone IDs
C040 270 06, F010 60 11
Pseudogene in D. rerio
H0 05400 15, B040 260 04, E050 370 09, F020 140 12, F030 220 11, G040 310 14
C050 350 05, G010 70 13
A030 170 01, C010 30 05
B030 180 03
C030 190 05 artifact?
B060 420 04, G030 230 13
A050 330 01, H040 320 16
D050 360 07, H030 240 15
D040 760 08
A040 730 02, H020 640 16, A030 650 01, F050 380 12, F040 780 12, D030 200 07
[long] A040 730 02, H020 640 160 , A030 650 01, F040 780 12
A040 250 02, A060 410 02, B010 20 03, A020 90 02
B050 340 03
G040 790 14
[long] -00 H060 480 16, E010 530 09, F030 700 11, G010 550 13
C040 750 06, E030 690 09
[long] -00 A020 570 02, D010 520 07, G030 710 13, A010 490 01
D030 680 07, F020 620 12, E020 610 10, G020 630 14
H030 720 15, E040 770 10, F010 540 11
[long] -00 C060 430 06, D060 440 08, F060 460 12, B010 500 03
B020 580 04, B040 740 04, C020 590 06, D020 600 08
C010 510 05, H040 800 16
[long] -00 [long] two extra A inserted H010 560 15
G060 470 14
PG, number of Hox paraglogs; pg, assigned Hox paraglog; *, putative pseudogene in C.a. auratus; ?, uncertain assignment of gene identity from
Hamming distances; [long], 114nt sequences instead of 81nt sequences for the posterior PGs; D, Hamming distance between goldfish PCR
fragment and nearest zebrafish gene; D, Hamming distance between most closely related zebrafish genes.
1
Levine and Schechter (’93).
the orthology relationships of most of the PCR
fragments (Table 1).
The identities of the PCR fragments were also
investigated by reconstructing phylogenetic trees
using different methods following the procedure
outlined by Prohaska and Stadler (2004). Neighbor-joining (Saitou and Nei, ’87), parsimony, and
maximum likelihood analysis for each PG was
performed using the Phylip package (Felsenstein,
’89), using other teleost Hox homeobox fragments
for comparison (D. rerio, Tetraodon nigroviridis,
Takifugu rubripes, Oryzias latipes, and Fundulus
heteroclitus). In most cases goldfish sequences are
grouped with a unique zebrafish sequence in these
trees. Bootstrap support for this grouping is at
least 70% in all cases not marked by ‘‘?’’ in
Table 1. Furthermore, all assignments derived
from the trees are consistent with those listed in
Table 1. Phylogenetic trees as well as neighbor
nets (Bryant and Moulton, 2004) (computed using
SplitsTree of Huson (’98), based on the Jukes–
Cantor distance) are compiled in the electronic
supplement (URL: http://www.bioinf.uni-leipzig.
de/Publications/SUPPLEMENTS/06-007/). We use
the neighbor-net approach in addition to classical
phylogenetic methods as it provides a convenient
graphical representation of noise and ambiguities
in the data. Quartet Mapping (Nieselt-Struwe and
J. Exp. Zool. (Mol. Dev. Evol.) DOI 10.1002/jez.b
254
J. LUO ET AL.
von Haeseler, 2001), as implemented in the
quartm 0.2 program (Stadler et al., 2004), was
used to provide further corroborating evidence
(data not shown), again with consistent results. In
this method, we treated gaps as missing data: for
each quadruple we ignored all those columns in
the alignment in which at least one gap was
contained in one of the four sequences under
consideration.
RESULTS
The combined dataset of our PCR products and
the sequences reported in Levine and Schechter
(’93) yield 26 distinct Hox gene fragments in the
goldfish C.a. auratus. The orthology relationships
of all but two genes could be identified with
confidence (Table 1). We observe sequence variations from different clones that were apparently
amplified from a single orthologous Hox gene (also
Table 1). Due to the short sequence length it is not
possible to determine whether this variation is due
to allelic variants, divergence of duplicate genes
(i.e., paralogous genes) following the recent tetraploidization, or whether it might be due to
potential sequencing artifacts. Sequence variation
is concentrated, however, in a small number of
sequence groups, suggesting relaxed selection in at
least one copy of the duplicated genes.
Two groups of Hox3 sequences seem to be recent
pseudogenes as judged by the existence of indels in
the homeobox sequences. An alignment of the
putative HoxB3a sequences is shown in Figure 1.
On the other hand, expression of L09697 (G11-4),
which is indistinguishable from the HoxB3a
PCR products, has been reported in the goldfish
brain (Levine and Schechter, ’93). The discrepancy is that only one copy of HoxB3a in the
tetraploid goldfish might have turned into a
pseudogene.
Similarly, various deletions suggest that HoxB10a is a pseudogene in goldfish and for the
HoxA3a gene we found only two sequences with a
deletion at the same position (not shown). A single
clone similar to zebrafish HoxD13a has two
inserted A residues as well as a conspicuous run
of T nucleotides, strongly suggesting that it is
either a PCR artifact or a real pseudogene.
Since we consistently find indels in independent
amplifications belonging to the same Hox genes,
we conclude that PCR or cloning-induced errors
are an unlikely explanation for the observed
‘‘corrupt’’ homeobox sequences. Rather, at least
both Hox3 paralogs as well as HoxB10a are most
likely true pseudogenes.
It has been reported previously that homeobox
fragment L09690 (G5-1) corresponds to a rare
1.4 kb transcript that is expressed in the goldfish
brain (Levine and Schechter, ’93). Comparison
with other Hox2 sequences (Fig. 2) shows that this
sequence is homologous to the HoxA2a pseudogene of zebrafish. This observation implies that
HoxA2a was lost in zebrafish after the split of the
Danio and Carassius lineages.
We are thus left with two single PCR clones for
which no detailed assignment can be made. One of
them belongs to PG13. By comparison with the
Hox gene complement of the zebrafish, it is
tempting to speculate that this is the ‘‘missing’’
HoxC13b gene. The second sequence belongs to
the middle group and differs by 13 nucleotides
from HoxA5a.
Fig. 1. Frequent indels suggest that HoxB3a and HoxB10a have recently turned into pseudogenes. Names refer to original
clone numbers in Table 1.
J. Exp. Zool. (Mol. Dev. Evol.) DOI 10.1002/jez.b
GOLDFISH HOX GENE EVOLUTION
255
Fig. 2. Neighbor-net analysis of Hox2 genes strongly suggests that L09690 (Levine and Schechter, ’93) is orthologous to the
zebrafish HoxA2a pseudogene. Dr-Danio rerio, Hf-Heterodontus francisci, Lm-Latineria menadonesis.
Fig. 3. Goldfish Hox genes in comparison to zebrafish. Our data are largely consistent with the expectation that goldfish and
zebrafish should have a similar organization of Hox clusters. Differences are essentially restricted to recent pseudo-gene
formations of HoxA2a in Danio but not in Carassius, while HoxB10a, HoxB3a, and likely also HoxA3a have turned
into pseudogenes in the goldfish. Green boxes represent Hox genes of Danio rerio, and ‘‘E’’ refers to Evx (even-skipped related
gene).
DISCUSSION
The results of the PCR survey data and the
published sequences from Levine and Schechter
(’93) are summarized in Figure 3 in comparison
to the Hox cluster complement of the zebrafish.
D. rerio has 49 Hox genes in seven tightly linked
clusters, all of which are expressed in the adult
fish (Corredor-Ad’amez et al., 2005). In addition,
there are at least two pseudogenes, HoxA2ac and
HoxA10ac in the zebrafish Hox clusters. Not
surprisingly, the homeobox fragments reported
here indicate that the Hox cluster organization of
goldfish is similar, but not identical to that of
zebrafish.
Surprisingly, the pseudogenes found in the Hox
clusters of zebrafish and goldfish are different.
While at least one of the zebrafish pseudogenes,
the pseudogene of HoxA2a, is expressed in gold-
fish, the PCR results strongly suggest that several
intact zebrafish genes: HoxB10a, Hox3a, and
possibly also HoxA3a might have evolved into
pseudogenes in the goldfish lineage. In the Hox10
paralog group, zebrafish has lost HoxA10a completely, while in the goldfish HoxB10a has recently
turned into a pseudogene. Even taking into
account that our PCR survey covers only 19 (or
25, if we include the data from Levine and
Schechter (’93)) of the expected roughly 50 Hox
genes, we already detect strong indications that
pseudogene formation is much more prevalent in
goldfish (3–5 of 25 PCR fragments) than in
zebrafish (2 of 51 genes).
Dosage effects might plausibly cause selection
pressure to lose redundant Hox genes through
the formation of pseudogenes. In this context, it is
intriguing that our examples of pseudogenes are
recognizable mostly because they contain indels in
J. Exp. Zool. (Mol. Dev. Evol.) DOI 10.1002/jez.b
256
J. LUO ET AL.
the homeobox, which would almost certainly
render the gene non-functional even if it was still
expressed.
Pseudogenes are formed by random mutations
that create either stop codons which prematurely
terminate the full-length functional expression
product, or by insertion/deletions causing a shift of
the reading frame, thereby rendering the translated protein non-functional. Although the substitution pattern in pseudogenes is not completely
random, pseudogenes are under nearly selective
neutrality (Gojobori and Li, ’82; Li et al., ’84; Li,
’97). In theory, the pseudogene formation happens
as likely via both stop codon creation as indels. Yet
in our study of the goldfish Hox pseudogenes,
indels seem to be dominant. This biased pattern
might arise from the selective pressure of removing the gene product soon after the duplication
event, a potential effect that will require further
investigation.
Moghadam et al. (2005a,b) have indicated there
were at least 14 Hox gene clusters in trout and
salmon (family Salmonidae). This group of fish
was thought to have evolved from an ancestor in
which an autotetraploidization event occurred
25–100 Mya (Ohno, ’70; Allendorf and Thorgaard,
’84). If goldfish has retained most of its duplicated
genome following the recent tetraploidization, we
would expect 14–16 Hox gene clusters, a similar
number to the salmon fishes (Risinger and
Larhammar, ’93; Amores et al., 2004; Yang and
Gui, 2004). Also, due to the recent tetraploidization event we expect that only a few mutations
accumulated between the homeobox sequences of
the paralogs. As a consequence, we cannot
distinguish with certainty recent paralogs from
allelic variants, and hence cannot determine a
minimum number of retained clusters at present.
Luo et al. (2006) identified two copies of the
recombinase-activating gene 1 (RAG1) from a
goldfish genomic BAC library, while in zebrafish
only a single-copy gene exists. Divergence analysis
of
RAG1
dated
the
gene
duplication
14.2–14.5 Mya, in agreement with previous dating
of the goldfish-specific gene duplication (Risinger
and Larhammar, ’93; Yang and Gui, 2004), it is
likely that the duplication of RAG1 was caused
by the genome duplication event in the goldfish
lineage. This suggests that the initially functional
genes were turned into pseudogenes within the
last 15 Mya. This age estimation is consistent
with the relative small branch lengths that still
allow a confident classification of the goldfish
Hox genes.
J. Exp. Zool. (Mol. Dev. Evol.) DOI 10.1002/jez.b
In summary, our PCR survey demonstrates that
there is a high rate of pseudogene formation
dominated by indels in the pseudogenes of goldfish
Hox genes. This is consistent with the expectation
that there are more Hox clusters in the goldfish
genome which have lost functional Hox genes,
thereby reducing redundancy following the recent
tetraploidization event. Due to the small evolutionary distance, however, extensive sequencing of
BAC clones of one individual containing goldfish
Hox clusters will be necessary to obtain a more
complete understanding of the processes involved
in this secondary gene loss and immediate evolution after duplication.
ACKNOWLEDGMENTS
We thank Kathrin Hoffmann for technical
assistance and Simone Hoegg, Nicol Siegel, Ingo
Braasch, Walter Salzburger, Elke Hespeler, and
Sonja J. Prohaska for valuable discussions. PFS
thanks the CAS/MPG Partner Institute at the
SIBS in Shanghai for its hospitality in March
2006, where part of the analysis was performed.
This work has been funded, in part, by the
Deutsche
Forschungsgemeinschaft,
through
grants to A.M. and the DFG Bioinformatics
Initiative (P.F.S.), by the Max-Planck-Society
(A.M., J.L.), and the Natural Science Foundation
of China (S.H., J.L.).
LITERATURE CITED
Allendorf FW. 1978. Protein polymorphism and the rate of loss
of duplicate gene expression. Nature 272:76–78.
Allendorf FW, Thorgaard GH. 1984. Tetraploidy and the
evolution of salmonid fish. In: Turner JB, editor, Evolutionary genetics of fish. New York: Plenum Press. p 1–53.
Amores A, Force A, Yan YL, Joly L, Amemiya C, Fritz A, Ho
RK, Langeland J, Prince V, Wang YL, Westerfield M, Ekker
M, Postlethwait JH. 1998. Zebrafish hox clusters and
vertebrate genome evolution. Science 282:1711–1714.
Amores A, Suzuki T, Yan YL, Pomeroy J, Singer A, Amemiya
C, Postlethwait J. 2004. Developmental roles of pufferfish
Hox clusters and genome evolution in ray-fin fish. Genome
Res 14:1–10.
Bryant D, Moulton V. 2004. Neighbor-net: an agglomerative
method for the construction of phylogenetic networks. Mol
Biol Evol 21:255–265.
Chen XL, Yue PQ, Lin RR. 1984. Major groups within the
family Cyrinidae and their Phylogenetic relationships. Acta
Zootaxon Sin 4:424–440.
Chen YY. 1998. Preface. In: Chen Yiyu et al., editors, Fauna
Sinica, Osteichthyes, Cypriniformes II. Beijing: Science
Press. p 1–18.
Chen ZJ, Wang J, Tian L, Lee HS, Wang JJ, Chen M, Lee JJ,
Josefsson C, Madlung A, Watson B, Pires JC, Lippman Z,
Vaughn MW, Colot V, Birchler JA, Doerge RW, Martienssen
R, Comai L, Osborn T. 2004. The development of an
GOLDFISH HOX GENE EVOLUTION
Arabidopsis model system for genome-wide analysis of
polyploidy effects. Biol J Linn Soc 82:689–700.
Comai L. 2005. The advantages and disadvantages of being
polyploid. Nat Rev Genet 6:836–846.
Corredor-Ad’amez M, Welten MC, Spaink HP, Jeffery JE,
Schoon RT, de Bakker MA, Bagowski CP, Meijer AH,
Verbeek FJ, Richardson MK. 2005. Genomic annotation and
transcriptome analysis of the zebrafish (Danio rerio) hox
complex with description of a novel member, HoxB13a. Evol
Dev 7:362–375.
Crow KD, Stadler PF, Lynch VJ, Amemiya CT, Wagner GP.
2006. The fish specific Hox cluster duplication is coincident
with the origin of teleosts. Mol Biol Evol 23:121–136.
Dawley RM, Goddard KA. 1988. Diploid–triploid mosaics
among unisexual hybrids of the minnows Phoxinus eos
and Phoxinus neogaeus. Evolution 42:649–659.
Dehal P, Boore JL. 2005. Two rounds of whole genome
duplication in the ancestral vertebrate. PLoS Biol 3:
e314.
Donoghue PC, Purnell MA. 2005. Genome duplication,
extinction and vertebrate evolution. Trends Ecol Evol 20:
312–319.
Felsenstein J. 1989. Phylip–phylogeny inference package
(version 3.2). Cladistics 5:164–166.
Ferris SD, Whitt GS. 1977. Loss of duplicate gene-expression
after poly-ploidization. Nature 265:258–260.
Force A, Lynch M, Pickett FB, Amores A, Yan Yl, Postlethwait
J. 1999. Preservation of duplicate genes by complementary,
degenerative mutations. Genetics 151:1531–1545.
Furlong RF, Holland PWH. 2004. Polyploidy in vertebrate
ancestry: Ohno and beyond. Biol J Linn Soc 82:425–430.
Gojobori T, Li WH. 1982. Patterns of nucleotide substitution
in pseudogenes and functional genes. J Mol Evol 18:
360–369.
Gregory TR, Hebert PDN. 1999. The modulation of DNA
content: proximate causes and ultimate consequences.
Genome Res 9:317–324.
Hoegg S, Meyer A. 2005. Hox clusters as models for vertebrate
genome evolution. Trends Genet 21:421–424.
Hoegg S, Brinkmann H, Taylor JS, Meyer A. 2004. Phylogenetic timing of the fish-specific genome duplication correlates with the diversification of teleost fish. J Mol Evol 59:
190–203.
Holland PWH, Garcia-Ferńandez J, Williams NA, Sidow A
1994. Gene duplication and the origins of vertebrate
development. Development (Suppl.):125–133.
Howes GJ. 1991. Systematics and biogeography: an overview.
In: Winfield IJ, Nelson JS, editors. Cyprinid fishes,
systematics, biology and exploitation. London: Chapman
and Hall. p 1–33.
Huson DH. 1998. Splitstree: analyzing and visualizing evolutionary data. Bioinformatics 14:68–73.
Kurosawa G, Takamatsu N, Takahashi M, Sumitomo M,
Sanaka E, Yamada K, Nishii K, Matsuda M, Asakawa S,
Ishiguro H, Miura K, Kurosawa Y, Shimizu N, Kohara Y,
Hori H. 2006. Organization and structure of hox gene loci in
medaka genome and comparison with those of pufferfish
and zebrafish genomes. Gene 370:75–82.
LeComber S, Smith C. 2004. Polyploidy in fishes. Biol J Linn
Soc 82:432–442.
Levine EM, Schechter N. 1993. Homeobox genes expressed in
the retina and brain of adult goldfish. Proc Natl Acad Sci
USA 90:2729–2733.
257
Li WH. 1997. Gene structure, genetic codes, and mutation. In:
Li WH, editor, Molecular evolution, Sunderland MA:
Sinauer Associates. p 31–33.
Li WH, Wu C, Luo C. 1984. Nonrandomness of point
mutations as reflected in nucleotide substitutions in
pseudogenes and its evolutionary implications. J Mol Evol
21:58–71.
Luo J, Lang M, Salzburger W, Siegel N, Stoelting K,
Meyer A. 2006. A BAC li-brary for the goldfish Carassius
auratus auratus (Cyprinidae, Cypriniformes). J Exp Zool B,
in press.
Lynch M, Conery JS. 2000. The evolutionary fate
and consequences of duplicate genes. Science 290:
1151–1155.
Lynch M, O’Hely M, Walsh B, Force A. 2001. The probability
of preservation of a newly arisen gene duplicate. Genetics
159:1789–1804.
Malaga-Trillo E, Meyer A. 2001. Genome duplications and
accelerated evolution of Hox genes and cluster architecture
in teleost fishes. Am Zool 41:676–686.
Martinez P, Amemiya CT. 2002. Genomics of the HOX gene
cluster. Comp Biochem Physiol B: Biochem Mol Biol 133:
571–580.
McGinnis W, Krumlauf R. 1992. Homeobox genes and axial
patterning. Cell 68:283–302.
Meyer A, Van de Peer Y. 2005. From 2R to 3R: evidence for a
fish-specific genome duplication (FSGD). Bioessays 27:
937–945.
Moghadam HK, Ferguson MM, Danzmann RG. 2005a. Evolution of Hox clusters in Salmonidae: a comparative analysis
between Atlantic salmon (Salmo salar) and rainbow trout
(Oncorhynchus mykiss). J Mol Evol 61:636–649.
Moghadam HK, Ferguson MM, Danzmann RG. 2005b.
Evidence for Hox gene duplication in rainbow trout
(Oncorhynchus mykiss): a tetraploid model species. J Mol
Evol 61:804–818.
Murakami M, Matsuba C, Fujitani H. 2001. The maternal
origins of the triploid ginbuna (Carassius auratus langsdorfi): phylogenetic relationships within the C. auratus taxa by
partial mitochondrial d-loop sequencing. Genes Genet Syst
76:25–32.
Nelson J. 1994. Fishes of the world. New York: Wiley.
Nieselt-Struwe K, von Haeseler A. 2001. Quartet-mapping, a
generalization of the likelihood mapping procedure. Mol
Biol Evol 18:1204–1219.
Ohno S. 1970. Evolution by gene duplication. New York:
Springer-Verlag.
Ohno S, Atkin NB. 1966. Comparative DNA values and
chromosome complements of eight species of fishes. Chromosoma 18:455–466.
Ohno S, Muramoto J, Christian L, Atkin NB. 1967. Diploid–tetraploid relationship among old-world members of the fish
family cyprinidae. Chromosoma 23:1–9.
Prohaska SJ, Stadler PF. 2004. The duplication of the hox
gene clusters in teleost fishes. Th Biosci 123:89–110.
Risinger C, Larhammar D. 1993. Multiple loci for synapse
protein SNAP-25 in the tetraploid goldfish. Proc Natl Acad
Sci USA 90:10598–10602.
Saitou N, Nei M. 1987. The neighbor-joining method: a new
method for reconstructing phylogenetic trees. Mol Biol Evol
4:406–425.
Sarkar IN, Thornton JW, Planet PJ, Figurski DH, Schierwater B, DeSalle R. 2002. An automated phylogenetic key
J. Exp. Zool. (Mol. Dev. Evol.) DOI 10.1002/jez.b
258
J. LUO ET AL.
for classifying homeoboxes. Mol Phylogenet Evol 24:
388–399.
Schubert FR, Nieselt-Struwe K, Gruss P. 1993. The antennapedia-type home-obox genes have evolved from three
precursors separated early in metazoan evolution. Proc
Natl Acad Sci USA 90:143–147.
Sidow A. 1996. Gen(om)e duplications in the evolution of early
vertebrates. Curr Opin Genet Dev 6:715–722.
Soltis P, Soltis DE. 2000. The role of genetic and genomic
attributes in the success of polyploids. Proc Natl Acad Sci
USA 97:7051–7057.
Stadler PF, Fried C, Prohaska SJ, Bailey WJ, Misof BY,
Ruddle FH, Wagner GP. 2004. Evidence for independent
Hox gene duplications in the hagfish lineage: A PCR-based
gene inventory of Eptatretus stoutii. Mol Phylogenet Evol
32:686–692.
Taylor J, Braasch I, Frickey T, Meyer A, Van De Peer Y. 2003.
Genome duplication, a trait shared by 22,000 species of rayfinned fish. Genome Res 13:382–390.
Taylor JS, Van de Peer Y, Braasch I, Meyer A. 2001.
Comparative genomics provides evidence for an ancient
genome duplication event in fish. Philos Trans R Soc Lond
Ser B 356:1661–1679.
Thompson JD, Higgs DG, Gibson TJ. 1994. CLUSTALW:
improving the sensitivity of progressive multiple sequence
alignment through sequence weighting, position specific gap
penalties, and weight matrix choice. Nuclei Acids Res
22:4673–4680.
Vandepoele K, De Vos W, Taylor JS, Meyer A, Van de Peer Y.
2004. Major events in the genome evolution of vertebrates:
J. Exp. Zool. (Mol. Dev. Evol.) DOI 10.1002/jez.b
Paranome age and size differ considerably between rayfinned fishes and land vertebrates. Proc Natl Acad Sci USA
101:1638–1643.
Wagner A. 2001. Birth and death of duplicated genes in
completely sequenced eukaryotes. Trends Genet 17:
237–239.
Wagner GP, Amemiya C, Ruddle F. 2003. Hox cluster
duplications and the opportunity for evolutionary novelties.
Proc Natl Acad Sci USA 100:14603–14606.
Wagner GP, Takahashi K, Lynch V, Prohaska SJ, Fried C,
Stadler PF, Amemiya C. 2005. Molecular evolution of
duplicated ray finned fish HoxA clusters: increased synonymous substitution rate and asymmetrical co-divergence of
coding and non-coding sequences. J Mol Evol 60:665–676.
Wendel JF. 2000. Genome evolution in polyploids. Plant Mol
Biol 42:225–249.
Wittbrodt J, Meyer A, Schartl M. 1998. More genes in fish?
BioEssays 20:511–515.
Wolfe K. 2001. Yesterday’s polyploidy and the mystery of
diploidization. Nat Rev Genet 2:333–341.
Yan YL, Willoughby J, Liu D, Crump JG, Wilson C, Miller CT,
Singer A, Kimmel C, Westerfield M, Postlethwait JH. 2005.
A pair of Sox: distinct and overlappig functions of zebrafish
sox9 co-orthologs in craniofacial and pectoral fin development. Development 132:1069–1083.
Yang L, Gui JF. 2004. Positive selection on multiple antique
allelic lineages of transferrin in the polyploid Carassius
auratus. Mol Biol Evol 21:1264–1277.
Yu XJ, Zhou T, Li YC, Li K, Zhou M. 1989. Chromosomes of
Chinese fresh-water fishes. Beijing: Science Press.