Divergent paralogues of ribosomal DNA in eucalypts (Myrtaceae)

https://doi.org/10.1016/j.ympev.2006.10.027Get rights and content

Abstract

The presence of divergent paralogues of nuclear ribosomal DNA, from the 18S–5.8S–26S cistron, is reported in members of Eucalyptus subg. Eucalyptus. These paralogues, which include non-functional pseudogenes, probably diverged prior to the differentiation of species groups in subg. Eucalyptus. When compared with presumably functional sequences, the pseudogenes show greater sequence variation between species, particularly in the 5.8S gene. They are also characterised by reduced GC content, associated with a reduced number of CpG and CpNpG methylation sites, and an increase in the inferred number of methylation-induced substitutions. Some pseudogenes also lack motifs that are usually conserved in plants, both in ITS1 and the 5.8S gene. Two main lineages of pseudogenes are identified, one isolated from a group of western Australian species, one from a group of eastern Australian species. It is not clear whether these two lineages of pseudogenes are orthologous, or represent independent divergences from functional sequence types. The presence of divergent rDNA paralogues highlights the need for caution when interpreting eucalypt phylogenies based on ITS sequences.

Introduction

The internal transcribed spacer (ITS) regions of ribosomal DNA (rDNA) are used extensively in phylogenetic studies of flowering plants, especially for analyses of relationships within genera, or among closely related genera. Plant genomes include multiple copies (paralogues) of these rDNA regions, in the order of thousands of copies per cell, arranged in one to several arrays of tandem repeats. Relative homogeneity of sequences across multiple copies of rDNA is maintained through concerted evolution, which involves such processes as gene conversion and unequal crossing over (Arnheim, 1983), and which recent studies have shown to be extremely efficient and rapid in some taxa (Kovarik et al., 2005). Divergence of rDNA copies will, however, occur where concerted evolution is slow, or does not act on some copies, e.g., if copies are dispersed to a different part of the genome (Childs et al., 1981). In recent years, divergent rDNA paralogues within single genomes have been reported from an increasing range of plant and fungal groups (e.g., Buckler et al., 1997, Lieckfeldt and Seifert, 2000, Hartmann et al., 2001, Mayol and Roselló, 2001, Muir et al., 2001, Bailey et al., 2003, Razafimandimbison et al., 2004, Won and Renner, 2005, Álvarez and Wendel, 2003). The presence of such paralogues can, if undetected, confound attempts at phylogenetic reconstruction (Sanderson and Doyle, 1992).

Prominent among divergent rDNA paralogues are non-functional pseudogenes. These rDNA copies, freed from functional constraints, are generally characterised by increased substitution rates in conserved regions and an increase in the number of methylation-induced substitutions, which lead, in turn, to reduced GC content and reduced stability of secondary structure (Buckler et al., 1997, Bailey et al., 2003). Methylation-induced substitutions occur because cytosines, methylated to 5-methylcytosine, frequently mutate to thymine by deamination (Vairapandi and Duker, 1994, Ng and Bird, 1999). DNA methylation in plants and, therefore, these kinds of mutations, occur chiefly at CpG and CpNpG sites (where N is any nucleotide; Gardiner-Garden et al., 1992, Bender, 2003). In contrast to pseudogenes, functional copies of rDNA maintain large numbers of CpG and CpNpG sites, presumably because of functional constraints.

In this paper, we report the occurrence of divergent paralogues of rDNA in Eucalyptus L’Hér. (Myrtaceae), the dominant tree genus over much of the Australian continent. These paralogues, which are probably non-functional pseudogenes, were discovered while trialling different primer combinations for direct sequencing of rDNA for a phylogenetic study of the monocalypt eucalypts, Eucalyptus subg. Eucalyptus, a group of 110 species (sensu Brooker, 2000) distributed in eastern and south-western Australia (Fig. 1). Properties of the divergent paralogues are described. Their phylogenetic histories are considered by comparison with previously published sequences of the eucalypt group (Steane et al., 1999, Steane et al., 2002, Udovicic and Ladiges, 2000), which, in the broad sense, includes seven genera (Allosyncarpia S.T.Blake, Angophora Cav., Arillastrum Pancher ex Baill., Corymbia K.D. Hill and L.A.S. Johnson, Eucalyptopsis C.T. White, Eucalyptus, Stockwellia D.J. Carr, S.G.M. Carr and B. Hyland) from Australia, Timor, New Guinea, New Britain, Sulawesi, Ceram, Mindanao and New Caledonia (Ladiges et al., 2003).

Section snippets

Isolation, amplification and sequencing of DNA

Leaf samples from eight species of Eucalyptus subg. Eucalyptus were used in this study: E. acies, E. globoidea, E. insularis, E. lacrimans, E. paliformis, E. sepulcralis, E. spectatrix (a taxonomic synonym of E. stricta in the treatment of Brooker, 2000) and E. triflora (see Brooker, 2000; for authorities of taxon names). These were collected at Currency Creek Arboretum, South Australia (Nicolle, 2003), and dried in silica gel; details of collections and voucher specimens are given in Table 1.

PCR results and paralogue characteristics

The primer pairs ITS5/ITS4 and S3/S5 each yielded single PCR products, seen as single bands on agarose gels, and direct sequencing of these products was unproblematic. Products amplified with ITS5/ITS4 spanned the region ITS1–5.8S–ITS2; those amplified with S3/S5 included only ITS1 and part of the 5.8S gene (358 aligned bases in total).

Based on patterns of sequence similarity in ITS1 and the 5.8S gene (Fig. 2), three classes of sequence were recognised among the eight eucalypt samples, i.e.,

PCR amplification of paralogues

PCR amplification of different paralogues by different primer pairs in this study was found by chance, and the reasons for this phenomenon have not been investigated. It is not known if the ITS4 and ITS5 primer sites are present in “divergent” sequences, since these sites are external to the recovered sequences. In the case of primers S3 and S5, however, sequence comparison shows that the primer sites are present in both “typical” and “divergent” sequences. Given that these primers amplify only

Acknowledgments

Dean Nicolle provided access to specimens at the Currency Creek Arboretum, and Emma Lewis and Carlos Parra assisted with collections made there. We thank Dan Murphy, Carlos Parra, Alison Kellow, Gareth Holmes and Ed Newbigin for helpful discussions and comments on this manuscript. This work was funded by an Australian Research Council Linkage (ARCL) grant, including financial support from the Maud Gibson Trust, RBG Melbourne and RBG Sydney.

References (51)

  • N. Arnheim

    Concerted evolution in multigene families

  • B.G. Baldwin

    Molecular phylogenetics of Calycadenia (Compositae) based on ITS sequences of nuclear ribosomal DNA: chromosomal and morphological evolution reexamined

    Am. J. Bot.

    (1993)
  • B.G. Baldwin et al.

    The ITS region of nuclear ribosomal DNA: a valuable source of evidence on angiosperm phylogeny

    Ann. Mo. Bot. Garden

    (1995)
  • J. Bender

    DNA methylation and epigenetics

    Ann. Rev. Plant Biol.

    (2003)
  • A. Bohte et al.

    Floral development and systematic position of Arillastrum, Allosyncarpia, Stockwellia and Eucalyptopsis (Myrtaceae)

    Plant Syst. Evol.

    (2005)
  • M.I.H. Brooker

    A new classification of genus Eucalyptus L’Hér (Myrtaceae)

    Aust. Syst. Bot.

    (2000)
  • E.S. Buckler et al.

    The evolution of ribosomal DNA: divergent paralogues and phylogenetic implications

    Genetics

    (1997)
  • M. Crisp et al.

    Radiation of the Australian flora: what can comparisons of molecular phylogenies across multiple taxa tell us about the evolution of diversity in present-day communities? Phil

    Trans. R. Soc. Lond. B

    (2004)
  • J. Felsenstein

    Cases in which parsimony and compatibility methods will be positively misleading

    Syst. Zool.

    (1978)
  • M. Gardiner-Garden et al.

    Methylation sites in angiosperm genes

    J. Mol. Evol.

    (1992)
  • S. Hartmann et al.

    Extensive ribosomal DNA genic variation in the columnar cactus Lophocereus

    J. Mol. Evol.

    (2001)
  • M.A. Hershkovitz et al.

    Deep-level diagnostic value of the rDNA-ITS region

    Mol. Biol. Evol.

    (1996)
  • M.A. Hershkovitz et al.

    Ribosomal DNA sequences and angiosperm systematics

  • K.D. Hill et al.

    Systematic studies in the eucalypts 7. A revision of the bloodwoods, genus Corymbia (Myrtaceae)

    Telopea

    (1995)
  • S.D. Hopper

    Biogeographical aspects of speciation in the southwest Australian flora

    Ann. Rev. Ecol. Syst.

    (1979)
  • Cited by (39)

    • Molecular evolution of the internal transcribed spacers in red oaks (Quercus sect. Lobatae)

      2019, Computational Biology and Chemistry
      Citation Excerpt :

      Intraspecific ITS sequence variation has also been detected in several oak species using the PCR-RFLP technique (Coutinho et al., 2015) Finally, it is important to point out that outside Quercus, non-concerted evolution has also been documented in other plant taxa such as Mammillaria, Raphia, Cycas and Lespedeza, among others (Baker et al., 2000; Razafimandimbison et al., 2004; Wissemann and Ritz, 2005; Harpke and Peterson, 2006; Bayly and Ladiges, 2007; Xiao et al., 2010; Xu et al., 2017; Barrett et al., 2018; Morales‐Briones and Tank, 2019). Unintentional inclusion of different ITS ribotypes in a phylogenetic analysis would reflect the gene history but it could erroneously be interpreted as the organismal history (Bailey et al., 2003).

    • Dispersion of rDNA loci and its implications on intragenomic variability and phylogenetic studies in Camellia

      2012, Scientia Horticulturae
      Citation Excerpt :

      Two mechanisms such as unequal crossing over and gene conversion have been attributed to concerted evolution (Dover, 1982; Arnheim, 1983; Gonzalez and Sylvester, 2001; Eickbush and Eickbush, 2007). However, failure of concerted evolution in complete homogenization of the multiple copies was noticed in a number tree species (Bayly and Ladiges, 2007; Campbell et al., 1997; Ritland et al., 1993; Vijayan et al., 2009). Several reasons have been postulated to explain this anomaly.

    • Population genetic analysis and phylogeny reconstruction in Eucalyptus (Myrtaceae) using high-throughput, genome-wide genotyping

      2011, Molecular Phylogenetics and Evolution
      Citation Excerpt :

      One problem with using sequence data from functional regions of DNA (such as ITS and ETS) comes from the functional constraints imposed on cistrons that might prevent “neutral” change of nucleotides during evolution. Furthermore, there are many copies of ribosomal RNA genes in a genome and this introduces a risk of comparing paralogous loci (Bayly and Ladiges, 2007). Despite the limitations of ribosomal and chloroplast DNA for resolution of species-level relationships, Gibbs et al. (2009) successfully used ITS, ETS and cpDNA sequence data in combination with morphological characters to resolve relationships among species within subgenus Eudesmia.

    View all citing articles on Scopus
    View full text