Zebrafish transposable elements show extensive diversification in age, genomic distribution, and developmental expression

  1. Juan M. Vaquerizas2,3,4
  1. 1Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14850, USA;
  2. 2Max Planck Institute for Molecular Biomedicine, 48149 Muenster, Germany;
  3. 3MRC London Institute of Medical Sciences, London W12 0NN, United Kingdom;
  4. 4Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, London W12 0NN, United Kingdom
  1. 5 These authors contributed equally to this work.

  • Corresponding authors: cf458{at}cornell.edu, j.vaquerizas{at}lms.mrc.ac.uk
  • Abstract

    There is considerable interest in understanding the effect of transposable elements (TEs) on embryonic development. Studies in humans and mice are limited by the difficulty of working with mammalian embryos and by the relative scarcity of active TEs in these organisms. The zebrafish is an outstanding model for the study of vertebrate development, and over half of its genome consists of diverse TEs. However, zebrafish TEs remain poorly characterized. Here we describe the demography and genomic distribution of zebrafish TEs and their expression throughout embryogenesis using bulk and single-cell RNA sequencing data. These results reveal a highly dynamic genomic ecosystem comprising nearly 2000 distinct TE families, which vary in copy number by four orders of magnitude and span a wide range of ages. Longer retroelements tend to be retained in intergenic regions, whereas short interspersed nuclear elements (SINEs) and DNA transposons are more frequently found nearby or within genes. Locus-specific mapping of TE expression reveals extensive TE transcription during development. Although two-thirds of TE transcripts are likely driven by nearby gene promoters, we still observe stage- and tissue-specific expression patterns in self-regulated TEs. Long terminal repeat (LTR) retroelements are most transcriptionally active immediately following zygotic genome activation, whereas DNA transposons are enriched among transcripts expressed in later stages of development. Single-cell analysis reveals several endogenous retroviruses expressed in specific somatic cell lineages. Overall, our study provides a valuable resource for using zebrafish as a model to study the impact of TEs on vertebrate development.

    Footnotes

    • [Supplemental material is available for this article.]

    • Article published online before print. Article, supplemental material, and publication date are at https://www.genome.org/cgi/doi/10.1101/gr.275655.121.

    • Freely available online through the Genome Research Open Access option.

    • Received April 15, 2021.
    • Accepted December 30, 2021.

    This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0/.

    | Table of Contents
    OPEN ACCESS ARTICLE

    Preprint Server