Experimental validation of the regulated expression of large numbers of non-coding RNAs from the mouse genome

  1. Timothy Ravasi1,4,5,
  2. Harukazu Suzuki2,4,
  3. Ken C. Pang1,3,4,
  4. Shintaro Katayama2,4,
  5. Masaaki Furuno2,4,6,
  6. Rie Okunishi2,
  7. Shiro Fukuda2,
  8. Kelin Ru1,
  9. Martin C. Frith1,2,
  10. M. Milena Gongora1,
  11. Sean M. Grimmond1,
  12. David A. Hume1,
  13. Yoshihide Hayashizaki2, and
  14. John S. Mattick1,7
  1. 1 ARC Special Research Centre for Functional and Applied Genomics, Institute for Molecular Bioscience, University of Queensland, Brisbane QLD 4072, Australia
  2. 2 Laboratory for Genome Exploration Research Group, RIKEN Genomic Science Center, RIKEN Yokohama Institute, Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa, 230-0045, Japan
  3. 3 T Cell Laboratory, Ludwig Institute for Cancer Research, Austin & Repatriation Medical Centre, Heidelberg VIC 3084, Australia

Abstract

Recent large-scale analyses of mainly full-length cDNA libraries generated from a variety of mouse tissues indicated that almost half of all representative cloned sequences did not contain an apparent protein-coding sequence, and were putatively derived from non-protein-coding RNA (ncRNA) genes. However, many of these clones were singletons and the majority were unspliced, raising the possibility that they may be derived from genomic DNA or unprocessed pre-mRNA contamination during library construction, or alternatively represent nonspecific “transcriptional noise.” Here we show, using reverse transcriptase-dependent PCR, microarray, and Northern blot analyses, that many of these clones were derived from genuine transcripts of unknown function whose expression appears to be regulated. The ncRNA transcripts have larger exons and fewer introns than protein-coding transcripts. Analysis of the genomic landscape around these sequences indicates that some cDNA clones were produced not from terminal poly(A) tracts but internal priming sites within longer transcripts, only a minority of which is encompassed by known genes. A significant proportion of these transcripts exhibit tissue-specific expression patterns, as well as dynamic changes in their expression in macrophages following lipopolysaccharide stimulation. Taken together, the data provide strong support for the conclusion that ncRNAs are an important, regulated component of the mammalian transcriptome.

Footnotes

  • [Supplemental material is available online at www.genome.org. The microarray data from this study have been submitted to the Gene Expression Omnibus under accession nos. GSD275 and GSE3098.]

  • Article published online ahead of print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.4200206.

  • 4 These authors contributed equally to this work.

  • 5 Present address: Department of Bioengineering, University of California-San Diego, La Jolla, CA 92093-0412, USA

  • 6 Present address: Mouse Genome Informatics Consortium, The Jackson Laboratory, Bar Harbor, ME 04609, USA

  • 7 Corresponding author. E-mail j.mattick{at}imb.uq.edu.au; fax 61-7-3346-2111.

    • Accepted September 7, 2005.
    • Received May 28, 2005.
| Table of Contents

Preprint Server