Skip to main content

Genome Annotation and Analysis

  • Chapter
Sequence — Evolution — Function

Abstract

In the preceding chapter, we gave a brief overview of the methods that are commonly used for identification of protein-coding genes and analysis of protein sequences. Here, we turn to one of the main subjects of this book, namely how these methods are applied to the task of primary analysis of genomes, which often goes under the name of “genome annotation”. Many researchers still view genome annotation as a notoriously unreliable and inaccurate process. There are excellent reasons for this opinion: genome annotation produces a considerable number of errors and some outright ridiculous “identifications” (see ♦3.1.3 and further discussion in this chapter). These errors are highly visible, even when the error rate is quite low: because of the large number of genes in most genomes, the errors are also rather numerous. Some of the problems and challenges faced by genome annotation are an issue of quantity turning into quality: an analysis that can be easily and reliably done by a qualified researcher for one or ten protein sequences becomes difficult and error-prone for the same scientist and much more so for an automated tool when the task is scaled up to 10,000 sequences. We discuss here the performance of manual, automated and mixed approaches in genome annotation and ways to avoid some common pitfalls.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 139.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 179.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 159.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Further Reading

  1. Brenner S. 1999. Errors in genome annotation. Trends in Genetics 15: 132–133.

    Article  PubMed  CAS  Google Scholar 

  2. Galperin MY, Koonin EV 2000. Who’s your neighbor? New computational approaches for functional genomics. Nature Biotechnology 18: 609–613.

    Article  PubMed  CAS  Google Scholar 

  3. Huynen, MA, Snel B. 2000. Gene and context: integrative approaches to genome analysis. Advances in Protein Chemistry 54: 345–379.

    Article  PubMed  CAS  Google Scholar 

  4. Huynen MA, Snel B, Lathe W, Bork P. 2000. Predicting protein function by genomic context: quantitative evaluation and qualitative inferences. Genome Research 10: 1204–1210.

    Article  PubMed  CAS  Google Scholar 

  5. Wolf YI, Rogozin IB, Kondrashov AS, Koonin EV. 2001. Genome alignment, evolution of prokaryotic genome organization and prediction of gene function using genomic context. Genome Research 11: 356–372.

    Article  PubMed  CAS  Google Scholar 

  6. Makarova KS, Aravind L, Grishin NV, Rogozin IB, Koonin EV. 2002. A DNA repair system specific for thermophilic Archaea and bacteria predicted by genomic context analysis. Nucleic Acids Research 30: 482–496.

    Article  PubMed  CAS  Google Scholar 

  7. Ouzounis CA, Karp PD. 2002. The past, present and future of genomewide re-annotation. Genome Biology 3, COMMENT2001.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer Science+Business Media Dordrecht

About this chapter

Cite this chapter

Koonin, E.V., Galperin, M.Y. (2003). Genome Annotation and Analysis. In: Sequence — Evolution — Function. Springer, Boston, MA. https://doi.org/10.1007/978-1-4757-3783-7_6

Download citation

  • DOI: https://doi.org/10.1007/978-1-4757-3783-7_6

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4419-5321-6

  • Online ISBN: 978-1-4757-3783-7

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics