skip to main content
10.1145/369133.369190acmconferencesArticle/Chapter ViewAbstractPublication PagesrecombConference Proceedingsconference-collections
Article

The greedy path-merging algorithm for sequence assembly

Authors Info & Claims
Published:22 April 2001Publication History

ABSTRACT

Two different approaches to determining the human genome are currently being pursued: one is the “clone-by-clone” approach, employed by the publicly-funded. Human Genome Project, and the other is the “whole genome shotgun” approach, favored by researchers at Celera Genomics. An interim strategy employed at Celera, called hierarchical assembly, makes use of preliminary data produced by both approaches. This paper introduces the Bactig Ordering Problem, which is a key problem that arises in this context, and presents an efficient heuristic called the greedy path-merginq algorithm that performs well on real data.

References

  1. BKML+00.D. A. Benson, I. Karsch-Mizrachi, D. J. Lipman, J. Ostell, B. A. Rapp, and D. L. Wheeler. Genbank. Nucleic Acids Research, 28(1):15-8, 2000.Google ScholarGoogle ScholarCross RefCross Ref
  2. GJ79.M.R. Garey and D. S. Johnson. Computers and Intractability, a guide to the theory of NP-completeness. Bell Telephone Laboratories, Inc., 1979. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. HRK+.D.H. Huson, K. Reinert, S. A. Kravitz, K. A. Remington, A. L. Delcher, I. M. Dew, A. L. Halpern, Z. Lai, G. G. Sutton, and E. W. Myers. Design and operation of an hierarchical assembler for the human genome. In preparation.Google ScholarGoogle Scholar
  4. LW88.E.S. Lander and M. S. Waterman. Genomic mapping by fingerprinting random clones: A mathematical analysis. Genomics, 2:231-239, 1988.Google ScholarGoogle ScholarCross RefCross Ref
  5. Mar99a.E. Marshall. A high-stakes gamble on genome sequencing. Science, 284(5422):1906-1909, 1999.Google ScholarGoogle ScholarCross RefCross Ref
  6. Mar99b.E. Marshall. Sequencers endorse plan for draft in 1 year. Science, 284(5419):1439-1441, 1999.Google ScholarGoogle ScholarCross RefCross Ref
  7. Mar00.E. Marshall. Human genome. Rival genome sequences celebrate a milestone together. Science, 288(5475):2294-5, 2000.Google ScholarGoogle ScholarCross RefCross Ref
  8. MSD+00.E.W. Myers, G. G. Sutton, A. L. Delcher, I. M. Dew, D. P. Fasulo, M. J. Flanigan, S. A. Kravitz, C. M. Mobarry, K. H. J. Reinert, K. A. Remington, E. L. Anson, R. A. Bolanos, H-H. Chou, C. M. Jordan, A. L. Halpern, S. Lonardi, E. M. Beasley, R. C. Brandon, L. Chen, P. J. Dunn, Z. Lai, Y. Liang, D. R. Nusskern, M. Zhan, Q. Zhang, X. Zheng, G. M. Rubin, M. D. Adams, and J. C. Venter. A whole-genome assembly of Drosophila. Science, 287:2196-2204, 2000.Google ScholarGoogle Scholar
  9. SCH+92.F. Sanger, A. R. Coulson, G. F. Hong, D. F. Hill, and G. B. Petersen. Nucleotide sequence of bacteriophage A DNA. J. Mol. Bio., 162(4):729-73, 1992.Google ScholarGoogle ScholarCross RefCross Ref
  10. SNC77.F. Sanger, S. Nicklen, and A. R. Coulson. DNA sequencing with chain-terminating inhibitors. Proceedings of the National Academy of Sciences, 74(12):5463-5467, 1977.Google ScholarGoogle Scholar
  11. UOO97.U.S. Dep. of Energy, Office of Energy Research, and Office of Biological and Environmental Research. Human genome program report. http ://www. ornl. gov/hgmis/publicat/97pr/, 1997.Google ScholarGoogle Scholar
  12. WM97.J.L. Webber and E. W. Myers. Human whole-genome shotgun sequencing. Genome Research, 7(5):401-409, 1997.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. The greedy path-merging algorithm for sequence assembly

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            RECOMB '01: Proceedings of the fifth annual international conference on Computational biology
            April 2001
            316 pages
            ISBN:1581133537
            DOI:10.1145/369133
            • Chairman:
            • Thomas Lengauer

            Copyright © 2001 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 22 April 2001

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • Article

            Acceptance Rates

            RECOMB '01 Paper Acceptance Rate35of128submissions,27%Overall Acceptance Rate148of538submissions,28%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader