skip to main content
10.1145/335305.335351acmconferencesArticle/Chapter ViewAbstractPublication PagesstocConference Proceedingsconference-collections
Article
Free Access

Compressed suffix arrays and suffix trees with applications to text indexing and string matching (extended abstract)

Authors Info & Claims
Published:01 May 2000Publication History
First page image

References

  1. 1.A. Amir, D. Keselman, G. M. Landau, M. Lewenstein, N. Lewenstein, and M. Rodeh. Indexing and dictionary matching with one error. Lecture Notes in Computer Science, 1663:181-190, 1999.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. 2.A. Andersson, N. J. Larsson, and K. Swanson. Suffix trees on words. Algorithmica, 23(3):246-260, 1999.]]Google ScholarGoogle ScholarCross RefCross Ref
  3. 3.A. Andersson and S. Nilsson. Efficient implementation of suffix trees. Software Practice and Experience, 25(2):129-141, Feb. 1995.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. 4.A. Apostolico. The myriad virtues of suffix trees. In A. Apostolico and Z. Galil, editors, Combinatorial Algorithms on Words, volume 12 of NATO Advanced Scieace Institutes, Series F, pages 85-96, Springer-Verlag, berlin, 1985.]]Google ScholarGoogle Scholar
  5. 5.A. Apostolico, C. Iliopoulos, G. M. Landau, B. Schieber, and U. Vishkin. Parallel construction of a suffix tree with applications. Algorithmica, 3:347-365, 1988.]]Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. 6.J. L. Bentley and H. A. Maurer. Efficient worst-case data structures for range searching. Acta Informatica, 13:155-168, 1980.]]Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. 7.A. Blumer, J. Blumer, D. Haussler, A. Ehrenfeucht, M. Chen, and J. Seiferas. The smallest automaton recognizing the subwords of a text. Theoretical Computer Science, 40(1):31-55, Sept. 1985.]]Google ScholarGoogle ScholarCross RefCross Ref
  8. 8.A. Blumer, J. Blumer, D. Haussler, R. McConnei1, and A. Ehrenfeucht. Complete inverted files for efficient text retrieval and analysis. Journal of the A CM, 34(3):578- 595, July 1987.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. 9.G. S. Brodal and L. G$sieniec. Approximate dictionary queries. In D. S. Hirschberg and E. W. Myers, editors, Proc. 7th Annual Syrup. Combinatorial Pattern Matching, CPM, volume 1075 of Lecture Notes in Computer Science, LNCS, pages 65-74, Springer-Verlag, 10- 12 June 1996.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. 10.D. Clark. Compact Pat trees. PhD Thesis, Department of Computer Science, University of Waterloo, 1996.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. 11.D. R. Clark and J. I. Munro. Efficient suffix trees on secondary storage (extended abstract). In Proceedings of the Seventh Annual A CM-SIAM Symposium on Discrete Algorithms, pages 383-391, Atlanta, Georgia, 28- 30 Jan. 1996.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. 12.L. Colussi and A. De Col. A time and space efficient data structure for string searching on large texts. Information Processing Letters, 58(5):217-222, Oct. 1996.]]Google ScholarGoogle ScholarCross RefCross Ref
  13. 13.M. Crochemore. Transducers and repetitions. Theoretical Computer Science, 45(1):63-86, 1986.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. 14.M. Crochemore and D. Perrin. Two-way string matching. Journal of the Association for Computing Machincry, 38:651-675, 1991.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. 15.M. Crochemore and W. Rytter. Text Algorithms. Oxford University Press, 1994.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. 16.M. Farach. Optimal suffix tree construction with large alphabets. In 38th Annual Symposium on Foundations of Computer Science, pages 137-143, Miami Beach, Florida, 20-22 Oct. 1997. IEEE.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. 17.M. Farach, P. Ferragina, and S. Muthukrishnan. Overcoming the memory bottleneck in suffix tree construction. in IEEE Symposium on Foundations on Computer Science (to appear in J.A CM), 1998.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. 18.M. Farach and S. Muthukrishnan. Optimal logarithmic time randomized suffix tree construction. In F. M. auf der Heide and B. Monien, editors, Automata, Languages and Programming, 23rd International Colloquium, volume 1099 of Lecture Notes in Computer Science, pages 550-561, Paderborn, Germany, 8-12 July 1996, Springer-Verlag.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. 19.P. Ferragina and R. Grossi. The String B-tree: a new data structure for string search in external memory and its applications. Journal of the A CM, 46(2):236-280, Max. 1999.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. 20.P. Ferragina and G. Manzini. Personal communication, 2OOO.]]Google ScholarGoogle Scholar
  21. 21.M. L. Fredlnan, J. Koml6s, and E. Szemer~di. Storing a sparse table with O(1) worst case access time. Journal of the Association for Computing Machinery, 31(3):538-544, July 1984.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. 22.Z. Galil and J. Seiferas. Time-space-optimal string matching. Journal of Computer and System Sciences, 26:280-294, 1983.]]Google ScholarGoogle ScholarCross RefCross Ref
  23. 23.R. Giegerich, S. Kurtz, and J. Stoye. Efficient implementation of lazy suffix trees. In J. S. Vitter and C. D. Zaroliagis, editors, Proceedings of the 3rd Workshop on Algorithm Engineering, number 1668 in Lecture Notes in Computer Science, pages 30-42, London, UK, 1999, Springer-Verlag, Berlin.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. 24.G. H. Gonnet, R. A. Baeza-Yates, and T. Snider. New indices for text: PAT trees and PAT arrays. In Information Retrieval: Data Structures And Algorithms, chapter 5, pages 66-82. Prentice-Hall, 1992.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. 25.D. Gusfield. Algorithms on Strings, Trees and Sequences: Computer Science and Computational Biology. Cambridge University Press, 1997.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. 26.R. W. Irving. Suffix binary search trees. Technical Report TR-1995-7, Computing Science Department, University of Glasgow, 1995.]]Google ScholarGoogle Scholar
  27. 27.G. Jacobson. Space-efficient static trees and graphs. In IEEE Symposium on Foundations of Computer Science, pages 549-554, 1989.]]Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. 28.G. Jacobson. Succinct static data structures. Technical Report CMU-CS-89-112, Dept. of Computer Science, Carnegie-Mellon University, Jan. 1989.]]Google ScholarGoogle Scholar
  29. 29.J. KSrkk~inen. Suffix cactus: A cross between suffix tree and suffix array. In Combinatorial Pattern Matching, volume 937 of Lecture Notes in Computer Science, pages 191-204. Springer, 1995.]]Google ScholarGoogle ScholarCross RefCross Ref
  30. 30.J. K~rkk~inen and E. Sutinen. Lempel-Ziv index for q-grams. Algorithmica, 21(1):137-154, 1998.]]Google ScholarGoogle ScholarCross RefCross Ref
  31. 31.J. K&rkk//inen and E. Ukkonen. Lempel-Ziv parsing and sublinear-size index structures for string matching. In N. Ziviani, R. Baeza-Yates, and K. Guimar&es, editors, Proceedings of the 3rd South American Workshop on String Processing, pages 141-155, Recife, Brazil, 1996. Carleton University Press.]]Google ScholarGoogle Scholar
  32. 32.J. K&rkk&inen and E. Ukkonen. Sparse suffix trees. Lecture Notes in Computer Science, 1090:219-230, 1996.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. 33.D. E. Knuth, J. H. Morris, and V. R. Pratt. Fast pattern matching in strings. SIAM Journal on Computing, 6:323-350, 1977.]]Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. 34.S. Kurtz. Reducing the space requirement of suffix trees. Technical Report 98-03, Universit/~t Bielefeld, 1998.]]Google ScholarGoogle Scholar
  35. 35.U. Manber and G. Myers. Suffix arrays: a new method for on-line string searches. SIAM Journal on Computing, 22(5):935-948, 1993.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. 36.U. Manber and S. Wu. GLIMPSE: A tool to search through entire file systems. In Proceedings of the USENIX Winter 199~ Technical Conference, pages 23- 32, 1994.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. 37.E. M. McCreight. A space-economical suffix tree construction algorithm. Journal of the A CM, 23(2):262- 272, 1976.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. 38.A. Moffat and J. Zobel. Self-indexing inverted files for fast text retrieval. A CM Transactions on Information Systems, 14(4):349-379, Oct. 1996.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. 39.D. R. Morrison. PATRICIA- Practical Algorithm To Retrieve Information Coded In Alphanumeric. Journal of the ACM, 15(4):514-534, Oct. 1968.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. 40.J. i. Munro. Tables. FSTTCS: Foundations of Software Technology and Theoretical Computer Science, 16:37- 42, 1996.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. 41.J. I. Munro and V. Raman. Succinct representation of balanced parentheses, static trees and planar graphs. In 38th Annual Symposium on Foundations of Computer Science, pages 118-126, 1997.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. 42.J. I. Munro, ~V. Raman, and S. Srinivasa Rao. Space efiqcient suffix trees. In Proceedings of Foundations of Software Technology and Theoretical Computer Science, volume 1530 of Lecture Notes in Computer Science, pages 186-195, Berlin, Germany, 1998, Springer-Verlag.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. 43.M. H. Overmars. Efficient data structures for range searching on a grid. Journal of Algorithms, 9(2):254- 275, June 1988.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. 44.S. C. Sahinalp and U. Vishkin. Symmetry breaking for suffix tree construction. In Proceedings of the 26th Annual Symposium on the Theory of Computing, pages 300-309, New York, May 1994, ACM Press.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. 45.E. Ukkonen. On-line construction of suffix trees. Algorithmica, 14(3):249-260, Sept. 1995.]]Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. 46.P. Weiner. Linear pattern matching algorithm. Proc. l~th IEEE Symposium on Switching and Automata Theory, pages 1-11, 1973.]]Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. 47.D. E. Willard. On the application of sheared retrieval to orthogonal range queries. In Proceedings of the Second Annual Symposium on Computational Geometry, pages 80-89, 1986.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. 48.A. C. Yao and F. F. Yao. Dictionary look-up with small errors. Lecture Notes in Computer Science, 937:387- 394, 1995.]]Google ScholarGoogle ScholarCross RefCross Ref
  49. 49.J. Zobel, A. Moffat, and K. Ramamohanarao. Inverted files versus signature files for text indexing. A CM Transactions on Database Systems, 23(4):453-490, Dec. 1998.]] Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Compressed suffix arrays and suffix trees with applications to text indexing and string matching (extended abstract)

                Recommendations

                Comments

                Login options

                Check if you have access through your login credentials or your institution to get full access on this article.

                Sign in
                • Published in

                  cover image ACM Conferences
                  STOC '00: Proceedings of the thirty-second annual ACM symposium on Theory of computing
                  May 2000
                  756 pages
                  ISBN:1581131844
                  DOI:10.1145/335305

                  Copyright © 2000 ACM

                  Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                  Publisher

                  Association for Computing Machinery

                  New York, NY, United States

                  Publication History

                  • Published: 1 May 2000

                  Permissions

                  Request permissions about this article.

                  Request Permissions

                  Check for updates

                  Qualifiers

                  • Article

                  Acceptance Rates

                  STOC '00 Paper Acceptance Rate85of182submissions,47%Overall Acceptance Rate1,469of4,586submissions,32%

                  Upcoming Conference

                  STOC '24
                  56th Annual ACM Symposium on Theory of Computing (STOC 2024)
                  June 24 - 28, 2024
                  Vancouver , BC , Canada

                PDF Format

                View or Download as a PDF file.

                PDF

                eReader

                View online with eReader.

                eReader