skip to main content
10.1145/974614.974632acmconferencesArticle/Chapter ViewAbstractPublication PagesrecombConference Proceedingsconference-collections
Article

Structured motifs search

Authors Info & Claims
Published:27 March 2004Publication History

ABSTRACT

In this paper we describe an algorithm for the localization of structured models, i.e. sequences of (simple) motifs and distance constraints. It basically combines standard pattern matching procedures with a constraint satisfaction solver, and it has the ability, not present in similar tools, to search for partial matches. A significant feature of our approach, especially in terms of efficiency for the application context, is that the (potentially) exponentially many solutions to the considered problem are represented in compact form as a graph. Moreover, the time and space necessary to build the graph are linear in the number of occurrences of the component patterns.

References

  1. F. Cattonaro, I. Jurman, and M. Morgante. The Alisei family of Gypsy-like retrotransposons in Norway spruce (Picea abies L., Karsten) genome. In preparation.]]Google ScholarGoogle Scholar
  2. M. Crochemore and M.-F. Sagot. Motifs in sequences: localization and extraction. In A. Konopka and al., editor, Handbook of Computational Chemistry. Marcel Dekker Inc., 2002. in press.]]Google ScholarGoogle Scholar
  3. K. M. Devos, K. M. Brown, and J. L. Bennetzen. Genome Size Reduction through Illegitimate Recombination Counteracts Genome Expansion in Arabidopsis. Genome Research, 12(7):1075--1079, 2002.]]Google ScholarGoogle ScholarCross RefCross Ref
  4. K. Fredriksson and G. Navarro. Average-Optimal Multiple Approximate String Matching. In Proceedings of the 14th Annual Symposium on Combinatorial Pattern Matching (CPM 2003), volume 2676 of LNCS, pages 109--128, 2003.]]Google ScholarGoogle Scholar
  5. R. Giegerich, S. Kurtz, and J. Stoye. Efficient Implementation of Lazy Suffix Trees. In Proc. of the Third Workshop on Algorithmic Engineering (WAE99), pages 30--42. Lecture Notes in Computer Science 1668, 1999.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. D. Gusfield. Algorithms on Strings, Trees and Sequences: Computer Science and Computational Biology. Cambridge University Press, New York, 1997.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. A. Kumar and J. L. Bennetzen. Plant Retrotransposons. Annu. Rev. Genet., 33:479--532, 1999.]]Google ScholarGoogle ScholarCross RefCross Ref
  8. A. Kumar and H. Hirochika. Applications of retrotransposons as genetic tools in plant biology. Trends in Plant Sciences, 6:127--134, March 2001.]]Google ScholarGoogle ScholarCross RefCross Ref
  9. S. Kurtz, E. Ohlebusch, C. Schleiermacher, J. Stoye, and R. Giegerich. Computation and Visualization of Degenerate Repeats in Complete Genomes. In Proceedings of the International Conference on Intelligent Systems for Molecular Biology, pages 228--238, Menlo Park, CA, 2000. AAAI-Press.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. E. M. McCarthy and J. F. McDonald. LTR_STRUC: a novel search and identification program for LTR retrotransposons. Bioinformatics, 19(3):362--367, February 2003.]]Google ScholarGoogle ScholarCross RefCross Ref
  11. G. Mehldau and E. Myers. A System for Pattern Matching Applications on Biosequences. In CABIOS, volume 9(3), pages 299--314, 1993.]]Google ScholarGoogle ScholarCross RefCross Ref
  12. E. Myers. Approximate Matching of Network Expressions with Spacers. Journal of Computational Biology, 1(3):33--51, 1996.]]Google ScholarGoogle ScholarCross RefCross Ref
  13. E. W. Myers, P. Oliva, and K. Guimãraes. Reporting exact and approximate regular expression matches. In M. Farach-Colton, editor, Proceedings of the 9th Annual Symposium on Combinatorial Pattern Matching, number 1448 in LNCS, pages 91--103, Piscataway, NJ, 1998. Springer-Verlag, Berlin.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. G. Navarro and M. Raffinot. Fast and Simple Character Classes and Bounded Gaps Pattern Matching, with Application to Protein Searching. In Proc. 5th Annual International ACM Conference on Computational Molecular Biology (RECOMB'01), pages 231--240, 2001.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. A. Zuccolo and M. Morgante. Abundance, distribution and phylogenetic relationship of LTR retrotransposons in the rice genome: an in silico survey. In preparation.]]Google ScholarGoogle Scholar

Index Terms

  1. Structured motifs search

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        RECOMB '04: Proceedings of the eighth annual international conference on Research in computational molecular biology
        March 2004
        370 pages
        ISBN:1581137559
        DOI:10.1145/974614

        Copyright © 2004 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 27 March 2004

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • Article

        Acceptance Rates

        Overall Acceptance Rate148of538submissions,28%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader