skip to main content
10.1145/1099554.1099692acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article

QED: a novel quaternary encoding to completely avoid re-labeling in XML updates

Published:31 October 2005Publication History

ABSTRACT

The method of assigning labels to the nodes of the XML tree is called a labeling scheme. Based on the labels only, both ordered and un-ordered queries can be processed without accessing the original XML file. One more important point for the labeling scheme is the label update cost in inserting or deleting a node into or from the XML tree. All the current labeling schemes have high update cost, therefore in this paper we propose a novel quaternary encoding approach for the labeling schemes. Based on this encoding approach, we need not re-label any existing nodes when the update is performed. Extensive experimental results on the XML datasets illustrate that our QED works much better than the existing labeling schemes on the label updates when considering either the number of nodes or the time for re-labeling.

References

  1. S. Abiteboul, H. Kaplan, and T. Milo. Compact labeling schemes for ancestor queries. In Proc. SODA, pages 547--556, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. R. Agrawal, A. Borgida, and H.V. Jagadish. Efficient Management of Transitive Relationships in Large Data and Knowledge Bases. In Proc. of SIGMOD, pages 253--262, 1989. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. T. Amagasa, M. Yoshikawa, and S. Uemura. QRS: A Robust Numbering Scheme for XML Documents. In Proc. of ICDE, pages 705--707, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  4. J.A. Anderson and J.M. Bell. Number Theory with Application. Prentice-Hall, New Jersey, 1997.Google ScholarGoogle Scholar
  5. A. Berglund, S. Boag, D. Chamberlin, M. F. Fernandez, M. Kay, J. Robie, and J. Simon. XML path language (XPath) 2.0. W3C working draft 04, Apr 2005.Google ScholarGoogle Scholar
  6. S. Boag, D. Chamberlin, M. F. Fernandez, D. Florescu, J. Robie, and J. Simon. XQuery 1.0: An XML Query Language. W3C working draft 04, Apr 2005.Google ScholarGoogle Scholar
  7. T. Bray, J. Paoli, C. M. Sperberg-McQueen, E. Maler, and F. Yergeau. Extensible markup language (XML) 1.0 third edition W3C recommendation. Oct. 2000.Google ScholarGoogle Scholar
  8. E. Cohen, H. Kaplan, and T. Milo. Labeling Dynamic XML Trees. In Proc. of PODS, pages 271--281, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. M. Duong and Y. Zhang. A New Labeling Scheme for Dynamically Updating XML Data. In Proc. of ADC, pages 185--193, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. R. Goldman and J. Widom. DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases. In Proc. of VLDB, pages 436--445, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. C. Li and T.W. Ling. An Improved Prefix Labeling Scheme: A Binary String Approach for Dynamic Ordered XML. In Proc. of DASFAA, pages 125--137, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. C. Li, T.W. Ling, J. Lu, and T. Yu. On Reducing Redundancy and Improving Efficiency of Labeling schemes. To appear in Proc. of CIKM, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Q. Li and B. Moon. Indexing and Querying XML Data for Regular Path Expressions. In Proc. of VLDB, pages 361--370, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. J. McHugh, S. Abiteboul, R. Goldman, D. Quass, and J. Widom. Lore: A Database Management System for Semistructured Data. SIGMOD Record, 26(3): 54--66, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. S. Nestorov, J.D. Ullman, J.L. Wiener, and S.S. Chawathe. Representative Objects: Concise Representations of Semistructured, Hierarchial Data. In Proc. of ICDE, pages 79--90, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. NIAGARA Experimental Data. Available at: http://www.cs.wisc.edu/niagara/data.htmlGoogle ScholarGoogle Scholar
  17. P.E. O'Neil, E.J. O'Neil, S. Pal, I. Cseri, G. Schaller, and N. Westbury. ORDPATHs: Insert-Friendly XML Node Labels. In Proc. of SIGMOD, pages 903--908, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. A. Silberstein, H. He, K. Yi, and J. Yang. BOXes: Efficient Maintenance of Order-Based Labeling for Dynamic XML Data. In Proc. of ICDE, pages 285--296, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. I. Tatarinov, S. Viglas, K.S. Beyer, J. Shanmugasundaram, E.J. Shekita, and C. Zhang. Storing and querying ordered XML using a relational database system. In Proc. of SIGMOD, pages 204--215, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. University of Washington XML Repository. Available at: http://www.cs.washington.edu/research/xmldatasets/Google ScholarGoogle Scholar
  21. X. Wu, M.L. Lee, and W. Hsu. A Prime Number Labeling Scheme for Dynamic Ordered XML Trees. In Proc. of ICDE, pages 66--78, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. XMark -- An XML Benchmark Project. Available at: http://monetdb.cwi.nl/xml/downloads.htmlGoogle ScholarGoogle Scholar
  23. F. Yergeau. UTF8: A Transformation Format of ISO 10646. Request for Comments (RFC) 2279, January 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. C. Zhang, et al. On Supporting Containment Queries in Relational Database Management Systems. In Proc. of SIGMOD, pages 425--436, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. QED: a novel quaternary encoding to completely avoid re-labeling in XML updates

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        CIKM '05: Proceedings of the 14th ACM international conference on Information and knowledge management
        October 2005
        854 pages
        ISBN:1595931406
        DOI:10.1145/1099554

        Copyright © 2005 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 31 October 2005

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • Article

        Acceptance Rates

        CIKM '05 Paper Acceptance Rate77of425submissions,18%Overall Acceptance Rate1,861of8,427submissions,22%

        Upcoming Conference

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader