Abstract
In this article, we address the problem of how to extend the definition of functional dependencies (FDs) in incomplete relations to XML documents (called XFDs) using the well-known strong satisfaction approach.We propose a syntactic definition of strong XFD satisfaction in an XML document and then justify it by showing that, similar to the case in relational databases, for the case of simple paths, keys in XML are a special case of XFDs. We also propose a normal form for XML documents based on our definition of XFDs and provide a formal justification for it by proving that it is a necessary and sufficient condition for the elimination of redundancy in an XML document.
- Arenas, M. and Libkin, L. 2002. A normal form for XML documents. In Proceedings of the ACM PODS Conference. 85--96. Google ScholarDigital Library
- Arenas, M. and Libkin, L. 2003. An information-theoretic approach to normal forms for relational and XML data. In Proceedings of the ACM PODS Conference. 15--26. Google ScholarDigital Library
- Atzeni, P. and DeAntonellis, V. 1993. Foundations of Databases. Benjamin Cummings, San Francisco, CA.Google Scholar
- Bray, T., Paoli, J., and Sperberg-McQueen, C. 1998a. Extensible Markup Language (XML) 1.0. Technical rep. Available online at http://www.w3.org/Tr/1998/REC-XML-19980819.Google Scholar
- Bray, T., Paoli, J., and Sperberg-McQueen, C. M. 1998b. XML-data. W3C Note, Available online at http://www.w3.org/Tr/1998/Note-XML-data.Google Scholar
- Buneman, P., Davidson, S., Fan, W., and Hara, C. 2001a. Reasoning about keys for XML. In International Workshop on Database Programming Languages. Google ScholarDigital Library
- Buneman, P., Davidson, S., Fan, W., Hara, C., and Tan, W. 2003. Reasoning about keys for XML. Inform. Syst. 28, 8, 1037--1063. Google ScholarDigital Library
- Buneman, P., Fan, W., Simeon, J., and Weinstein, S. 2001b. Constraints for semistructured data and XML. ACM SIGMOD Rec. 30, 1, 45--47. Google ScholarDigital Library
- Buneman, P., Fan, W., and Weinstein, S. 1998. Path constraints on structured and semistructured data. In Proceedings of the ACM PODS Conference. 129--138. Google ScholarDigital Library
- Cover, T. and Thomas, J. 1991. Elements of Information Theory. Wiley-Interscience, New York, NY. Google ScholarDigital Library
- Fagin, R. 1979. Normal forms and relational database operators. In ACM SIGMOD International Conference on the Management of Data. 123--134. Google ScholarDigital Library
- Fan, W. and Libkin, L. 2002. On XML integrity constraints in the presence of DTDs. J. ACM 49, 3, 368--406. Google ScholarDigital Library
- Fan, W. and Simeon, J. 2003. Integrity constraints for XML. J. Comput. Syst. Sci. 66, 1, 254--291. Google ScholarDigital Library
- LeHors, A., LeHgaret, P., Wood, L., Nicol, G., Robie, J., Champion, M., and Byrne, S. 2003. Document object model (dom) level 3 core specifications. Technical rep. Available online at http://www.w3.org/TR/2003/WD-DOM-Level-3-Core-20030609/.Google Scholar
- Levene, M. and Vincent, M. W. 2000. Justification for inclusion dependency normal form. IEEE Trans. Knowl. Data Eng. 12, 281--291. Google ScholarDigital Library
- Thompson, H. S., Beech, D., Maloney, M., and Mendelsohn, N. 2001. XML schema part 1:structures. W3C Working Draft. Available online at http://www.w3.org/Tr/1998/XMLschema-1.Google Scholar
- Vincent, M., Liu, J., and Liu, C. 2003a. The implication problem for unary functional dependencies in XML. Submitted for publication.Google Scholar
- Vincent, M., Liu, J., and Liu, C. 2003b. Strong functional dependencies and a redundancy free normal form for XML. Research rep. ACRC-03-011. School of Computer and Information Science, University of South Australia, Mawson Lakes, Adelaide, Australia. Available online at http://www.cis.unisa.edu.au/ cismwv/papers/index.html.Google Scholar
- Vincent, M. W. 1994. Semantic foundations of normal forms in relational database design. Ph.D. dessirtation. Department of Computer Science, Monash University, Melbourne, Victoria, Australia.Google Scholar
- Vincent, M. W. 1997. A corrected 5NF definition for relational database design. Theoret. Comput. Sci. 185, 379--391. Google ScholarDigital Library
- Vincent, M. W. 1998. A new redundancy free normal form for relational database design. In Database Semantics, B. Thalheim and L. Libkin, Eds. Springer Verlag, Berlin, Germany, 247--264. Google ScholarDigital Library
- Vincent, M. W. 1999. Semantic foundations of 4NF in relational database design. Acta Informatica 36, 1--41.Google ScholarCross Ref
- Widom, J. 1999. Data management for XML---research directions. IEEE Data Eng. Bull. 22, 3, 44--52.Google Scholar
Index Terms
- Strong functional dependencies and their application to normal forms in XML
Recommendations
On Defining Functional Dependency for XML
ICSC '09: Proceedings of the 2009 IEEE International Conference on Semantic ComputingFunctional dependency (FD) is one of the integrity constraints for any data model. In relational data model, FDs are well studied and are widely used in normalization theory and in key algorithm. In recent years, XML has emerged as an widely used data ...
Removing XML data redundancies using functional and equality-generating dependencies
ADC '05: Proceedings of the 16th Australasian database conference - Volume 39We study the design issues of data-centric XML documents where (1) there are no mixed contents, i.e., each element may have some subelements and attributes, or it may have a single value in the form of a character string, but not a mixture of strings and ...
The implication problem for 'closest node' functional dependencies in complete XML documents
With the growing use of XML as a format for the permanent storage of data, the study of functional dependencies in XML (XFDs) is of fundamental importance in a number of areas such as understanding how to effectively design XML databases without ...
Comments