Abstract
Applications have an increasing need to manage semistructured data (such as data encoded in XML) along with conventional structured data. We extend the structured object database model ODMG and its query language OQL with the ability to handle semistructured data based on the OEM model and Lorel language, and we implement our extensions in a system called Ozone. In our approach, structured data may contain entry points to semistructured data, and vice-versa. The unified representation and querying of such “hybrid” data is the main contribution of our work. We retain strong typing and access to all properties of structured portions of the data while allowing exible navigation of semistructured data without requiring full knowledge of structure. Ozone also enhances both ODMG/OQL and OEM/Lorel by virtue of their combination. For instance, Ozone allows OEM semantics to be applied to ODMG data, thus supporting semistructured-style navigation of structured data. Ozone also enables ODMG views of OEM data, allowing standard ODMG applications to access semistructured data without losing the benefits of structure. Ozone is implemented on top of the ODMG-compliant O2 database system, and it fully supports our extensions to the ODMG model and OQL.
This work was supported by the Air Force Rome Laboratories under DARPA Contract F30602-95-C-0119.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
S. Abiteboul. Querying semistructured data. In Proceedings of the International Conference on Database Theory, Delphi, Greece, January 1997.
S. Abiteboul, S. Cluet, V. Christophides, T. Milo, G. Moerkotte, and J. Simeon. Querying documents in object databases. International Journal on Digital Libraries, 1(1):5–19, 1997.
S. Abiteboul, D. Quass, J. McHugh, J. Widom, and J. Wiener. The Lorel query language for semistructured data. International Journal on Digital Libraries, 1(1):68–88, April 1997.
P. Buneman, S. Davidson, M. Fernandez, and D. Suciu. Adding structure to unstructured data. In Proceedings of the International Conference on Database Theory, pages 335–350, Delphi, Greece, January 1997.
P. Buneman, S. Davidson, G. Hillebrand, and D. Suciu. A query language and optimization techniques for unstructured data. In Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 505–516, Montreal, Canada, June 1996.
F. Bancilhon, C. Delobel, and P. Kanellakis, editors. Building an Object-Oriented Database System: The Story of O2. Morgan Kaufmann, San Francisco, California, 1992.
P. Buneman, S. Davidson, and D. Suciu. Programming constructs for unstructured data. In Proceedings of the 1995 International Workshop on Database Programming Languages (DBPL), 1995.
P. Buneman. Semistructured data. In Proceedings of the Sixth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, Tucson, Arizona, May 1997. Tutorial.
R.G.G. Cattell, editor. The Object Database Standard: ODMG-93. Morgan Kaufmann, San Francisco, California, 1994.
E.F. Codd. A relational model for large shared data banks. Communications of the ACM, 13(6):377–387, June 1970.
M. Fernandez, D. Florescu, A. Levy, and D. Suciu. A query language for a Web-site management system. SIGMOD Record, 26(3):4–11, September 1997.
R. Goldman and J. Widom. DataGuides: Enabling query formulation and optimization in semistructured databases. In Proceedings of the Twenty-Third International Conference on Very Large Data Bases, pages 436–445, Athens, Greece, August 1997.
R. Light and T. Bray. Presenting XML. Sams, Indianapolis, Indiana, September 1997.
J. McHugh, S. Abiteboul, R. Goldman, D. Quass, and J. Widom. Lore: A database management system for semistructured data. SIGMOD Record, 26(3):54–66, September 1997.
J. Melton and A.R. Simon. Understanding the New SQL: A Complete Guide. Morgan Kaufmann, San Francisco, California, 1993.
J. McHugh and J. Widom. Query optimization for XML. In Proceedings of the Twenty-Fifth International Conference on Very Large Databases, Edinburgh, Scotland, September 1999. To appear.
S. Nestorov, S. Abiteboul, and R. Motwani. Extracting schema from semi-structured data. In Proceedings of the ACM SIGMOD International Conference on Management of Data, Seattle, Washington, May 1998.
Y. Papakonstantinou, H. Garcia-Molina, and J. Widom. Object exchange across heterogeneous information sources. In Proceedings of the Eleventh International Conference on Data Engineering, pages 251–260, Taipei, Taiwan, March 1995.
M. Stonebraker and D. Moore. Object-Relational DBMSs: The Next Great Wave. Morgan Kaufmann, San Francisco, California, April 1996.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lahiri, T., Abiteboul, S., Widom, J. (2000). Ozone: Integrating Structured and Semistructured Data. In: Connor, R., Mendelzon, A. (eds) Research Issues in Structured and Semistructured Database Programming. DBPL 1999. Lecture Notes in Computer Science, vol 1949. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44543-9_18
Download citation
DOI: https://doi.org/10.1007/3-540-44543-9_18
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41481-0
Online ISBN: 978-3-540-44543-2
eBook Packages: Springer Book Archive