ABSTRACT
The Semantic Web envisions a World Wide Web in which data is described with rich semantics and applications can pose complex queries. To this point, researchers have defined new languages for specifying meanings for concepts and developed techniques for reasoning about them, using RDF as the data model. To flourish, the Semantic Web needs to be able to accommodate the huge amounts of existing data and the applications operating on them. To achieve this, we are faced with two problems. First, most of the world's data is available not in RDF but in XML; XML and the applications consuming it rely not only on the domain structure of the data, but also on its document structure. Hence, to provide interoperability between such sources, we must map between both their domain structures and their document structures. Second, data management practitioners often prefer to exchange data through local point-to-point data translations, rather than mapping to common mediated schemas or ontologies.This paper describes the Piazza system, which addresses these challenges. Piazza offers a language for mediating between data sources on the Semantic Web, which maps both the domain structure and document structure. Piazza also enables interoperation of XML data with RDF data that is accompanied by rich OWL ontologies. Mappings in Piazza are provided at a local scale between small sets of nodes, and our query answering algorithm is able to chain sets mappings together to obtain relevant data from across the Piazza network. We also describe an implemented scenario in Piazza and the lessons we learned from it.
- S. Abiteboul and O. Duschka Complexity of answering queries using materialized views. In PODS '98, pages 254--263, Seattle, WA, 1998.]] Google ScholarDigital Library
- B. Amann, C. Beeri, I. Fundulaki, and M. Scholl Ontology-based integration of XML web resources. In Int'l Semantic Web Conference '02, pages 117--131, 2002.]] Google ScholarDigital Library
- M. Arenas, L. E. Bertossi, and J. Chomicki. Consistent query answers in inconsistent databases. PODS'99, pages 68--79, 1999.]] Google ScholarDigital Library
- T. Berners-Lee, J. Hendler, and O. Lassila. The semantic web. Scientific American, May 2001.]]Google ScholarCross Ref
- P. A. Bernstein, F. Giunchiglia, A. Kementsietsidis, J. Mylopoulos, L. Serafini, and I. Zaihrayeu. Data management for peer-to-peer computing: A vision. In ACM SIGMOD WebDB Workshop '02, June 2002.]]Google Scholar
- S. Boag, D. Chamberlin, M. F. Fernandez, D. Florescu, J. Robie, J. Simeon, and M. Stefanescu. XQuery 1.0: An XML query language. http://www.w3.org/TR/xquery/, 30 April 2002. W3C working draft.]]Google Scholar
- J. Broekstra, A. Kampan, and F. van Harmelen. Sesame: A generic architecture for storing and querying RDF and RDF Schema. In Int'l Semantic Web Conference '02, pages 54--68, 2002.]] Google ScholarDigital Library
- S. Cluet, P. Veltri, and D. Vodislav. Views in a large scale XML repository. In VLDB '01, pages 271--280, September 2001.]] Google ScholarDigital Library
- M. Dean, D. Connolly, F. van Harmelen, J. Hendler, I. Horrocks, D. McGuinness, P. Patel-Schneider, and L. Stein. OWL web ontology language 1.0 reference, 2002. Manuscript available from http://www.w3.org/2001/sw/WebOnt/.]]Google Scholar
- A. Deutsch, M. F. Fernandez, D. Florescu, A. Levy, and D. Suciu. A query language for XML. In Eighth International World Wide Web Conference, 1999.]] Google ScholarDigital Library
- A. Doan, P. Domingos, and A. Y. Halevy. Reconciling schemas of disparate data sources: A machine-learning approach. In SIGMOD '01, 2001.]] Google ScholarDigital Library
- A. Doan, J. Madhavan, P. Domingos, and A. Halevy. Learning to map between ontologies on the semantic web. In Proc. of the Int. WWW Conf., 2002.]] Google ScholarDigital Library
- M. Fernandez, W.-C. Tan, and D. Suciu. SilkRoute: Trading between relations and XML. In Ninth International World Wide Web Conference, November 1999.]] Google ScholarDigital Library
- A. Halevy, O. Etzioni, A. Doan, Z. Ives, J. Madhavan, L. McDowell, and I. Tatarinov. Crossing the structure chasm. In Proceedings of the First Biennian Conference on Innovative Data Systems Research (CIDR), 2003.]]Google Scholar
- A. Y. Halevy. Answering queries using views: A survey. VLDB Journal, 10(4), 2001.]] Google ScholarDigital Library
- A. Y. Halevy, Z. G. Ives, D. Suciu, and I. Tatarinov. Schema mediation in peer data management systems. In Proc. of ICDE, 2003.]]Google ScholarCross Ref
- I. Horrocks, F. van Harmelen, and P. Patel-Schneider. DAML+OIL. http://www.daml.org/2001/03/daml+oil-index.html, March 2001.]]Google Scholar
- Z. Ives, A. Halevy, and D. Weld. An xml query engine for network-bound data. VLDB Journal, Special Issue on XML Query Processing, 2003.]] Google ScholarDigital Library
- V. Kashyap. The semantic web: Has the db community missed the bus (again)? In Proceedings of the NSF Workshop on DB & IS Research on the Semantic Web and Enterprises, Amicalola, GA, 2002.]]Google Scholar
- D. Lembo, M. Lenzerini, and R. Rosati. Source inconsistency and incompleteness in data integration. In KRDB '02, April 2002.]]Google Scholar
- A. Levy and M.-C. Rousset. Combining Horn rules and description logics in carin. Artificial Intelligence, 104:165--209, September 1998.]] Google ScholarDigital Library
- A. Y. Levy, A. Rajaraman, and J. J. Ordille. Querying heterogeneous information sources using source descriptions. In Proc. of VLDB, pages 251--262, Bombay, India, 1996.]] Google ScholarDigital Library
- D. L. McGuinness, R. Fikes, J. Rice, and S. Wilder. The Chimæra ontology environment. In AAAI '00, 2000.]] Google ScholarDigital Library
- E. Mena, V. Kashyap, A. Illarramendi, and A. P. Sheth. Imprecise answers in distributed environments: Estimation of information loss for multi-ontology based query processing. International Journal of Cooperative Information Systems, 9(4):403--425, 2000.]]Google ScholarCross Ref
- W. Nejdl, B. Wolf, C. Qu, S. Decker, M. Sintek, A. Naeve, M. Nilsson, M. Palmer, and T. Risch. EDUTELLA: A P2P networking infrastructure based on RDF. In Eleventh International World Wide Web Conference, pages 604--615, 2002.]] Google ScholarDigital Library
- N. F. Noy and M. A. Musen. PROMPT: Algorithm and tool for ontology merging and alignment. In AAAI '00, 2000.]] Google ScholarDigital Library
- Y. Papakonstantinou, H. Garcia-Molina, and J. Widom. Object exchange across heterogeneous information sources. In ICDE '95, pages 251--260, 1995.]] Google ScholarDigital Library
- P. Patel-Schneider and J. Simeon. Building the Semantic Web on XML. In Int'l Semantic Web Conference '02, June 2002.]] Google ScholarDigital Library
- E. Rahm and P. A. Bernstein. A survey of approaches to automatic schema matching. VLDB Journal, 10(4):334--350, 2001.]] Google ScholarDigital Library
- D. D. Roure, I. Foster, E. Miller, J. Hendler, and C. Goble. The semantic grid: The grid meets the semantic web. Panel at the WWW Conference, Honolulu, Hawaii, 2002.]]Google Scholar
- M. Rys Bringing the internet to your database: Using SQLServer 2000 and XML to build loosely-coupled systems. In ICDE '01, pages 465--472, 2001.]] Google ScholarDigital Library
- A. P. Sheth and J. A. Larson. Federated database systems for managing distributed, heterogeneous, and autonomous databases. ACM Computing Surveys, 22(3):183--236, 1990.]] Google ScholarDigital Library
- P. Westerman. Data Warehousing: Using the Wal-Mart Model. Morgan Kaufmann Publishers, 2000.]] Google ScholarDigital Library
Index Terms
- Piazza: data management infrastructure for semantic web applications
Recommendations
The Yin/Yang Web: A Unified Model for XML Syntax and RDF Semantics
XML is the W3C standard document format for writing and exchanging information on the Web. RDF is the W3C standard model for describing the semantics and reasoning about information on the Web. Unfortunately, RDF and XML—although very close to each ...
The Yin/Yang web: XML syntax and RDF semantics
WWW '02: Proceedings of the 11th international conference on World Wide WebXML is the W3C standard document format for writing and exchanging information on the Web. RDF is the W3C standard model for describing the semantics and reasoning about information on the Web. Unfortunately, RDF and XML---although very close to each ...
Extracting knowledge from XML document repository: a semantic Web-based approach
XML plays an important role as the standard language for representing structured data for the traditional Web, and hence many Web-based knowledge management repositories store data and documents in XML. If semantics about the data are formally ...
Comments