skip to main content
research-article

Obi-Wan: ontology-based RDF integration of heterogeneous data

Published:01 August 2020Publication History
Skip Abstract Section

Abstract

We consider the problem of integrating heterogeneous data (relational, JSON, key-values, graphs etc.) and querying it efficiently. Traditional data integration systems fall into two classes: data warehousing, where all data source content is materialized in a single repository, and mediation, where data remains in their original stores and all data can be queried through a mediator.

We propose to demonstrate Obi-Wan, a novel mediator following the Ontology-Based Data access (OBDA) paradigm. Obi-Wan integrates data sources of many data models under an interface based on RDF graphs and ontologies (classes, properties, and relations between them). The novelty of Obi-Wan is to combine maximum integration power (GLAV mappings, see below) with the highest query answering power supported by an RDF mediator: RDF queries not only over the data but also over the integration ontologies. This makes it more flexible and powerful than comparable systems.

References

  1. N. Abdallah, F. Goasdoué, and M. Rousset. DL-LITER in the light of propositional logic for decentralized data management. In IJCAI, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. R. Alotaibi, D. Bursztyn, A. Deutsch, I. Manolescu, and S. Zampetakis. Towards Scalable Hybrid Stores: Constraint-Based Rewriting to the Rescue. In SIGMOD, June 2019. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. J.-F. Baget, M. Leclère, M. Mugnier, S. Rocher, and C. Sipieter. Graal: A toolkit for query answering with existential rules. In RuleML, 2015.Google ScholarGoogle ScholarCross RefCross Ref
  4. R. Bonaque, T. D. Cao, et al. Mixed-instance querying: A lightweight integration architecture for data journalism. PVLDB, 9(13):1513--1516, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. M. Buron, F. Goasdoué, I. Manolescu, and M. Mugnier. Reformulation-based query answering for RDF graphs with RDFS ontologies. In ESWC, 2019.Google ScholarGoogle ScholarCross RefCross Ref
  6. M. Buron, F. Goasdoué, I. Manolescu, and M. Mugnier. Ontology-based RDF integration of heterogeneous data. In EDBT, 2020.Google ScholarGoogle Scholar
  7. D. Calvanese, B. Cogrel, S. Komla-Ebri, R. Kontchakov, D. Lanti, M. Rezk, M. Rodriguez-Muro, and G. Xiao. Ontop: Answering SPARQL queries over relational databases. Semantic Web, 8(3), 2017.Google ScholarGoogle Scholar
  8. D. Calvanese, G. De Giacomo, D. Lembo, et al. The MASTRO system for ontology-based data access. Semantic Web, 2(1), 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. D. Calvanese, G. De Giacomo, D. Lembo, M. Lenzerini, R. Rosati, and M. Ruzzi. Using owl in data integration. In Semantic Web Information Management. 2009.Google ScholarGoogle Scholar
  10. D. Calvanese, G. De Giacomo, M. Lenzerini, and M. Y. Vardi. Query processing under GLAV mappings for relational and graph databases. PVLDB, 6(2):61--72, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. G. De Giacomo, D. Lembo, M. Lenzerini, A. Poggi, and R. Rosati. Using Ontologies for Semantic Data Integration. 2018.Google ScholarGoogle Scholar
  12. A. Deutsch and V. Tannen. MARS: A system for publishing XML from mixed and redundant storage. In PVLDB, pages 201--212, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. A. Doan, A. Halevy, and Z. G. Ives. Principles of Data Integration. Morgan Kaufmann, Waltham, MA, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. J. Duggan, A. J. Elmore, M. Stonebraker, et al. The BigDAWG polystore system. SIGMOD, 44(2), 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. H. Garcia-Molina, Y. Papakonstantinou, D. Quass, et al. The TSIMMIS approach to mediation: Data models and languages. JIIS, 8(2), 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. F. Goasdoué, V. Lattès, and M. Rousset. The use of CARIN language and algorithms for information integration: The PICSEL system. IJCIS, 2000.Google ScholarGoogle ScholarCross RefCross Ref
  17. I. Manolescu, D. Florescu, and D. Kossmann. Answering XML queries on heterogeneous data sources. In VLDB, pages 241--250, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. S. Nadal, K. Rabbani, O. Romero, and S. Tadesse. ODIN: A Dataspace Management System. 2019.Google ScholarGoogle Scholar
  19. A. Poggi, D. Lembo, D. Calvanese, G. De Giacomo, M. Lenzerini, and R. Rosati. Linking data to ontologies. J. Data Semantics, 10, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. M. Rodriguez-Muro, R. Kontchakov, and M. Zakharyaschev. Ontology-based data access: Ontop of databases. In ISWC, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. J. F. Sequeda, M. Arenas, and D. P. Miranker. OBDA: query rewriting or materialization? in practice, both! In ISWC, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. G. Smits, O. Pivert, H. Jaudoin, and F. Paulus. AGGREGO SEARCH: interactive keyword query construction. In EDBT, 2014.Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in

Full Access

  • Published in

    cover image Proceedings of the VLDB Endowment
    Proceedings of the VLDB Endowment  Volume 13, Issue 12
    August 2020
    1710 pages
    ISSN:2150-8097
    Issue’s Table of Contents

    Publisher

    VLDB Endowment

    Publication History

    • Published: 1 August 2020
    Published in pvldb Volume 13, Issue 12

    Qualifiers

    • research-article

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader