ABSTRACT
Materialisation is often used in RDF systems as a preprocessing step to derive all facts implied by given RDF triples and rules. Although widely used, materialisation considers all possible rule applications and can use a lot of memory for storing the derived facts, which can hinder performance. We present a novel materialisation technique that compresses the RDF triples so that the rules can sometimes be applied to multiple facts at once, and the derived facts can be represented using structure sharing. Our technique can thus require less space, as well as skip certain rule applications. Our experiments show that our technique can be very effective: when the rules are relatively simple, our system is both faster and requires less memory than prominent state-of-the-art RDF systems.
- Daniel J. Abadi. 2008. Query Execution in Column-oriented Database Systems. Ph.D. Dissertation. MIT, Cambridge, MA, USA. AAI0820132.Google Scholar
- D. J. Abadi, S. Madden, and M. Ferreira. 2006. Integrating Compression and Execution in Column-Oriented Database Systems. In Proc. SIGMOD. 671--682.Google Scholar
- D. J. Abadi, A. Marcus, S. Madden, and K. Hollenbach. 2009. SW-Store: a vertically partitioned DBMS for Semantic Web data management . VLDB Journal , Vol. 18, 2 (2009), 385--406.Google ScholarDigital Library
- S. Abiteboul, R. Hull, and V. Vianu. 1995. Foundations of Databases. Addison Wesley.Google Scholar
- B. Bishop, A. Kiryakov, D. Ognyanoff, I. Peikov, Z. Tashev, and R. Velkov. 2011. OWLIM: A family of scalable semantic repositories . Semantic Web , Vol. 2, 1 (2011), 33--42.Google ScholarCross Ref
- D. Croft, A.F. Mundo, R. Haw, M. Milacic, J. Weiser, G. Wu, M. Caudy, P. Garapati, M. Gillespie, M.R. Kamdar, et almbox. 2013. The Reactome pathway knowledgebase. Nucleic acids research , Vol. 42, D1 (2013), D472--D477.Google Scholar
- B. N. Grosof, I. Horrocks, R. Volz, and S. Decker. 2003. Description Logic Programs: Combining Logic Programs with Description Logic. In Proc. WWW . 48--57.Google Scholar
- Y. Guo, Z. Pan, and J. Heflin. 2005. LUBM: A benchmark for OWL knowledge base systems . Journal of Web Semantics , Vol. 3, 2--3 (2005), 158--182.Google ScholarDigital Library
- Pan Hu, Jacopo Urbani, Boris Motik, and Ian Horrocks. 2019. Datalog Reasoning over Compressed RDF Knowledge Bases . CoRR , Vol. abs/1908.10177 (2019).Google Scholar
- S. Idreos, F. Groffen, N. Nes, S. Manegold, K. S. Mullender, and M. L. Kersten. 2012. MonetDB: Two Decades of Research in Column-oriented Database Architectures . IEEE Data Engineering Bulletin , Vol. 35, 1 (2012), 40--45.Google Scholar
- Graham Klyne, Jeremy J. Carroll, and Brian McBride. 2014. RDF 1.1: Concepts and Abstract Syntax . W3C Recommendation.Google Scholar
- A. Lamb, M. Fuller, R. Varadarajan, N. Tran, B. Vandier, L. Doshi, and C. Bear. 2012. The Vertica Analytic Database: C-Store 7 Years Later . PVLDB , Vol. 5, 12 (2012), 1790--1801.Google ScholarDigital Library
- Stefan Manegold, Peter A. Boncz, and Niels Nes. 2004. Cache-Conscious Radix-Decluster Projections. In Proc. VLDB. 684--695.Google ScholarCross Ref
- B. Motik, Y. Nenov, R. Piro, and I. Horrocks. 2015. Incremental update of datalog materialisation: the backward/forward algorithm. In Proc. AAAI . 1560--1568.Google Scholar
- B. Motik, Y. Nenov, R. Piro, I. Horrocks, and D. Olteanu. 2014. Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF Systems. In Proc. AAAI. 129--137.Google Scholar
- Sebastian Rahtz, Alexander Dutton, Donna Kurtz, Graham Klyne, Andrew Zisserman, and Relja Arandjelovic. 2011. CLAROS--Collaborating on Delivering the Future of the Past. In Proc. DH . 355--357.Google Scholar
- Jacopo Urbani, Ceriel J. H. Jacobs, and Markus Krö tzsch. 2016. Column-Oriented Datalog Materialization for Large Knowledge Graphs. In Proc. AAAI . 258--264.Google Scholar
- Zhe Wu, George Eadon, Souripriya Das, Eugene Inseok Chong, Vladimir Kolovski, Melliyal Annamalai, and Jagannathan Srinivasan. 2008. Implementing an inference engine for RDFS/OWL constructs and user-defined rules in Oracle. In Proc. ICDE. 1239--1248.Google ScholarDigital Library
Index Terms
- Datalog Reasoning over Compressed RDF Knowledge Bases
Recommendations
Modular materialisation of Datalog programs
AbstractAnswering queries over large datasets extended with Datalog rules plays a key role in numerous data management applications, and it has been implemented in several highly optimised Datalog systems in both academic and commercial ...
Datalog±: a unified approach to ontologies and integrity constraints
ICDT '09: Proceedings of the 12th International Conference on Database TheoryWe report on a recently introduced family of expressive extensions of Datalog, called Datalog±, which is a new framework for representing ontological axioms in form of integrity constraints, and for query answering under such constraints. Datalog± is ...
A general Datalog-based framework for tractable query answering over ontologies
Ontologies and rules play a central role in the development of the Semantic Web. Recent research in this context focuses especially on highly scalable formalisms for the Web of Data, which may highly benefit from exploiting database technologies. In ...
Comments