skip to main content
10.1145/1807167.1807212acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article
Results Reproduced / v1.1

Active knowledge: dynamically enriching RDF knowledge bases by web services

Published:06 June 2010Publication History

ABSTRACT

The proliferation of knowledge-sharing communities and the advances in information extraction have enabled the construction of large knowledge bases using the RDF data model to represent entities and relationships. However, as the Web and its latently embedded facts evolve, a knowledge base can never be complete and up-to-date. On the other hand, a rapidly increasing suite of Web services provide access to timely and high-quality information, but this is encapsulated by the service interface. We propose to leverage the information that could be dynamically obtained from Web services in order to enrich RDF knowledge bases on the fly whenever the knowledge base does not suffice to answer a user query.

To this end, we develop a sound framework for appropriately generating queries to encapsulated Web services and efficient algorithms for query execution and result integration. The query generator composes sequences of function calls based on the available service interfaces. As Web service calls are expensive, our method aims to reduce the number of calls in order to retrieve results with sufficient recall. Our approach is fully implemented in a complete prototype system named ANGIE1. The user can query and browse the RDF knowledge base as if it already contained all the facts from the Web services. This data, however, is gathered and integrated on the fly, transparently to the user. We demonstrate the viability and efficiency of our approach in experiments based on real-life data provided by popular Web services.

References

  1. S. Abiteboul, O. Benjelloun, and T. Milo. The Active XML project: an overview. VLDB J., 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. S. Abiteboul, R. Hull, and V. Vianu. Foundations of Databases. Addison-Wesley, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. B. Amann, I. Fundulaki, M. Scholl, C. Beeri, and A.-M. Vercoustre. Mapping XML fragments to community Web ontologies. In WebDB, 2001.Google ScholarGoogle Scholar
  4. A. Arasu and R. Kaushik. A grammar-based entity representation framework for data cleaning. In SIGMOD Conference, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. S. Auer, C. Bizer, G. Kobilarov, J. Lehmann, R. Cyganiak, and Z. Ives. DBpedia: A nucleus for a Web of Open Data. The Semantic Web, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. M. Banko, M. J. Cafarella, S. Soderland, M. Broadhead, and O. Etzioni. Open Information Extraction from the Web. In IJCAI, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. D. Berardi, D. Calvanese, G. D. Giacomo, R. Hull, and M. Mecella. Automatic composition of transition-based semantic web services with messaging. In VLDB, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. C. Bizer, T. Heath, K. Idehen, and T. Berners-Lee. Linked data on the web (LDOW2008). In WWW, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. A. Calì, G. Gottlob, and T. Lukasiewicz. A general datalog-based framework for tractable query answering over ontologies. In PODS, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. B. Cautis, A. Deutsch, and N. Onose. Querying data sources that export infinite sets of views. In ICDT, pages 84--97, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. K. C.-C. Chang, B. He, and Z. Zhang. Toward large scale integration: Building a metaquerier over databases on the web. In CIDR, 2005.Google ScholarGoogle Scholar
  12. A. Deutsch, L. Sui, and V. Vianu. Specification and verification of data-driven web services. In PODS, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. X. L. Dong, L. Berti-Equille, and D. Srivastava. Truth discovery and copying detection in a dynamic world. PVLDB, 2(1), 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. O. M. Duschka and M. R. Genesereth. Answering recursive queries using views. In PODS, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. O. M. Duschka, M. R. Genesereth, and A. Y. Levy. Recursive query plans for data integration. J. Log. Program., 43(1), 2000.Google ScholarGoogle Scholar
  16. R. Fagin, L. M. Haas, M. A. Hernández, R. J. Miller, L. Popa, and Y. Velegrakis. Clio: Schema mapping creation and data exchange. In Conceptual Modeling: Foundations and Applications, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. D. Freitag and N. Kushmerick. Boosted wrapper induction. In AAAI/IAAI, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. M. Friedman and D. S. Weld. Efficiently executing information-gathering plans. In IJCAI (1), 1997.Google ScholarGoogle Scholar
  19. H. Garcia-Molina, Y. Papakonstantinou, D. Quass, A. Rajaraman, Y. Sagiv, J. D. Ullman, V. Vassalos, and J. Widom. The tsimmis approach to mediation: Data models and languages. J. Intell. Inf. Syst., 8(2), 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. A. Y. Halevy. Answering queries using views: A survey. VLDB J., 10(4), 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. M. Jarrar and M. D. Dikaiakos. MashQL: a query-by-diagram topping SPARQL. In ONISW, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. S. Kambhampati, E. Lambrecht, U. Nambiar, Z. Nie, and G. Senthil. Optimizing recursive information gathering plans in emerac. J. Intell. Inf. Syst., 22(2), 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. G. Kasneci, F. M. Suchanek, G. Ifrim, M. Ramanath, and G. Weikum. NAGA: Searching and Ranking Knowledge. In ICDE, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. I. Koffina, G. Serfiotis, V. Christophides, and V. Tannen. Mediating rdf/s queries to relational and xml sources. Int. J. Semantic Web Inf. Syst., 2(4), 2006.Google ScholarGoogle Scholar
  25. C. T. Kwok and D. S. Weld. Planning to gather information. In AAAI/IAAI, Vol. 1, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. A. Y. Levy, A. Rajaraman, and J. J. Ordille. Querying heterogeneous information sources using source descriptions. In VLDB, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. T. Neumann and G. Weikum. RDF-3X: a RISC-style engine for RDF. PVLDB, 1(1), 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. A. Polleres. From SPARQL to rules (and back). In WWW, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. R. Pottinger and A. Y. Levy. A scalable algorithm for answering queries using views. In VLDB, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. N. Preda, F. M. Suchanek, G. Kasneci, T. Neumann, M. Ramanath, and G. Weikum. ANGIE: Active knowledge for interactive exploration. PVLDB, 2(2), 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. K. Q. Pu, V. Hristidis, and N. Koudas. Syntactic rule based approach to Web service composition. In ICDE, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. A. Rajaraman, Y. Sagiv, and J. D. Ullman. Answering queries using templates with binding patterns. In PODS, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. P. Senellart, A. Mittal, D. Muschick, R. Gilleron, and M. Tommasi. Automatic wrapper induction from hidden-Web sources with domain knowledge. In WIDM, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. D. E. Simmen, M. Altinel, V. Markl, S. Padmanabhan, and A. Singh. Damia: data mashups for intranet applications. In SIGMOD, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. F. M. Suchanek, G. Kasneci, and G. Weikum. YAGO: A Core of Semantic Knowledge. In 16th international World Wide Web conference (WWW 2007), New York, NY, USA, 2007. ACM Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. M. Technologies. The freebase project. http://freebase.com.Google ScholarGoogle Scholar
  37. S. Thakkar, J. L. Ambite, and C. A. Knoblock. Composing, optimizing, and executing plans for bioinformatics web services. VLDB J., 14(3), 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. V. Vassalos and Y. Papakonstantinou. Describing and using query capabilities of heterogeneous sources. In VLDB, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Word Wide Web Consortium. RDF Vocabulary Description Language 1.0: RDF Schema. W3C Recommendation 2004-02-10.Google ScholarGoogle Scholar
  40. Word Wide Web Consortium. XSL Transformations (XSLT). W3C Recommendation 1999-11-16.Google ScholarGoogle Scholar
  41. World Wide Web Consortium. SPARQL Query Language for RDF (W3C Recommendation 2008-01-15), 2008. http://www.w3.org/TR/rdf-sparql-query/.Google ScholarGoogle Scholar
  42. F. Wu and D. S. Weld. Automatically refining the Wikipedia infobox ontology. In Proc. of the Int. WWW Conf., 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. V. Zadorozhny, L. Raschid, M.-E. Vidal, T. Urhan, and L. Bright. Efficient evaluation of queries in a mediator for websources. In SIGMOD Conference, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Active knowledge: dynamically enriching RDF knowledge bases by web services

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in
            • Published in

              cover image ACM Conferences
              SIGMOD '10: Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
              June 2010
              1286 pages
              ISBN:9781450300322
              DOI:10.1145/1807167

              Copyright © 2010 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 6 June 2010

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • research-article

              Acceptance Rates

              Overall Acceptance Rate785of4,003submissions,20%

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader