ABSTRACT
The Deep Web is constituted by dynamically generated pages, usually requested through HTML forms; it is notoriously difficult to query and to search, as its pages are obviously non-indexable. Recently, Deep Web data have been made accessible through RESTful services that return information usually structured in JSON or XML format. We propose techniques to make the Deep Web available in the Linked Data Cloud, and we study algorithms for processing queries posed in a transparent way on the Linked Data, providing answers based on the underlying Deep Web sources. We present a software prototype that exposes RESTful services as Linked Data datasets thus allowing a smoother semantic integration of different structured information sources in a global data and knowledge space.
- Sanjay Agrawal, Surajit Chaudhuri, and Gautam Das. 2002. DBXplorer: A System for Keyword-Based Search over Relational Databases. In Proc. of ICDE. 5--16. Google ScholarCross Ref
- Rosa Alarcon and Erik Wilde. 2010. Linking Data from RESTful Services. In Proceedings of the WWW2010 Workshop on Linked Data on the Web, LDOW 2010, Raleigh, USA, April 27, 2010.Google Scholar
- Christian Bizer, Tom Heath, and Tim Berners-Lee. 2009. Linked Data - The Story So Far. Int. J. Semantic Web Inf. Syst 5, 3 (2009), 1--22. Google ScholarCross Ref
- Andrea Calì, Diego Calvanese, Giuseppe De Giacomo, and Maurizio Lenzerini. 2013. Data integration under integrity constraints. In Seminal Contributions to Information Systems Engineering. Springer, 335--352. Google ScholarCross Ref
- Andrea Calì and Davide Martinenghi. 2008. Conjunctive Query Containment under Access Limitations. In Proc. of ER 2008. 326--340. Google ScholarDigital Library
- Andrea Calì and Davide Martinenghi. 2008. Querying Data under Access Limitations. In Proc. of ICDE. 50--59. Google ScholarDigital Library
- Andrea Calì, Davide Martinenghi, and Riccardo Torlone. 2016. Keyword Queries over the Deep Web. In Proc. of ER 2016. 260--268. Google ScholarCross Ref
- Kevin Chen-Chuan Chang, Bin He, and Zhen Zhang. 2005. Toward Large Scale Integration: Building a MetaQuerier over Databases on the Web. In Proc. of CIDR. 44--55.Google Scholar
- Roy T. Fielding and Richard N. Taylor. 2002. Principled Design of the Modern Web Architecture. ACM Transactions on Internet Technology 2, 2 (2002), 115--150. Google ScholarDigital Library
- Vagelis Hristidis and Yannis Papakonstantinou. 2002. Discover: Keyword Search in Relational Databases. In Proc. of VLDB. Google ScholarCross Ref
- Govind Kabra, Zhen Zhang, and Kevin Chen-Chuan Chang. 2007. Dewex: An Exploration Facility for Enabling the Deep Web Integration. In Proc. of ICDE. 1511--1512. Google ScholarCross Ref
- Jayant Madhavan, Loredana Afanasiev, Lyublena Antova, and Alon Y. Halevy. 2009. Harnessing the Deep Web: Present and Future. In Proc. of CIDR.Google Scholar
- Kevin R. Page, David C. De Roure, and Kirk Martinez. 2011. REST and Linked Data: A Match Made for Domain Driven Development?. In Proceedings of the Second International Workshop on RESTful Design (WS-REST '11). ACM, 22--25. Google ScholarDigital Library
- Ahmet Soylu, Felix M dritscher, Fridolin Wild, Patrick De Causmaecker, and Piet Desmet. 2012. Mashups by orchestration and widget-based personal environments: Key challenges, solution strategies, and an application. Program 46, 4 (2012), 383--428. Google ScholarCross Ref
- Steffen Stadtmüller and Andreas Harth. 2012. Towards Data-driven Programming for RESTful Linked Data. In Proceedings of the ISWC 2012 workshop on Programming the Semantic Web. -.Google Scholar
- Steffen Stadtmüller, Sebastian Speiser, Andreas Harth, and Rudi Studer. 2013. Data-Fu: A Language and an Interpreter for Interaction with Read/Write Linked Data. In Proceedings of the 22Nd International Conference on World Wide Web (WWW '13). ACM, 1225--1236.Google ScholarDigital Library
Index Terms
- Querying deep web data sources as linked data
Recommendations
Using the relation ontology Metarel for modelling Linked Data as multi-digraphs
Linked Data for Health Care and the Life SciencesThe Semantic Web standards OWL and RDF are often used to represent biomedical information as Linked Data; however, the OWL/RDF syntax, which combines both, was never optimised for querying. By combining two formal paradigms for modelling Linked Data, ...
Querying semantic web data with SPARQL
PODS '11: Proceedings of the thirtieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systemsThe Semantic Web is the initiative of the W3C to make information on the Web readable not only by humans but also by machines. RDF is the data model for Semantic Web data, and SPARQL is the standard query language for this data model. In the last ten ...
Toward the Semantic Deep Web
The Semantic Deep Web fuses aspects of the Semantic Web with the use of ontology-aware browsers to extract information from the Deep Web.
Comments