skip to main content
10.1145/1376616.1376707acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

A graph method for keyword-based selection of the top-K databases

Authors Info & Claims
Published:09 June 2008Publication History

ABSTRACT

While database management systems offer a comprehensive solution to data storage, they require deep knowledge of the schema, as well as the data manipulation language, in order to perform effective retrieval. Since these requirements pose a problem to lay or occasional users, several methods incorporate keyword search (KS) into relational databases. However, most of the existing techniques focus on querying a single DBMS. On the other hand, the proliferation of distributed databases in several conventional and emerging applications necessitates the support for keyword-based data sharing and querying over multiple DMBSs. In order to avoid the high cost of searching in numerous, potentially irrelevant, databases in such systems, we propose G-KS, a novel method for selecting the top-K candidates based on their potential to contain results for a given query. G-KSsummarizes each database by a keyword relationship graph, where nodes represent terms and edges describe relationships between them. Keyword relationship graphs are utilized for computing the similarity between each database and a KS query, so that, during query processing, only the most promising databases are searched. An extensive experimental evaluation demonstrates that G-KS outperforms the current state-of-the-art technique on all aspects, including precision, recall, efficiency, space overhead and flexibility of accommodating different semantics.

References

  1. S. Agrawal, S. Chaudhuri, and G. Das. DBXplorer: A system for keyword-based search over relational databases. In Proceedings of ICDE, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. BestPeer. http://www.bestpeer.com.Google ScholarGoogle Scholar
  3. G. Bhalotia, A. Hulgeri, C. Nakhe, S. Chakrabarti, and S. Sudarshan. Keyword searching and browsing in databases using banks. In Proceedings of ICDE, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. J. P. Callan, Z. Lu, and W. B. Croft. Searching distributed collections with inference networks. In Proceedings of SIGIR, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. G. Cao, J.-Y. Nie, and J. Bai. Integrating word relationships into language models. In Proceedings of SIGIR, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. S. Cohen. XSEarch: A semantic search engine for XML. In Proceedings of VLDB, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. DBLP. http://dblp.uni-trier.de.Google ScholarGoogle Scholar
  8. C. Fellbaum, editor. Wordnet: An Electronic Lexical Database. MIT Press, 1998.Google ScholarGoogle ScholarCross RefCross Ref
  9. J. Gao, J.-Y. Nie, G. Wu, and G. Cao. Dependence language model for information retrieval. In Proceedings of SIGIR, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. L. Gravano, H. Garcia-Molina, and A. Tomasic. GlOSS: Text-source discovery over the internet. ACM Transactions on Database Systems (TODS), 24(2):229--264, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. L. Guo, F. Shao, C. Botev, and J. Shanmugasundaram. XRANK: Ranked keyword search over XML documents. In Proceedings of SIGMOD, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. H. He, H. Wang, J. Yang, and P. S. Yu. BLINKS: Ranked keyword searched on graphs. In Proceedings of SIGMOD, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. V. Hristidis, L. Gravano, and Y. Papakonstantinou. Efficient IR-style keyword search over relational databases. In Proceedings of VLDB, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. V. Hristidis and Y. Papakonstantinou. DISCOVER: Keyword search in relational databases. In Proceedings of VLDB, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. V. Kacholia, S. Pandit, S. Chakrabarti, S. Sudarshan, R. Desai, and H. Karambelkar. Bidirectional expansion for keyword search on graph databases. In Proceedings of VLDB, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Y. Li, C. Yu, and H. V. Jagadish. Schema-Free XQuery. In Proceedings of VLDB, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. F. Liu, C. Yu, W. Meng, and A. Chowdhury. Effective keyword search in relational databases. In Proceedings of SIGMOD, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Z. Liu and Y. Chen. Identifying meaningful return information for XML keyword search. In Proceedings of SIGMOD, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Y. Luo, X. Lin, W. Wang, and X. Zhou. SPARK: Top-k keyword query in relational databases. In Proceedings of SIGMOD, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. A. Markowetz, Y. Yang, and D. Papadias. Keyword search on relational data streams. In Proceedings of SIGMOD, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. R. Nallapati and J. Allan. Capturing term dependencies using a language model based on sentence trees. In Proceedings of CIKM, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. S3: Scalable, Shareable and Secure P2P Based Data Management System. http://www.comp.nus.edu.sg/~s3p2p.Google ScholarGoogle Scholar
  23. G. Salton, A. Wong, and C. S. Yang. A vector space model for automatic indexing. Communications of the ACM, 18(11):613--620, 1975. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. M. Sayyadian, H. LeKhac, A. Doan, and L. Gravano. Efficient keyword search across heterogeneous relational databases. In Proceedings of ICDE, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  25. S. K. M. Wong, W. Ziarko, V. V. Raghavan, and P. C. N. Wong. On modeling of information retrieval concepts in vector spaces. ACM Transactions on Database Systems (TODS), 12(3):299--321, 1987. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Y. Xu and Y. Papakonstantinou. Efficient keyword search for smallest LCAs in XML databases. In Proceedings of SIGMOD, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. B. Yu, G. Li, K. Sollins, and A. K. H. Tung. Effective keyword-based selection of relational databases. In Proceedings of SIGMOD, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. B. Yuwono and D. L. Lee. Server ranking for distributed text retrieval systems on the internet. In Proceedings of DASFAA, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. A graph method for keyword-based selection of the top-K databases

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          SIGMOD '08: Proceedings of the 2008 ACM SIGMOD international conference on Management of data
          June 2008
          1396 pages
          ISBN:9781605581026
          DOI:10.1145/1376616

          Copyright © 2008 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 9 June 2008

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate785of4,003submissions,20%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader