skip to main content
10.1145/1142473.1142536acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
Article

Effective keyword search in relational databases

Published:27 June 2006Publication History

ABSTRACT

With the amount of available text data in relational databases growing rapidly, the need for ordinary users to search such information is dramatically increasing. Even though the major RDBMSs have provided full-text search capabilities, they still require users to have knowledge of the database schemas and use a structured query language to search information. This search model is complicated for most ordinary users. Inspired by the big success of information retrieval (IR) style keyword search on the web, keyword search in relational databases has recently emerged as a new research topic. The differences between text databases and relational databases result in three new challenges: (1) Answers needed by users are not limited to individual tuples, but results assembled from joining tuples from multiple tables are used to form answers in the form of tuple trees. (2) A single score for each answer (i.e. a tuple tree) is needed to estimate its relevance to a given query. These scores are used to rank the most relevant answers as high as possible. (3) Relational databases have much richer structures than text databases. Existing IR strategies to rank relational outputs are not adequate. In this paper, we propose a novel IR ranking strategy for effective keyword search. We are the first that conducts comprehensive experiments on search effectiveness using a real world database and a set of keyword queries collected by a major search company. Experimental results show that our strategy is significantly better than existing strategies. Our approach can be used both at the application level and be incorporated into a RDBMS to support keyword-based search in relational databases.

References

  1. S Agrawal, S Chaudhuri, G Das: DBXplorer: A system for keyword-based search over relational databases. ICDE 2002Google ScholarGoogle Scholar
  2. G. Bhalotia, A. Hulgeri, C. Nakhey, S. Chakrabarti, and S. Sudarshan. Keyword searching and browsing in databases using BANKS. ICDE 2002 Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. A. Balmin, V. Hristidis,Y. Papakonstantinou: Authority Based Keyword Queries in Databases using ObjectRank. VLDB 2004Google ScholarGoogle Scholar
  4. S. Brin and L. Page. The anatomy of a large-scale hypertextual web search engine. WWW 1998 Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. S. Cohen, Jonathan Mamou, Yaron Kanza, Yehoshua Sagiv: XSEarch: A Semantic Search Engine for XML. VLDB 2003 Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. D. Florescu, I. Manolescu, and D. Kossmann. Integrating keyword search into XML query processing. WWW 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. R. Goldman, N. Shivakumar, S. Venkatasubramanian, and H. Garcia-Molina. Proximity Search in Databases. VLDB 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. L. Guo, F. Shao, C. Botev, and J. Shanmugasundaram. XRANK: Ranked keyword search over XML documents. SIGMOD 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. L. Guo,J. Shanmugasundaram, K. Beyer, E. Shekita:Efficient Inverted Lists and Query Algorithms for Structured Value Ranking in Update-Intensive Relational Databases. ICDE 2005 Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. V. Hristidis and Y. Papakonstantinou. DISCOVER: Keyword search in relational databases. VLDB 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. V. Hristidis,L. Gravano,Y. Papakonstantinou:Efficient IR-Style Keyword Search over Relational Databases.VLDB 2003Google ScholarGoogle Scholar
  12. R. Kaushik, R. Krishnamurthy, J. F. Naughton, and R. Ramakrishnan. On the integration of structure indexes and inverted lists. SIGMOD 2004 Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Y. Li, Cong Yu, H. V. Jagadish: Schema-Free XQuery. VLDB 2004Google ScholarGoogle Scholar
  14. S. Liu, F. Liu, C. T. Yu, Weiyi Meng: An effective approach to document retrieval via utilizing WordNet and recognizing phrases. SIGIR 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. L. Page, S. Brin, R. Motwani and T. Winograd. The PageRank Citation Ranking: Bringing Order to the Web, Technical Report, 1998Google ScholarGoogle Scholar
  16. R. Sacks-Davis, Tuong Dao, James A. Thom, Justin Zobel Indexing documents for queries on structure, content and attributes. ISDM 1997Google ScholarGoogle Scholar
  17. A. Singhal, Chris Buckley, Mandar Mitra: Pivoted Document Length Normalization. SIGIR 1996 Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. A. Singhal. Modern information retrieval: A brief overview. IEEE Data Eng. Bull. 24(4), 2001Google ScholarGoogle Scholar
  19. E. M. Voorhees. Overview of the TREC-9 Question Answering Track. TREC 2000Google ScholarGoogle Scholar
  20. Pew Internet & American Life Project Report: Search Engine Users, 2005. www.pewinternet.org/pdfs/PIP_Searchengine_users.pdGoogle ScholarGoogle Scholar
  21. Google. www.google.com/ 2005Google ScholarGoogle Scholar
  22. DB2 Text Information Extender. 2005 http://www.ibm.com/software/data/db2/extenders/textinformation/index.htmlGoogle ScholarGoogle Scholar
  23. Micorsoft SQL Server 2000. www.microsoft.com/sql/ 2005Google ScholarGoogle Scholar
  24. MySQL. dev.mysql.com/doc/mysql/en/Fulltext_Search.html.Google ScholarGoogle Scholar
  25. G. A. Miller. WordNet: A lexical database for English. CACM, 38(11):39--41, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. G. Salton and M. McGill. Introduction to Modern Information Retrieval. McGraw-Hill, 1983 Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. D. Grossman and O. Frieder, Information Retrieval: Algorithms and Heuristics, Springer Publishers, 2nd Edition 2004 Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Effective keyword search in relational databases

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SIGMOD '06: Proceedings of the 2006 ACM SIGMOD international conference on Management of data
      June 2006
      830 pages
      ISBN:1595934340
      DOI:10.1145/1142473

      Copyright © 2006 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 27 June 2006

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

      Acceptance Rates

      Overall Acceptance Rate785of4,003submissions,20%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader