research-article

When a Knowledge Base Is Not Enough: Question Answering over Knowledge Bases with External Text Data

Authors:
Denis Savenkov

Emory University, Atlanta, GA, USA

Emory University, Atlanta, GA, USA
View Profile

,
Eugene Agichtein

Emory University, Atlanta, GA, USA

Emory University, Atlanta, GA, USA
View Profile

SIGIR '16: Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information RetrievalJuly 2016Pages 235–244https://doi.org/10.1145/2911451.2911536

Published:07 July 2016Publication History

SIGIR '16: Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval

Pages 235–244

ABSTRACT

One of the major challenges for automated question answering over Knowledge Bases (KBQA) is translating a natural language question to the Knowledge Base (KB) entities and predicates. Previous systems have used a limited amount of training data to learn a lexicon that is later used for question answering. This approach does not make use of other potentially relevant text data, outside the KB, which could supplement the available information. We introduce a new system, Text2KB, that enriches question answering over a knowledge base by using external text data. Specifically, we revisit different phases in the KBQA process and demonstrate that text resources improve question interpretation, candidate generation and ranking. Building on a state-of-the-art traditional KBQA system, Text2KB utilizes web search results, community question answering and general text document collection data, to detect question topic entities, map question phrases to KB predicates, and to enrich the features of the candidates derived from the KB. Text2KB significantly improves performance over the baseline KBQA method, as measured on a popular WebQuestions dataset. The results and insights developed in this work can guide future efforts on combining textual and structured KB data for question answering.

References

S. Auer, C. Bizer, G. Kobilarov, J. Lehmann, R. Cyganiak, and Z. Ives. Dbpedia: A nucleus for a web of open data. Springer, 2007.Google Scholar
K. Barker. Combining structured and unstructured knowledge sources for question answering in watson. In DILS, Lecture Notes in Computer Science. Springer, 2012. Google ScholarDigital Library
H. Bast and E. Haussmann. More accurate question answering on freebase. Proceedings of CIKM, 2015. Google ScholarDigital Library
P. Baudiš. Systems and approaches for question answering. 2015.Google Scholar
P. Baudiš and J. Šediỳ. Modeling of the question answering task in the yodaqa system. In Experimental IR Meets Multilinguality, Multimodality, and Interaction. Springer, 2015. Google ScholarDigital Library
J. Berant, A. Chou, R. Frostig, and P. Liang. Semantic parsing on freebase from question-answer pairs. In Proceedings of EMNLP, 2013.Google Scholar
J. Berant and P. Liang. Semantic parsing via paraphrasing. In Proceedings of ACL, 2014.Google ScholarCross Ref
J. Berant and P. Liang. Imitation learning of agenda-based semantic parsers. Transactions of the Association for Computational Linguistics, 3, 2015.Google Scholar
K. Bollacker, C. Evans, P. Paritosh, T. Sturge, and J. Taylor. Freebase: A collaboratively created graph database for structuring human knowledge. In Proceedings of ICMD, 2008. Google ScholarDigital Library
A. Bordes, S. Chopra, and J. Weston. Question answering with subgraph embeddings. In Proceedings of EMNLP, 2014.Google ScholarCross Ref
E. Brill, S. Dumais, and M. Banko. An analysis of the askmsr question-answering system. In Proceedings of EMNLP. Association for Computational Linguistics, 2002. Google ScholarDigital Library
M. Cornolti, P. Ferragina, M. Ciaramita, H. Schütze, and S. Rüd. The smaph system for query entity recognition and disambiguation. In Proceedings of the First International Workshop on Entity Recognition and Disambiguation, 2014. Google ScholarDigital Library
J. Dalton. Entity-based Enrichment for Information Extraction and Retrieval. PhD thesis, University of Massachusetts Amherst, 2014.Google Scholar
H. T. Dang, D. Kelly, and J. J. Lin. Overview of the trec 2007 question answering track. In Proceedings of TREC, 2007.Google Scholar
A. Fader, S. Soderland, and O. Etzioni. Identifying relations for open information extraction. In Proceedings of EMNLP, 2011. Google ScholarDigital Library
A. Fader, L. Zettlemoyer, and O. Etzioni. Open question answering over curated and extracted knowledge bases. In Proceedings of SIGKDD, 2014. Google ScholarDigital Library
J. Lin. An exploration of the principles underlying redundancy-based factoid question answering. Transactions of ACM, 25(2), Apr. 2007. Google ScholarDigital Library
M. Mintz, S. Bills, R. Snow, and D. Jurafsky. Distant supervision for relation extraction without labeled data. In Proceedings of ACL, 2009. Google ScholarDigital Library
J. Pound, P. Mika, and H. Zaragoza. Ad-hoc object retrieval in the web of data. In Proceedings of WWW, 2010. Google ScholarDigital Library
D. Savenkov, W.-L. Lu, J. Dalton, and E. Agichtein. Relation extraction from community generated question-answer pairs. In Proceedings of NAACL: Student Research Workshop, 2015.Google ScholarCross Ref
V. I. Spitkovsky and A. X. Chang. A cross-lingual dictionary for english wikipedia concepts. In Proceedings of LREC, 2012.Google Scholar
H. Sun, H. Ma, W.-t. Yih, C.-T. Tsai, J. Liu, and M.-W. Chang. Open domain question answering via semantic enrichment. Proceedings of WWW, 2015. Google ScholarDigital Library
C. Unger, C. Forascu, V. Lopez, A.-C. N. Ngomo, E. Cabrio, P. Cimiano, and S. Walter. Question answering over linked data (qald-5). In Proceedings of CLEF, 2015.Google Scholar
D. Vrandečić and M. Krötzsch. Wikidata: A free collaborative knowledgebase. Communications of ACM, (10), Sept. 2014. Google ScholarDigital Library
K. Xu, Y. Feng, S. Reddy, S. Huang, and D. Zhao. Enhancing freebase question answering using textual evidence. arXiv preprint arXiv:1603.00957, 2016.Google Scholar
M. Yahya, D. Barbosa, K. Berberich, Q. Wang, and G. Weikum. Relationship queries on extended knowledge graphs. In Proceedings of WSDM, 2016. Google ScholarDigital Library
M. Yahya, K. Berberich, S. Elbassuoni, and G. Weikum. Robust question answering over the web of linked data. In Proceedings of the CIKM, 2013. Google ScholarDigital Library
X. Yao. Lean question answering over freebase from scratch. In Proceedings of NAACL Demo, 2015.Google ScholarCross Ref
X. Yao, J. Berant, and B. Van Durme. Freebase qa: Information extraction or semantic parsing? In Proceedings of ACL, 2014.Google ScholarCross Ref
X. Yao and B. Van Durme. Information extraction over structured data: Question answering with freebase. In Proceedings of ACL, 2014.Google ScholarCross Ref
W.-t. Yih, M.-W. Chang, X. He, and J. Gao. Semantic parsing via staged query graph generation: Question answering with knowledge base. In Proceedings of ACL, 2015.Google ScholarCross Ref

Index Terms

When a Knowledge Base Is Not Enough: Question Answering over Knowledge Bases with External Text Data
1. Computing methodologies
  1. Artificial intelligence
    1. Knowledge representation and reasoning
2. Information systems
  1. Information retrieval
    1. Retrieval tasks and goals
      1. Information extraction
      2. Question answering

Recommendations

Table Cell Search for Question Answering
WWW '16: Proceedings of the 25th International Conference on World Wide Web

Tables are pervasive on the Web. Informative web tables range across a large variety of topics, which can naturally serve as a significant resource to satisfy user information needs. Driven by such observations, in this paper, we investigate an ...
Read More
Question Answering When Knowledge Bases are Incomplete
Experimental IR Meets Multilinguality, Multimodality, and Interaction
Abstract
While systems for question answering over knowledge bases (KB) continue to progress, real world usage requires systems that are robust to incomplete KBs. Dependence on the closed world assumption is highly problematic, as in many practical cases ...
Read More
Open Domain Question Answering via Semantic Enrichment
WWW '15: Proceedings of the 24th International Conference on World Wide Web

Most recent question answering (QA) systems query large-scale knowledge bases (KBs) to answer a question, after parsing and transforming natural language questions to KBs-executable forms (e.g., logical forms). As a well-known fact, KBs are far from ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SIGIR '16: Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval
July 2016
1296 pages
ISBN:9781450340694
DOI:10.1145/2911451
General Chairs:
Raffaele Perego
ISTI-CNR, Italy
,
Fabrizio Sebastiani
Qatar Computing Research Institute, HBKU, Qatar
,
Program Chairs:
Javed Aslam
Northeastern University, US
,
Ian Ruthven
University of Strathclyde, UK
,
Justin Zobel
University of Melbourne, Australia
Copyright © 2016 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 7 July 2016
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
knowledge bases
question answering
Qualifiers
- research-article
Conference

Acceptance Rates
SIGIR '16 Paper Acceptance Rate62of341submissions,18%Overall Acceptance Rate792of3,983submissions,20%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 37
  Total Citations
  View Citations
- 935
  Total Downloads
- Downloads (Last 12 months)23
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

When a Knowledge Base Is Not Enough: Question Answering over Knowledge Bases with External Text Data

SIGIR '16: Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval

ABSTRACT

References

Cited By

Index Terms

Recommendations

Table Cell Search for Question Answering

Question Answering When Knowledge Bases are Incomplete

Open Domain Question Answering via Semantic Enrichment