article

Acquiring causal knowledge from text using the connective marker tame

Authors:
Takashi Inui

Research Fellow of the Japan Society for the Promotion of Science, Yamato, Japan

Research Fellow of the Japan Society for the Promotion of Science, Yamato, Japan
View Profile

,
Kentaro Inui

School of Information Science, Nara Institute of Science and Technology, Nara, Japan

School of Information Science, Nara Institute of Science and Technology, Nara, Japan
View Profile

,
Yuji Matsumoto

School of Information Science, Nara Institute of Science and Technology, Nara, Japan

School of Information Science, Nara Institute of Science and Technology, Nara, Japan
View Profile

ACM Transactions on Asian Language Information Processing Volume 4 Issue 4pp 435–474https://doi.org/10.1145/1113308.1113313

Published:01 December 2005Publication History

ACM Transactions on Asian Language Information Processing

Abstract

In this paper, we deal with automatic knowledge acquisition from text, specifically the acquisition of causal relations. A causal relation is the relation existing between two events such that one event causes (or enables) the other event, such as “hard rain causes flooding” or “taking a train requires buying a ticket.” In previous work these relations have been classified into several types based on a variety of points of view. In this work, we consider four types of causal relations---cause, effect, precond(ition) and means---mainly based on agents' volitionality, as proposed in the research field of discourse understanding. The idea behind knowledge acquisition is to use resultative connective markers, such as “because,” “but,” and “if” as linguistic cues. However, there is no guarantee that a given connective marker always signals the same type of causal relation. Therefore, we need to create a computational model that is able to classify samples according to the causal relation. To examine how accurately we can automatically acquire causal knowledge, we attempted an experiment using Japanese newspaper articles, focusing on the resultative connective “tame.” By using machine-learning techniques, we achieved 80% recall with over 95% precision for the cause, precond, and means relations, and 30% recall with 90% precision for the effect relation. Furthermore, the classification results suggest that one can expect to acquire over 27,000 instances of causal relations from 1 year of Japanese newspaper articles.

References

Allen, J. F. 1983. Recognizing intentions from natural language utterances. In M. Brady and R.C. Berwick (Eds.), Computational models of discourse. MIT Press, Cambridge, MA.]]Google Scholar
Allen, J. F. 1995. Natural Language Understanding. Benjamin/Cumming, New York.]] Google Scholar
Altenberg, B. 1984. Causal linking in spoken and written English. Studia Linguistica 38, 1.]]Google Scholar
Britannica. 1998. Britannica CD98 multimedia edition.]]Google Scholar
Carberry, S. 1990. Plan Recognition in Natural Language Dialogue. MIT Press, Cambridge, MA.]] Google Scholar
Cohen, J. 1960. A coefficient of agreement for nominal scales. Educational and Psychological Measurement 20, 37--46.]]Google Scholar
Fellbaum, C. 1998. WordNet: An Electronic Lexical Database. MIT Press, Cambridge, MA.]]Google Scholar
Garcia, D. 1997. COATIS, an NLP system to locate expressions of actions connected by causality links. In Proc. of The 10th European Knowledge Acquisition Workshop. 347--352.]] Google Scholar
Girju, R. and Moldovan, D. 2002. Mining answers for causation questions. In Proc. The AAAI Spring Symposium on Mining Answers from Texts and Knowledge Bases.]]Google Scholar
Harabagiu, S. M. and Moldovan, D. I. 1997. Textnet---a text-based intelligent system. Natural Language Engineering 3, 171--190.]] Google Scholar
Heckerman, D., Meek, C., and Cooper, G. 1997. A Bayesian approach to causal discovery. Tech. rep., Microsoft Research Advanced Technology Division, Microsoft Corporation, Technical Report MSR-TR-97-05.]]Google Scholar
Hobbs, J. R. 1979. Coherence and co-reference. Cognitive Science 1, 67--82.]]Google Scholar
Hobbs, J. R. 1985. On the coherence and structure of discourse. Tech. rep., Technical Report CSLI-85-37, Center for The Study of Language and Information.]]Google Scholar
Hobbs, J. R., Stickel, M., Appelt, D., and Martion, P. 1993. Interpretation as abduction. Artificial Intelligence 63, 69--142.]] Google Scholar
Ichikawa, T. 1978. Introduction to tyle theory for Japanese education. Education (in Japan).]]Google Scholar
Ikehara, S., Miyazaki, M., Shirai, S., Yokoo, A., Nakaiwa, H., Ogura, K., Ooyama, Y., and Hayashi, Y. 1997. Goi-Taikei---A Japanese Lexicon, Iwanami Shoten.]]Google Scholar
Ikehara, S., Shirai, S., Yokoo, A., and Nakaiwa, H. 1991. Toward an MT system without pre-editing---effects of new methods in ALT-J/E-. In Proc. of the Third Machine Translation Summit: MT Summit III, Washington DC. 101--106.]]Google Scholar
Iwanska, L. M. and Shapiro, S. C. 2000. Natural Language Processing and Knowledge Representation---Language for Knowledge and Knowledge for Language. MIT Press, Cambridge, MA.]] Google Scholar
Joachims, T. 1998. Text categorization with support vector machines: learning with many relevant features. In Proceedings of ECML-98, 10th European Conference on Machine Learning, C. Nédellec and C. Rouveirol, Eds. Number 1398. Springer Verlag, New York. 137--142.]] Google Scholar
Jonsson, K. 2000. Robust correlation and support vector machines for face identification. Ph.D. thesis, University of Surrey.]]Google Scholar
Khoo, C. S. G., Chan, S., and Niu, Y. 2000. Extracting causal knowledge from a medical database using graphical patterns. In Proc. of The 38th. Annual Meeting of The Association for Computational Linguistics (ACL2000). 336--343.]] Google Scholar
Krippendorf, K. 1980. Content analysis: An introduction to its methodology. Sage, Thousand Oaks, CA.]]Google Scholar
Kudo, T. and Matsumoto, Y. 2003. Japanese dependency analysis using cascaded chunking. In Proc. of The 6th. Conference on Natural Language Learning (CoNLL).]] Google Scholar
Lenat, D. 1995. Cyc: A large-scale investment in knowledge infrastructure. Communications of the ACM 38, 11.]] Google Scholar
Litman, D. J. and Allen, J. F. 1987. A plan recognition model for subdialogues in conversations. Cognitive Science 11, 163--200.]]Google Scholar
Liu, H., Lieberman, H., and Selker, T. 2003. A model of textual affect sensing using real-world knowledge. In Proc. of The International Conference on Intelligent User Interfaces. 125--132.]] Google Scholar
Low, B. T., Chan, K., Choi, L. L., Chin, M. Y., and Lay, S. L. 2001. Semantic expectation-based causation knowledge extraction: A study on Hong Kong stock movement analysis. In Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD). 114--123.]] Google Scholar
Mann, W. C. and Thompson, S. A. 1987. Rhetorical structure theory: A theory of text organization. In USC Information Sciences Institute, Technical Report ISI/RS-87-190.]]Google Scholar
Marcu, D. 1997. The rhetorical parsing of natural language texts. In Proc. of ACL97/EACL97. 96--103.]] Google Scholar
Marcu, D. 2002. An unsupervised approach to recognizing discourse relations. In Proc. of The 40th. Annual Meeting of The Association for Computational Linguistics (ACL2002). 368--375.]] Google Scholar
Masuoka, T. 1997. Complex sentence. Kuroshio (in Japan).]]Google Scholar
Masuoka, T. and Takubo, Y. 1992. Fundamental Japanese grammar(revised version). Kuroshio. (in Japan).]]Google Scholar
Matsumoto, Y., Kitauchi, A., Yamashita, T., Hirano, Y., Matsuda, H., and Asahara, M. 1999. Japanese Morphological Analyzer ChaSen Users Manual version 2.0. Technical Report NAIST-IS-TR990123, Nara Institute of Science and Technology Technical Report.]]Google Scholar
MEDLINE. 2001. The MEDLINE database. See also, http://www.ncbi.nlm.nih.gov/PubMed/.]]Google Scholar
Nagano, M. 1986. A reveiw of style theory. Asakura (in Japan).]]Google Scholar
Nakaiwa, H. and Ikehara, S. 1995. Intrasentential resolution of Japanese zero pronouns in a machine translation system using semantic and pragmatic constraints. In Proc. of The 6th TMI. 96--105.]]Google Scholar
Nikkei. 1990. Nihon Keizai Shimbun CD-ROM version.]]Google Scholar
Pearl, J. 1988. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann.]] Google Scholar
Pearl, J. 2000. Causality: Models, Reasoning, and Inference. Cambridge Universiy Press, London.]] Google Scholar
Pei, J., Han, J., Mortazavi-Asi, B., Pinto, H., Chen, Q., Dayal, U., and Hsu, M. C. 2001. Prefix Span: mining sequential patterns efficiently by prefix projected pattern growth. In Proc. of 1st. Conference of Data Enginnering (ICDE2001). 215--226.]] Google Scholar
Rifkin, R. and Klautau, A. 2004. In defense of one-vs-all classification. Journal of Machine Learning Research 5, 101--141.]] Google Scholar
RWC. 1998. RWC text corpus 2nd edition, Iwanami Japanese dictionary tagged/morphological data, 5th edn.]]Google Scholar
Satou, H., Kasahara, K., and Matsuzawa, K. 1999. Rertrieval {sic} of simplified causal knowledge in text and its application. In Technical report of IEICE, Thought and Language. (in Japan).]]Google Scholar
Satou, S. and Nagao, M. 1990. Toward memory-based translation. In Proc. of The 13th. International Conference on Computational Linguistics (COLING90). 247--252.]] Google Scholar
Schank, R. and Abelson, R. 1977. Scripts, Plans, Goals and Understanding: An Inquiry into Human Knowledge Structures. Lawrence Erlbaum Assoc., Mahwah, NJ.]]Google Scholar
Schank, R. and Riesbeck, C. 1981. Inside Computer Understanding: Five Programs Plus Minitures. Lawrence Erlbaum Assoc., Mahwah, NJ.]] Google Scholar
Singh, P., Lin, T., Mueller, E. T., Lim, G., Perkins, T., and Zhu, W. L. 2002. Open Mind Common Sense: Knowledge acquisition from the general public. In Proc. of The 1st. International Conference on Ontologies, Databases and Applications of Semantics for Large Scale Information Systems.]] Google Scholar
Stork, D. G. 1999. Character and document research in the Open Mind Initiative. In Proc. of International Conference on Document Analysis and Recognition. 1--12.]] Google Scholar
Suyama, A. 2005. Sentence identification in HTML documents with machine learning. M.S. thesis, Tokyo Institute of Technology (in Japan).]]Google Scholar
Terada, A. 2003. A study of text mining techniques using natural language processing. Ph.D. thesis, Tokyo Institute of Technology (in Japan).]]Google Scholar
Torisawa, K. 2003. Automatic extraction of ‘commonsense’ inference rules from corpora. In Proc. of The 9th Annual Meeting of The Association for Natural Language Processing. 318--321 (in Japan).]]Google Scholar
Vapnik, V. N. 1995. The Nature of Statistical Learning Theory. Springer, New York.]] Google Scholar
Vert, J. 2002. Support vector machine prediction of signal peptide cleavage site using a new class of kernels for strings. In Proceedings of the Pacific Symposium on Biocomputing 2002. 649--660.]]Google Scholar
Voorhees, E. M. and Harman, D. K. 2001. The Ninth Text Retrieval Conference (TREC9). See also, http://trec.nist.gov.]]Google Scholar
Yokoi, T. 1995. The EDR electronic dictionary. Communnications of the ACM 38, 11, 42--44.]] Google Scholar

Index Terms

Acquiring causal knowledge from text using the connective marker tame
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Language resources
  2. Machine learning
    1. Learning settings
2. Information systems
  1. Information retrieval
    1. Document representation
      1. Content analysis and feature selection

Recommendations

Causal relation of queries from temporal logs
WWW '07: Proceedings of the 16th international conference on World Wide Web

In this paper, we study a new problem of mining causal relation of queries in search engine query logs. Causal relation between two queries means event on one query is the causation of some event on the other. We first detect events in query logs by ...
Read More
The Study of Causal Nouns in Mandarin Chinese: From the Perspective of Syntactic Realization and Pragmatic Function
Chinese Lexical Semantics
Abstract
Causal nouns can express two types of information, namely, the cause and the effect. These two types of information can be realized in different syntactic positions in the sentence and discourse where causal nouns appear. This paper first explains ...
Read More
Mining causality knowledge from textual data
AIA'06: Proceedings of the 24th IASTED international conference on Artificial intelligence and applications

Mining causality knowledge will induce a knowledge of reasoning that is beneficial for our daily use in diagnosis. Then, this framework is for discovering causality existing between causative antecedent and effective consequent discourse units. There ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in

ACM Transactions on Asian Language Information Processing Volume 4, Issue 4
December 2005
129 pages
ISSN:1530-0226
EISSN:1558-3430
DOI:10.1145/1113308
Issue’s Table of Contents

Copyright © 2005 ACM
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 1 December 2005
Published in talip Volume 4, Issue 4

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Causal relation
connective marker
volitionality
Qualifiers
- article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 19
  Total Citations
  View Citations
- 785
  Total Downloads
- Downloads (Last 12 months)5
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Acquiring causal knowledge from text using the connective marker tame

ACM Transactions on Asian Language Information Processing

Abstract

References

Cited By

Index Terms

Recommendations

Causal relation of queries from temporal logs

The Study of Causal Nouns in Mandarin Chinese: From the Perspective of Syntactic Realization and Pragmatic Function

Mining causality knowledge from textual data

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Acquiring causal knowledge from text using the connective marker tame

ACM Transactions on Asian Language Information Processing

Abstract

References

Cited By

Index Terms

Recommendations

Causal relation of queries from temporal logs

The Study of Causal Nouns in Mandarin Chinese: From the Perspective of Syntactic Realization and Pragmatic Function

Mining causality knowledge from textual data

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media