Abstract
In this paper, we deal with automatic knowledge acquisition from text, specifically the acquisition of causal relations. A causal relation is the relation existing between two events such that one event causes (or enables) the other event, such as “hard rain causes flooding” or “taking a train requires buying a ticket.” In previous work these relations have been classified into several types based on a variety of points of view. In this work, we consider four types of causal relations---cause, effect, precond(ition) and means---mainly based on agents' volitionality, as proposed in the research field of discourse understanding. The idea behind knowledge acquisition is to use resultative connective markers, such as “because,” “but,” and “if” as linguistic cues. However, there is no guarantee that a given connective marker always signals the same type of causal relation. Therefore, we need to create a computational model that is able to classify samples according to the causal relation. To examine how accurately we can automatically acquire causal knowledge, we attempted an experiment using Japanese newspaper articles, focusing on the resultative connective “tame.” By using machine-learning techniques, we achieved 80% recall with over 95% precision for the cause, precond, and means relations, and 30% recall with 90% precision for the effect relation. Furthermore, the classification results suggest that one can expect to acquire over 27,000 instances of causal relations from 1 year of Japanese newspaper articles.
- Allen, J. F. 1983. Recognizing intentions from natural language utterances. In M. Brady and R.C. Berwick (Eds.), Computational models of discourse. MIT Press, Cambridge, MA.]]Google Scholar
- Allen, J. F. 1995. Natural Language Understanding. Benjamin/Cumming, New York.]] Google Scholar
- Altenberg, B. 1984. Causal linking in spoken and written English. Studia Linguistica 38, 1.]]Google Scholar
- Britannica. 1998. Britannica CD98 multimedia edition.]]Google Scholar
- Carberry, S. 1990. Plan Recognition in Natural Language Dialogue. MIT Press, Cambridge, MA.]] Google Scholar
- Cohen, J. 1960. A coefficient of agreement for nominal scales. Educational and Psychological Measurement 20, 37--46.]]Google Scholar
- Fellbaum, C. 1998. WordNet: An Electronic Lexical Database. MIT Press, Cambridge, MA.]]Google Scholar
- Garcia, D. 1997. COATIS, an NLP system to locate expressions of actions connected by causality links. In Proc. of The 10th European Knowledge Acquisition Workshop. 347--352.]] Google Scholar
- Girju, R. and Moldovan, D. 2002. Mining answers for causation questions. In Proc. The AAAI Spring Symposium on Mining Answers from Texts and Knowledge Bases.]]Google Scholar
- Harabagiu, S. M. and Moldovan, D. I. 1997. Textnet---a text-based intelligent system. Natural Language Engineering 3, 171--190.]] Google Scholar
- Heckerman, D., Meek, C., and Cooper, G. 1997. A Bayesian approach to causal discovery. Tech. rep., Microsoft Research Advanced Technology Division, Microsoft Corporation, Technical Report MSR-TR-97-05.]]Google Scholar
- Hobbs, J. R. 1979. Coherence and co-reference. Cognitive Science 1, 67--82.]]Google Scholar
- Hobbs, J. R. 1985. On the coherence and structure of discourse. Tech. rep., Technical Report CSLI-85-37, Center for The Study of Language and Information.]]Google Scholar
- Hobbs, J. R., Stickel, M., Appelt, D., and Martion, P. 1993. Interpretation as abduction. Artificial Intelligence 63, 69--142.]] Google Scholar
- Ichikawa, T. 1978. Introduction to tyle theory for Japanese education. Education (in Japan).]]Google Scholar
- Ikehara, S., Miyazaki, M., Shirai, S., Yokoo, A., Nakaiwa, H., Ogura, K., Ooyama, Y., and Hayashi, Y. 1997. Goi-Taikei---A Japanese Lexicon, Iwanami Shoten.]]Google Scholar
- Ikehara, S., Shirai, S., Yokoo, A., and Nakaiwa, H. 1991. Toward an MT system without pre-editing---effects of new methods in ALT-J/E-. In Proc. of the Third Machine Translation Summit: MT Summit III, Washington DC. 101--106.]]Google Scholar
- Iwanska, L. M. and Shapiro, S. C. 2000. Natural Language Processing and Knowledge Representation---Language for Knowledge and Knowledge for Language. MIT Press, Cambridge, MA.]] Google Scholar
- Joachims, T. 1998. Text categorization with support vector machines: learning with many relevant features. In Proceedings of ECML-98, 10th European Conference on Machine Learning, C. Nédellec and C. Rouveirol, Eds. Number 1398. Springer Verlag, New York. 137--142.]] Google Scholar
- Jonsson, K. 2000. Robust correlation and support vector machines for face identification. Ph.D. thesis, University of Surrey.]]Google Scholar
- Khoo, C. S. G., Chan, S., and Niu, Y. 2000. Extracting causal knowledge from a medical database using graphical patterns. In Proc. of The 38th. Annual Meeting of The Association for Computational Linguistics (ACL2000). 336--343.]] Google Scholar
- Krippendorf, K. 1980. Content analysis: An introduction to its methodology. Sage, Thousand Oaks, CA.]]Google Scholar
- Kudo, T. and Matsumoto, Y. 2003. Japanese dependency analysis using cascaded chunking. In Proc. of The 6th. Conference on Natural Language Learning (CoNLL).]] Google Scholar
- Lenat, D. 1995. Cyc: A large-scale investment in knowledge infrastructure. Communications of the ACM 38, 11.]] Google Scholar
- Litman, D. J. and Allen, J. F. 1987. A plan recognition model for subdialogues in conversations. Cognitive Science 11, 163--200.]]Google Scholar
- Liu, H., Lieberman, H., and Selker, T. 2003. A model of textual affect sensing using real-world knowledge. In Proc. of The International Conference on Intelligent User Interfaces. 125--132.]] Google Scholar
- Low, B. T., Chan, K., Choi, L. L., Chin, M. Y., and Lay, S. L. 2001. Semantic expectation-based causation knowledge extraction: A study on Hong Kong stock movement analysis. In Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD). 114--123.]] Google Scholar
- Mann, W. C. and Thompson, S. A. 1987. Rhetorical structure theory: A theory of text organization. In USC Information Sciences Institute, Technical Report ISI/RS-87-190.]]Google Scholar
- Marcu, D. 1997. The rhetorical parsing of natural language texts. In Proc. of ACL97/EACL97. 96--103.]] Google Scholar
- Marcu, D. 2002. An unsupervised approach to recognizing discourse relations. In Proc. of The 40th. Annual Meeting of The Association for Computational Linguistics (ACL2002). 368--375.]] Google Scholar
- Masuoka, T. 1997. Complex sentence. Kuroshio (in Japan).]]Google Scholar
- Masuoka, T. and Takubo, Y. 1992. Fundamental Japanese grammar(revised version). Kuroshio. (in Japan).]]Google Scholar
- Matsumoto, Y., Kitauchi, A., Yamashita, T., Hirano, Y., Matsuda, H., and Asahara, M. 1999. Japanese Morphological Analyzer ChaSen Users Manual version 2.0. Technical Report NAIST-IS-TR990123, Nara Institute of Science and Technology Technical Report.]]Google Scholar
- MEDLINE. 2001. The MEDLINE database. See also, http://www.ncbi.nlm.nih.gov/PubMed/.]]Google Scholar
- Nagano, M. 1986. A reveiw of style theory. Asakura (in Japan).]]Google Scholar
- Nakaiwa, H. and Ikehara, S. 1995. Intrasentential resolution of Japanese zero pronouns in a machine translation system using semantic and pragmatic constraints. In Proc. of The 6th TMI. 96--105.]]Google Scholar
- Nikkei. 1990. Nihon Keizai Shimbun CD-ROM version.]]Google Scholar
- Pearl, J. 1988. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann.]] Google Scholar
- Pearl, J. 2000. Causality: Models, Reasoning, and Inference. Cambridge Universiy Press, London.]] Google Scholar
- Pei, J., Han, J., Mortazavi-Asi, B., Pinto, H., Chen, Q., Dayal, U., and Hsu, M. C. 2001. Prefix Span: mining sequential patterns efficiently by prefix projected pattern growth. In Proc. of 1st. Conference of Data Enginnering (ICDE2001). 215--226.]] Google Scholar
- Rifkin, R. and Klautau, A. 2004. In defense of one-vs-all classification. Journal of Machine Learning Research 5, 101--141.]] Google Scholar
- RWC. 1998. RWC text corpus 2nd edition, Iwanami Japanese dictionary tagged/morphological data, 5th edn.]]Google Scholar
- Satou, H., Kasahara, K., and Matsuzawa, K. 1999. Rertrieval {sic} of simplified causal knowledge in text and its application. In Technical report of IEICE, Thought and Language. (in Japan).]]Google Scholar
- Satou, S. and Nagao, M. 1990. Toward memory-based translation. In Proc. of The 13th. International Conference on Computational Linguistics (COLING90). 247--252.]] Google Scholar
- Schank, R. and Abelson, R. 1977. Scripts, Plans, Goals and Understanding: An Inquiry into Human Knowledge Structures. Lawrence Erlbaum Assoc., Mahwah, NJ.]]Google Scholar
- Schank, R. and Riesbeck, C. 1981. Inside Computer Understanding: Five Programs Plus Minitures. Lawrence Erlbaum Assoc., Mahwah, NJ.]] Google Scholar
- Singh, P., Lin, T., Mueller, E. T., Lim, G., Perkins, T., and Zhu, W. L. 2002. Open Mind Common Sense: Knowledge acquisition from the general public. In Proc. of The 1st. International Conference on Ontologies, Databases and Applications of Semantics for Large Scale Information Systems.]] Google Scholar
- Stork, D. G. 1999. Character and document research in the Open Mind Initiative. In Proc. of International Conference on Document Analysis and Recognition. 1--12.]] Google Scholar
- Suyama, A. 2005. Sentence identification in HTML documents with machine learning. M.S. thesis, Tokyo Institute of Technology (in Japan).]]Google Scholar
- Terada, A. 2003. A study of text mining techniques using natural language processing. Ph.D. thesis, Tokyo Institute of Technology (in Japan).]]Google Scholar
- Torisawa, K. 2003. Automatic extraction of ‘commonsense’ inference rules from corpora. In Proc. of The 9th Annual Meeting of The Association for Natural Language Processing. 318--321 (in Japan).]]Google Scholar
- Vapnik, V. N. 1995. The Nature of Statistical Learning Theory. Springer, New York.]] Google Scholar
- Vert, J. 2002. Support vector machine prediction of signal peptide cleavage site using a new class of kernels for strings. In Proceedings of the Pacific Symposium on Biocomputing 2002. 649--660.]]Google Scholar
- Voorhees, E. M. and Harman, D. K. 2001. The Ninth Text Retrieval Conference (TREC9). See also, http://trec.nist.gov.]]Google Scholar
- Yokoi, T. 1995. The EDR electronic dictionary. Communnications of the ACM 38, 11, 42--44.]] Google Scholar
Index Terms
- Acquiring causal knowledge from text using the connective marker tame
Recommendations
Causal relation of queries from temporal logs
WWW '07: Proceedings of the 16th international conference on World Wide WebIn this paper, we study a new problem of mining causal relation of queries in search engine query logs. Causal relation between two queries means event on one query is the causation of some event on the other. We first detect events in query logs by ...
The Study of Causal Nouns in Mandarin Chinese: From the Perspective of Syntactic Realization and Pragmatic Function
Chinese Lexical SemanticsAbstractCausal nouns can express two types of information, namely, the cause and the effect. These two types of information can be realized in different syntactic positions in the sentence and discourse where causal nouns appear. This paper first explains ...
Mining causality knowledge from textual data
AIA'06: Proceedings of the 24th IASTED international conference on Artificial intelligence and applicationsMining causality knowledge will induce a knowledge of reasoning that is beneficial for our daily use in diagnosis. Then, this framework is for discovering causality existing between causative antecedent and effective consequent discourse units. There ...
Comments