skip to main content
article

Acquiring causal knowledge from text using the connective marker tame

Authors Info & Claims
Published:01 December 2005Publication History
Skip Abstract Section

Abstract

In this paper, we deal with automatic knowledge acquisition from text, specifically the acquisition of causal relations. A causal relation is the relation existing between two events such that one event causes (or enables) the other event, such as “hard rain causes flooding” or “taking a train requires buying a ticket.” In previous work these relations have been classified into several types based on a variety of points of view. In this work, we consider four types of causal relations---cause, effect, precond(ition) and means---mainly based on agents' volitionality, as proposed in the research field of discourse understanding. The idea behind knowledge acquisition is to use resultative connective markers, such as “because,” “but,” and “if” as linguistic cues. However, there is no guarantee that a given connective marker always signals the same type of causal relation. Therefore, we need to create a computational model that is able to classify samples according to the causal relation. To examine how accurately we can automatically acquire causal knowledge, we attempted an experiment using Japanese newspaper articles, focusing on the resultative connective “tame.” By using machine-learning techniques, we achieved 80% recall with over 95% precision for the cause, precond, and means relations, and 30% recall with 90% precision for the effect relation. Furthermore, the classification results suggest that one can expect to acquire over 27,000 instances of causal relations from 1 year of Japanese newspaper articles.

References

  1. Allen, J. F. 1983. Recognizing intentions from natural language utterances. In M. Brady and R.C. Berwick (Eds.), Computational models of discourse. MIT Press, Cambridge, MA.]]Google ScholarGoogle Scholar
  2. Allen, J. F. 1995. Natural Language Understanding. Benjamin/Cumming, New York.]] Google ScholarGoogle Scholar
  3. Altenberg, B. 1984. Causal linking in spoken and written English. Studia Linguistica 38, 1.]]Google ScholarGoogle Scholar
  4. Britannica. 1998. Britannica CD98 multimedia edition.]]Google ScholarGoogle Scholar
  5. Carberry, S. 1990. Plan Recognition in Natural Language Dialogue. MIT Press, Cambridge, MA.]] Google ScholarGoogle Scholar
  6. Cohen, J. 1960. A coefficient of agreement for nominal scales. Educational and Psychological Measurement 20, 37--46.]]Google ScholarGoogle Scholar
  7. Fellbaum, C. 1998. WordNet: An Electronic Lexical Database. MIT Press, Cambridge, MA.]]Google ScholarGoogle Scholar
  8. Garcia, D. 1997. COATIS, an NLP system to locate expressions of actions connected by causality links. In Proc. of The 10th European Knowledge Acquisition Workshop. 347--352.]] Google ScholarGoogle Scholar
  9. Girju, R. and Moldovan, D. 2002. Mining answers for causation questions. In Proc. The AAAI Spring Symposium on Mining Answers from Texts and Knowledge Bases.]]Google ScholarGoogle Scholar
  10. Harabagiu, S. M. and Moldovan, D. I. 1997. Textnet---a text-based intelligent system. Natural Language Engineering 3, 171--190.]] Google ScholarGoogle Scholar
  11. Heckerman, D., Meek, C., and Cooper, G. 1997. A Bayesian approach to causal discovery. Tech. rep., Microsoft Research Advanced Technology Division, Microsoft Corporation, Technical Report MSR-TR-97-05.]]Google ScholarGoogle Scholar
  12. Hobbs, J. R. 1979. Coherence and co-reference. Cognitive Science 1, 67--82.]]Google ScholarGoogle Scholar
  13. Hobbs, J. R. 1985. On the coherence and structure of discourse. Tech. rep., Technical Report CSLI-85-37, Center for The Study of Language and Information.]]Google ScholarGoogle Scholar
  14. Hobbs, J. R., Stickel, M., Appelt, D., and Martion, P. 1993. Interpretation as abduction. Artificial Intelligence 63, 69--142.]] Google ScholarGoogle Scholar
  15. Ichikawa, T. 1978. Introduction to tyle theory for Japanese education. Education (in Japan).]]Google ScholarGoogle Scholar
  16. Ikehara, S., Miyazaki, M., Shirai, S., Yokoo, A., Nakaiwa, H., Ogura, K., Ooyama, Y., and Hayashi, Y. 1997. Goi-Taikei---A Japanese Lexicon, Iwanami Shoten.]]Google ScholarGoogle Scholar
  17. Ikehara, S., Shirai, S., Yokoo, A., and Nakaiwa, H. 1991. Toward an MT system without pre-editing---effects of new methods in ALT-J/E-. In Proc. of the Third Machine Translation Summit: MT Summit III, Washington DC. 101--106.]]Google ScholarGoogle Scholar
  18. Iwanska, L. M. and Shapiro, S. C. 2000. Natural Language Processing and Knowledge Representation---Language for Knowledge and Knowledge for Language. MIT Press, Cambridge, MA.]] Google ScholarGoogle Scholar
  19. Joachims, T. 1998. Text categorization with support vector machines: learning with many relevant features. In Proceedings of ECML-98, 10th European Conference on Machine Learning, C. Nédellec and C. Rouveirol, Eds. Number 1398. Springer Verlag, New York. 137--142.]] Google ScholarGoogle Scholar
  20. Jonsson, K. 2000. Robust correlation and support vector machines for face identification. Ph.D. thesis, University of Surrey.]]Google ScholarGoogle Scholar
  21. Khoo, C. S. G., Chan, S., and Niu, Y. 2000. Extracting causal knowledge from a medical database using graphical patterns. In Proc. of The 38th. Annual Meeting of The Association for Computational Linguistics (ACL2000). 336--343.]] Google ScholarGoogle Scholar
  22. Krippendorf, K. 1980. Content analysis: An introduction to its methodology. Sage, Thousand Oaks, CA.]]Google ScholarGoogle Scholar
  23. Kudo, T. and Matsumoto, Y. 2003. Japanese dependency analysis using cascaded chunking. In Proc. of The 6th. Conference on Natural Language Learning (CoNLL).]] Google ScholarGoogle Scholar
  24. Lenat, D. 1995. Cyc: A large-scale investment in knowledge infrastructure. Communications of the ACM 38, 11.]] Google ScholarGoogle Scholar
  25. Litman, D. J. and Allen, J. F. 1987. A plan recognition model for subdialogues in conversations. Cognitive Science 11, 163--200.]]Google ScholarGoogle Scholar
  26. Liu, H., Lieberman, H., and Selker, T. 2003. A model of textual affect sensing using real-world knowledge. In Proc. of The International Conference on Intelligent User Interfaces. 125--132.]] Google ScholarGoogle Scholar
  27. Low, B. T., Chan, K., Choi, L. L., Chin, M. Y., and Lay, S. L. 2001. Semantic expectation-based causation knowledge extraction: A study on Hong Kong stock movement analysis. In Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD). 114--123.]] Google ScholarGoogle Scholar
  28. Mann, W. C. and Thompson, S. A. 1987. Rhetorical structure theory: A theory of text organization. In USC Information Sciences Institute, Technical Report ISI/RS-87-190.]]Google ScholarGoogle Scholar
  29. Marcu, D. 1997. The rhetorical parsing of natural language texts. In Proc. of ACL97/EACL97. 96--103.]] Google ScholarGoogle Scholar
  30. Marcu, D. 2002. An unsupervised approach to recognizing discourse relations. In Proc. of The 40th. Annual Meeting of The Association for Computational Linguistics (ACL2002). 368--375.]] Google ScholarGoogle Scholar
  31. Masuoka, T. 1997. Complex sentence. Kuroshio (in Japan).]]Google ScholarGoogle Scholar
  32. Masuoka, T. and Takubo, Y. 1992. Fundamental Japanese grammar(revised version). Kuroshio. (in Japan).]]Google ScholarGoogle Scholar
  33. Matsumoto, Y., Kitauchi, A., Yamashita, T., Hirano, Y., Matsuda, H., and Asahara, M. 1999. Japanese Morphological Analyzer ChaSen Users Manual version 2.0. Technical Report NAIST-IS-TR990123, Nara Institute of Science and Technology Technical Report.]]Google ScholarGoogle Scholar
  34. MEDLINE. 2001. The MEDLINE database. See also, http://www.ncbi.nlm.nih.gov/PubMed/.]]Google ScholarGoogle Scholar
  35. Nagano, M. 1986. A reveiw of style theory. Asakura (in Japan).]]Google ScholarGoogle Scholar
  36. Nakaiwa, H. and Ikehara, S. 1995. Intrasentential resolution of Japanese zero pronouns in a machine translation system using semantic and pragmatic constraints. In Proc. of The 6th TMI. 96--105.]]Google ScholarGoogle Scholar
  37. Nikkei. 1990. Nihon Keizai Shimbun CD-ROM version.]]Google ScholarGoogle Scholar
  38. Pearl, J. 1988. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann.]] Google ScholarGoogle Scholar
  39. Pearl, J. 2000. Causality: Models, Reasoning, and Inference. Cambridge Universiy Press, London.]] Google ScholarGoogle Scholar
  40. Pei, J., Han, J., Mortazavi-Asi, B., Pinto, H., Chen, Q., Dayal, U., and Hsu, M. C. 2001. Prefix Span: mining sequential patterns efficiently by prefix projected pattern growth. In Proc. of 1st. Conference of Data Enginnering (ICDE2001). 215--226.]] Google ScholarGoogle Scholar
  41. Rifkin, R. and Klautau, A. 2004. In defense of one-vs-all classification. Journal of Machine Learning Research 5, 101--141.]] Google ScholarGoogle Scholar
  42. RWC. 1998. RWC text corpus 2nd edition, Iwanami Japanese dictionary tagged/morphological data, 5th edn.]]Google ScholarGoogle Scholar
  43. Satou, H., Kasahara, K., and Matsuzawa, K. 1999. Rertrieval {sic} of simplified causal knowledge in text and its application. In Technical report of IEICE, Thought and Language. (in Japan).]]Google ScholarGoogle Scholar
  44. Satou, S. and Nagao, M. 1990. Toward memory-based translation. In Proc. of The 13th. International Conference on Computational Linguistics (COLING90). 247--252.]] Google ScholarGoogle Scholar
  45. Schank, R. and Abelson, R. 1977. Scripts, Plans, Goals and Understanding: An Inquiry into Human Knowledge Structures. Lawrence Erlbaum Assoc., Mahwah, NJ.]]Google ScholarGoogle Scholar
  46. Schank, R. and Riesbeck, C. 1981. Inside Computer Understanding: Five Programs Plus Minitures. Lawrence Erlbaum Assoc., Mahwah, NJ.]] Google ScholarGoogle Scholar
  47. Singh, P., Lin, T., Mueller, E. T., Lim, G., Perkins, T., and Zhu, W. L. 2002. Open Mind Common Sense: Knowledge acquisition from the general public. In Proc. of The 1st. International Conference on Ontologies, Databases and Applications of Semantics for Large Scale Information Systems.]] Google ScholarGoogle Scholar
  48. Stork, D. G. 1999. Character and document research in the Open Mind Initiative. In Proc. of International Conference on Document Analysis and Recognition. 1--12.]] Google ScholarGoogle Scholar
  49. Suyama, A. 2005. Sentence identification in HTML documents with machine learning. M.S. thesis, Tokyo Institute of Technology (in Japan).]]Google ScholarGoogle Scholar
  50. Terada, A. 2003. A study of text mining techniques using natural language processing. Ph.D. thesis, Tokyo Institute of Technology (in Japan).]]Google ScholarGoogle Scholar
  51. Torisawa, K. 2003. Automatic extraction of ‘commonsense’ inference rules from corpora. In Proc. of The 9th Annual Meeting of The Association for Natural Language Processing. 318--321 (in Japan).]]Google ScholarGoogle Scholar
  52. Vapnik, V. N. 1995. The Nature of Statistical Learning Theory. Springer, New York.]] Google ScholarGoogle Scholar
  53. Vert, J. 2002. Support vector machine prediction of signal peptide cleavage site using a new class of kernels for strings. In Proceedings of the Pacific Symposium on Biocomputing 2002. 649--660.]]Google ScholarGoogle Scholar
  54. Voorhees, E. M. and Harman, D. K. 2001. The Ninth Text Retrieval Conference (TREC9). See also, http://trec.nist.gov.]]Google ScholarGoogle Scholar
  55. Yokoi, T. 1995. The EDR electronic dictionary. Communnications of the ACM 38, 11, 42--44.]] Google ScholarGoogle Scholar

Index Terms

  1. Acquiring causal knowledge from text using the connective marker tame

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image ACM Transactions on Asian Language Information Processing
          ACM Transactions on Asian Language Information Processing  Volume 4, Issue 4
          December 2005
          129 pages
          ISSN:1530-0226
          EISSN:1558-3430
          DOI:10.1145/1113308
          Issue’s Table of Contents

          Copyright © 2005 ACM

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 1 December 2005
          Published in talip Volume 4, Issue 4

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • article

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader