skip to main content
10.1145/2187836.2187958acmotherconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
research-article

Learning causality for news events prediction

Published:16 April 2012Publication History

ABSTRACT

The problem we tackle in this work is, given a present news event, to generate a plausible future event that can be caused by the given event. We present a new methodology for modeling and predicting such future news events using machine learning and data mining techniques. Our Pundit algorithm generalizes examples of causality pairs to infer a causality predictor. To obtain precise labeled causality examples, we mine 150 years of news articles, and apply semantic natural language modeling techniques to titles containing certain predefined causality patterns. For generalization, the model uses a vast amount of world knowledge ontologies mined from LinkedData, containing ~200 datasets with approximately 20 billion relations. Empirical evaluation on real news articles shows that our Pundit algorithm reaches a human-level performance.

References

  1. E. Agichtein and L. Gravano. Snowball: extracting relations from large plain-text collections. In Proc. of JCDL, pages 85--94, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Amr Ahmed, Qirong Ho, Jacob Eisenstein, Eric P. Xing, Alexander J. Smola, and Choon Hui Teo. Unified analysis of streaming news. In Proc. of WWW, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. Banko and O. Etzioni. The tradeoffs between open and traditional relation extraction. In Proc. of ACL-08: HLT, 2008.Google ScholarGoogle Scholar
  4. C. Bizer, T. Heath, and T. Berners-Lee. Linked data -- the story so far. IJSWIS, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  5. E. Blanco, N. Castell, and D. Moldovan. Causal Relation Extraction. In Proc. of LREC, 2008.Google ScholarGoogle Scholar
  6. A. Carlson, J. Betteridge, B. Kisiel, B. Settles, E.R. Hruschka Jr., and T.M. Mitchell. Toward an architecture for never-ending language learning. In Proc. of AAAI, 2010.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. N. Chambers and D. Jurafsky. Template-Based Information Extraction without the Templates. In Proc. of ACL, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. N. Chambers, S Wang, and D. Jurafsky. Classifying temporal relations between events. In Proc. of ACL (Poster), 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. K. Chan and W. Lam. Extracting causation knowledge from natural language texts. IJIS, 20:327--358, 05. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Q. Do, Y. Chan, and D. Roth. Minimally supervised event causality identification. In EMNLP, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. M. B. Eisen, P. T. Spellman, P. O. Brown, and D. Botstein. Cluster analysis and display of genome-wide expression patterns.PNAS, 95:14863--14868, 1998.Google ScholarGoogle ScholarCross RefCross Ref
  12. D. Garcia. Coatis, an nlp system to locate expressions of actions connected by causality links. In Proc. of EKAW, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. R. Girju and D. Moldovan. Text mining for causal relations. InProc. of FLAIRS, pages 360--364, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. O. Glickman, I. Dagan, and M. Koppel. A probabilistic classification approach for lexical textual entailment. InProc. of AAAI, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. M. Palmer H. Dang, K. Kipper and J. Rosenzweig. Investigating regular sense extensions based on intersective levin classes. In Proc. of Coling-ACL, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. A. Jatowt and C.M Yeung. Extracting collective expectations about the future from large text collections. In Proc. of CIKM, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. N. Ryant K. Kipper, A. Korhonen and M. Palmer. Extending verbnet with novel verb classes. In Proc. of LREC, 2006.Google ScholarGoogle Scholar
  18. R. Kaplan and G. Berry-Rogghe. Knowledge-based acquisition of causal relationships in text. Knowledge Acquisition, 3:317--337, 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. C. Khoo, S. Chan, and Y. Niu. Extracting causal knowledge from a medical database using graphical patterns. InProc. of ACL, pages 336--343, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. J. Kim. Supervenience and mind. Selected Philosophical Essays, 1993.Google ScholarGoogle Scholar
  21. M. Lapata and A. Lascarides. Learning sentence-internal temporal relations. JAIR, 27:85--117, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Douglas B. Lenat and R. V. Guha. Building Large Knowledge-Based Systems: Representation and Inference in the Cyc Project. Addison-Wesley, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. B. Levin and M. Rappaport Hovav. A preliminary analysis of causative verbs in english. Lingua, 92:35--77, 1994.Google ScholarGoogle ScholarCross RefCross Ref
  24. D. Lin and P. Pantel. Dirt-discovery of inference rules from text. In Proc. of KDD, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. X. Ling and D. Weld. Temporal information extraction. In Proc. of AAAI, 2010.Google ScholarGoogle Scholar
  26. H. Liu and P. Singh. Conceptnet: A practical commonsense reasoning toolkit.BT Technology Journal, 22, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. I. Mani, B. Schiffman, and J. Zhang. Inferring temporal ordering of events in news. In Proc. of HLT-NAACL 2003, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. M. Marneffe, B. MacCartney, and C.D Manning. Generating typed dependency parses from phrase structure parses. In Proc. of LREC, 2006.Google ScholarGoogle Scholar
  29. J. Michel, Y.K Shen, A. Aiden, A. Veres, M. Gray, Google Books Team, J. Pickett, D. Hoiberg, D. Clancy, P. Norvig, J. Orwant, S. Pinker, M. Nowak, and E. Aiden. Quantitative analysis of culture using millions of digitized books.Science, 331:176--182, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  30. G. Miller. Wordnet: A lexical database for english. CACM, 38:39--41, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. H. Mili E. Bicknell Rada, R. and M. Blettner. Development and application of a metric to semantic nets.IEEE Transactions on Systems, Man and Cybernetics, 19(1):17--30, 1989.Google ScholarGoogle ScholarCross RefCross Ref
  32. E. Riloff. Automatically Generating Extraction Patterns from Untagged Text. In Proc. of AAAI, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. E. Riloff and R. Jones. Learning dictionaries for information extraction by multi-level bootstrapping. In Proc. of AAAI, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. L. Schubert. Can we derive general world knowledge from texts? In Proc. of HLT 2002, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. D. Shahaf and C. Guestrin. Connecting the dots between news articles. In Proc. of KDD, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. A. Sil, F. Huang, and A. Yates. Extracting action and event semantics from web text. In Proc. of AAAI Fall Symposium on Commonsense Knowledge, 2010.Google ScholarGoogle Scholar
  37. Michael Strube and Simone Paolo Ponzetto. Wikirelate! computing semantic relatedness using wikipedia. In Proc. of AAAI, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. F. Suchanek, G Kasneci, and G. Weikum. Yago: a core of semantic knowledge. InProc. of WWW, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. M. Tatu and M. Srikanth. Experiments with reasoning for temporal relations between events. In Proc. of COLING, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. P. Wolff, G. Song, and D. Driscoll. Models of causation and causal verbs. InProc. of ACL, 2002.Google ScholarGoogle Scholar
  41. K. Yoshikawa, S. Riedel, M. Asahara, and Y. Matsumoto. Jointly identifying temporal relations with markov logic. In Proc. of ACL-IJCNLP, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Learning causality for news events prediction

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        WWW '12: Proceedings of the 21st international conference on World Wide Web
        April 2012
        1078 pages
        ISBN:9781450312295
        DOI:10.1145/2187836

        Copyright © 2012 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 16 April 2012

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate1,899of8,196submissions,23%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader