skip to main content
10.1145/3233547.3233555acmconferencesArticle/Chapter ViewAbstractPublication PagesbcbConference Proceedingsconference-collections
research-article

CausalTriad: Toward Pseudo Causal Relation Discovery and Hypotheses Generation from Medical Text Data

Authors Info & Claims
Published:15 August 2018Publication History

ABSTRACT

Deriving pseudo causal relations from medical text data lies at the heart of medical literature mining. Existing studies have utilized extraction models to find pseudo causal relation from single sentences, while the knowledge created by causation transitivity - often spanning multiple sentences - has not been considered. Furthermore, we observe that many pseudo causal relations follow the rule of causation transitivity, which makes it possible to discover unseen casual relations and generate new causal relation hypotheses. In this paper, we address these two issues by proposing a factor graph model to incorporate three clues to discover causation expressions in the text data. We propose four types of triad structures to represent the rules of causation transitivity among causal relations. Our proposed model, called CausalTriad, uses textual and structural knowledge to infer pseudo causal relations from the triad structures. Experimental results on two datasets demonstrate that (a) CausalTriad is effective for pseudo causal relation discovery within and across sentences; (b) CausalTriad is highly capable at recognizing implicit pseudo causal relations; (c) CausalTriad can infer missing/new pseudo causal relations from text data.

References

  1. David Arbour, Dan Garant, and David Jensen. 2016. Inferring Network Effects from Observational Data. In KDD'16. 715--724. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Michael Bada, Miriam Eckert, Donald Evans, Kristin Garcia, Krista Shipley, Dmitry Sitnikov, William A Baumgartner, K Bretonnel Cohen, Karin Verspoor, Judith A Blake, et al. 2012. Concept annotation in the CRAFT corpus. BMC bioinformatics 13, 1 (2012), 1.Google ScholarGoogle Scholar
  3. S. Siegel Castellan. 1988. Non-Parametric Statistics for The Behavioral Sciences. McGraw-Hill,. 3 pages.Google ScholarGoogle Scholar
  4. Wanxiang Che, Zhenghua Li, and Ting Liu. 2010. Ltp: A chinese language technology platform. In Coling'10. 13--16. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Angela M Coventry. 2006. Hume's Theory of Causation. A&C Black.Google ScholarGoogle Scholar
  6. Quang Xuan Do, Yee Seng Chan, and Dan Roth. 2011. Minimally Supervised Event Causality Identification. In EMNLP'11. 294-- 303. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Samuel G Finlayson, Paea LePendu, and Nigam H Shah. 2014. Building the graph of medicine from millions of clinical narratives. Scientific data 1 (2014).Google ScholarGoogle Scholar
  8. Chikara Hashimoto, Kentaro Torisawa, Julien Kloetzer, and Jong Hoon Oh. 2015. Generating event causality hypotheses through semantic relations. In AAAI'15. 2396--2403. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Christopher Hidey and Kathleen McKeown. 2016. Identifying Causal Relations Using Parallel Wikipedia Articles. In ACL'16. 1424--1433.Google ScholarGoogle Scholar
  10. Patrik O Hoyer, Dominik Janzing, Joris M Mooij, Jonas Peters, and Bernhard Sch¨olkopf. 2009. Nonlinear causal discovery with additive noise models. In NIPS'09. 689--696. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Sarvnaz Karimi, Chen Wang, Alejandro Metke-Jimenez, Raj Gaire, and Cecile Paris. 2015. Text and data mining techniques in adverse drug reaction detection. Comput. Surveys 47, 4 (2015), 56. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Paul L. A. and Hall Ned. 2013. Causation: A User's Guide. Oxford University Press.Google ScholarGoogle Scholar
  13. Claudia Leacock and Martin Chodorow. 1998. Combining local context and WordNet similarity for word sense identification. WordNet: An electronic lexical database 49, 2 (1998), 265--283.Google ScholarGoogle Scholar
  14. Christopher D. Manning, Mihai Surdeanu, John Bauer, Jenny Finkel, Steven J. Bethard, and David Mcclosky. 2014. The Stanford CoreNLP Natural Language Processing Toolkit. In ACL'14.Google ScholarGoogle Scholar
  15. Alejandro Metke-Jimenez and Sarvnaz Karimi. 2015. Concept Extraction to Identify Adverse Drug Reactions in Medical Forums: A Comparison of Algorithms. CoRR (2015).Google ScholarGoogle Scholar
  16. Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. {n. d.}. Distributed representations of words and phrases and their compositionality. In NIPS'13. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Joris M Mooij, Jonas Peters, Dominik Janzing, Jakob Zscheischler, and Bernhard Sch¨olkopf. 2016. Distinguishing cause from effect using observational data: methods and benchmarks. JMLR 17, 32 (2016), 1--102. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Aur´elie N´ev´eol, Sonya E Shooshan, Susanne M Humphrey, James G Mork, and Alan R Aronson. 2009. A recent advance in the automatic indexing of the biomedical literature. Journal of biomedical informatics 42, 5 (2009), 814--823. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Thien Huu Nguyen and Ralph Grishman. 2015. Relation Extraction: Perspective from Convolutional Neural Networks. In NAACL'15. 39--48.Google ScholarGoogle ScholarCross RefCross Ref
  20. Nanyun Peng, Hoifung Poon, Chris Quirk, Kristina Toutanova, and Wen-tau Yih. 2017. Cross-Sentence N-ary Relation Extraction with Graph LSTMs. Transactions of the Association for Computational Linguistics 5 (2017), 101--115.Google ScholarGoogle ScholarCross RefCross Ref
  21. Jeffrey Pennington, Richard Socher, and Christopher Manning. 2014. Glove: Global Vectors for Word Representation. In EMNLP'14. 1532--1543.Google ScholarGoogle Scholar
  22. Chris Quirk, Pallavi Choudhury, Jianfeng Gao, Hisami Suzuki, Kristina Toutanova, Michael Gamon, Wen Tau Yih, Lucy Vanderwende, and Colin Cherry. 2012. MSR SPLAT, a language analysis toolkit. In NAACL'12. 21--24. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Kira Radinsky, Sagie Davidovich, and Shaul Markovitch. 2012. Learning Causality for News Events Prediction. In WWW. 909-- 918. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Mehwish Riaz and Roxana Girju. 2014. Recognizing causality in verb-noun pairs via noun and verb semantics. In EACL'14.Google ScholarGoogle ScholarCross RefCross Ref
  25. Patrick Suppes. 1970. A probabilistic theory of causality. NorthHolland Publishing Company Amsterdam.Google ScholarGoogle Scholar
  26. Phillip Wolff and Grace Song. 2003. Models of causation and the semantics of causal verbs. Cognitive Psychology 47, 3 (2003), 276--332.Google ScholarGoogle ScholarCross RefCross Ref
  27. R. Xue, Z. Fang, M. Zhang, Z. Yi, C. Wen, and T. Shi. 2013. TCMID: Traditional Chinese Medicine integrative database for herb molecular mechanism analysis. Nucleic Acids Research 41 (2013).Google ScholarGoogle Scholar
  28. Hong Yu and Eugene Agichtein. 2003. Extracting synonymous gene and protein terms from biological literature. Bioinformatics 19 (2003), 1340--1349.Google ScholarGoogle ScholarCross RefCross Ref
  29. Ce Zhang. 2015. DeepDive: a data management system for automatic knowledge base construction. Ph.D. Dissertation. Citeseer.Google ScholarGoogle Scholar
  30. Sendong Zhao, Quan Wang, Sean Massung, Bing Qin, Ting Liu, Bin Wang, and ChengXiang Zhai. 2017. Constructing and Embedding Abstract Event Causality Networks from Text Snippets. In WSDM'17. 335--344. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. CausalTriad: Toward Pseudo Causal Relation Discovery and Hypotheses Generation from Medical Text Data

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          BCB '18: Proceedings of the 2018 ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics
          August 2018
          727 pages
          ISBN:9781450357944
          DOI:10.1145/3233547

          Copyright © 2018 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 15 August 2018

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          BCB '18 Paper Acceptance Rate46of148submissions,31%Overall Acceptance Rate254of885submissions,29%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader