ABSTRACT
As computing systems are more frequently and more actively intervening to improve people's work and daily lives, it is critical to correctly predict and understand the causal effects of these interventions. Conventional machine learning methods, built on pattern recognition and correlational analyses, are insufficient for causal analysis. This tutorial will introduce participants to concepts in causal inference and counterfactual reasoning, drawing from a broad literature on the topic from statistics, social sciences and machine learning. We will motivate the use of causal inference through examples in domains such as recommender systems, social media datasets, health, education and governance. To tackle such questions, we will introduce the key ingredient that causal analysis depends on---counterfactual reasoning---and describe the two most popular frameworks based on Bayesian graphical models and potential outcomes. Based on this, we will cover methods suitable for doing causal inference with large-scale online data, including randomized experiments, observational methods like matching and stratification, and natural experiment-based methods such as instrumental variables and regression discontinuity. We will also focus on best practices for evaluation and validation of causal inference techniques, drawing from our own experiences. We will show application of these techniques using DoWhy, a Python library for causal inference. Throughout, the emphasis will be on considerations of working with large-scale data, such as logs of user interactions or social data.
- Judea Pearl. Causality. Cambridge University Press, 2009.Google Scholar
- Peter Spirtes, Clark N Glymour, and Richard Scheines. Causation, prediction, and search. MIT Press, 2000.Google Scholar
- Guido W Imbens and Donald B Rubin. Causal inference in statistics, social, and biomedical sciences. Cambridge University Press, 2015.Google ScholarCross Ref
- Thad Dunning. Natural experiments in the social sciences: a design-based approach. Cambridge University Press, 2012.Google ScholarCross Ref
- Paul R Rosenbaum. Observational studies. Springer, 2002.Google ScholarCross Ref
- Joshua D Angrist and Jörn-Steffen Pischke. Mostly harmless econometrics: An empiricist's companion. Princeton University Press, 2008.Google ScholarCross Ref
- Elizabeth A Stuart. Matching methods for causal inference: A review and a look forward. Statistical Science, 25(1):1, 2010.Google ScholarCross Ref
- Léon Bottou, Jonas Peters, Joaquin Quiñonero-Candela, Denis X Charles, D Max Chickering, Elon Portugaly, Dipankar Ray, Patrice Simard, and Ed Snelson. Counterfactual reasoning and learning systems: The example of computational advertising. The Journal of Machine Learning Research, 14(1):3207--3260, 2013.Google ScholarDigital Library
- J. Peters, D. Janzing, and B. Schölkopf. Elements of Causal Inference - Foundations and Learning Algorithms. Adaptive Computation and Machine Learning Series. The MIT Press, Cambridge, MA, USA, 2017.Google ScholarDigital Library
- Daniel N. Hill, Robert Moakler, Alan E. Hubbard, Vadim Tsemekhman, Foster Provost, and Kiril Tsemekhman. Measuring causal impact of online actions via natural experiments: Application to display advertising. In Proc. KDD, 2015.Google ScholarDigital Library
- Amit Sharma, Jake M Hofman, and Duncan J Watts. Split-door criterion: Identification of causal effects through auxiliary outcomes. The Annals of Applied Statistics, 12(4):2699--2733, 2018.Google ScholarCross Ref
- Dean Eckles, Brian Karrer, and Johan Ugander. Design and analysis of experiments in networks: Reducing bias from interference. Journal of Causal Inference, 2017.Google ScholarCross Ref
- Fredrik Johansson, Uri Shalit, and David Sontag. Learning representations for counterfactual inference. In Proc. ICML, pages 3020--3029, 2016.Google Scholar
- Causal Inference and Counterfactual Reasoning
Recommendations
Causal Inference and Counterfactual Reasoning (3hr Tutorial)
WSDM '19: Proceedings of the Twelfth ACM International Conference on Web Search and Data MiningAs computing systems are more frequently and more actively intervening to improve people's work and daily lives, it is critical to correctly predict and understand the causal effects of these interventions. Conventional machine learning methods, built ...
Causal Reasoning and Meno’s Paradox
AbstractCausal reasoning is an aspect of learning, reasoning, and decision-making that involves the cognitive ability to discover relationships between causal relata, learn and understand these causal relationships, and make use of this causal knowledge ...
Causal inference and causal explanation with background knowledge
UAI'95: Proceedings of the Eleventh conference on Uncertainty in artificial intelligenceThis paper presents correct algorithms for answering the following two questions; (i) Does there exist a causal explanation consistent with a set of background knowledge which explains all of the observed independence facts in a sample? (ii) Given that ...
Comments