[go: up one dir, main page]

skip to main content
research-article
Open access

The seven tools of causal inference, with reflections on machine learning

Published: 21 February 2019 Publication History

Abstract

The kind of causal inference seen in natural human thought can be "algorithmitized" to help produce human-level machine intelligence.

References

[1]
Balke, A. and Pearl, J. Probabilistic evaluation of counterfactual queries. In Proceedings of the 12<sup>th</sup> National Conference on Artificial Intelligence (Seattle, WA, July 31-Aug. 4). MIT Press, Menlo Park, CA, 1994, 230--237.
[2]
Bareinboim, E. and Pearl, J. Causal inference by surrogate experiments: z-identifiability. In Proceedings of the 28<sup>th</sup> Conference on Uncertainty in Artificial Intelligence, N. de Freitas and K. Murphy, Eds. (Catalina Island, CA, Aug. 14--18). AUAI Press, Corvallis, OR, 2012, 113--120.
[3]
Bareinboim, E. and Pearl, J. Causal inference and the data-fusion problem. Proceedings of the National Academy of Sciences 113, 27 (2016), 7345--7352.
[4]
Chen, Z. and Liu, B. Lifelong Machine Learning. Morgan and Claypool Publishers, San Rafael, CA, 2016.
[5]
Darwiche, A. Human-Level Intelligence or Animal-Like Abilities? Technical Report. Department of Computer Science, University of California, Los Angeles, CA, 2017; https://arxiv.org/pdf/1707.04327.pdf
[6]
Graham, J. Missing Data: Analysis and Design (Statistics for Social and Behavioral Sciences). Springer, 2012.
[7]
Halpern, J.H. and Pearl, J. Causes and explanations: A structural-model approach: Part I: Causes. British Journal of Philosophy of Science 56 (2005), 843--887.
[8]
Hutson, M. AI researchers allege that machine learning is alchemy. Science (May 3, 2018); https://www.sciencemag.org/news/2018/05/ai-researchers-allege-machine-learning-alchemy
[9]
Jaber, A., Zhang, J.J., and Bareinboim, E. Causal identification under Markov equivalence. In Proceedings of the 34<sup>th</sup> Conference on Uncertainty in Artificial Intelligence, A. Globerson and R. Silva, Eds. (Monterey, CA, Aug. 6--10). AUAI Press, Corvallis, OR, 2018, 978--987.
[10]
Lake, B.M., Salakhutdinov, R., and Tenenbaum, J.B. Human-level concept learning through probabilistic program induction. Science 350, 6266 (Dec. 2015), 1332--1338.
[11]
Marcus, G. Deep Learning: A Critical Appraisal. Technical Report. Departments of Psychology and Neural Science, New York University, New York, 2018; https://arxiv.org/pdf/1801.00631.pdf
[12]
Mohan, K. and Pearl, J. Graphical Models for Processing Missing Data. Technical Report R-473. Department of Computer Science, University of California, Los Angeles, CA, 2018; forthcoming, Journal of American Statistical Association; http://ftp.cs.ucla.edu/pub/stat_ser/r473.pdf
[13]
Mohan, K., Pearl, J., and Tian, J. Graphical models for inference with missing data. In Advances in Neural Information Processing Systems 26, C.J.C. Burges, L. Bottou, M. Welling, Z. Ghahramani, and K.Q. Weinberger, Eds. Curran Associates, Inc., Red Hook, NY, 2013, 1277--1285; http://papers.nips.cc/paper/4899-graphical-models-for-inference-with-missing-data.pdf
[14]
Morgan, S.L. and Winship, C. Counterfactuals and Causal Inference: Methods and Principles for Social Research (Analytical Methods for Social Research), Second Edition. Cambridge University Press, New York, 2015.
[15]
Pearl, J. Probabilistic Reasoning in Intelligent Systems. Morgan Kaufmann, San Mateo, CA, 1988.
[16]
Pearl, J. Comment: Graphical models, causality, and intervention. Statistical Science 8, 3 (1993), 266--269.
[17]
Pearl, J. Causal diagrams for empirical research. Biometrika 82, 4 (Dec. 1995), 669--710.
[18]
Pearl, J. Causality: Models, Reasoning, and Inference. Cambridge University Press, New York, 2000; Second Edition, 2009.
[19]
Pearl, J. Direct and indirect effects. In Proceedings of the 17<sup>th</sup> Conference on Uncertainty in Artificial Intelligence (Seattle, WA, Aug. 2--5). Morgan Kaufmann, San Francisco, CA, 2001, 411--420.
[20]
Pearl, J. Causes of effects and effects of causes. Journal of Sociological Methods and Research 44, 1 (2015a), 149--164.
[21]
Pearl, J. Trygve Haavelmo and the emergence of causal calculus. Econometric Theory 31, 1 (2015b), 152--179; special issue on Haavelmo centennial
[22]
Pearl, J. and Bareinboim, E. External validity: From do-calculus to transportability across populations. Statistical Science 29, 4 (2014), 579--595.
[23]
Pearl, J. and Mackenzie, D. The Book of Why: The New Science of Cause and Effect. Basic Books, New York, 2018.
[24]
Peters, J., Janzing, D. and Schölkopf, B. Elements of Causal Inference: Foundations and Learning Algorithms. MIT Press, Cambridge, MA, 2017.
[25]
Porta, M. The deconstruction of paradoxes in epidemiology. OUPblog, Oct. 17, 2014; https://blog.oup.com/2014/10/deconstruction-paradoxes-sociology-epidemiology/
[26]
Ribeiro, M.T., Singh, S., and Guestrin, C. Why should I trust you?: Explaining the predictions of any classifier. In Proceedings of the 22<sup>nd</sup> ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (San Francisco, CA, Aug. 13--17). ACM Press, New York, 2016, 1135--1144.
[27]
Robins, J.M. and Greenland, S. Identifiability and exchangeability for direct and indirect effects. Epidemiology 3, 2 (Mar. 1992), 143--155.
[28]
Rosenbaum, P. and Rubin, D. The central role of propensity score in observational studies for causal effects. Biometrika 70, 1 (Apr. 1983), 41--55.
[29]
Shimizu, S., Hoyer, P.O., Hyvärinen, A., and Kerminen, A.J. A linear non-Gaussian acyclic model for causal discovery. Journal of the Machine Learning Research 7 (Oct. 2006), 2003--2030.
[30]
Shpitser, I. and Pearl, J. Complete identification methods for the causal hierarchy. Journal of Machine Learning Research 9 (2008), 1941--1979.
[31]
Spirtes, P., Glymour, C.N., and Scheines, R. Causation, Prediction, and Search, Second Edition. MIT Press, Cambridge, MA, 2000.
[32]
Tian, J. and Pearl, J. A general identification condition for causal effects. In Proceedings of the 18<sup>th</sup> National Conference on Artificial Intelligence (Edmonton, AB, Canada, July 28-Aug. 1). AAAI Press/MIT Press, Menlo Park, CA, 2002, 567--573.
[33]
van der Laan, M.J. and Rose, S. Targeted Learning: Causal Inference for Observational and Experimental Data. Springer, New York, 2011.
[34]
VanderWeele, T.J. Explanation in Causal Inference: Methods for Mediation and Interaction. Oxford University Press, New York, 2015.
[35]
Zhang, J. and Bareinboim, E. Transfer learning in multi-armed bandits: A causal approach. In Proceedings of the 26<sup>th</sup> International Joint Conference on Artificial Intelligence (Melbourne, Australia, Aug. 19--25). AAAI Press, Menlo Park, CA, 2017, 1340--1346.

Cited By

View all
  • (2025)Causal program dependence analysisScience of Computer Programming10.1016/j.scico.2024.103208240(103208)Online publication date: Feb-2025
  • (2025)The role of artificial intelligence and machine learning in clinical trialsArtificial Intelligence for Drug Product Lifecycle Applications10.1016/B978-0-323-91819-0.00008-7(205-234)Online publication date: 2025
  • (2024)The Role of Big Data in Sustainable SolutionsDesigning Sustainable Internet of Things Solutions for Smart Industries10.4018/979-8-3693-5498-8.ch007(169-208)Online publication date: 22-Nov-2024
  • Show More Cited By

Index Terms

  1. The seven tools of causal inference, with reflections on machine learning

          Recommendations

          Reviews

          Jonathan P. E. Hodgson

          There are three obstacles to meeting the increasing expectations for artificial intelligence (AI), according to this article: the lack of adaptability or robustness; the lack of explainability; and "the lack of understanding of cause-effect connections." This article is mainly concerned with the latter. The author asserts that an intelligent system should be able to answer such questions as "What would have happened if I had acted differently " The author's claim is that all three obstacles and the answering of such questions can be overcome by using causal reasoning. The article gives an overview of causal reasoning and structured causal models (SCM), emphasizing a three-level hierarchy in which each level is capable of answering different types of questions. The first level manages association-questions such as "What does a symptom tell me about a disease " The second level, "intervention," manages questions like "What if we ban cigarettes " The third level manages counterfactuals, dealing with questions like "Would Kennedy be alive had Oswald not shot him " The tools used for causal reasoning include graphical models that show the causal relationships between variables and the " do -calculus," which simulates physical interventions where the distribution resulting from a specific action is predicted. The article's extended example illustrates the process and presents "a bird's-eye view of seven tasks accomplished through the SCM framework." A critical property of Pearl's system is that it is effectively computable. The article provides a lucid introduction to the ideas and is recommended to all AI workers.

          Access critical reviews of Computing literature here

          Become a reviewer for Computing Reviews.

          Comments

          Information & Contributors

          Information

          Published In

          cover image Communications of the ACM
          Communications of the ACM  Volume 62, Issue 3
          March 2019
          109 pages
          ISSN:0001-0782
          EISSN:1557-7317
          DOI:10.1145/3314328
          Issue’s Table of Contents
          Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          Published: 21 February 2019
          Published in CACM Volume 62, Issue 3

          Check for updates

          Qualifiers

          • Research-article
          • Popular
          • Refereed

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • Downloads (Last 12 months)4,246
          • Downloads (Last 6 weeks)368
          Reflects downloads up to 14 Oct 2024

          Other Metrics

          Citations

          Cited By

          View all
          • (2025)Causal program dependence analysisScience of Computer Programming10.1016/j.scico.2024.103208240(103208)Online publication date: Feb-2025
          • (2025)The role of artificial intelligence and machine learning in clinical trialsArtificial Intelligence for Drug Product Lifecycle Applications10.1016/B978-0-323-91819-0.00008-7(205-234)Online publication date: 2025
          • (2024)The Role of Big Data in Sustainable SolutionsDesigning Sustainable Internet of Things Solutions for Smart Industries10.4018/979-8-3693-5498-8.ch007(169-208)Online publication date: 22-Nov-2024
          • (2024)Next Generation Computing and Communication Hub for First Responders in Smart CitiesSensors10.3390/s2407236624:7(2366)Online publication date: 8-Apr-2024
          • (2024)Predicting the Liquid Steel End-Point Temperature during the Vacuum Tank Degassing Process Using Machine Learning ModelingProcesses10.3390/pr1207141412:7(1414)Online publication date: 6-Jul-2024
          • (2024)Implications of Minimum Description Length for Adversarial Attack in Natural Language ProcessingEntropy10.3390/e2605035426:5(354)Online publication date: 24-Apr-2024
          • (2024)Causal Economic Machine Learning (CEML): “Human AI”AI10.3390/ai50400945:4(1893-1917)Online publication date: 11-Oct-2024
          • (2024)Brain organoids and organoid intelligence from ethical, legal, and social points of viewFrontiers in Artificial Intelligence10.3389/frai.2023.13076136Online publication date: 5-Jan-2024
          • (2024)Understanding the impact of radiotherapy fractionation on overall survival in a large head and neck squamous cell carcinoma dataset: a comprehensive approach combining mechanistic and machine learning modelsFrontiers in Oncology10.3389/fonc.2024.142221114Online publication date: 13-Aug-2024
          • (2024)Causality and tractable probabilistic modelsFrontiers in Computer Science10.3389/fcomp.2023.12633865Online publication date: 8-Jan-2024
          • Show More Cited By

          View Options

          View options

          PDF

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          Digital Edition

          View this article in digital edition.

          Digital Edition

          Magazine Site

          View this article on the magazine site (external)

          Magazine Site

          Get Access

          Login options

          Full Access

          Media

          Figures

          Other

          Tables

          Share

          Share

          Share this Publication link

          Share on social media