research-article

Open access

The seven tools of causal inference, with reflections on machine learning

Author:

Judea PearlAuthors Info & Claims

Communications of the ACM, Volume 62, Issue 3

Pages 54 - 60

https://doi.org/10.1145/3241036

Published: 21 February 2019 Publication History

All formats PDF

Abstract

The kind of causal inference seen in natural human thought can be "algorithmitized" to help produce human-level machine intelligence.

References

[1]

Balke, A. and Pearl, J. Probabilistic evaluation of counterfactual queries. In Proceedings of the 12th National Conference on Artificial Intelligence (Seattle, WA, July 31-Aug. 4). MIT Press, Menlo Park, CA, 1994, 230--237.

Digital Library

Google Scholar

[2]

Bareinboim, E. and Pearl, J. Causal inference by surrogate experiments: z-identifiability. In Proceedings of the 28th Conference on Uncertainty in Artificial Intelligence, N. de Freitas and K. Murphy, Eds. (Catalina Island, CA, Aug. 14--18). AUAI Press, Corvallis, OR, 2012, 113--120.

Digital Library

Google Scholar

[3]

Bareinboim, E. and Pearl, J. Causal inference and the data-fusion problem. Proceedings of the National Academy of Sciences 113, 27 (2016), 7345--7352.

Crossref

Google Scholar

[4]

Chen, Z. and Liu, B. Lifelong Machine Learning. Morgan and Claypool Publishers, San Rafael, CA, 2016.

Digital Library

Google Scholar

[5]

Darwiche, A. Human-Level Intelligence or Animal-Like Abilities? Technical Report. Department of Computer Science, University of California, Los Angeles, CA, 2017; https://arxiv.org/pdf/1707.04327.pdf

Google Scholar

[6]

Graham, J. Missing Data: Analysis and Design (Statistics for Social and Behavioral Sciences). Springer, 2012.

Crossref

Google Scholar

[7]

Halpern, J.H. and Pearl, J. Causes and explanations: A structural-model approach: Part I: Causes. British Journal of Philosophy of Science 56 (2005), 843--887.

Crossref

Google Scholar

[8]

Hutson, M. AI researchers allege that machine learning is alchemy. Science (May 3, 2018); https://www.sciencemag.org/news/2018/05/ai-researchers-allege-machine-learning-alchemy

Google Scholar

[9]

Jaber, A., Zhang, J.J., and Bareinboim, E. Causal identification under Markov equivalence. In Proceedings of the 34th Conference on Uncertainty in Artificial Intelligence, A. Globerson and R. Silva, Eds. (Monterey, CA, Aug. 6--10). AUAI Press, Corvallis, OR, 2018, 978--987.

Google Scholar

[10]

Lake, B.M., Salakhutdinov, R., and Tenenbaum, J.B. Human-level concept learning through probabilistic program induction. Science 350, 6266 (Dec. 2015), 1332--1338.

Crossref

Google Scholar

[11]

Marcus, G. Deep Learning: A Critical Appraisal. Technical Report. Departments of Psychology and Neural Science, New York University, New York, 2018; https://arxiv.org/pdf/1801.00631.pdf

Google Scholar

[12]

Mohan, K. and Pearl, J. Graphical Models for Processing Missing Data. Technical Report R-473. Department of Computer Science, University of California, Los Angeles, CA, 2018; forthcoming, Journal of American Statistical Association; http://ftp.cs.ucla.edu/pub/stat_ser/r473.pdf

Google Scholar

[13]

Mohan, K., Pearl, J., and Tian, J. Graphical models for inference with missing data. In Advances in Neural Information Processing Systems 26, C.J.C. Burges, L. Bottou, M. Welling, Z. Ghahramani, and K.Q. Weinberger, Eds. Curran Associates, Inc., Red Hook, NY, 2013, 1277--1285; http://papers.nips.cc/paper/4899-graphical-models-for-inference-with-missing-data.pdf

Digital Library

Google Scholar

[14]

Morgan, S.L. and Winship, C. Counterfactuals and Causal Inference: Methods and Principles for Social Research (Analytical Methods for Social Research), Second Edition. Cambridge University Press, New York, 2015.

Google Scholar

[15]

Pearl, J. Probabilistic Reasoning in Intelligent Systems. Morgan Kaufmann, San Mateo, CA, 1988.

Digital Library

Google Scholar

[16]

Pearl, J. Comment: Graphical models, causality, and intervention. Statistical Science 8, 3 (1993), 266--269.

Crossref

Google Scholar

[17]

Pearl, J. Causal diagrams for empirical research. Biometrika 82, 4 (Dec. 1995), 669--710.

Crossref

Google Scholar

[18]

Pearl, J. Causality: Models, Reasoning, and Inference. Cambridge University Press, New York, 2000; Second Edition, 2009.

Digital Library

Google Scholar

[19]

Pearl, J. Direct and indirect effects. In Proceedings of the 17th Conference on Uncertainty in Artificial Intelligence (Seattle, WA, Aug. 2--5). Morgan Kaufmann, San Francisco, CA, 2001, 411--420.

Digital Library

Google Scholar

[20]

Pearl, J. Causes of effects and effects of causes. Journal of Sociological Methods and Research 44, 1 (2015a), 149--164.

Google Scholar

[21]

Pearl, J. Trygve Haavelmo and the emergence of causal calculus. Econometric Theory 31, 1 (2015b), 152--179; special issue on Haavelmo centennial

Crossref

Google Scholar

[22]

Pearl, J. and Bareinboim, E. External validity: From do-calculus to transportability across populations. Statistical Science 29, 4 (2014), 579--595.

Crossref

Google Scholar

[23]

Pearl, J. and Mackenzie, D. The Book of Why: The New Science of Cause and Effect. Basic Books, New York, 2018.

Digital Library

Google Scholar

[24]

Peters, J., Janzing, D. and Schölkopf, B. Elements of Causal Inference: Foundations and Learning Algorithms. MIT Press, Cambridge, MA, 2017.

Digital Library

Google Scholar

[25]

Porta, M. The deconstruction of paradoxes in epidemiology. OUPblog, Oct. 17, 2014; https://blog.oup.com/2014/10/deconstruction-paradoxes-sociology-epidemiology/

Google Scholar

[26]

Ribeiro, M.T., Singh, S., and Guestrin, C. Why should I trust you?: Explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (San Francisco, CA, Aug. 13--17). ACM Press, New York, 2016, 1135--1144.

Digital Library

Google Scholar

[27]

Robins, J.M. and Greenland, S. Identifiability and exchangeability for direct and indirect effects. Epidemiology 3, 2 (Mar. 1992), 143--155.

Crossref

Google Scholar

[28]

Rosenbaum, P. and Rubin, D. The central role of propensity score in observational studies for causal effects. Biometrika 70, 1 (Apr. 1983), 41--55.

Crossref

Google Scholar

[29]

Shimizu, S., Hoyer, P.O., Hyvärinen, A., and Kerminen, A.J. A linear non-Gaussian acyclic model for causal discovery. Journal of the Machine Learning Research 7 (Oct. 2006), 2003--2030.

Digital Library

Google Scholar

[30]

Shpitser, I. and Pearl, J. Complete identification methods for the causal hierarchy. Journal of Machine Learning Research 9 (2008), 1941--1979.

Digital Library

Google Scholar

[31]

Spirtes, P., Glymour, C.N., and Scheines, R. Causation, Prediction, and Search, Second Edition. MIT Press, Cambridge, MA, 2000.

Google Scholar

[32]

Tian, J. and Pearl, J. A general identification condition for causal effects. In Proceedings of the 18th National Conference on Artificial Intelligence (Edmonton, AB, Canada, July 28-Aug. 1). AAAI Press/MIT Press, Menlo Park, CA, 2002, 567--573.

Digital Library

Google Scholar

[33]

van der Laan, M.J. and Rose, S. Targeted Learning: Causal Inference for Observational and Experimental Data. Springer, New York, 2011.

Crossref

Google Scholar

[34]

VanderWeele, T.J. Explanation in Causal Inference: Methods for Mediation and Interaction. Oxford University Press, New York, 2015.

Google Scholar

[35]

Zhang, J. and Bareinboim, E. Transfer learning in multi-armed bandits: A causal approach. In Proceedings of the 26th International Joint Conference on Artificial Intelligence (Melbourne, Australia, Aug. 19--25). AAAI Press, Menlo Park, CA, 2017, 1340--1346.

Digital Library

Google Scholar

Cited By

View all

Lee SBinkley DFeldt RGold NYoo S(2025)Causal program dependence analysisScience of Computer Programming10.1016/j.scico.2024.103208240(103208)Online publication date: Feb-2025
https://doi.org/10.1016/j.scico.2024.103208
Dri DMassella MCarafa MMarianecci C(2025)The role of artificial intelligence and machine learning in clinical trialsArtificial Intelligence for Drug Product Lifecycle Applications10.1016/B978-0-323-91819-0.00008-7(205-234)Online publication date: 2025
https://doi.org/10.1016/B978-0-323-91819-0.00008-7
Igoche IAyem G(2024)The Role of Big Data in Sustainable SolutionsDesigning Sustainable Internet of Things Solutions for Smart Industries10.4018/979-8-3693-5498-8.ch007(169-208)Online publication date: 22-Nov-2024
https://doi.org/10.4018/979-8-3693-5498-8.ch007
Show More Cited By

Index Terms

The seven tools of causal inference, with reflections on machine learning

Recommendations

Causal Inference Meets Machine Learning
KDD '20: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Causal inference has numerous real-world applications in many domains such as health care, marketing, political science and online advertising. Treatment effect estimation, a fundamental problem in causal inference, has been extensively studied in ...
Causal Learning with Occam’s Razor
Abstract
Occam’s razor directs us to adopt the simplest hypothesis consistent with the evidence. Learning theory provides a precise definition of the inductive simplicity of a hypothesis for a given learning problem. This definition specifies a learning ...
Causal Inference and Causal Machine Learning with Practical Applications: The paper highlights the concepts of Causal Inference and Causal ML along with different implementation techniques
CODS-COMAD '23: Proceedings of the 6th Joint International Conference on Data Science & Management of Data (10th ACM IKDD CODS and 28th COMAD)

One of the most important research areas in Machine Learning is to build prescriptive models. This requires understanding and measurement of the causal impact of any proposed treatment, followed by designing optimal strategy based on such causal ...

Reviews

Reviewer: Jonathan P. E. Hodgson

There are three obstacles to meeting the increasing expectations for artificial intelligence (AI), according to this article: the lack of adaptability or robustness; the lack of explainability; and "the lack of understanding of cause-effect connections." This article is mainly concerned with the latter. The author asserts that an intelligent system should be able to answer such questions as "What would have happened if I had acted differently " The author's claim is that all three obstacles and the answering of such questions can be overcome by using causal reasoning. The article gives an overview of causal reasoning and structured causal models (SCM), emphasizing a three-level hierarchy in which each level is capable of answering different types of questions. The first level manages association-questions such as "What does a symptom tell me about a disease " The second level, "intervention," manages questions like "What if we ban cigarettes " The third level manages counterfactuals, dealing with questions like "Would Kennedy be alive had Oswald not shot him " The tools used for causal reasoning include graphical models that show the causal relationships between variables and the " do -calculus," which simulates physical interventions where the distribution resulting from a specific action is predicted. The article's extended example illustrates the process and presents "a bird's-eye view of seven tasks accomplished through the SCM framework." A critical property of Pearl's system is that it is effectively computable. The article provides a lucid introduction to the ideas and is recommended to all AI workers.

Access critical reviews of Computing literature here

Become a reviewer for Computing Reviews.

Comments

Information & Contributors

Information

Published In

Communications of the ACM Volume 62, Issue 3

March 2019

109 pages

ISSN:0001-0782

EISSN:1557-7317

DOI:10.1145/3314328

Editor:
Andrew A. Chien
Association for Computing Machinery, New York, NY

Issue’s Table of Contents

Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 February 2019

Published in CACM Volume 62, Issue 3

Check for updates

Qualifiers

Research-article
Popular
Refereed

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

322
Total Citations
View Citations
63,564
Total Downloads

Downloads (Last 12 months)4,246
Downloads (Last 6 weeks)368

Reflects downloads up to 14 Oct 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

Lee SBinkley DFeldt RGold NYoo S(2025)Causal program dependence analysisScience of Computer Programming10.1016/j.scico.2024.103208240(103208)Online publication date: Feb-2025
https://doi.org/10.1016/j.scico.2024.103208
Dri DMassella MCarafa MMarianecci C(2025)The role of artificial intelligence and machine learning in clinical trialsArtificial Intelligence for Drug Product Lifecycle Applications10.1016/B978-0-323-91819-0.00008-7(205-234)Online publication date: 2025
https://doi.org/10.1016/B978-0-323-91819-0.00008-7
Igoche IAyem G(2024)The Role of Big Data in Sustainable SolutionsDesigning Sustainable Internet of Things Solutions for Smart Industries10.4018/979-8-3693-5498-8.ch007(169-208)Online publication date: 22-Nov-2024
https://doi.org/10.4018/979-8-3693-5498-8.ch007
Shaposhnyk OLai KWolbring GShmerko VYanushkevich S(2024)Next Generation Computing and Communication Hub for First Responders in Smart CitiesSensors10.3390/s2407236624:7(2366)Online publication date: 8-Apr-2024
https://doi.org/10.3390/s24072366
Vita RCarlsson LSamuelsson P(2024)Predicting the Liquid Steel End-Point Temperature during the Vacuum Tank Degassing Process Using Machine Learning ModelingProcesses10.3390/pr1207141412:7(1414)Online publication date: 6-Jul-2024
https://doi.org/10.3390/pr12071414
Tiwari KZhang L(2024)Implications of Minimum Description Length for Adversarial Attack in Natural Language ProcessingEntropy10.3390/e2605035426:5(354)Online publication date: 24-Apr-2024
https://doi.org/10.3390/e26050354
Horton A(2024)Causal Economic Machine Learning (CEML): “Human AI”AI10.3390/ai50400945:4(1893-1917)Online publication date: 11-Oct-2024
https://doi.org/10.3390/ai5040094
Hartung TMorales Pantoja ISmirnova L(2024)Brain organoids and organoid intelligence from ethical, legal, and social points of viewFrontiers in Artificial Intelligence10.3389/frai.2023.13076136Online publication date: 5-Jan-2024
https://doi.org/10.3389/frai.2023.1307613
Shuryak IWang EBrenner D(2024)Understanding the impact of radiotherapy fractionation on overall survival in a large head and neck squamous cell carcinoma dataset: a comprehensive approach combining mechanistic and machine learning modelsFrontiers in Oncology10.3389/fonc.2024.142221114Online publication date: 13-Aug-2024
https://doi.org/10.3389/fonc.2024.1422211
Cruz DBatista J(2024)Causality and tractable probabilistic modelsFrontiers in Computer Science10.3389/fcomp.2023.12633865Online publication date: 8-Jan-2024
https://doi.org/10.3389/fcomp.2023.1263386
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Digital Edition

View this article in digital edition.

Digital Edition

Magazine Site

View this article on the magazine site (external)

Magazine Site

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Abstract

References

Cited By

Index Terms

Recommendations

Causal Inference Meets Machine Learning

Causal Learning with Occam’s Razor

Causal Inference and Causal Machine Learning with Practical Applications: The paper highlights the concepts of Causal Inference and Causal ML along with different implementation techniques

Reviews

Access critical reviews of Computing literature here

Comments

Information

Published In

Publisher

Publication History

Check for updates

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

PDF

eReader

Digital Edition

Magazine Site

Get Access

Login options

Full Access

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations