Abstract
We present the Mambo algorithm for the discovery of association rules. Mambo is driven by conditional independence relations between the variables instead of the minimum support restrictions of algorithms like Apriori. We argue that making use of conditional independencies is an intuitively appealing way to restrict the set of association rules considered. Since we only have a finite sample from the probability distribution of interest, we have to deal with uncertainty concerning the conditional independencies present. Bayesian methods are used to quantify this uncertainty, and the posterior probabilities of conditional independence relations are estimated with the Markov Chain Monte Carlo technique. We analyse an insurance data set with Mambo and illustrate the differences in results compared to Apriori.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Agrawal, R., Imilinski, T., Swami, A.: Mining Association Rules between Sets of Items in Large Databases. In Proc. ACM SIGMOD Conf. Management of Data. (1993) 207–216
Agrawal, R., Srikant, R.: Fast Algorithms for Mining Association Rules. In Proc. 20th Int’l Conf Very Large Databases. (1994)
Bay, S. D.: The UCI KDD Archive [http://kdd.ics.uci.edu]. Irvine, CA: University of California, Department of Information and Computer Science. (1999)
Chung, K.L.: Markov Chains with Stationary Transition Probabilities (2nd ed). Springer-Verlag (1967)
Cohen, E., Datar, M., Funjiwara, S., Gionis, A., Indyk, P., Motwani, R., Ullman, J.D., Yang, C.: Finding Interesting Associations without Support Pruning. In IEEE Transactions on Knowledge and Data Engineering. 13(1) (2001) 64–78
Giudici, P. and Castelo, R.: Improving Markov Chain Monte Carlo model search for data mining. Technical Report 117, Department of Economics and Quantitative Methods, University of Pavia, Italy. (2000)
Dawid, A.P. and Lauritzen, S.L.: Hyper Markov laws in the statistical analysis of decomposable graphical models. Annals of Statistics. 21 (1993) 1272–1317
Madigan, D. and York, J.: Bayesian graphical models for discrete data. International Statistical Review. 63 (1995) 215–232
Pearl, J.: Probabilistic Reasoning in intelligent systems. Morgan Kaufmann (1988)
Silverstein, C., Brin, S., Motwani, R.: Beyond Market Baskets: Generalizing Association Rules to Dependence Rules. Data Mining and Knowledge Discovery. 2 (1998) 69–96
Lauritzen, S.L. and Spiegelhalter, D.J.: Local computations with probabilities on graphical structures and their application to expert systems (with discussion). Journal of the Royal Statistical Society, Series B. 50 (1988) 157–224
Tanner, M.A.: Tools for Statistical Inference 3ed. Springer (1996)
Whittaker, J.: Graphical Models in Applied Mathematical Multivariate Statistics. Wiley (1990)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Castelo, R., Feelders, A., Siebes, A. (2001). MAMBO: Discovering Association Rules Based on Conditional Independencies. In: Hoffmann, F., Hand, D.J., Adams, N., Fisher, D., Guimaraes, G. (eds) Advances in Intelligent Data Analysis. IDA 2001. Lecture Notes in Computer Science, vol 2189. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44816-0_29
Download citation
DOI: https://doi.org/10.1007/3-540-44816-0_29
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42581-6
Online ISBN: 978-3-540-44816-7
eBook Packages: Springer Book Archive