Performance Evaluation of Approximate Pattern Mining Based on Probabilistic and Statistical Techniques

Unil Yun⁵,
Gwangbum Pyun⁵ &
Sung-Jin Kim⁵

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 215))

985 Accesses

Abstract

Approximate frequent pattern mining is to find approximate patterns, not exact frequent patterns with tolerable variations for more efficiency. As the size of database increases, much faster mining techniques are needed to deal with huge databases. Moreover, it is more difficult to discover exact results of mining patterns due to inherent noise or data diversity. In these cases, by mining approximate frequent patterns, more efficient mining can be performed in terms of runtime, memory usage and scalability. In this paper, we benchmark efficient algorithms of mining approximate frequent patterns based on statistical and probabilistic methods. We study the characteristics of approximate mining algorithms, and perform performance evaluations of the state of the art approximate mining algorithms. Finally, we analyze the test results for more improvement.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Comparative evaluation of pattern mining techniques: an empirical study

Article Open access 11 November 2020

Performance and characteristic analysis of maximal frequent pattern mining methods using additional factors

Article 25 September 2017

Analysis of Recent Maximal Frequent Pattern Mining Approaches

References

Chen C, Yan X, Zhu F, Han J (2007) gApprox: mining frequent approximate patterns from a massive network. ICDM, pp 445–450
Google Scholar
Chi R, Wai A (2006) Mining top-K frequent itemsets from data streams. Data Min Knowl Discov 13(2):197–217
Google Scholar
Han J, Pei J, Yin Y, Mao R (2004) Mining frequent patterns without candidate generation: a frequent pattern tree approach. Data Min Knowl Disc 8:53–87
Article MathSciNet Google Scholar
Han J, Cheng H, Xin D, Yan X (2007) Frequent pattern mining: current status and future directions. Data Min Knowl Discov (DMKD) l.15(1):55–86
Google Scholar
Manku G, Motwani R (2002) Approximate frequency counts over data streams. VLDB
Google Scholar
Wong RC, Fu AW (2006) Mining top-K frequent itemsets from data streams. Data Min Knowl Discov 13:193–217
Article MathSciNet Google Scholar
Wong P, Chan T, Wong MH, Leung K (2012) Predicting approximate protein-DNA binding cores using association rule mining, ICDE pp 965–976
Google Scholar
Yun U, Ryu K (2011) Approximate weight frequent pattern mining with/without noisy environments. Knowl Based Syst 24(1):73–82
Article Google Scholar
Zhao Y, Zhang C, Zhang S (2006) Efficient frequent itemsets mining by sampling, advances in intelligent IT. Active Media Technology, pp 112–117
Google Scholar
Zhu F, Yan X, Han J, Yu PS (2007) Efficient discovery of frequent approximate sequential patterns. In: International conference on data mining (ICDM), pp 751–756
Google Scholar

Download references

Acknowledgments

This research was supported by the National Research Foundation of Korea (NRF) funded by the Ministry of Education, Science and Technology (NRF No. 2012-0003740 and 2012-0000478).

Author information

Authors and Affiliations

Department of Computer Engineering, Chungbuk National University, 410, Gaesin-dong, Heungdeok-gu, Cheongju, Republic of Korea
Unil Yun, Gwangbum Pyun & Sung-Jin Kim

Authors

Unil Yun
View author publications
You can also search for this author in PubMed Google Scholar
Gwangbum Pyun
View author publications
You can also search for this author in PubMed Google Scholar
Sung-Jin Kim
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sung-Jin Kim .

Editor information

Editors and Affiliations

Convergence Security, Kyoung-gi University, Suwon, Gyeonggi-do, Korea, Republic of (South Korea)
Kuinam J. Kim
Dept. of Computer Information Engineerin, Sangji University, Wonju-si Gangwon-do, Korea, Republic of (South Korea)
Kyung-Yong Chung

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yun, U., Pyun, G., Kim, SJ. (2013). Performance Evaluation of Approximate Pattern Mining Based on Probabilistic and Statistical Techniques. In: Kim, K., Chung, KY. (eds) IT Convergence and Security 2012. Lecture Notes in Electrical Engineering, vol 215. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-5860-5_115

Download citation

DOI: https://doi.org/10.1007/978-94-007-5860-5_115
Published: 11 December 2012
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-007-5859-9
Online ISBN: 978-94-007-5860-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Performance Evaluation of Approximate Pattern Mining Based on Probabilistic and Statistical Techniques

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Comparative evaluation of pattern mining techniques: an empirical study

Performance and characteristic analysis of maximal frequent pattern mining methods using additional factors

Analysis of Recent Maximal Frequent Pattern Mining Approaches

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Performance Evaluation of Approximate Pattern Mining Based on Probabilistic and Statistical Techniques

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Comparative evaluation of pattern mining techniques: an empirical study

Performance and characteristic analysis of maximal frequent pattern mining methods using additional factors

Analysis of Recent Maximal Frequent Pattern Mining Approaches

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation