0% found this document useful (0 votes)

26 views5 pages

FP-Growth Algorithm

The FP-Growth algorithm is an efficient method for mining frequent patterns in large datasets, utilizing a compact data structure called the FP-tree to avoid candidate generation. The process involves calculating item frequencies, constructing the FP-tree, creating a header table, and recursively mining for frequent itemsets. Key advantages of FP-Growth include reduced computation time, efficient memory use, and scalability with large datasets.

Uploaded by

Arya Verma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views5 pages

FP-Growth Algorithm

Uploaded by

Arya Verma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

FP-Growth Algorithm and FP-Tree Construction

November 28, 2024

1 Introduction
The FP-Growth algorithm is an efficient method for frequent pattern mining in
large datasets, which avoids the candidate generation step used in algorithms
like Apriori. It uses a compact data structure called the FP-tree (Frequent
Pattern tree) to represent the database in a compressed form. This allows for
efficient mining of frequent itemsets.

2 Step 1: Calculate Item Frequencies

Before building the FP-tree, we need to calculate the frequency (support) of
each item in the dataset. Support refers to the number of transactions that
contain the item.

2.1 Example Dataset

Consider the following transactions:

Transaction ID Items Purchased

T1 A, B, D, E
T2 B, C, D, E
T3 A, B, D, E, F
T4 B, C, D, E
T5 A, B, D, E, F

2.2 Item Frequency Calculation

The frequency (support) of each item is calculated as follows:

• A: Appears in T1, T3, T5 (3 times)

• B: Appears in T1, T2, T3, T4, T5 (5 times)
• C: Appears in T2, T4 (2 times)
• D: Appears in all transactions (5 times)

1
• E: Appears in all transactions (5 times)
• F: Appears in T3, T5 (2 times)

3 Step 2: Build the FP-Tree

Once the item frequencies are calculated, items are sorted in descending order of
their frequencies. Items with low support are removed from the dataset before
constructing the tree.

3.1 Sort Items by Frequency

The items are sorted in descending order of frequency:

B : 5, D : 5, E : 5, A : 3, C : 2, F : 2

3.2 Insert Transactions into the FP-Tree

Now, we insert each transaction into the FP-tree, sorting the items according
to the frequency list and building the tree. We either increment the count of an
existing node or create a new node.

3.2.1 Inserting Transactions

• T1: A, B, D, E → Sorted: B, D, E, A
• T2: B, C, D, E → Sorted: B, D, E, C

• T3: A, B, D, E, F → Sorted: B, D, E, A, F
• T4: B, C, D, E → Sorted: B, D, E, C
• T5: A, B, D, E, F → Sorted: B, D, E, A, F

3.3 FP-Tree Structure

After inserting all transactions, the FP-tree looks like this:

2
[ROOT]
|
(B:5)
|
+---------+---------+
| |
(D:4) (F:2)
|
+-------+-------+
| | |
(E:4) (A:3) (C:2)
|
(F:2)
Here:
• ROOT is the root node.

• The node B appears 5 times.

• The node D appears 4 times.
• The node E appears 4 times, and so on.

4 Step 3: Header Table

The header table is an essential part of the FP-tree. It stores the items and
their frequencies and provides links to the nodes in the tree. The header table
facilitates efficient mining by allowing easy traversal of the FP-tree.

4.1 Header Table Example

For the FP-tree above, the header table looks like this:

Item Frequency Linked Nodes

B 5 (B : 5) → (B : 4) → (B : 4) → (B : 3) → (B : 3)
D 5 (D : 4) → (D : 4) → (D : 3) → (D : 2)
E 5 (E : 4) → (E : 4) → (E : 3) → (E : 2)
A 3 (A : 3) → (A : 2)
C 2 (C : 2)
F 2 (F : 2) → (F : 2)

5 Step 4: Mining the FP-Tree

The mining process involves recursively extracting frequent itemsets from the
FP-tree by examining the header table and conditional pattern bases.

3
5.1 Conditional Pattern Base for Item B
To mine the patterns related to item B, we look at all paths that contain B and
trace back to the root. The conditional pattern base for B consists of all items
appearing with B in the transactions.

Transactions containing B:
T 1 : B, D, E, A
T 2 : B, D, E, C
T 3 : B, D, E, A, F
T 4 : B, D, E, C
T 5 : B, D, E, A, F
The conditional pattern base for B is:

{D, E, A, F }, {D, E, C}

We then recursively build a conditional FP-tree for this pattern base and
continue mining for further frequent itemsets.

6 Step 5: Recursive Mining and Frequent Item-

sets
The mining process continues recursively for each item in the header table, and
we extract frequent itemsets. The frequent itemsets for the given dataset could
include:

{B} : Support = 5
{D} : Support = 5
{E} : Support = 5
{A} : Support = 3
{B, D} : Support = 4
{B, E} : Support = 4
{B, D, E} : Support = 4
{A, B} : Support = 3
{A, D} : Support = 3
{A, B, D} : Support = 3

4
7 Advantages of FP-Growth
• No Candidate Generation: Unlike Apriori, FP-Growth does not gen-
erate candidate itemsets, which reduces computation time.
• Efficient Memory Use: The FP-tree is a compressed representation of
the dataset, saving memory.
• Scalability: FP-Growth scales well with large datasets due to its efficient
use of memory and reduced I/O operations.

8 Conclusion
The FP-Growth algorithm is a powerful tool for frequent pattern mining. By us-
ing the FP-tree and header table, FP-Growth efficiently mines frequent itemsets
without the need for candidate generation, making it faster and more scalable
than other algorithms like Apriori.

Fp-Tree Growth Algorithm
No ratings yet
Fp-Tree Growth Algorithm
11 pages
FP Tree Growth: Frequent Pattern Growth Algorithm
100% (1)
FP Tree Growth: Frequent Pattern Growth Algorithm
2 pages
FP Tree
No ratings yet
FP Tree
37 pages
Lecture 6
No ratings yet
Lecture 6
18 pages
Efficient FP-Growth Pattern Mining
No ratings yet
Efficient FP-Growth Pattern Mining
7 pages
18-FP-Growth Algorithm-12-02-2025
No ratings yet
18-FP-Growth Algorithm-12-02-2025
24 pages
Lecture 5 - Monday, September 3, 2007: 2.1 Example From Paper
No ratings yet
Lecture 5 - Monday, September 3, 2007: 2.1 Example From Paper
6 pages
Unit4 2 Association Rules FP Growth
No ratings yet
Unit4 2 Association Rules FP Growth
33 pages
FP Growth
No ratings yet
FP Growth
30 pages
FP Growth Algorithm
No ratings yet
FP Growth Algorithm
17 pages
U3 - FP Trees - 5th Sem - DS
No ratings yet
U3 - FP Trees - 5th Sem - DS
9 pages
AzqaSaleemKhan (SP22 RCS 003) FPGrowth
No ratings yet
AzqaSaleemKhan (SP22 RCS 003) FPGrowth
19 pages
FP Tree
No ratings yet
FP Tree
42 pages
15-Fp-Tree Problem-10-09-2024
No ratings yet
15-Fp-Tree Problem-10-09-2024
2 pages
FP Growth Presentation v1 (Handout)
No ratings yet
FP Growth Presentation v1 (Handout)
10 pages
Fpgrowth
No ratings yet
Fpgrowth
11 pages
Lecture 2.3.3 2.3.4
No ratings yet
Lecture 2.3.3 2.3.4
29 pages
FP-Growth Algorithm Overview
No ratings yet
FP-Growth Algorithm Overview
21 pages
Shihab Rahman Dolon Chanpa Department of Computer Science and Engineering, University of Dhaka
No ratings yet
Shihab Rahman Dolon Chanpa Department of Computer Science and Engineering, University of Dhaka
23 pages
FP-Growth for Data Scientists
No ratings yet
FP-Growth for Data Scientists
20 pages
FP-Growth Algorithm Guide
No ratings yet
FP-Growth Algorithm Guide
16 pages
Frequent Closed Pattern Mining Algorithm Based On COFI-Tree
No ratings yet
Frequent Closed Pattern Mining Algorithm Based On COFI-Tree
2 pages
FP-Tree Growth Algorithm
No ratings yet
FP-Tree Growth Algorithm
15 pages
Lecture 13 14 FP
No ratings yet
Lecture 13 14 FP
41 pages
FP Growth
No ratings yet
FP Growth
16 pages
Estimating Frequent Patterns Using FP-Growth On A Transactional Data Stream
No ratings yet
Estimating Frequent Patterns Using FP-Growth On A Transactional Data Stream
3 pages
DM Unit2 - 1 Association Mining 19I504
No ratings yet
DM Unit2 - 1 Association Mining 19I504
86 pages
Association Rule Mining: FP Growth
No ratings yet
Association Rule Mining: FP Growth
22 pages
Lecture 5 - FP-Growth Algorithm
No ratings yet
Lecture 5 - FP-Growth Algorithm
26 pages
Association Rule Mining Guide
No ratings yet
Association Rule Mining Guide
88 pages
ESE Handouts 4 - FP Growth Algorithm (Fall 2016)
No ratings yet
ESE Handouts 4 - FP Growth Algorithm (Fall 2016)
13 pages
Improv Me Net
No ratings yet
Improv Me Net
7 pages
FP-Growth Algorithm
No ratings yet
FP-Growth Algorithm
23 pages
ML 4
No ratings yet
ML 4
13 pages
Tutorial 02
No ratings yet
Tutorial 02
17 pages
Q) FP Growth Algorithm?: This Algorithm Works As Follows
No ratings yet
Q) FP Growth Algorithm?: This Algorithm Works As Follows
3 pages
FP-Growth Algorithm in C
No ratings yet
FP-Growth Algorithm in C
5 pages
FP Tree
No ratings yet
FP Tree
54 pages
Untitled Document
No ratings yet
Untitled Document
5 pages
Machine Learning Based FP Growth Algorithm
No ratings yet
Machine Learning Based FP Growth Algorithm
8 pages
FPgrowth
No ratings yet
FPgrowth
2 pages
FP Growth Alg
No ratings yet
FP Growth Alg
17 pages
4.1) FP Growth Algorithm
No ratings yet
4.1) FP Growth Algorithm
26 pages
Data Mining - : Dr. Mahmoud Mounir Mahmoud - Mounir@cis - Asu.edu - Eg
No ratings yet
Data Mining - : Dr. Mahmoud Mounir Mahmoud - Mounir@cis - Asu.edu - Eg
23 pages
Mining Frequent Patterns Without Candidate Generation
No ratings yet
Mining Frequent Patterns Without Candidate Generation
44 pages
Mining Frequent Patterns Without Candidate Generation
No ratings yet
Mining Frequent Patterns Without Candidate Generation
12 pages
A Frequent Pattern Mining Algorithm Based On Fp-Tree Structure Andapriori Algorithm
No ratings yet
A Frequent Pattern Mining Algorithm Based On Fp-Tree Structure Andapriori Algorithm
3 pages
What Is Frequent Pattern Analysis?
No ratings yet
What Is Frequent Pattern Analysis?
37 pages
FPTree 09
No ratings yet
FPTree 09
45 pages
DWDM Unit-3
100% (1)
DWDM Unit-3
63 pages
Tan FP Growth
No ratings yet
Tan FP Growth
8 pages
DWM Exp10 - 96
No ratings yet
DWM Exp10 - 96
11 pages
An Improved Frequent Pattern Tree The Child Struct
No ratings yet
An Improved Frequent Pattern Tree The Child Struct
19 pages
Mining Frequent Patterns Without Candidate Generation
No ratings yet
Mining Frequent Patterns Without Candidate Generation
44 pages
FP-Growth for Data Analysts
No ratings yet
FP-Growth for Data Analysts
24 pages
Data Mining Unit 2 (Part 2) - 1
No ratings yet
Data Mining Unit 2 (Part 2) - 1
7 pages
Frequent Pattern Mining Without Candidate Generation: Lesson Introduction
No ratings yet
Frequent Pattern Mining Without Candidate Generation: Lesson Introduction
6 pages
Assignment On Intermediate Code Generationand Three-Address Cod
No ratings yet
Assignment On Intermediate Code Generationand Three-Address Cod
1 page
Assignment 6
No ratings yet
Assignment 6
1 page
New CV Syllabus
No ratings yet
New CV Syllabus
3 pages
NVIDIA - SW 6M Jul-Dec Spring Intern
No ratings yet
NVIDIA - SW 6M Jul-Dec Spring Intern
2 pages
Distributed Database Management System (PEC-IT601B)
No ratings yet
Distributed Database Management System (PEC-IT601B)
2 pages
Bank Reconcilalation
No ratings yet
Bank Reconcilalation
35 pages
LL1 Parser Presentation
No ratings yet
LL1 Parser Presentation
54 pages
Grade 7 Fundamentals-Of-Programming.
No ratings yet
Grade 7 Fundamentals-Of-Programming.
17 pages
Data Structures and Algorithms - Kaunda - James
No ratings yet
Data Structures and Algorithms - Kaunda - James
14 pages
01 2023 1 00061737 Fee Voucher
No ratings yet
01 2023 1 00061737 Fee Voucher
1 page
Unit-6 Notes
No ratings yet
Unit-6 Notes
8 pages
OS Unit-2
No ratings yet
OS Unit-2
63 pages
CPP Unit-4
No ratings yet
CPP Unit-4
61 pages
Pro Le: GNSU - Faculty of Information Technology
No ratings yet
Pro Le: GNSU - Faculty of Information Technology
2 pages
Be Computer-Engineering Semeste
No ratings yet
Be Computer-Engineering Semeste
2 pages
E-Commerce Web App Project Report
No ratings yet
E-Commerce Web App Project Report
7 pages
Scala Akka Assignment
No ratings yet
Scala Akka Assignment
4 pages
Course Curriculum
No ratings yet
Course Curriculum
3 pages
CS 2017 - Set 1-Watermark - pdf-42
No ratings yet
CS 2017 - Set 1-Watermark - pdf-42
7 pages
Math Quiz Bee Challenge
No ratings yet
Math Quiz Bee Challenge
60 pages
Linked List 2
No ratings yet
Linked List 2
5 pages
C Programming Basics Quiz
No ratings yet
C Programming Basics Quiz
81 pages
Jai Maharashtra
No ratings yet
Jai Maharashtra
2 pages
Lecture 0 - CS50's Introduction To Artificial Intelligence With Python
No ratings yet
Lecture 0 - CS50's Introduction To Artificial Intelligence With Python
13 pages
Refactoring To Rust MEAP V05 1st / Chapters 1 To 7 of 11 Edition Lily Mara Instant Download
No ratings yet
Refactoring To Rust MEAP V05 1st / Chapters 1 To 7 of 11 Edition Lily Mara Instant Download
52 pages
System Calls & API Implementation
No ratings yet
System Calls & API Implementation
25 pages
Digital Logic Project
No ratings yet
Digital Logic Project
3 pages
დაპროგრამების საფუძვლები - გ. ჯანელიძე
No ratings yet
დაპროგრამების საფუძვლები - გ. ჯანელიძე
196 pages
PROGRAMMING in C UNIT-2 Notes
No ratings yet
PROGRAMMING in C UNIT-2 Notes
46 pages
Rajesh P Mishra 2152A: Classical Optimization Theory
No ratings yet
Rajesh P Mishra 2152A: Classical Optimization Theory
22 pages
AIMLA25
No ratings yet
AIMLA25
5 pages
Pig and Pig Latin
No ratings yet
Pig and Pig Latin
16 pages
0478 Pseudocode Guide
No ratings yet
0478 Pseudocode Guide
17 pages
Unit - I Introduction To Programming Languages: Computer Application in Business
No ratings yet
Unit - I Introduction To Programming Languages: Computer Application in Business
93 pages

FP-Growth Algorithm

Uploaded by

FP-Growth Algorithm

Uploaded by

FP-Growth Algorithm and FP-Tree Construction

November 28, 2024

2 Step 1: Calculate Item Frequencies

2.1 Example Dataset

Transaction ID Items Purchased

2.2 Item Frequency Calculation

• A: Appears in T1, T3, T5 (3 times)

3 Step 2: Build the FP-Tree

3.1 Sort Items by Frequency

3.2 Insert Transactions into the FP-Tree

3.2.1 Inserting Transactions

3.3 FP-Tree Structure

• The node B appears 5 times.

4 Step 3: Header Table

4.1 Header Table Example

Item Frequency Linked Nodes

5 Step 4: Mining the FP-Tree

6 Step 5: Recursive Mining and Frequent Item-

You might also like