0% found this document useful (0 votes)

74 views3 pages

2 Huff

The document provides a Python implementation of Huffman Encoding using a greedy strategy, detailing the construction of a Huffman Tree and the generation of binary codes for character encoding. It includes functions for encoding and decoding data, along with an analysis of time and space complexity. The document also explains the steps involved in Huffman encoding, emphasizing the greedy approach to achieve efficient data compression.

Uploaded by

shinchansbv

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

74 views3 pages

2 Huff

Uploaded by

shinchansbv

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 3

# : Write a program to implement Huffman Encoding using a greedy strategy.

import heapq
from collections import Counter, defaultdict

# Node class to represent each character and frequency

class Node:
def __init__(self, char, freq):
self.char = char # Character
self.freq = freq # Frequency of the character
self.left = None # Left child in the tree
self.right = None # Right child in the tree

# Define comparison for priority queue

def __lt__(self, other):
return self.freq < other.freq

# Function to build the Huffman Tree

def build_huffman_tree(char_freq):
# Step 1: Create a min-heap (priority queue) with all characters and their
frequencies
heap = [Node(char, freq) for char, freq in char_freq.items()]
heapq.heapify(heap)

# Step 2: Combine nodes until we have the final Huffman Tree

while len(heap) > 1:
left = heapq.heappop(heap) # Smallest frequency node
right = heapq.heappop(heap) # Second smallest frequency node

# Create a new node with combined frequency of the two nodes

merged = Node(None, left.freq + right.freq)
merged.left = left
merged.right = right

# Add this new node back to the heap

heapq.heappush(heap, merged)

# The last remaining node is the root of the Huffman Tree

return heap[0]

# Function to generate Huffman codes for each character

def build_huffman_codes(root):
huffman_codes = {}

# Helper function to generate codes by traversing the tree

def encode(node, code):
if node:
if node.char is not None: # Leaf node
huffman_codes[node.char] = code
encode(node.left, code + "0") # Traverse left
encode(node.right, code + "1") # Traverse right

encode(root, "")
return huffman_codes

# Function to encode the input data using the Huffman codes

def huffman_encoding(data):
# Calculate frequency of each character in the input string
char_freq = Counter(data)
# Build the Huffman Tree based on character frequencies
root = build_huffman_tree(char_freq)

# Generate the Huffman codes from the tree

huffman_codes = build_huffman_codes(root)

# Encode the data by replacing each character with its code

encoded_data = "".join(huffman_codes[char] for char in data)

return encoded_data, huffman_codes

# Function to decode the encoded string back to original text

def huffman_decoding(encoded_data, huffman_codes):
# Reverse the huffman_codes dictionary to get codes as keys
code_to_char = {code: char for char, code in huffman_codes.items()}

# Decode the encoded data

current_code = ""
decoded_data = []
for bit in encoded_data:
current_code += bit
if current_code in code_to_char:
decoded_data.append(code_to_char[current_code])
current_code = ""

return "".join(decoded_data)

# Main program
data = input("Enter a string to encode: ")
encoded_data, huffman_codes = huffman_encoding(data)
print("Encoded data:", encoded_data)
print("Huffman Codes:", huffman_codes)

# Decode the encoded data to verify correctness

decoded_data = huffman_decoding(encoded_data, huffman_codes)
print("Decoded data:", decoded_data)

# Time and Space Complexity Analysis

# Time Complexity:
# Building the Tree:
# O(nlogn)
# Inserting nodes into the heap and extracting two nodes take
# O(logn), and this happens n-1 times
# n−1 times.
# Generating Codes: O(n)
# A DFS traversal takes linear time.
# Overall Time Complexity: O(nlogn)

# Space Complexity:
# Heap Storage:
# O(n) for storing all nodes.
# Tree Storage:
# O(n) for the internal nodes and leaves.
# Huffman Codes:
# O(n) for storing the codes.
# Overall Space Complexity: O(n)
# Huffman encoding is a popular algorithm for data compression, especially for
text. It uses a **greedy strategy** to assign shorter codes to more frequent
characters, thereby reducing the overall storage size for the data. Here’s a step-
by-step explanation of how Huffman encoding works, guided by the greedy approach:

# ### 1. Analyze Character Frequency

# The first step in Huffman encoding is to determine the frequency of each
character in the text or dataset you want to compress. The frequency count helps
prioritize which characters should have shorter codes.

# ### 2. Build a Priority Queue (Min-Heap)

# Using a min-heap, create a priority queue where each node contains:
# - A character from the dataset (or, for internal nodes, no specific
character).
# - The frequency of the character.

# In a greedy approach, nodes with lower frequencies are prioritized, ensuring that
characters with higher frequencies end up closer to the root of the tree and get
shorter codes.

# ### 3. Construct the Huffman Tree

# - **While there are at least two nodes in the queue**:
# - Remove the two nodes with the lowest frequency from the priority queue.
# - Create a new internal node with a frequency equal to the sum of these two
nodes' frequencies.
# - Make the two nodes the left and right children of this new node.
# - Insert the new node back into the priority queue.

# This merging process follows the greedy principle of always combining the
least frequent nodes first, ensuring minimal "cost" for more frequent characters.

# ### 4. Assign Binary Codes

# Once the Huffman tree is complete, assign binary codes to each character:
# - Traverse the tree from the root to each leaf node (representing a
character).
# - Assign a `0` for each left edge and a `1` for each right edge.
# - The path taken to reach each character forms its Huffman code.

# Since more frequent characters are closer to the root, their binary codes are
shorter, achieving efficient compression.

# ### 5. Encode the Data

# Replace each character in the original data with its Huffman code to get the
compressed binary string.

# ### Example

# Consider the string "AAABBC":

# 1. Character frequencies: A = 3, B = 2, C = 1.
# 2. Build initial nodes and use a priority queue to construct the Huffman tree:
# - Merge B and C (smallest frequencies) into a node with frequency 3.
# - Merge the new node (frequency 3) with A (frequency 3).
# 3. Assign codes: A might get `0`, B `10`, and C `11`.

# Thus, Huffman encoding is efficient due to its greedy nature, always making local
choices (combining lowest frequency nodes) that lead to a globally optimized tree
for encoding.

Huffman Coding Notes
No ratings yet
Huffman Coding Notes
7 pages
DAA Lab Exp (3) 9920004660
No ratings yet
DAA Lab Exp (3) 9920004660
6 pages
Assignment3 DSA
No ratings yet
Assignment3 DSA
3 pages
Adp Huffman Coding
No ratings yet
Adp Huffman Coding
15 pages
Activity Selection & Huffman Encoding
No ratings yet
Activity Selection & Huffman Encoding
4 pages
Huffman Encoding for Programmers
No ratings yet
Huffman Encoding for Programmers
2 pages
Huffman Assign (Hifza 117)
No ratings yet
Huffman Assign (Hifza 117)
6 pages
Steps of Huffman Encoding:: Calculate The Frequency of Each Character Build A Priority Queue Build A Binary Tree
No ratings yet
Steps of Huffman Encoding:: Calculate The Frequency of Each Character Build A Priority Queue Build A Binary Tree
1 page
FALLSEM2024-25 STS3007 TH AP2024252001217 2024-11-13 Reference-Material-I
No ratings yet
FALLSEM2024-25 STS3007 TH AP2024252001217 2024-11-13 Reference-Material-I
17 pages
Huffman Coding Algorithm
No ratings yet
Huffman Coding Algorithm
1 page
Huffman Coding
No ratings yet
Huffman Coding
2 pages
Huffman Coding
No ratings yet
Huffman Coding
7 pages
Huffman Codes and Its Implementation: Submitted by Kesarwani Aashita Int. M.Sc. in Applied Mathematics (3 Year)
No ratings yet
Huffman Codes and Its Implementation: Submitted by Kesarwani Aashita Int. M.Sc. in Applied Mathematics (3 Year)
28 pages
Ex 7 Daa
No ratings yet
Ex 7 Daa
8 pages
CSA Lab 10
No ratings yet
CSA Lab 10
4 pages
Huffmann Algo
No ratings yet
Huffmann Algo
3 pages
5 Huffman Coding
No ratings yet
5 Huffman Coding
50 pages
Huffman Coding
No ratings yet
Huffman Coding
10 pages
Project Report Huffman Algorithm: Jinnah University For Women
No ratings yet
Project Report Huffman Algorithm: Jinnah University For Women
11 pages
Rakib Project
No ratings yet
Rakib Project
14 pages
Lab 6
No ratings yet
Lab 6
6 pages
Mini Project
No ratings yet
Mini Project
26 pages
Huffman Encoding with Greedy Method
No ratings yet
Huffman Encoding with Greedy Method
16 pages
Huffman Coding for Beginners
No ratings yet
Huffman Coding for Beginners
10 pages
Samarth Adatia MLSP Exp2
No ratings yet
Samarth Adatia MLSP Exp2
14 pages
Assignment No: 02 Title: Huffman Algorithm
No ratings yet
Assignment No: 02 Title: Huffman Algorithm
7 pages
Huffman's Algorithm Lecture1
No ratings yet
Huffman's Algorithm Lecture1
69 pages
DMS Report
No ratings yet
DMS Report
5 pages
Getting Started: Huffman Coding
No ratings yet
Getting Started: Huffman Coding
5 pages
Huffman Tasks
No ratings yet
Huffman Tasks
5 pages
7.4 Huffman Coding
No ratings yet
7.4 Huffman Coding
26 pages
Huffman Coding
No ratings yet
Huffman Coding
22 pages
Huffman Coding Tutorial with C++
No ratings yet
Huffman Coding Tutorial with C++
5 pages
Huffman Code
No ratings yet
Huffman Code
5 pages
Huffman Encoding for Beginners
No ratings yet
Huffman Encoding for Beginners
5 pages
Compression: Another Example of Greedy Algorithm: Huffman Codes
No ratings yet
Compression: Another Example of Greedy Algorithm: Huffman Codes
4 pages
Huffman Coding
No ratings yet
Huffman Coding
32 pages
Output and Algo
No ratings yet
Output and Algo
1 page
Assignment No-05
No ratings yet
Assignment No-05
3 pages
Huffman Coding Algorithm
No ratings yet
Huffman Coding Algorithm
4 pages
Huffman Coding
No ratings yet
Huffman Coding
8 pages
5.2 Huffman Algorithm
No ratings yet
5.2 Huffman Algorithm
12 pages
Huffman Tree
No ratings yet
Huffman Tree
8 pages
Huffman Tree
No ratings yet
Huffman Tree
10 pages
Algorithm Analysis of Huffman Coding Using Python
No ratings yet
Algorithm Analysis of Huffman Coding Using Python
16 pages
What Is The Greedy Method, Illustrated by Huffman Coding
No ratings yet
What Is The Greedy Method, Illustrated by Huffman Coding
4 pages
DAA Lab Practice 2
No ratings yet
DAA Lab Practice 2
15 pages
HuffmanCoding 2
No ratings yet
HuffmanCoding 2
16 pages
Huffman
No ratings yet
Huffman
15 pages
Design and Analysis of Algorithms (COM336) : Huffman Coding
No ratings yet
Design and Analysis of Algorithms (COM336) : Huffman Coding
1 page
Huffman Coding Compression Intro
No ratings yet
Huffman Coding Compression Intro
4 pages
2.3a Huffman Coding
No ratings yet
2.3a Huffman Coding
25 pages
5c. Huffman
No ratings yet
5c. Huffman
13 pages
Huffman Tree and Construction and Encoding Analysis - Compressed
No ratings yet
Huffman Tree and Construction and Encoding Analysis - Compressed
13 pages
2 2 5huffman
No ratings yet
2 2 5huffman
52 pages
Unit 3
No ratings yet
Unit 3
122 pages
Huffman Tree and Coding
No ratings yet
Huffman Tree and Coding
6 pages
Algorithm Analysis and Implementation of Huffman Coding For Grayscale Image Compression Using Python
No ratings yet
Algorithm Analysis and Implementation of Huffman Coding For Grayscale Image Compression Using Python
18 pages
Huffman
No ratings yet
Huffman
70 pages
Data Representation Basics
No ratings yet
Data Representation Basics
192 pages
CSE 215 - Programming Language II Assigment#1
No ratings yet
CSE 215 - Programming Language II Assigment#1
2 pages
Final PPT Student-Management-System
No ratings yet
Final PPT Student-Management-System
31 pages
Class 8 Practical Assignment
No ratings yet
Class 8 Practical Assignment
2 pages
Extra Output Questions
No ratings yet
Extra Output Questions
9 pages
Compiler Design
No ratings yet
Compiler Design
4 pages
5 3 (A) One Operating System (OS) Most Appropriate OS Term Description
No ratings yet
5 3 (A) One Operating System (OS) Most Appropriate OS Term Description
4 pages
Final 450
No ratings yet
Final 450
47 pages
CS201-Midterm Subjectives Solved With References by Moaaz
67% (3)
CS201-Midterm Subjectives Solved With References by Moaaz
26 pages
Cambridge IGCSE CS Work Book For Second Edition - Texy
No ratings yet
Cambridge IGCSE CS Work Book For Second Edition - Texy
96 pages
Os Unit - 4
No ratings yet
Os Unit - 4
16 pages
Assignment 1
No ratings yet
Assignment 1
8 pages
Dijkstra's Algorithm Lab Guide
No ratings yet
Dijkstra's Algorithm Lab Guide
6 pages
4 Cpu
No ratings yet
4 Cpu
19 pages
A Comparative Study On Text Representation Schemes in Text Categorization
No ratings yet
A Comparative Study On Text Representation Schemes in Text Categorization
11 pages
JAVA LAB GUIDE Final
No ratings yet
JAVA LAB GUIDE Final
15 pages
Cs607p Solution-1 2025 by Junaid Malik
100% (1)
Cs607p Solution-1 2025 by Junaid Malik
4 pages
Paper 1 - Assessment I
No ratings yet
Paper 1 - Assessment I
4 pages
Lecture 7-2 - Logic - Truth Tables and Equivalent Statements
No ratings yet
Lecture 7-2 - Logic - Truth Tables and Equivalent Statements
20 pages
Flajolet-Martin Algorithm Guide
No ratings yet
Flajolet-Martin Algorithm Guide
3 pages
Paper 4 WS
No ratings yet
Paper 4 WS
25 pages
3-Classification, Clustering and Prediction
No ratings yet
3-Classification, Clustering and Prediction
142 pages
Machine Learning - What Is TensorFlow
No ratings yet
Machine Learning - What Is TensorFlow
5 pages
IEC101 Slave
No ratings yet
IEC101 Slave
35 pages
Regular Expression
No ratings yet
Regular Expression
17 pages
B.sc. Computer Science
No ratings yet
B.sc. Computer Science
23 pages
221 Lesson Plan
No ratings yet
221 Lesson Plan
1 page
DS QUESTION BANK UPDATED One
No ratings yet
DS QUESTION BANK UPDATED One
14 pages
Best Practices and Tools of Coding
No ratings yet
Best Practices and Tools of Coding
2 pages
Unit-3 Oose
No ratings yet
Unit-3 Oose
28 pages

2 Huff

Uploaded by

2 Huff

Uploaded by

# : Write a program to implement Huffman Encoding using a greedy strategy.

# Node class to represent each character and frequency

# Define comparison for priority queue

# Function to build the Huffman Tree

# Step 2: Combine nodes until we have the final Huffman Tree

# Create a new node with combined frequency of the two nodes

# Add this new node back to the heap

# The last remaining node is the root of the Huffman Tree

# Function to generate Huffman codes for each character

# Helper function to generate codes by traversing the tree

# Function to encode the input data using the Huffman codes

# Generate the Huffman codes from the tree

# Encode the data by replacing each character with its code

return encoded_data, huffman_codes

# Function to decode the encoded string back to original text

# Decode the encoded data

# Decode the encoded data to verify correctness

# Time and Space Complexity Analysis

# ### 1. Analyze Character Frequency

# ### 2. Build a Priority Queue (Min-Heap)

# ### 3. Construct the Huffman Tree

# ### 4. Assign Binary Codes

# ### 5. Encode the Data

# Consider the string "AAABBC":

You might also like