0% found this document useful (0 votes)

172 views23 pages

ENSC 424 - Multimedia Communications Engineering: Topic 6: Arithmetic Coding 1

This document provides an introduction to arithmetic coding, a lossless data compression technique. It begins with an outline and comparison of arithmetic coding to Huffman coding. The basic principles of arithmetic coding are explained, where an input sequence is mapped to a unique tag between 0 and 1, with interval sizes proportional to symbol probabilities. Examples of arithmetic coding encoding and decoding are provided. Implementation details such as maintaining cumulative distribution functions and normalizing interval ranges are also discussed.

Uploaded by

Fotaf Fola

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

172 views23 pages

ENSC 424 - Multimedia Communications Engineering: Topic 6: Arithmetic Coding 1

Uploaded by

Fotaf Fola

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

ENSC 424 - Multimedia

Communications Engineering
Topic 6: Arithmetic Coding 1

Jie Liang
Engineering Science
Simon Fraser University
JieL@sfu.ca

J. Liang: SFU ENSC 424 9/20/2005 1

Outline
Introduction
Basic Encoding and Decoding
Scaling and Incremental Coding
Integer Implementation
Adaptive Arithmetic Coding
Binary Arithmetic Coding
Applications
JBIG, H.264, JPEG 2000

J. Liang: SFU ENSC 424 9/20/2005 2

Huffman Coding: The Retired Champion
Replacing an input symbol with a codeword
Need a probability distribution
Hard to adapt to changing statistics
Need to store the codeword table
Minimum codeword length is 1 bit
Arithmetic Coding: The Rising Star
Replace the entire input with a single floating-point
number
Does not need the probability distribution
Adaptive coding is very easy
No need to keep and send codeword table
Fractional codeword length
J. Liang: SFU ENSC 424 9/20/2005 3
History of Arithmetic Coding
Claude Shannon: 1916-2001
A distant relative of Thomas Edison
1932: Went to University of Michigan.
1937: Master thesis at MIT became the foundation of digital circuit design:
“The most important, and also the most famous, master's thesis of the century“
1940: PhD, MIT
1940-1956: Bell Lab (back to MIT after that)
1948: The birth of Information Theory
A mathematical theory of communication, Bell System Technical Journal.
Earliest idea of arithmetic coding
Robert Fano: 1917-
Shannon-Fano code: proved to be sub-optimal by Huffman
1952: First Information Theory class. Students included:
David Huffman: Huffman Coding
Peter Elias: Recursive implementation of arithmetic coding
Frederick Jelinek
Also Fano’s student: PhD MIT 1962 (now at Johns Hopkins)
1968: Further development of arithmetic coding
1976: Rediscovered by Pasco and Rissanen
Practical implementation: since 1980’s
Bell Lab for Sale: http://www.spectrum.ieee.org/sep05/1683
J. Liang: SFU ENSC 424 9/20/2005 4
Introduction
Recall table look-up decoding of Huffman code
N: alphabet size
1
L: Max codeword length
00
Divide [0, 2^L] into N intervals
One interval for one symbol
010 011
Interval size is roughly
proportional to symbol prob. 000 010 011 100

Arithmetic coding applies this idea recursively

Normalizes the range [0, 2^L] to [0, 1].
Map an input sequence to a unique tag in [0, 1).

abcd…..
dcba….. 0 1
J. Liang: SFU ENSC 424 9/20/2005 5
0 1
Arithmetic Coding a b c
Disjoint and complete partition of the range [0, 1)
[0, 0.8), [0.8, 0.82), [0.82, 1)
Each interval corresponds to one symbol
Interval size is proportional to symbol probability
The first symbol restricts the tag
0 1
position to be in one of the intervals
The reduced interval is partitioned
0 1
recursively as more symbols are
processed.
0 1
Observation: once the tag falls into an interval, it
never gets out of it
J. Liang: SFU ENSC 424 9/20/2005 6
Some Questions to think about:
Why compression is achieved this way?
How to implement it efficiently?
How to decode the sequence?
Why is it better than Huffman code?

J. Liang: SFU ENSC 424 9/20/2005 7

Possible Ways to Terminate Encoding

1. Define an end of file (EOF) symbol in the

alphabet. Assign a probability for it.
0 1

a b c EOF

2. Encode the lower end of the final range.

3. If number of symbols is known to the
decoder, encode any nice number in the
final range.

J. Liang: SFU ENSC 424 9/20/2005 8

Example:
1 2 3
Symbol Prob.
1 0.8
0 0.8 0.82 1.0
2 0.02
Map to real line range [0, 1)
3 0.18
Order does not matter
Decoder needs to use the same order

Disjoint but complete partition:

1: [0, 0.8): 0, 0.799999…9
2: [0.8, 0.82): 0.8, 0.819999…9
3: [0.82, 1): 0.82, 0.999999…9

J. Liang: SFU ENSC 424 9/20/2005 9

Encoding Input sequence: “1321”
1 2 3
Range 1
0 0.8 0.82 1.0
1 2 3
Range 0.8
0 0.64 0.656 0.8

1 2 3
Range 0.144
0.656 0.7712 0.77408 0.8

1 2 3
Range 0.00288
0.7712 0.773504 0.7735616 0.77408
Termination: Encode the lower end (0.7712) to signal the end.
Difficulties: 1. Shrinking of interval requires very high precision for long sequence.
2. No output is generated until the entire sequence has been processed.
J. Liang: SFU ENSC 424 9/20/2005 10
Encoder Pseudo Code
Probability Mass Function
Cumulative Density Function (CDF)
0.4
For continuous distribution:
x 0.2
0.2 0.2
FX ( x) = P ( X ≤ x) =
−∞
∫ p( x)dx
1 2 3 4 X
For discrete distribution:
i 1.0
FX (i ) = P( X ≤ i ) = ∑ P( X = k )
k = −∞
CDF 0.8

0.4
Properties: 0.2
Non-decreasing
Piece-wise constant X
1 2 3 4
Each segment is closed at the lower end.

J. Liang: SFU ENSC 424 9/20/2005 11

Encoder Pseudo Code
low=0.0, high=1.0;
Keep track of
while (not EOF) {
LOW, HIGH, RANGE n = ReadSymbol();
Any two are RANGE = HIGH - LOW;
sufficient, e.g., HIGH = LOW + RANGE * CDF(n);
LOW and RANGE. LOW = LOW + RANGE * CDF(n-1);
}
output LOW;
Input HIGH LOW RANGE
Initial 1.0 0.0 1.0
1 0.0+1.0*0.8=0.8 0.0+1.0*0 = 0.0 0.8
3 0.0 + 0.8*1=0.8 0.0 + 0.8*0.82=0.656 0.144
2 0.656+0.144*0.82=0.77408 0.656+0.144*0.8=0.7712 0.00288
1 0.7712+0.00288*0=0.7712 0.7712+0.00288*0.8=0.773504 0.002304

J. Liang: SFU ENSC 424 9/20/2005 12

Decoding Receive 0.7712

1 2 3
Decode 1
0 0.8 0.82 1.0
1 2 3
Decode 3
0 0.64 0.656 0.8

1 2 3
Decode 2
0.656 0.7712 0.77408 0.8

1 2 3
Decode 1
0.7712 0.773504 0.7735616 0.77408

Drawback: need to recalculate all thresholds each time.

J. Liang: SFU ENSC 424 9/20/2005 13

Simplified Decoding x − low
Normalize RANGE to [0, 1) each time x←
range
No need to recalculate the thresholds.
Receive 0.7712 1 2 3
Decode 1

x =(0.7712-0) / 0.8 0 0.8 0.82 1.0

= 0.964
1 2 3
Decode 3

0 0.8 0.82 1.0

x =(0.964-0.82) / 0.18
= 0.8 1 2 3
Decode 2
x =(0.8-0.8) / 0.02 0 0.8 0.82 1.0
=0
Decode 1.
1 2 3
Stop.

0 0.8 0.82 1.0

J. Liang: SFU ENSC 424 9/20/2005 14
Decoder Pseudo Code
Low = 0; high = 1;
x = GetEncodedNumber();
While (x ≠ low) {
n = DecodeOneSymbol(x);
output symbol n;
x = (x - CDF(n-1)) / (CDF(n) - CDF(n-1));
};

J. Liang: SFU ENSC 424 9/20/2005 15

Outline
Introduction
Basic Encoding and Decoding
Scaling and Incremental Coding
Integer Implementation
Adaptive Arithmetic Coding
Binary Arithmetic Coding
Applications
JBIG, H.264, JPEG 2000

J. Liang: SFU ENSC 424 9/20/2005 16

Scaling and Incremental Coding
Problems of Previous examples:
Need high precision
No output is generated until the entire sequence is
encoded
Key Observation:
As the RANGE reduces, many MSB’s of LOW and HIGH become
identical:
Example: Binary form of 0.7712 and 0.773504:
0.1100010.., 0.1100011..,
We can output identical MSB’s and re-scale the rest:
Incremental encoding
This also allows us to achieve infinite precision with finite-precision
integers.
Three kinds of scaling: E1, E2, E3

J. Liang: SFU ENSC 424 9/20/2005 17

E1 and E2 Scaling
E1: [LOW HIGH) in [0, 0.5) 0 0.5 1.0
LOW: 0.0xxxxxxx (binary),
HIGH: 0.0xxxxxxx.
0 0.5 1.0
Output 0, then shift left by 1 bit
[0, 0.5) [0, 1): E1(x) = 2 x

E2: [LOW HIGH) in [0.5, 1) 0 0.5 1.0

LOW: 0.1xxxxxxx,
HIGH: 0.1xxxxxxx.
0 0.5 1.0
Output 1, subtract 0.5,
shift left by 1 bit
[0.5, 1) [0, 1): E2(x) = 2(x - 0.5)

J. Liang: SFU ENSC 424 9/20/2005 18

Encoding with E1 and E2 Symbol
1
Prob.
0.8
Input 1
2 0.02
0 0.8 1.0
3 0.18
Input 3
0 0.656 0.8 E2: Output 1
Input 2 2(x – 0.5)
0.312 0.5424 0.54816 0.6 E2: Output 1

0.0848 0.09632
E1: 2x, Output 0
0.1696 0.19264 E1: Output 0

0.3392 0.38528 E1: Output 0

0.6784 0.77056 E2: Output 1

Input 1
Encode any value
0.3568 0.54112 in the tag, e.g., 0.5
Output 1
0.3568 0.504256 All outputs: 1100011
J. Liang: SFU ENSC 424 9/20/2005 19
To verify
LOW = 0.5424 (0.10001010... in binary),
HIGH = 0.54816 (0.10001100... in binary).
So we can send out 10001 (0.53125)
Equivalent to E2E1E1E1E2
After left shift by 5 bits:
LOW = (0.5424 – 0.53125) x 32 = 0.3568
HIGH = (0.54816 – 0.53125) x 32 = 0.54112
Same as the result in the last page.

J. Liang: SFU ENSC 424 9/20/2005 20

Symbol Prob.
Note: Complete all possible scaling before
1 0.8
encoding the next symbol 2 0.02
3 0.18
Comparison with Huffman
Input Symbol 1 does not cause any output
Input Symbol 3 generates 1 bit
Input Symbol 2 generates 5 bits
Symbols with larger probabilities generates less
number of bits.
Sometimes no bit is generated at all
Advantage over Huffman coding
Large probabilities are desired in arithmetic coding
Can use context-adaptive method to create larger probability
and to improve compression ratio.
J. Liang: SFU ENSC 424 9/20/2005 21
Incremental Decoding Input 1100011
Decode 1: Need ≥ 5 bits
(verify)
0 0.8 1.0 Read 6 bits:
Tag: 110001, 0.765625
0 0.656 0.8 Decode 3, E2 scaling
Tag: 100011 (0.546875)
0.312 0.5424 0.54816 0.6 Decode 2, E2 scaling
Tag: 000110 (0.09375)
0.0848 0.09632
E1: Tag: 001100 (0.1875)

0.1696 0.19264 E1: Tag: 011000 (0.375)

0.3392 0.38528 E1: Tag: 110000 (0.75)

0.6784 0.77056 E2: Tag: 100000 (0.5)

0.3568 0.54112 Decode 1

Summary: Complete all possible scaling before further decoding
Adjust LOW, HIGH and Tag together.
J. Liang: SFU ENSC 424 9/20/2005 22
Summary
Introduction
Encoding and Decoding
Scaling and Incremental Coding
E1, E2
Next:
Integer Implementation
E3 scaling
Adaptive Arithmetic Coding
Binary Arithmetic Coding
Applications
JBIG, H.264, JPEG 2000
J. Liang: SFU ENSC 424 9/20/2005 23

05 Arith 1
No ratings yet
05 Arith 1
54 pages
Lec05 Arithmetic Coding II
No ratings yet
Lec05 Arithmetic Coding II
44 pages
Data Compression with Arithmetic Coding
No ratings yet
Data Compression with Arithmetic Coding
11 pages
Multimedia Coding Techniques
No ratings yet
Multimedia Coding Techniques
44 pages
Ec8093-Digital Image Processing: Dr.K.Kalaivani Associate Professor Dept. of EIE Easwari Engineering College
No ratings yet
Ec8093-Digital Image Processing: Dr.K.Kalaivani Associate Professor Dept. of EIE Easwari Engineering College
37 pages
Entropy & Run Length Coding
No ratings yet
Entropy & Run Length Coding
45 pages
Audio and Video Coding PDF
No ratings yet
Audio and Video Coding PDF
72 pages
Arini, MT, MSC: Basic Compression Entropy Coding Statistical
No ratings yet
Arini, MT, MSC: Basic Compression Entropy Coding Statistical
34 pages
Module IV
No ratings yet
Module IV
37 pages
Problem Set 4: MAS160: Signals, Systems & Information For Media Technology
No ratings yet
Problem Set 4: MAS160: Signals, Systems & Information For Media Technology
4 pages
Arithmetic Coding: Presented By: Einat & Kim
No ratings yet
Arithmetic Coding: Presented By: Einat & Kim
48 pages
Arithmetic Coding
No ratings yet
Arithmetic Coding
6 pages
Week 03-Informtion Sources and Source Coding
No ratings yet
Week 03-Informtion Sources and Source Coding
25 pages
Arithmetic Coding
No ratings yet
Arithmetic Coding
15 pages
Chapter 2
No ratings yet
Chapter 2
13 pages
Data Compression Unit-5
No ratings yet
Data Compression Unit-5
17 pages
Basics of Information Theory
No ratings yet
Basics of Information Theory
21 pages
Arithmetic Coding for CS Students
No ratings yet
Arithmetic Coding for CS Students
36 pages
Dkkexer8 Ans
No ratings yet
Dkkexer8 Ans
7 pages
Chapter 4 - Arithmetic Coding
No ratings yet
Chapter 4 - Arithmetic Coding
66 pages
Lecture 4 - Arithmetic Coding and Lempel-Ziv
No ratings yet
Lecture 4 - Arithmetic Coding and Lempel-Ziv
26 pages
Arithmetic Coding
No ratings yet
Arithmetic Coding
22 pages
cp467 12 Lecture14 Compression1
No ratings yet
cp467 12 Lecture14 Compression1
146 pages
Group Assignment Multimedia System
No ratings yet
Group Assignment Multimedia System
26 pages
2201.01741v2 - Understanding Entropy Coding With Asymmetric Numeral Systems (ANS) - Statistician Perspective
No ratings yet
2201.01741v2 - Understanding Entropy Coding With Asymmetric Numeral Systems (ANS) - Statistician Perspective
26 pages
Image Compression
No ratings yet
Image Compression
10 pages
Distortionless Source Coding Guide
No ratings yet
Distortionless Source Coding Guide
80 pages
Multimedia Systems: Chapter 7: Data Compression
No ratings yet
Multimedia Systems: Chapter 7: Data Compression
25 pages
An Introduction To Arithmetic Coding: Glen G. Langdon, JR
No ratings yet
An Introduction To Arithmetic Coding: Glen G. Langdon, JR
15 pages
Activity 1 & 2 - Manual
No ratings yet
Activity 1 & 2 - Manual
12 pages
EC 2214: Coding & Data Compression: Vishwakarma Institute of Technology
No ratings yet
EC 2214: Coding & Data Compression: Vishwakarma Institute of Technology
35 pages
Program No. 1 AIM: S - P (I) LN P (I)
No ratings yet
Program No. 1 AIM: S - P (I) LN P (I)
19 pages
Shannon Encoding with MATLAB
No ratings yet
Shannon Encoding with MATLAB
8 pages
Week8 Slides
No ratings yet
Week8 Slides
43 pages
Verilog-Based Lossless Data Compression
No ratings yet
Verilog-Based Lossless Data Compression
6 pages
Information Theory: Dr. Muhammad Imran Farid
No ratings yet
Information Theory: Dr. Muhammad Imran Farid
32 pages
ADC EXPT 2 078 Mane B1
No ratings yet
ADC EXPT 2 078 Mane B1
10 pages
10.7 Arithmetic Coding: Figure 10.9 Assignment of Ranges Between 0 and 1
No ratings yet
10.7 Arithmetic Coding: Figure 10.9 Assignment of Ranges Between 0 and 1
4 pages
Ut 1 PPT
No ratings yet
Ut 1 PPT
77 pages
Tutorial 8
No ratings yet
Tutorial 8
20 pages
Dcomm Master File
No ratings yet
Dcomm Master File
33 pages
Implementation Details and Examples: Variable-Length Entropy Encoding Lossless Data Compression
No ratings yet
Implementation Details and Examples: Variable-Length Entropy Encoding Lossless Data Compression
26 pages
Codage 2SN TR 2021
No ratings yet
Codage 2SN TR 2021
65 pages
Coding Theory
No ratings yet
Coding Theory
49 pages
Computer Solutions
No ratings yet
Computer Solutions
7 pages
Arithmetic Coding Explained
No ratings yet
Arithmetic Coding Explained
38 pages
Class 4 Multiple Outputs (Like Lab 1) and More
No ratings yet
Class 4 Multiple Outputs (Like Lab 1) and More
14 pages
ECS452 2019 Premidterm HW
No ratings yet
ECS452 2019 Premidterm HW
33 pages
21ECL55 - Communication Lab-II
No ratings yet
21ECL55 - Communication Lab-II
23 pages
Lecture 5
No ratings yet
Lecture 5
13 pages
A Unique Perspective On Data Coding and Decoding
No ratings yet
A Unique Perspective On Data Coding and Decoding
11 pages
LabManual - Information Theory and Coding
No ratings yet
LabManual - Information Theory and Coding
27 pages
Data Compression - Unit 3
No ratings yet
Data Compression - Unit 3
18 pages
CH 6
No ratings yet
CH 6
10 pages
Dialysis Dose
No ratings yet
Dialysis Dose
7 pages
Archive of SID: Access Recirculation in Jugular Venous Catheter in Regular and Reversed Lines
No ratings yet
Archive of SID: Access Recirculation in Jugular Venous Catheter in Regular and Reversed Lines
4 pages
TRANINg
No ratings yet
TRANINg
5 pages
Huffbit Compress - Algorithm To Compress Dna Sequences Using Extended Binary Trees
No ratings yet
Huffbit Compress - Algorithm To Compress Dna Sequences Using Extended Binary Trees
6 pages
Comparative Performance and Emission Properties of Spark-Ignition Outboard Engine Powered by Gasoline and LPG
No ratings yet
Comparative Performance and Emission Properties of Spark-Ignition Outboard Engine Powered by Gasoline and LPG
10 pages
BIOS Beep Codes
No ratings yet
BIOS Beep Codes
6 pages
HxC Floppy Emulator Overview
No ratings yet
HxC Floppy Emulator Overview
17 pages
III BSC Paper III Dbms
No ratings yet
III BSC Paper III Dbms
14 pages
ss64 Com
No ratings yet
ss64 Com
4 pages
Nodeb Data Configuration: Internal
No ratings yet
Nodeb Data Configuration: Internal
57 pages
Useful SAP Tools
100% (1)
Useful SAP Tools
34 pages
Liquor Management System
No ratings yet
Liquor Management System
36 pages
Command Line Utilities in SSRS 2008
No ratings yet
Command Line Utilities in SSRS 2008
8 pages
IGCSE Chapter 2 Data Transmission
No ratings yet
IGCSE Chapter 2 Data Transmission
14 pages
Oracle Database Interview Q&A
No ratings yet
Oracle Database Interview Q&A
17 pages
Week 3: Assignment: Assignment Submitted On 2025-02-12, 12:17 IST
No ratings yet
Week 3: Assignment: Assignment Submitted On 2025-02-12, 12:17 IST
5 pages
RAC 11gR2 CLUSTER SETUP
No ratings yet
RAC 11gR2 CLUSTER SETUP
82 pages
SQL DBA AlwaysOn Interview Questions and Answers 01
No ratings yet
SQL DBA AlwaysOn Interview Questions and Answers 01
9 pages
4Q08 Databook ComputingMemory
No ratings yet
4Q08 Databook ComputingMemory
25 pages
SY - 21 - ISE - Paper - H-DV - 2021 - 2022 - Odd - Sem - III - Without - CO
No ratings yet
SY - 21 - ISE - Paper - H-DV - 2021 - 2022 - Odd - Sem - III - Without - CO
2 pages
19 Master Document & Record Index
No ratings yet
19 Master Document & Record Index
3 pages
Lab-2 Networking Basics: What Is A Network?
100% (1)
Lab-2 Networking Basics: What Is A Network?
6 pages
R911420423 01 CTRLX Key Value Database App 01VRS ApplicationManual EN
No ratings yet
R911420423 01 CTRLX Key Value Database App 01VRS ApplicationManual EN
50 pages
Troubleshooting Tools
No ratings yet
Troubleshooting Tools
18 pages
Custom XPATH in SOA 11g Guide
No ratings yet
Custom XPATH in SOA 11g Guide
3 pages
S Ʀɪ
No ratings yet
S Ʀɪ
23 pages
Computer Networking Essentials
No ratings yet
Computer Networking Essentials
74 pages
SQL Joins & Set Operators Guide
No ratings yet
SQL Joins & Set Operators Guide
42 pages
28 Consistent Hashing
No ratings yet
28 Consistent Hashing
6 pages
OS Lab Manual - Laxmi
No ratings yet
OS Lab Manual - Laxmi
63 pages
LC-3 Assembly TRAPs & Subroutines Lab
0% (1)
LC-3 Assembly TRAPs & Subroutines Lab
4 pages
Data Communications Computer Networks-II 206
No ratings yet
Data Communications Computer Networks-II 206
6 pages
Turn Based Combat Tutorial - English
No ratings yet
Turn Based Combat Tutorial - English
21 pages
Cloud Computing Resource Replication
No ratings yet
Cloud Computing Resource Replication
18 pages
CCIE DC 21 Learning Matrix
No ratings yet
CCIE DC 21 Learning Matrix
22 pages

ENSC 424 - Multimedia Communications Engineering: Topic 6: Arithmetic Coding 1

Uploaded by

ENSC 424 - Multimedia Communications Engineering: Topic 6: Arithmetic Coding 1

Uploaded by

ENSC 424 - Multimedia

J. Liang: SFU ENSC 424 9/20/2005 1

J. Liang: SFU ENSC 424 9/20/2005 2

Arithmetic coding applies this idea recursively

J. Liang: SFU ENSC 424 9/20/2005 7

1. Define an end of file (EOF) symbol in the

2. Encode the lower end of the final range.

J. Liang: SFU ENSC 424 9/20/2005 8

Disjoint but complete partition:

J. Liang: SFU ENSC 424 9/20/2005 9

J. Liang: SFU ENSC 424 9/20/2005 11

J. Liang: SFU ENSC 424 9/20/2005 12

Drawback: need to recalculate all thresholds each time.

J. Liang: SFU ENSC 424 9/20/2005 13

x =(0.7712-0) / 0.8 0 0.8 0.82 1.0

0 0.8 0.82 1.0

0 0.8 0.82 1.0

J. Liang: SFU ENSC 424 9/20/2005 15

J. Liang: SFU ENSC 424 9/20/2005 16

J. Liang: SFU ENSC 424 9/20/2005 17

E2: [LOW HIGH) in [0.5, 1) 0 0.5 1.0

J. Liang: SFU ENSC 424 9/20/2005 18

0.3392 0.38528 E1: Output 0

0.6784 0.77056 E2: Output 1

J. Liang: SFU ENSC 424 9/20/2005 20

0.1696 0.19264 E1: Tag: 011000 (0.375)

0.3392 0.38528 E1: Tag: 110000 (0.75)

0.6784 0.77056 E2: Tag: 100000 (0.5)

0.3568 0.54112 Decode 1

You might also like