0% found this document useful (0 votes)

11 views21 pages

L1 Part2

Uploaded by

Manoel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views21 pages

L1 Part2

Uploaded by

Manoel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Lecture 1

Introduction to lossless compression

EE 274: Data Compression - Lecture 1 1

Plan: Lecture 1-3: theory and concepts from information theory

EE 274: Data Compression - Lecture 1 2

A simple probability distribution
Consider:
Alphabet X = {A, B, C, D}
Uniform probability distribution: P (A) = P (B) = P (C) = P (D) = 14

A text file generating by independently sampling one million symbols from this distribution:
$ cat abcd.txt

ACABDADCBDDC....

What is the size of this file?

EE 274: Data Compression - Lecture 1 3
Bits and bytes
bit: a unit of information expressed as either a 0 or 1 in binary notation.
byte: a group of eight bits operated on as a unit.
1 byte (B) = 8 bits
1 kilobyte (KB) = 1000 bytes = 8000 bits
So on for MB, GB, TB, PB, EB, ...
Note: Sometimes we like to use powers of two, e.g., 1 kilobyte = 1024 bytes.

EE 274: Data Compression - Lecture 1 4

abcd.txt
Size on disk: 1 MB (1 million bytes).
Why 1 byte per letter/character?

EE 274: Data Compression - Lecture 1 5

EE 274: Data Compression - Lecture 1 6
ASCII Table
Symbol ASCII code
A 1000001
B 1000010
C 1000011
D 1000100
8 bits = 1 byte per symbol.
Can we do better?
EE 274: Data Compression - Lecture 1 7
Fixed bitwidth code
Symbol Code
A 00
B 01
C 10
D 11
Bits/symbol?
Decoding?
EE 274: Data Compression - Lecture 1 8
Fixed bitwidth code
k = ∣S∣ different symbols implies at least ⌈log2 k⌉ bits per symbol in a fixed bitwidth
code.

Can we do better? In the uniform distribution example above?

EE 274: Data Compression - Lecture 1 9

Uniform distribution
Symbol Probability
A 0.5
B 0.5
Fixed bitwidth code: 1 bit/symbol

EE 274: Data Compression - Lecture 1 10

Non-uniform distribution
Symbol Probability
A 0.49
B 0.49
C 0.01
D 0.01
Fixed bitwidth code: 2 bits/symbol
Can we do better? Closer to the previous page's 1 bit/base?
EE 274: Data Compression - Lecture 1 11
Non-uniform distribution
Symbol Probability
A 0.49
B 0.49
C 0.01
D 0.01
Solution 1: C and D are low probability, let's just lose them - Lossy Compression (not
commonly used for text/database/log data).
EE 274: Data Compression - Lecture 1 12
Non-uniform distribution
Symbol Probability
A 0.49
B 0.49
C 0.01
D 0.01
Solution 2: Variable length codes: Use fewer bits for more probable symbols.

EE 274: Data Compression - Lecture 1 13

Variable length codes
Use fewer bits for more probable symbols
Symbol Probability Code
A 0.49 0
B 0.49 10
C 0.01 110
D 0.01 111
How to evaluate coding efficiency? Expected code length.
EE 274: Data Compression - Lecture 1 14
Expected code length
"Compressed size/Uncompressed size" - often in units bits/symbol.
Also sometimes called compression rate/compression ratio.
Warning: There's some variability in notation and definitions of these terms so be
careful.
Let l(x) denote the code length for symbol x with probability P (x), where x ∈ X .
Expected code length: E[l(X)] = ∑x∈X P (x)l(x)

EE 274: Data Compression - Lecture 1 15

Expected code length
Symbol Probability Code
A 0.49 0
B 0.49 10
C 0.01 110
D 0.01 111
Expected code length: E[l(X)] = ?

EE 274: Data Compression - Lecture 1 16

Expected code length
Symbol Probability Code l(x)
A 0.49 0 1
B 0.49 10 2
C 0.01 110 3
D 0.01 111 3
E[l(X)] = 0.49 × 1 + 0.49 × 2 + 0.01 × 3 + 0.01 × 3 = 1.53 bits/symbol

EE 274: Data Compression - Lecture 1 17

Thoughts and conclusion
Is the code above lossless? Can you decode it? <- homework for next lecture!

EE 274: Data Compression - Lecture 1 18

EE 274: Data Compression - Lecture 1 19

Thoughts and conclusion
Is the code above lossless? Can you decode it? <- homework for next lecture!
The non-uniform distribution above seems "worse" but "similar" to the uniform
distribution on just A and B.
In the next few lectures, we will learn how to compute the optimal compression rate
and how we can get close to 1.14 bits/symbol for the above distribution (and no
better).

EE 274: Data Compression - Lecture 1 20

Thank you!

EE 274: Data Compression - Lecture 1 21

Ec8093-Digital Image Processing: Dr.K.Kalaivani Associate Professor Dept. of EIE Easwari Engineering College
No ratings yet
Ec8093-Digital Image Processing: Dr.K.Kalaivani Associate Professor Dept. of EIE Easwari Engineering College
37 pages
Digital Coding for ELEC1010 Students
No ratings yet
Digital Coding for ELEC1010 Students
72 pages
Why Needed?: Without Compression, These Applications Would Not Be Feasible
No ratings yet
Why Needed?: Without Compression, These Applications Would Not Be Feasible
11 pages
Information Theory Notes
No ratings yet
Information Theory Notes
4 pages
Chapter 7
No ratings yet
Chapter 7
70 pages
ECEVSP L03 Compression2
No ratings yet
ECEVSP L03 Compression2
40 pages
Chapter 4 - Introduction To Source Coding PDF
No ratings yet
Chapter 4 - Introduction To Source Coding PDF
72 pages
Agenda For The Lecture: C Himanshu Tyagi. Feel Free To Use With Acknowledgement
No ratings yet
Agenda For The Lecture: C Himanshu Tyagi. Feel Free To Use With Acknowledgement
7 pages
CSEP 590 Data Compression: Course Policies Introduction To Data Compression Entropy Variable Length Codes
No ratings yet
CSEP 590 Data Compression: Course Policies Introduction To Data Compression Entropy Variable Length Codes
93 pages
Module IV
No ratings yet
Module IV
37 pages
01 EntropyLosslessCoding PDF
No ratings yet
01 EntropyLosslessCoding PDF
29 pages
Dce Easy Solution
0% (1)
Dce Easy Solution
87 pages
Intro To ICT 11
No ratings yet
Intro To ICT 11
31 pages
Chapter 2-Compression Techniques
No ratings yet
Chapter 2-Compression Techniques
63 pages
Chapter 3 Multimedia Data Compression
100% (2)
Chapter 3 Multimedia Data Compression
23 pages
Tutorial 8
No ratings yet
Tutorial 8
20 pages
Unit 1 Data Compression
No ratings yet
Unit 1 Data Compression
30 pages
L15 Compression
No ratings yet
L15 Compression
63 pages
BTETPE405B Data Compression and Encryption
No ratings yet
BTETPE405B Data Compression and Encryption
3 pages
09 Basic Compression
No ratings yet
09 Basic Compression
81 pages
Session 4
No ratings yet
Session 4
66 pages
Algorithms in The Real World: Data Compression: Lectures 1 and 2
No ratings yet
Algorithms in The Real World: Data Compression: Lectures 1 and 2
55 pages
DC Sessional I
No ratings yet
DC Sessional I
1 page
1-Data Compression-2022
No ratings yet
1-Data Compression-2022
24 pages
TEOI InformationOfDataSources
No ratings yet
TEOI InformationOfDataSources
55 pages
Image and Video Compression: Lecture 12, April 27, 2009 Lexing Xie
No ratings yet
Image and Video Compression: Lecture 12, April 27, 2009 Lexing Xie
77 pages
Mpeg Coding Principles
No ratings yet
Mpeg Coding Principles
23 pages
Data Compression
No ratings yet
Data Compression
22 pages
Advanced Multimedia Compression
No ratings yet
Advanced Multimedia Compression
32 pages
Data Compression Explained
No ratings yet
Data Compression Explained
110 pages
Lecture 1
No ratings yet
Lecture 1
35 pages
cp467 12 Lecture14 Compression1
No ratings yet
cp467 12 Lecture14 Compression1
146 pages
Data Compression Unit-1 - 1
No ratings yet
Data Compression Unit-1 - 1
21 pages
ECE359 - Image Compression
No ratings yet
ECE359 - Image Compression
42 pages
Communication Systems Engineering
No ratings yet
Communication Systems Engineering
25 pages
Quantization and Compression PDF
No ratings yet
Quantization and Compression PDF
220 pages
Digital Communication System
No ratings yet
Digital Communication System
10 pages
Chapter 2
No ratings yet
Chapter 2
13 pages
Data Compression Techniques Guide
No ratings yet
Data Compression Techniques Guide
87 pages
Chapter10 Part1 Huffman
No ratings yet
Chapter10 Part1 Huffman
17 pages
Data Compression
No ratings yet
Data Compression
113 pages
Chapter Five Lossless Compression
No ratings yet
Chapter Five Lossless Compression
49 pages
Answers
No ratings yet
Answers
2 pages
UNIT-5 Entropy Encoding
No ratings yet
UNIT-5 Entropy Encoding
8 pages
Data Compression Unit-5
No ratings yet
Data Compression Unit-5
17 pages
Multimedia Communication - ECE - VTU - 8th Sem - Unit 3 - Text and Image Compression, Ramisuniverse
83% (6)
Multimedia Communication - ECE - VTU - 8th Sem - Unit 3 - Text and Image Compression, Ramisuniverse
30 pages
Compression For Sending and Storing Information: Text, Audio, Images, Videos
No ratings yet
Compression For Sending and Storing Information: Text, Audio, Images, Videos
28 pages
Chapter 1: Lossless Data Compression
No ratings yet
Chapter 1: Lossless Data Compression
4 pages
HTCS501 Unit 4
No ratings yet
HTCS501 Unit 4
17 pages
Source Coding
No ratings yet
Source Coding
29 pages
3 Chapter Text and Image Compression
No ratings yet
3 Chapter Text and Image Compression
132 pages
Entropy Coding - Wikipedia
No ratings yet
Entropy Coding - Wikipedia
2 pages
Data Compression Chapter 7
No ratings yet
Data Compression Chapter 7
40 pages
Lecture I: Data Compression Data Encoding: Efficient Information Encoding To
No ratings yet
Lecture I: Data Compression Data Encoding: Efficient Information Encoding To
48 pages
Sayood DataCompression
No ratings yet
Sayood DataCompression
22 pages
Lec-2 Source Coding v3.0
No ratings yet
Lec-2 Source Coding v3.0
10 pages
Data Compression 1
No ratings yet
Data Compression 1
25 pages
Microprocessor Systems Overview
No ratings yet
Microprocessor Systems Overview
56 pages
SGK Complete Ielts 5.5 - 6.5-Pages
No ratings yet
SGK Complete Ielts 5.5 - 6.5-Pages
4 pages
Xilsem
No ratings yet
Xilsem
42 pages
Discuss The Components and Characteristics of Maximization and Minimization Model
No ratings yet
Discuss The Components and Characteristics of Maximization and Minimization Model
5 pages
What A Beautiful Name-chords-D
No ratings yet
What A Beautiful Name-chords-D
2 pages
Definite Integration - JEE Main 2023 April Chapterwise PYQ - MathonGo
No ratings yet
Definite Integration - JEE Main 2023 April Chapterwise PYQ - MathonGo
8 pages
Module 2 App Testing Tools
No ratings yet
Module 2 App Testing Tools
21 pages
Literacy Rate Analysis Project File
50% (2)
Literacy Rate Analysis Project File
41 pages
Premium Upgrade Procedure PDF
No ratings yet
Premium Upgrade Procedure PDF
70 pages
Third Conditional
No ratings yet
Third Conditional
2 pages
Religious Education Guide: Key Teachings
No ratings yet
Religious Education Guide: Key Teachings
3 pages
Sources of Citation Juben Odal
No ratings yet
Sources of Citation Juben Odal
3 pages
Deity Worship & Bhakti Yoga Exam
No ratings yet
Deity Worship & Bhakti Yoga Exam
6 pages
Free Will
No ratings yet
Free Will
2 pages
Preview-9781453900024 A34467571
No ratings yet
Preview-9781453900024 A34467571
40 pages
Calculus Lecture Series
No ratings yet
Calculus Lecture Series
134 pages
4.00 Impartation Anointing Mantles-Handout
No ratings yet
4.00 Impartation Anointing Mantles-Handout
6 pages
Polar Coordinates for Math Students
No ratings yet
Polar Coordinates for Math Students
3 pages
Trabajo Final Ingles II Final
No ratings yet
Trabajo Final Ingles II Final
6 pages
Gentoo Linux AMD64 Handbook
No ratings yet
Gentoo Linux AMD64 Handbook
95 pages
Weinzierl KNX Over IP en
No ratings yet
Weinzierl KNX Over IP en
12 pages
Lesson 3 Collocations
No ratings yet
Lesson 3 Collocations
3 pages
El Kah-Anoual-Publications-17-08-2022-11-08-19-34
No ratings yet
El Kah-Anoual-Publications-17-08-2022-11-08-19-34
10 pages
Case Ih Tractor Precision Air 2230 2280 2330 3380 3430 Air Cart Complete Service Manual 84329233
100% (4)
Case Ih Tractor Precision Air 2230 2280 2330 3380 3430 Air Cart Complete Service Manual 84329233
22 pages
Contiki NG Cheat Sheet
No ratings yet
Contiki NG Cheat Sheet
1 page
PISO Verilog PDF
No ratings yet
PISO Verilog PDF
5 pages
Topic 1-3 Formative Test Year 4 Get Smart Plus
No ratings yet
Topic 1-3 Formative Test Year 4 Get Smart Plus
4 pages
CS6660
No ratings yet
CS6660
2 pages
Configdb 1
No ratings yet
Configdb 1
2 pages
Sikhs in The Eighteenth Century-1
No ratings yet
Sikhs in The Eighteenth Century-1
427 pages

L1 Part2

Uploaded by

L1 Part2

Uploaded by

Lecture 1

Introduction to lossless compression

EE 274: Data Compression - Lecture 1 1

EE 274: Data Compression - Lecture 1 2

What is the size of this file?

EE 274: Data Compression - Lecture 1 4

EE 274: Data Compression - Lecture 1 5

Can we do better? In the uniform distribution example above?

EE 274: Data Compression - Lecture 1 9

EE 274: Data Compression - Lecture 1 10

EE 274: Data Compression - Lecture 1 13

EE 274: Data Compression - Lecture 1 15

EE 274: Data Compression - Lecture 1 16

EE 274: Data Compression - Lecture 1 17

EE 274: Data Compression - Lecture 1 18

EE 274: Data Compression - Lecture 1 19

EE 274: Data Compression - Lecture 1 20

EE 274: Data Compression - Lecture 1 21

You might also like