Huffman
Huffman
Shannon-Fano coding
This online calculator generates Shannon-Fano coding based on a set of symbols and their
probabilities
This online calculator produces Shannon-Fano coding for set of symbols given their probabilities. A
bit of theory can be found below the calculator.
Shannon–Fano coding
addimport_exportmode_editdelete
Symbols probability table
Name Value
A 25 mode_edit
B 25 mode_edit
C 20 mode_edit
D 15 mode_edit
E 10 mode_edit
Name Value
F 5 mode_edit
1-6 of 6chevron_leftchevron_right
Calculation precision
Digits after the decimal point: 2
CALCULATE
Weighted path length
2.45
Shannon entropy
2.42
Symbol Encoding
B 01
A 00
C 10
D 110
E 1110
F 1111
1 - 6 of 6chevron_leftchevron_right
Shannon-Fano coding
In the field of data compression, Shannon–Fano coding, named after Claude Shannon and Robert
Fano, is a technique for constructing a prefix code based on a set of symbols and their probabilities
(estimated or measured). It is suboptimal in the sense that it does not achieve the lowest possible
expected code word length like Huffman coding.
In Shannon–Fano coding, the symbols are arranged in order from most probable to least probable,
and then divided into two sets whose total probabilities are as close as possible to being equal. All
symbols then have the first digits of their codes assigned; symbols in the first set receive "0" and
symbols in the second set receive "1". As long as any sets with more than one member remain, the
same process is repeated on those sets, to determine successive digits of their codes. When a set has
been reduced to one symbol this means the symbol's code is complete and will not form the prefix of
any other symbol's code.
The algorithm produces fairly efficient variable-length encodings; when the two smaller sets
produced by a partitioning are in fact of equal probability, the one bit of information used to
distinguish them is used most efficiently. Unfortunately, Shannon–Fano does not always produce
optimal prefix codes; the set of probabilities {0.35, 0.17, 0.17, 0.16, 0.15} is an example of one that
will be assigned non-optimal codes by Shannon–Fano coding.
For this reason, Shannon–Fano is almost never used; Huffman coding is almost as computationally
simple and produces prefix codes that always achieve the lowest expected code word length, under
the constraints that each symbol is represented by a code formed of an integral number of bits