Advertisement

Huffman Codes

  • Henk C. A. van Tilborg
Part of the The Kluwer International Series in Engineering and Computer Science book series (SECS, volume 52)

Abstract

It is clear from the previous chapter (in particular from (4.5) and (4.6)) that the security of a cryptosystem can be increased by reducing the redundancy in the plaintext. In Example 4.1 such a reduction has been demonstrated. In this chapter we shall describe a general method for what is called data compression or source coding. Let a (plaintext) source S output independently chosen symbols from the set {m1,m2, …, mn}, with respective probabilities p1, p2, …, pn. A symbol mi, 1 ≤ in, will be encoded into a binary string ci of length li.The set {c1,c2, …, cn} is called a code C for the source S. The idea of data compression is to use such a code that the expected value of the length of the encoded plaintext is minimal. Since the output symbols are independent we have to minimize the expected length per symbol
$$ L = \sum\limits_i -1^n p_il_i$$
(5.1)
over all possible codes C for the source S. There is however an additional constraint. One has to be able to retrieve the individual messages from the concatenation of the succesive codewords. Not every code has this property. Indeed let C = {0,01, 10}. The sequence 010 can be made in two ways: 0 followed by 10 and 01 followed by 0. This ambiguity has to be avoided.

Keywords

Binary String Huffman Code Splitting Process Output Symbol Source Symbol 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Kluwer Academic Publishers 1988

Authors and Affiliations

  • Henk C. A. van Tilborg
    • 1
  1. 1.Eindhoven University of TechnologyThe Netherlands

Personalised recommendations