W.B.C.S. Examination Notes On - Source Coding Theorem

Posted on July 24th, 2020 in Exam Details (QP Included)>Main Exam>Optional Subject-Medical Group>Computer Science
Tags: Computer Science, Notes, WBCS

W.B.C.S. Examination Notes On – Source Coding Theorem – Computer Science Notes.

The Code produced by a discrete memoryless source, has to be efficiently represented, which is an important problem in communications. For this to happen, there are code words, which represent these source codes.Continue Reading W.B.C.S. Examination Notes On – Source Coding Theorem – Computer Science Notes.

For example, in telegraphy, we use Morse code, in which the alphabets are denoted by Marks and Spaces. If the letter E is considered, which is mostly used, it is denoted by “.” Whereas the letter Q which is rarely used, is denoted by “–.-”

Where S_k is the output of the discrete memoryless source and b_k is the output of the source encoder which is represented by 0s and 1s.

The encoded sequence is such that it is conveniently decoded at the receiver.

Let us assume that the source has an alphabet with k different symbols and that the k^th symbol S_k occurs with the probability P_k, where k = 0, 1…k-1.

Let the binary code word assigned to symbol S_k, by the encoder having length l_k, measured in bits.

Hence, we define the average code word length L of the source encoder as

L ¯¯¯¯ = \sum k = 0 k - 1 p k l k

L represents the average number of bits per source symbol

If $L_{m i n} = m i n i m u m p o s s i b l e v a l u e o f \bar{L}$

Then coding efficiency can be defined as

η = L m i n L ¯¯¯¯

With $\bar{L} \geq L_{m i n}$ we will have $η \leq 1$

However, the source encoder is considered efficient when $η = 1$

For this, the value $L_{m i n}$ has to be determined.

Let us refer to the definition, “Given a discrete memoryless source of entropy $H (δ)$ , the average code-word length L for any source encoding is bounded as $\bar{L} \geq H (δ)$ .”

code $Q U E U E i n e x a m p l e$ . Which means, the symbols in the code word are greater than or equal to the alphabets in the source code.

Hence with $L_{m i n} = H (δ)$ , the efficiency of the source encoder in terms of Entropy $H (δ)$ may be written as

η = H ( δ ) L ¯¯¯¯

This source coding theorem is called as noiseless coding theorem as it establishes an error-free encoding. It is also called as Shannon’s first theorem.

source coding theorem In communication theory, the statement that the output of any information source having entropy H units per symbol can be encoded into an alphabet having N symbols in such a way that the source symbols are represented by codewords having a weighted average length not less than H/logN

(where the base of the logarithm is consistent with the entropy units). Also, that this limit can be approached arbitrarily closely, for any source, by suitable choice of a variable-length code and the use of a sufficiently long extension of the source (see source coding).

The theorem was first expounded and proved by Claude Elwood Shannon in 1948.

Please subscribe here to get all future updates on this post/page/category/website

Recent updates

Tags

W.B.C.S. Examination Notes On – Source Coding Theorem – Computer Science Notes.

Leave a Reply Cancel reply

Recent updates

Tags

W.B.C.S. Examination Notes On – Source Coding Theorem – Computer Science Notes.

If you want to be in contact with us and get future posts and news, subscribe here...

Leave a Reply Cancel reply