EECS 229A Spring 2007 * * (a) By stationarity and the chain rule for entropy, we have

Size: px
Start display at page:

Download "EECS 229A Spring 2007 * * (a) By stationarity and the chain rule for entropy, we have"

Transcription

1 EECS 229A Spring 2007 * * Solutions to Homework 3 1. Problem 4.11 on pg. 93 of the text. Stationary processes (a) By stationarity and the chain rule for entropy, we have H(X 0 ) + H(X n X 0 ) = H(X 0, X n ) = H(X n, X 0 ) = H(X 0 ) + H(X n X 0 ) so that H(X n X 0 ) = H(X n X 0 ) is true for any stationary stochastic process. (b) This is false in general. Consider, for instance, the {0, 1}-valued stationary stochastic process defined by P ((X 0, X 1, X 2 ) = (0, 0, 1)) = P ((X 0, X 1, X 2 ) = (0, 1, 0)) = P ((X 0, X 1, X 2 ) = (1, 0, 0)) = 1/3. with X n = X n+3k for all n Z and all k Z ((X n ) can be thought of as the stationary discrete time renewal process corresponding to a renewal time of 3.) Then H(X 3 X 0 ) = 0, while H(X 2 X 0 ) = 1 3 H(X 2 X 0 = 1) H(X 2 X 0 = 0) = = 2 3 > 0. Here we have observed that conditional on X 0 = 1, we have X 2 = 0, while conditional on X 0 = 0, X 2 is either 0 or 1, equiprobably. Note however that if (X n ) is a Markov process, the claim is true, because H(X n X 0 ) H(X n X 0, X 1 ) = H(X n X 1 ) = H(X n 1 X 0 ) where the Markov property was used in the second step. 1

2 (c) This statement is true for any stationary stochastic process. We have H(X n+1 X1 n, X n+2 ) H(X n+1 X2 n, X n+2 ) = H(X n X1 n 1, X n+1 ) where the first step is because conditioning does not increase entropy, and the second step uses the stationarity of the stochastic process. (d) This statement is true for any stationary stochastic process, because H(X n+1 X n 0, X 2(n+1) n+2 ) (a) = H(X 0 X 1 (n+1), Xn+1 1 ) (b) H(X 0 X 1 n, X n 1 ) (c) = H(X n X n 1 0, X 2n n+1) where in steps (a) and (c) we have used the stationarity of the process and in step (b) we observe that conditioning reduces entropy. 2. Problem 5.27 on pp of the text. Sardinas-Patterson test for unique decodability (a) For a pair of nonempty subsets of strings over the alphabet, A and B, let D AB = {d : ad = b for some a A and some b B}. Here ad denotes the concatenation of a and d. Thus D AB is the set of dangling suffixes generated from strings in B by erasing prefixes in A. The set S can be generated from the code C by the following algorithm. To start with, let S 1 = DCC. S 1 = iff C is a prefix code, in which case, set S =. If S 1, define sequentially, starting with k = 1 : S k+1 = S k D Sk C D CSk, stopping at the first k for which S k+1 = S k. This algorithm must terminate in finitely many steps. Set S = S k for the k at which the algorithm terminates. (b) Let C be comprised of M codewords of lengths l 1,..., l M, as indicated. A simple bound on the size of S may be arrived at by reasoning as follows : each string in S is a suffix of one of the codewords. The total number of distinct suffixes of a string of length l is l. Hence M S l m. Better bounds are also welcome. 2 m=1

3 (c) i. This is a prefix code, hence S = and the code is uniquely decodable. ii. Here S = {1}, which is disjoint from C, so this code is uniquely decodable. iii. Here S = {0, 1} and 0 C so this code is not uniquely decodable. An example of a string that is not uniquely decodable is 010 which can be parsed either as 0, 10 or as 01, 0. iv. Here S = {1} which is disjoint from C, so the code is uniquely decodable. v. This is a prefix code, hence S = and the code is uniquely decodable. vi. Here S = {0}, which is disjoint from C, so the code is uniquely decodable. vii. Here S = {0}, which is disjoint from C, so the code is uniquely decodable. In each of the uniquely decodable cases above, it is instructive to think through how one would uniquely parse a given finite string of bits that admits a parsing in terms of the codewords of the code. (d) Any infinite string that has a parsing into the codewords of a prefix code has only one such parsing. If it had two such parsings, there would be a phrase in one of the parsings which has a phrase in the other parsing as a prefix, contradicting the prefix nature of the code. Thus, to solve this problem we need only consider cases (ii), (iv), (vi), and (vii) from the cases above. In case (ii) consider the string i.e., 0 followed by an infinite string of 1 s. This has two distinct parsings : 0, 11, 11, 11,... or 01, 11, 11,.... In case (iv) any infinite string that admits a parsing into phrases from the code must start with 0. If it is an infinite string of 0 s it has a unique parsing. If not, there is a first place where it is 1, and this is followed by a 0 and the portion of the string up to and including the first place where it is 1 has a unique parsing, and so one is left with parsing the rest of the string which starts with the 0 following the first 1. From this reasoning we conclude that if an infinite string has a parsing in terms of the codewords of this code, this parsing is unique. In case (vi) any infinite string that admits a parsing into phrases from the code must start with 1. If it is an infinite string of 1 s it has a unique parsing. If not, there is a first place where it is 0, and this is followed by 1, and the portion of the string up to and including the first 0 has a unique parsing and one is then left with the problem of parsing the rest of the string which starts with the 1 following the first 0. From this reasoning we conclude that if an infinite string has a parsing in terms of the codewords of this code, this parsing is unique. In case (vii) consider the infinite string i.e. 1 followed by an infinite sequence of 0 s. This has two distinct parsings : 10, 00, and 100, 00,

4 3. Problem 11.5 on pg. 401 of the text. Counting Let X = {1, 2,..., m} as in the text. If α > max x X g(x) there are no sequences x n X n satisfying 1 n ni=1 g(x i ) α. Also H =, since as defined in the text H = P : sup H(P ), x X P (x)g(x) α and the supremum over an empty set is. If α = max x X g(x), let d = {x X : g(x) = α}. The number of sequences x n X n satisfying 1 ni=1 n g(x i ) α is then d n. Also, a probability distribution P on X satisfies x X P (x)g(x) α iff P (x) = 0 for all x such that g(x) < α. Thus H = log d. In both these cases, the claim is verified, so we may now assume that α < max x X g(x). H(P ) is a continuous strictly concave function of P ranging over probability distributions on X. Thus there is a unique probability distribution P on X such that H = H(P ). By the continuity of H(P ) and the fact that the set sup P : x X P (x)g(x) α has nonempty interior, for any ɛ > 0 for all sufficiently large n there is some P n P n with x X P n(x)g(x) α and H(P n ) H ɛ. Thus, for all n sufficiently large the total number of sequences x n X n satisfying 1 n ni=1 g(x i ) α is at least as large as the cardinality of the type class of P n, i.e. at least as large as 1 (n+1) m 2 n(h ɛ). Since the sequences x n X n satisfying 1 n ni=1 g(x i ) α are precisely those whose type lies in the set sup P : x X P (x)g(x) α, the total number of such sequences is at most (n + 1)m 2 nh. The claim is now proved, by first taking the limit as n and then taking the limit as ɛ Problem on pp of the text. Sanov s theorem: There are four bullets in the suggested scheme of proof. The problem is not strictly correct as stated. The summation on the right hand side of equation (11.326) should start at np for the problem statement to be correct. For the first bullet, observe that (x 1,..., x n ) satisfies x n p (where x n is defined as 1 n ni=1 x i ) iff the total number of 1 s in (x 1,..., x n ) is np or bigger. This gives (the corrected version of) equation (11.326). For the second bullet, consider the ratio of the i + 1-st to the i-th term in the summation in equation (11.326). It is n i q n i i+1 1 q. Since i+1 1 p p when i np, this ratio is at most 1, and so the largest of the terms in this summation is the one corresponding to i = np. For the third bullet, the proof comes as a special case of the general estimate in equation (11.54) on page 354 of the text. Take Q to be the Binomial (q) distribution and P to be the Binomial ( np n ) distribution and apply this estimate. We get, for all n, ( ) 1 n (n + 1) 2 2 nd q i (1 q) n i 2 nd. np 4

5 You can get a sharper estimate, if you like, by instead applying equation (11.40) on pg. 353 of the text. For the last bullet, we get the upper bound by noting that the total number of terms in the sum in equation (11.326) is no more than n. We get the lower bound by noting that the left hand side of equation (11.326) can be lower bounded by term ( n ) np q i (1 q) n i and using the same kind of estimate as in the third bullet above. 5. Problem 13.4 on pg. 458 of the text. Arithmetic Coding (a) Under the stationary distribution the states 0 and 1 are equiprobable. We can think of 0 as owning the interval [0, 1 2 ) and 1 as owning the interval [ 1 2, 1). Then 00 owns the interval [0, 1 6 ), 01 owns the interval [ 1 6, 1 2 ), 10 owns the interval [ ), and 11 owns the interval [ 5 6, 1). The general pattern is that any interval owned by a string whose last bit is 0 is split into two intervals, the left one being 1 3-rd of its total length and the right one being 2 3-rd of its total length, and these now belong to the extensions of the parent string by 0 and 1 respectively. Likewise, any interval owned by a string whose last bit is 1 is split into two intervals, the left one being 2 3 -rd of its total length and the right one being 1 3-rd of its total length, and these now belong to the extensions of the parent string by 0 and 1 respectively. To determine the interval owned by (01110) we may proceed step by step through the intervals [0, 1 2 ), [ 1 6, 3 6 ), [ 7 18, ), [ 54, ), to [ 162, ), where we have renormalized the denominators on the fractions to make the steps transparent. In the notation of the textbook, we get F (.01110) = (b) The question should be interpreted from the point of view of the receiver. In arithmetic coding, the transmitter will transmit the bit string corresponding to smallest basic dyadic interval that contains the interval owned by the target string k (recall that a basic dyadic interval is any interval of the form [ 2, k+1 m 2 ), where m 0 k 2 m 1 and m 1; it can also be thought of as any interval owned by a binary string under the model that the individual bits are i.i.d. and equiprobable). Some calculation shows that the smallest basic dyadic interval that contains the interval [ , ) is [ 7 16, 8 16 ). Hence, having seen (01110) the transmitter would have output (0111). The receiver, on receiving a binary string, thinks of it as defining a basic dyadic interval and then looks for the smallest interval owned by one of the strings of the original source that contains this dyadic interval. The receiver can be sure that the string that owns this (containing) interval in the original source is a prefix of the original source sequence. In this example, that interval is seen to be [ 7 18, 9 18 ), which is owned by the string (011) in the original source. Thus the receiver can be sure of the first three bits of the binary fractional representation of the overall source string (01110X 6 X 7...). 5

6 6. Run length coding versus Huffman coding This is problem 3.17 of the book of Gallager. A source produces a sequence of independent bits with p(0) = 0.9 and p(1) = 0.1. We will encode the sequence in two stages, first counting the number of zeros between successive ones in the source output and then encoding their run lengths into binary code words. The first stage of encoding maps source sequences into intermediate digits by the following rule : Source sequence Intermediate digit As an example to clarify the description consider the following sequence and make sure you understand how the intermediate encoding is created : , 2, 8, 2, 0, 4 (Imagine that every finite source sequence is terminated with a 1 before the first stage so that there is no problem at the end of the sequence; after decoding this 1 can be discarded.) The final stage of the encoding assigns the bit string 1 to the intermediate digit 8 and the eight bit strings of length 4 that start with the bit 0 in some one to one way to the other intermediate digits 0,..., 7. (a) Argue that the overall code is uniquely decodable. (b) Find the average number of source digits per intermediate digit. (c) Find the average number of encoded bits per intermediate digit. (d) Based on the preceding two parts and an appeal to the law of large numbers, determine the average number of bits per source symbol used by the overall code. (e) Find a Huffman code for the source based on encoding four source digits at a time and find the average number of bits per symbol used by this Huffman code. (f) Compare your answers in the preceding two parts. Did you find this surprising? (a) If the first received bit is 1, we know that the first intermediate digit is 8 and so the source sequence starts with a string of 8 zeros. 6

7 If the first received bit is 0, we look at the first four received bits and thereby determine the first intermediate digit, which must be one of the digits from 0 through 7. From this we can determine the prefix of the source sequence. This decoding procedure can now be repeated to eventually reconstruct the entire source sequence. (b) For each intermediate digit, the length and the probability of the source string that it codes for are listed below : From this, the mean length of the source string coded for by an intermediate digit is seen to be 10(1 (0.9) 8 ) = (c) For each intermediate digit, its probability and the number of bits needed to encode it are listed below : From this, the mean number of bits needed to encode an intermediate digit is seen to be 4 3(0.9) 8 = (d) Consider a infinitely long sequence from the source. The sequence of intermediate digits that results from the first stage of coding is an i.i.d. sequence from {0, 1,..., 8} with probability distribution given as in either of the tables above. Let L 1, L 2,... be the lengths of the source strings coded for by these intermediate digits and B 1, B 2,... be the lengths of the bit strings used to encode these intermediate digits. Each of these respectively is an i.i.d sequence of random variables, with respective means and The number of bits per source bit need to encoded the portion of the source sequence that requires k intermediate digits to code is B B k L L k = B B k k L L k k. Letting k and appealing to the law of large numbers, this converges to = code bits per source bit. (e) There are 16 source strings of length 4, one with probability , four with probability , six with probability 0.081, four with probability 0.009, and one with probability Listing these in lexicographic order and constructing a Huffman code using the algorithm as described in class, with the lower branch of each binary fork getting bit 0 and the upper branch getting bit 1 results in the code : 7

8 The mean length of this Huffman code is bits, i.e code bits per source bit. (f) The Huffman code of the preceding part is not as good as the run length code described earlier. Whether this is surprising or not depends on you. The entropy of the source is Since the source is i.i.d., and the Huffman code is based on taking blocks of four source symbols, it is guaranteed to give a compression rate of between and = code bits per source bit, so it is actually doing pretty well relative to these bounds. 7. Problem on pp of the text. Tunstall coding The original source has alphabet X = {0, 1,..., m 1}. Since the strings defining any variable to fixed length code must close out all ways of escaping from the root in the m-ary tree, the size D of the alphabet of the code must satisfy D = 1 + k(m 1) for some k 1. (a) Given a variable to fixed length code, let (Y 1, Y 2,...) denote the sequence of symbols from the D-ary alphabet that results from the source sequence (X 1, X 2,...). Note that Y 1, Y 2,... are independent and identically distributed. (To verify this formally actually requires some care. You need to argue that if (Z 1, Z 2,...) denote the successive source symbols after the first phrase in the source, then (Z 1, Z 2,...) has the same distribution as (X 1, X 2,...) and is independent of the first phrase of the source. You can do this by conditioning on all possible realizations of the first phrase and noticing that the conditioning does not matter.) Since Y 1 takes values in an alphabet of size D, we have H(Y 1 ) log D. We will argue that every variable to fixed length code has the property H(Y 1 ) = H(X)E[L(A D )], completing the proof. We argue iteratively, following the construction of any variable 8

9 to fixed length code. The basic variable to fixed length code has D = m, mean length 1 and H(Y 1 ), being the same as the entropy of the distribution on the phrases, equals H(X), because each phrase in the basic variable to fixed length code is just a symbol from the source. Suppose now that we have shown the claim for some variable to fixed length code and we grow one of its leaves, which has probability say p. The mean length of the code then increases by p. Also, the entropy of the phrase distribution increases by ph(x). Thus the property holds for the grown variable to fixed length code. This verifies that the property holds for all variable to fixed length codes, as desired. (b) Let D = 1 + k(m 1) be fixed. Consider an optimal variable to fixed length code for a source with alphabet of size m, having D leaves. It suffices to show that in the m-ary tree, at every depth l 1, any node that is split has higher probability than any node at that depth that is not split. Suppose this is not the case. Then there is a node at depth l with probability p that is not split and another node at depth l with probability q < p that is split. We may move the entire subtree that has the latter node as its root to root it at the former node, getting another variable to fixed length code that has strictly larger mean length than the one we started with (in the new code the former node will be split and the latter node will not be split). This contradicts the optimality of the given code, completing the proof. (c) We can in fact show that for every ɛ > 0 there exists D with R(A D ) < H(X) + 2ɛ. Given δ > 0, Let n be large enough that the set of ɛ-weakly typical strings of length n from the source has probability at least 1 δ. In the m-ary tree, start out by marking with the color red each node at depth n that corresponds to an ɛ-weakly typical sequence. There is a total of at most 2 n(h(x)+ɛ) such nodes. Next, for each path from the root in the m-ary tree to any node at depth n that is not already marked red, mark as blue the node that is the first node encountered on path that does not lie on a path to a node marked red. The blue and the red nodes together determine the phrases of a variable to fixed length code, because they define a prefix free set of strings that closes out all paths from the root. Further, the total number of blue nodes is at most (m 1)n times the total number of red nodes; this can be seen by noting that for every blue node there is some red node such that on the path from the root to that red node there is a branch of a single step into that blue node. This gives the upper bound D ((m 1)n + 1)2 n(h(x)+ɛ) mn2 n(h(x)+ɛ), so we have log D log m + log n + n(h(x) + ɛ). The mean length of this variable to fixed length code is at least n(1 δ). For this D (whatever it might be), we therefore get R(A log m + log n + n(h(x) + ɛ) D). n(1 δ) If δ > 0 is chosen small enough and n is sufficiently large, the right hand side is bounded above by H(X) + 2ɛ, completing the proof. 9

SIGNAL COMPRESSION Lecture 7. Variable to Fix Encoding

SIGNAL COMPRESSION Lecture 7. Variable to Fix Encoding SIGNAL COMPRESSION Lecture 7 Variable to Fix Encoding 1. Tunstall codes 2. Petry codes 3. Generalized Tunstall codes for Markov sources (a presentation of the paper by I. Tabus, G. Korodi, J. Rissanen.

More information

EE376A: Homework #3 Due by 11:59pm Saturday, February 10th, 2018

EE376A: Homework #3 Due by 11:59pm Saturday, February 10th, 2018 Please submit the solutions on Gradescope. EE376A: Homework #3 Due by 11:59pm Saturday, February 10th, 2018 1. Optimal codeword lengths. Although the codeword lengths of an optimal variable length code

More information

Lecture 1 : Data Compression and Entropy

Lecture 1 : Data Compression and Entropy CPS290: Algorithmic Foundations of Data Science January 8, 207 Lecture : Data Compression and Entropy Lecturer: Kamesh Munagala Scribe: Kamesh Munagala In this lecture, we will study a simple model for

More information

Chapter 5: Data Compression

Chapter 5: Data Compression Chapter 5: Data Compression Definition. A source code C for a random variable X is a mapping from the range of X to the set of finite length strings of symbols from a D-ary alphabet. ˆX: source alphabet,

More information

10-704: Information Processing and Learning Fall Lecture 10: Oct 3

10-704: Information Processing and Learning Fall Lecture 10: Oct 3 0-704: Information Processing and Learning Fall 206 Lecturer: Aarti Singh Lecture 0: Oct 3 Note: These notes are based on scribed notes from Spring5 offering of this course. LaTeX template courtesy of

More information

EE376A: Homework #2 Solutions Due by 11:59pm Thursday, February 1st, 2018

EE376A: Homework #2 Solutions Due by 11:59pm Thursday, February 1st, 2018 Please submit the solutions on Gradescope. Some definitions that may be useful: EE376A: Homework #2 Solutions Due by 11:59pm Thursday, February 1st, 2018 Definition 1: A sequence of random variables X

More information

Entropy as a measure of surprise

Entropy as a measure of surprise Entropy as a measure of surprise Lecture 5: Sam Roweis September 26, 25 What does information do? It removes uncertainty. Information Conveyed = Uncertainty Removed = Surprise Yielded. How should we quantify

More information

Multimedia Communications. Mathematical Preliminaries for Lossless Compression

Multimedia Communications. Mathematical Preliminaries for Lossless Compression Multimedia Communications Mathematical Preliminaries for Lossless Compression What we will see in this chapter Definition of information and entropy Modeling a data source Definition of coding and when

More information

Exercises with solutions (Set B)

Exercises with solutions (Set B) Exercises with solutions (Set B) 3. A fair coin is tossed an infinite number of times. Let Y n be a random variable, with n Z, that describes the outcome of the n-th coin toss. If the outcome of the n-th

More information

3F1 Information Theory, Lecture 3

3F1 Information Theory, Lecture 3 3F1 Information Theory, Lecture 3 Jossy Sayir Department of Engineering Michaelmas 2013, 29 November 2013 Memoryless Sources Arithmetic Coding Sources with Memory Markov Example 2 / 21 Encoding the output

More information

Source Coding. Master Universitario en Ingeniería de Telecomunicación. I. Santamaría Universidad de Cantabria

Source Coding. Master Universitario en Ingeniería de Telecomunicación. I. Santamaría Universidad de Cantabria Source Coding Master Universitario en Ingeniería de Telecomunicación I. Santamaría Universidad de Cantabria Contents Introduction Asymptotic Equipartition Property Optimal Codes (Huffman Coding) Universal

More information

Motivation for Arithmetic Coding

Motivation for Arithmetic Coding Motivation for Arithmetic Coding Motivations for arithmetic coding: 1) Huffman coding algorithm can generate prefix codes with a minimum average codeword length. But this length is usually strictly greater

More information

Lecture 4 : Adaptive source coding algorithms

Lecture 4 : Adaptive source coding algorithms Lecture 4 : Adaptive source coding algorithms February 2, 28 Information Theory Outline 1. Motivation ; 2. adaptive Huffman encoding ; 3. Gallager and Knuth s method ; 4. Dictionary methods : Lempel-Ziv

More information

10-704: Information Processing and Learning Fall Lecture 9: Sept 28

10-704: Information Processing and Learning Fall Lecture 9: Sept 28 10-704: Information Processing and Learning Fall 2016 Lecturer: Siheng Chen Lecture 9: Sept 28 Note: These notes are based on scribed notes from Spring15 offering of this course. LaTeX template courtesy

More information

1 Introduction to information theory

1 Introduction to information theory 1 Introduction to information theory 1.1 Introduction In this chapter we present some of the basic concepts of information theory. The situations we have in mind involve the exchange of information through

More information

Run-length & Entropy Coding. Redundancy Removal. Sampling. Quantization. Perform inverse operations at the receiver EEE

Run-length & Entropy Coding. Redundancy Removal. Sampling. Quantization. Perform inverse operations at the receiver EEE General e Image Coder Structure Motion Video x(s 1,s 2,t) or x(s 1,s 2 ) Natural Image Sampling A form of data compression; usually lossless, but can be lossy Redundancy Removal Lossless compression: predictive

More information

4F5: Advanced Communications and Coding Handout 2: The Typical Set, Compression, Mutual Information

4F5: Advanced Communications and Coding Handout 2: The Typical Set, Compression, Mutual Information 4F5: Advanced Communications and Coding Handout 2: The Typical Set, Compression, Mutual Information Ramji Venkataramanan Signal Processing and Communications Lab Department of Engineering ramji.v@eng.cam.ac.uk

More information

lossless, optimal compressor

lossless, optimal compressor 6. Variable-length Lossless Compression The principal engineering goal of compression is to represent a given sequence a, a 2,..., a n produced by a source as a sequence of bits of minimal possible length.

More information

SIGNAL COMPRESSION Lecture Shannon-Fano-Elias Codes and Arithmetic Coding

SIGNAL COMPRESSION Lecture Shannon-Fano-Elias Codes and Arithmetic Coding SIGNAL COMPRESSION Lecture 3 4.9.2007 Shannon-Fano-Elias Codes and Arithmetic Coding 1 Shannon-Fano-Elias Coding We discuss how to encode the symbols {a 1, a 2,..., a m }, knowing their probabilities,

More information

EE5139R: Problem Set 4 Assigned: 31/08/16, Due: 07/09/16

EE5139R: Problem Set 4 Assigned: 31/08/16, Due: 07/09/16 EE539R: Problem Set 4 Assigned: 3/08/6, Due: 07/09/6. Cover and Thomas: Problem 3.5 Sets defined by probabilities: Define the set C n (t = {x n : P X n(x n 2 nt } (a We have = P X n(x n P X n(x n 2 nt

More information

EE 229B ERROR CONTROL CODING Spring 2005

EE 229B ERROR CONTROL CODING Spring 2005 EE 229B ERROR CONTROL CODING Spring 2005 Solutions for Homework 1 1. Is there room? Prove or disprove : There is a (12,7) binary linear code with d min = 5. If there were a (12,7) binary linear code with

More information

An instantaneous code (prefix code, tree code) with the codeword lengths l 1,..., l N exists if and only if. 2 l i. i=1

An instantaneous code (prefix code, tree code) with the codeword lengths l 1,..., l N exists if and only if. 2 l i. i=1 Kraft s inequality An instantaneous code (prefix code, tree code) with the codeword lengths l 1,..., l N exists if and only if N 2 l i 1 Proof: Suppose that we have a tree code. Let l max = max{l 1,...,

More information

Information Theory CHAPTER. 5.1 Introduction. 5.2 Entropy

Information Theory CHAPTER. 5.1 Introduction. 5.2 Entropy Haykin_ch05_pp3.fm Page 207 Monday, November 26, 202 2:44 PM CHAPTER 5 Information Theory 5. Introduction As mentioned in Chapter and reiterated along the way, the purpose of a communication system is

More information

Chapter 3 Source Coding. 3.1 An Introduction to Source Coding 3.2 Optimal Source Codes 3.3 Shannon-Fano Code 3.4 Huffman Code

Chapter 3 Source Coding. 3.1 An Introduction to Source Coding 3.2 Optimal Source Codes 3.3 Shannon-Fano Code 3.4 Huffman Code Chapter 3 Source Coding 3. An Introduction to Source Coding 3.2 Optimal Source Codes 3.3 Shannon-Fano Code 3.4 Huffman Code 3. An Introduction to Source Coding Entropy (in bits per symbol) implies in average

More information

3F1 Information Theory, Lecture 3

3F1 Information Theory, Lecture 3 3F1 Information Theory, Lecture 3 Jossy Sayir Department of Engineering Michaelmas 2011, 28 November 2011 Memoryless Sources Arithmetic Coding Sources with Memory 2 / 19 Summary of last lecture Prefix-free

More information

Intro to Information Theory

Intro to Information Theory Intro to Information Theory Math Circle February 11, 2018 1. Random variables Let us review discrete random variables and some notation. A random variable X takes value a A with probability P (a) 0. Here

More information

Communications Theory and Engineering

Communications Theory and Engineering Communications Theory and Engineering Master's Degree in Electronic Engineering Sapienza University of Rome A.A. 2018-2019 AEP Asymptotic Equipartition Property AEP In information theory, the analog of

More information

Coding of memoryless sources 1/35

Coding of memoryless sources 1/35 Coding of memoryless sources 1/35 Outline 1. Morse coding ; 2. Definitions : encoding, encoding efficiency ; 3. fixed length codes, encoding integers ; 4. prefix condition ; 5. Kraft and Mac Millan theorems

More information

EE376A - Information Theory Midterm, Tuesday February 10th. Please start answering each question on a new page of the answer booklet.

EE376A - Information Theory Midterm, Tuesday February 10th. Please start answering each question on a new page of the answer booklet. EE376A - Information Theory Midterm, Tuesday February 10th Instructions: You have two hours, 7PM - 9PM The exam has 3 questions, totaling 100 points. Please start answering each question on a new page

More information

Lecture 1: September 25, A quick reminder about random variables and convexity

Lecture 1: September 25, A quick reminder about random variables and convexity Information and Coding Theory Autumn 207 Lecturer: Madhur Tulsiani Lecture : September 25, 207 Administrivia This course will cover some basic concepts in information and coding theory, and their applications

More information

10-704: Information Processing and Learning Spring Lecture 8: Feb 5

10-704: Information Processing and Learning Spring Lecture 8: Feb 5 10-704: Information Processing and Learning Spring 2015 Lecture 8: Feb 5 Lecturer: Aarti Singh Scribe: Siheng Chen Disclaimer: These notes have not been subjected to the usual scrutiny reserved for formal

More information

Lecture 16. Error-free variable length schemes (contd.): Shannon-Fano-Elias code, Huffman code

Lecture 16. Error-free variable length schemes (contd.): Shannon-Fano-Elias code, Huffman code Lecture 16 Agenda for the lecture Error-free variable length schemes (contd.): Shannon-Fano-Elias code, Huffman code Variable-length source codes with error 16.1 Error-free coding schemes 16.1.1 The Shannon-Fano-Elias

More information

Information Theory and Statistics Lecture 2: Source coding

Information Theory and Statistics Lecture 2: Source coding Information Theory and Statistics Lecture 2: Source coding Łukasz Dębowski ldebowsk@ipipan.waw.pl Ph. D. Programme 2013/2014 Injections and codes Definition (injection) Function f is called an injection

More information

Introduction to information theory and coding

Introduction to information theory and coding Introduction to information theory and coding Louis WEHENKEL Set of slides No 5 State of the art in data compression Stochastic processes and models for information sources First Shannon theorem : data

More information

Homework Set #2 Data Compression, Huffman code and AEP

Homework Set #2 Data Compression, Huffman code and AEP Homework Set #2 Data Compression, Huffman code and AEP 1. Huffman coding. Consider the random variable ( x1 x X = 2 x 3 x 4 x 5 x 6 x 7 0.50 0.26 0.11 0.04 0.04 0.03 0.02 (a Find a binary Huffman code

More information

Asymptotic redundancy and prolixity

Asymptotic redundancy and prolixity Asymptotic redundancy and prolixity Yuval Dagan, Yuval Filmus, and Shay Moran April 6, 2017 Abstract Gallager (1978) considered the worst-case redundancy of Huffman codes as the maximum probability tends

More information

Data Compression Techniques

Data Compression Techniques Data Compression Techniques Part 2: Text Compression Lecture 5: Context-Based Compression Juha Kärkkäinen 14.11.2017 1 / 19 Text Compression We will now look at techniques for text compression. These techniques

More information

Lecture 3. Mathematical methods in communication I. REMINDER. A. Convex Set. A set R is a convex set iff, x 1,x 2 R, θ, 0 θ 1, θx 1 + θx 2 R, (1)

Lecture 3. Mathematical methods in communication I. REMINDER. A. Convex Set. A set R is a convex set iff, x 1,x 2 R, θ, 0 θ 1, θx 1 + θx 2 R, (1) 3- Mathematical methods in communication Lecture 3 Lecturer: Haim Permuter Scribe: Yuval Carmel, Dima Khaykin, Ziv Goldfeld I. REMINDER A. Convex Set A set R is a convex set iff, x,x 2 R, θ, θ, θx + θx

More information

MARKOV CHAINS A finite state Markov chain is a sequence of discrete cv s from a finite alphabet where is a pmf on and for

MARKOV CHAINS A finite state Markov chain is a sequence of discrete cv s from a finite alphabet where is a pmf on and for MARKOV CHAINS A finite state Markov chain is a sequence S 0,S 1,... of discrete cv s from a finite alphabet S where q 0 (s) is a pmf on S 0 and for n 1, Q(s s ) = Pr(S n =s S n 1 =s ) = Pr(S n =s S n 1

More information

ECE Advanced Communication Theory, Spring 2009 Homework #1 (INCOMPLETE)

ECE Advanced Communication Theory, Spring 2009 Homework #1 (INCOMPLETE) ECE 74 - Advanced Communication Theory, Spring 2009 Homework #1 (INCOMPLETE) 1. A Huffman code finds the optimal codeword to assign to a given block of source symbols. (a) Show that cannot be a Huffman

More information

Information Theory: Entropy, Markov Chains, and Huffman Coding

Information Theory: Entropy, Markov Chains, and Huffman Coding The University of Notre Dame A senior thesis submitted to the Department of Mathematics and the Glynn Family Honors Program Information Theory: Entropy, Markov Chains, and Huffman Coding Patrick LeBlanc

More information

Chapter 2: Source coding

Chapter 2: Source coding Chapter 2: meghdadi@ensil.unilim.fr University of Limoges Chapter 2: Entropy of Markov Source Chapter 2: Entropy of Markov Source Markov model for information sources Given the present, the future is independent

More information

PART III. Outline. Codes and Cryptography. Sources. Optimal Codes (I) Jorge L. Villar. MAMME, Fall 2015

PART III. Outline. Codes and Cryptography. Sources. Optimal Codes (I) Jorge L. Villar. MAMME, Fall 2015 Outline Codes and Cryptography 1 Information Sources and Optimal Codes 2 Building Optimal Codes: Huffman Codes MAMME, Fall 2015 3 Shannon Entropy and Mutual Information PART III Sources Information source:

More information

COS597D: Information Theory in Computer Science October 19, Lecture 10

COS597D: Information Theory in Computer Science October 19, Lecture 10 COS597D: Information Theory in Computer Science October 9, 20 Lecture 0 Lecturer: Mark Braverman Scribe: Andrej Risteski Kolmogorov Complexity In the previous lectures, we became acquainted with the concept

More information

Chapter 2 Date Compression: Source Coding. 2.1 An Introduction to Source Coding 2.2 Optimal Source Codes 2.3 Huffman Code

Chapter 2 Date Compression: Source Coding. 2.1 An Introduction to Source Coding 2.2 Optimal Source Codes 2.3 Huffman Code Chapter 2 Date Compression: Source Coding 2.1 An Introduction to Source Coding 2.2 Optimal Source Codes 2.3 Huffman Code 2.1 An Introduction to Source Coding Source coding can be seen as an efficient way

More information

Notes for Lecture Notes 2

Notes for Lecture Notes 2 Stanford University CS254: Computational Complexity Notes 2 Luca Trevisan January 11, 2012 Notes for Lecture Notes 2 In this lecture we define NP, we state the P versus NP problem, we prove that its formulation

More information

Information Theory, Statistics, and Decision Trees

Information Theory, Statistics, and Decision Trees Information Theory, Statistics, and Decision Trees Léon Bottou COS 424 4/6/2010 Summary 1. Basic information theory. 2. Decision trees. 3. Information theory and statistics. Léon Bottou 2/31 COS 424 4/6/2010

More information

(Classical) Information Theory II: Source coding

(Classical) Information Theory II: Source coding (Classical) Information Theory II: Source coding Sibasish Ghosh The Institute of Mathematical Sciences CIT Campus, Taramani, Chennai 600 113, India. p. 1 Abstract The information content of a random variable

More information

Bandwidth: Communicate large complex & highly detailed 3D models through lowbandwidth connection (e.g. VRML over the Internet)

Bandwidth: Communicate large complex & highly detailed 3D models through lowbandwidth connection (e.g. VRML over the Internet) Compression Motivation Bandwidth: Communicate large complex & highly detailed 3D models through lowbandwidth connection (e.g. VRML over the Internet) Storage: Store large & complex 3D models (e.g. 3D scanner

More information

Lecture 5: Asymptotic Equipartition Property

Lecture 5: Asymptotic Equipartition Property Lecture 5: Asymptotic Equipartition Property Law of large number for product of random variables AEP and consequences Dr. Yao Xie, ECE587, Information Theory, Duke University Stock market Initial investment

More information

CS 229r Information Theory in Computer Science Feb 12, Lecture 5

CS 229r Information Theory in Computer Science Feb 12, Lecture 5 CS 229r Information Theory in Computer Science Feb 12, 2019 Lecture 5 Instructor: Madhu Sudan Scribe: Pranay Tankala 1 Overview A universal compression algorithm is a single compression algorithm applicable

More information

Further discussion of Turing machines

Further discussion of Turing machines Further discussion of Turing machines In this lecture we will discuss various aspects of decidable and Turing-recognizable languages that were not mentioned in previous lectures. In particular, we will

More information

Data Compression Techniques

Data Compression Techniques Data Compression Techniques Part 1: Entropy Coding Lecture 4: Asymmetric Numeral Systems Juha Kärkkäinen 08.11.2017 1 / 19 Asymmetric Numeral Systems Asymmetric numeral systems (ANS) is a recent entropy

More information

Lecture 6: Kraft-McMillan Inequality and Huffman Coding

Lecture 6: Kraft-McMillan Inequality and Huffman Coding EE376A/STATS376A Information Theory Lecture 6-0/25/208 Lecture 6: Kraft-McMillan Inequality and Huffman Coding Lecturer: Tsachy Weissman Scribe: Akhil Prakash, Kai Yee Wan In this lecture, we begin with

More information

CS6304 / Analog and Digital Communication UNIT IV - SOURCE AND ERROR CONTROL CODING PART A 1. What is the use of error control coding? The main use of error control coding is to reduce the overall probability

More information

An introduction to basic information theory. Hampus Wessman

An introduction to basic information theory. Hampus Wessman An introduction to basic information theory Hampus Wessman Abstract We give a short and simple introduction to basic information theory, by stripping away all the non-essentials. Theoretical bounds on

More information

Data Compression. Limit of Information Compression. October, Examples of codes 1

Data Compression. Limit of Information Compression. October, Examples of codes 1 Data Compression Limit of Information Compression Radu Trîmbiţaş October, 202 Outline Contents Eamples of codes 2 Kraft Inequality 4 2. Kraft Inequality............................ 4 2.2 Kraft inequality

More information

Introductory Analysis I Fall 2014 Homework #9 Due: Wednesday, November 19

Introductory Analysis I Fall 2014 Homework #9 Due: Wednesday, November 19 Introductory Analysis I Fall 204 Homework #9 Due: Wednesday, November 9 Here is an easy one, to serve as warmup Assume M is a compact metric space and N is a metric space Assume that f n : M N for each

More information

1 Computational Problems

1 Computational Problems Stanford University CS254: Computational Complexity Handout 2 Luca Trevisan March 31, 2010 Last revised 4/29/2010 In this lecture we define NP, we state the P versus NP problem, we prove that its formulation

More information

UNIT I INFORMATION THEORY. I k log 2

UNIT I INFORMATION THEORY. I k log 2 UNIT I INFORMATION THEORY Claude Shannon 1916-2001 Creator of Information Theory, lays the foundation for implementing logic in digital circuits as part of his Masters Thesis! (1939) and published a paper

More information

Tight Upper Bounds on the Redundancy of Optimal Binary AIFV Codes

Tight Upper Bounds on the Redundancy of Optimal Binary AIFV Codes Tight Upper Bounds on the Redundancy of Optimal Binary AIFV Codes Weihua Hu Dept. of Mathematical Eng. Email: weihua96@gmail.com Hirosuke Yamamoto Dept. of Complexity Sci. and Eng. Email: Hirosuke@ieee.org

More information

CSCI 2570 Introduction to Nanocomputing

CSCI 2570 Introduction to Nanocomputing CSCI 2570 Introduction to Nanocomputing Information Theory John E Savage What is Information Theory Introduced by Claude Shannon. See Wikipedia Two foci: a) data compression and b) reliable communication

More information

Shannon-Fano-Elias coding

Shannon-Fano-Elias coding Shannon-Fano-Elias coding Suppose that we have a memoryless source X t taking values in the alphabet {1, 2,..., L}. Suppose that the probabilities for all symbols are strictly positive: p(i) > 0, i. The

More information

Information Theory with Applications, Math6397 Lecture Notes from September 30, 2014 taken by Ilknur Telkes

Information Theory with Applications, Math6397 Lecture Notes from September 30, 2014 taken by Ilknur Telkes Information Theory with Applications, Math6397 Lecture Notes from September 3, 24 taken by Ilknur Telkes Last Time Kraft inequality (sep.or) prefix code Shannon Fano code Bound for average code-word length

More information

Complexity Theory VU , SS The Polynomial Hierarchy. Reinhard Pichler

Complexity Theory VU , SS The Polynomial Hierarchy. Reinhard Pichler Complexity Theory Complexity Theory VU 181.142, SS 2018 6. The Polynomial Hierarchy Reinhard Pichler Institut für Informationssysteme Arbeitsbereich DBAI Technische Universität Wien 15 May, 2018 Reinhard

More information

Outline. Complexity Theory EXACT TSP. The Class DP. Definition. Problem EXACT TSP. Complexity of EXACT TSP. Proposition VU 181.

Outline. Complexity Theory EXACT TSP. The Class DP. Definition. Problem EXACT TSP. Complexity of EXACT TSP. Proposition VU 181. Complexity Theory Complexity Theory Outline Complexity Theory VU 181.142, SS 2018 6. The Polynomial Hierarchy Reinhard Pichler Institut für Informationssysteme Arbeitsbereich DBAI Technische Universität

More information

Investigation of the Elias Product Code Construction for the Binary Erasure Channel

Investigation of the Elias Product Code Construction for the Binary Erasure Channel Investigation of the Elias Product Code Construction for the Binary Erasure Channel by D. P. Varodayan A THESIS SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF BACHELOR OF APPLIED

More information

EE 121: Introduction to Digital Communication Systems. 1. Consider the following discrete-time communication system. There are two equallly likely

EE 121: Introduction to Digital Communication Systems. 1. Consider the following discrete-time communication system. There are two equallly likely EE 11: Introduction to Digital Communication Systems Midterm Solutions 1. Consider the following discrete-time communication system. There are two equallly likely messages to be transmitted, and they are

More information

ELEC 515 Information Theory. Distortionless Source Coding

ELEC 515 Information Theory. Distortionless Source Coding ELEC 515 Information Theory Distortionless Source Coding 1 Source Coding Output Alphabet Y={y 1,,y J } Source Encoder Lengths 2 Source Coding Two coding requirements The source sequence can be recovered

More information

Lecture 8: Channel and source-channel coding theorems; BEC & linear codes. 1 Intuitive justification for upper bound on channel capacity

Lecture 8: Channel and source-channel coding theorems; BEC & linear codes. 1 Intuitive justification for upper bound on channel capacity 5-859: Information Theory and Applications in TCS CMU: Spring 23 Lecture 8: Channel and source-channel coding theorems; BEC & linear codes February 7, 23 Lecturer: Venkatesan Guruswami Scribe: Dan Stahlke

More information

COMM901 Source Coding and Compression. Quiz 1

COMM901 Source Coding and Compression. Quiz 1 German University in Cairo - GUC Faculty of Information Engineering & Technology - IET Department of Communication Engineering Winter Semester 2013/2014 Students Name: Students ID: COMM901 Source Coding

More information

Solutions to Set #2 Data Compression, Huffman code and AEP

Solutions to Set #2 Data Compression, Huffman code and AEP Solutions to Set #2 Data Compression, Huffman code and AEP. Huffman coding. Consider the random variable ( ) x x X = 2 x 3 x 4 x 5 x 6 x 7 0.50 0.26 0. 0.04 0.04 0.03 0.02 (a) Find a binary Huffman code

More information

Lecture 4 Noisy Channel Coding

Lecture 4 Noisy Channel Coding Lecture 4 Noisy Channel Coding I-Hsiang Wang Department of Electrical Engineering National Taiwan University ihwang@ntu.edu.tw October 9, 2015 1 / 56 I-Hsiang Wang IT Lecture 4 The Channel Coding Problem

More information

4.8 Huffman Codes. These lecture slides are supplied by Mathijs de Weerd

4.8 Huffman Codes. These lecture slides are supplied by Mathijs de Weerd 4.8 Huffman Codes These lecture slides are supplied by Mathijs de Weerd Data Compression Q. Given a text that uses 32 symbols (26 different letters, space, and some punctuation characters), how can we

More information

Transducers for bidirectional decoding of prefix codes

Transducers for bidirectional decoding of prefix codes Transducers for bidirectional decoding of prefix codes Laura Giambruno a,1, Sabrina Mantaci a,1 a Dipartimento di Matematica ed Applicazioni - Università di Palermo - Italy Abstract We construct a transducer

More information

Information and Entropy

Information and Entropy Information and Entropy Shannon s Separation Principle Source Coding Principles Entropy Variable Length Codes Huffman Codes Joint Sources Arithmetic Codes Adaptive Codes Thomas Wiegand: Digital Image Communication

More information

Introduction to information theory and coding

Introduction to information theory and coding Introduction to information theory and coding Louis WEHENKEL Set of slides No 4 Source modeling and source coding Stochastic processes and models for information sources First Shannon theorem : data compression

More information

Coding for Discrete Source

Coding for Discrete Source EGR 544 Communication Theory 3. Coding for Discrete Sources Z. Aliyazicioglu Electrical and Computer Engineering Department Cal Poly Pomona Coding for Discrete Source Coding Represent source data effectively

More information

Lecture 1: Shannon s Theorem

Lecture 1: Shannon s Theorem Lecture 1: Shannon s Theorem Lecturer: Travis Gagie January 13th, 2015 Welcome to Data Compression! I m Travis and I ll be your instructor this week. If you haven t registered yet, don t worry, we ll work

More information

Algorithms: Lecture 2

Algorithms: Lecture 2 1 Algorithms: Lecture 2 Basic Structures: Sets, Functions, Sequences, and Sums Jinwoo Kim jwkim@jjay.cuny.edu 2.1 Sets 2 1 2.1 Sets 3 2.1 Sets 4 2 2.1 Sets 5 2.1 Sets 6 3 2.1 Sets 7 2.2 Set Operations

More information

Data Compression Techniques (Spring 2012) Model Solutions for Exercise 2

Data Compression Techniques (Spring 2012) Model Solutions for Exercise 2 582487 Data Compression Techniques (Spring 22) Model Solutions for Exercise 2 If you have any feedback or corrections, please contact nvalimak at cs.helsinki.fi.. Problem: Construct a canonical prefix

More information

Introduction to Information Theory. By Prof. S.J. Soni Asst. Professor, CE Department, SPCE, Visnagar

Introduction to Information Theory. By Prof. S.J. Soni Asst. Professor, CE Department, SPCE, Visnagar Introduction to Information Theory By Prof. S.J. Soni Asst. Professor, CE Department, SPCE, Visnagar Introduction [B.P. Lathi] Almost in all the means of communication, none produces error-free communication.

More information

1 Ex. 1 Verify that the function H(p 1,..., p n ) = k p k log 2 p k satisfies all 8 axioms on H.

1 Ex. 1 Verify that the function H(p 1,..., p n ) = k p k log 2 p k satisfies all 8 axioms on H. Problem sheet Ex. Verify that the function H(p,..., p n ) = k p k log p k satisfies all 8 axioms on H. Ex. (Not to be handed in). looking at the notes). List as many of the 8 axioms as you can, (without

More information

Capacity of a channel Shannon s second theorem. Information Theory 1/33

Capacity of a channel Shannon s second theorem. Information Theory 1/33 Capacity of a channel Shannon s second theorem Information Theory 1/33 Outline 1. Memoryless channels, examples ; 2. Capacity ; 3. Symmetric channels ; 4. Channel Coding ; 5. Shannon s second theorem,

More information

2018/5/3. YU Xiangyu

2018/5/3. YU Xiangyu 2018/5/3 YU Xiangyu yuxy@scut.edu.cn Entropy Huffman Code Entropy of Discrete Source Definition of entropy: If an information source X can generate n different messages x 1, x 2,, x i,, x n, then the

More information

Slides for CIS 675. Huffman Encoding, 1. Huffman Encoding, 2. Huffman Encoding, 3. Encoding 1. DPV Chapter 5, Part 2. Encoding 2

Slides for CIS 675. Huffman Encoding, 1. Huffman Encoding, 2. Huffman Encoding, 3. Encoding 1. DPV Chapter 5, Part 2. Encoding 2 Huffman Encoding, 1 EECS Slides for CIS 675 DPV Chapter 5, Part 2 Jim Royer October 13, 2009 A toy example: Suppose our alphabet is { A, B, C, D }. Suppose T is a text of 130 million characters. What is

More information

d(ν) = max{n N : ν dmn p n } N. p d(ν) (ν) = ρ.

d(ν) = max{n N : ν dmn p n } N. p d(ν) (ν) = ρ. 1. Trees; context free grammars. 1.1. Trees. Definition 1.1. By a tree we mean an ordered triple T = (N, ρ, p) (i) N is a finite set; (ii) ρ N ; (iii) p : N {ρ} N ; (iv) if n N + and ν dmn p n then p n

More information

Induction and recursion. Chapter 5

Induction and recursion. Chapter 5 Induction and recursion Chapter 5 Chapter Summary Mathematical Induction Strong Induction Well-Ordering Recursive Definitions Structural Induction Recursive Algorithms Mathematical Induction Section 5.1

More information

EE376A - Information Theory Final, Monday March 14th 2016 Solutions. Please start answering each question on a new page of the answer booklet.

EE376A - Information Theory Final, Monday March 14th 2016 Solutions. Please start answering each question on a new page of the answer booklet. EE376A - Information Theory Final, Monday March 14th 216 Solutions Instructions: You have three hours, 3.3PM - 6.3PM The exam has 4 questions, totaling 12 points. Please start answering each question on

More information

arxiv: v1 [cs.it] 5 Sep 2008

arxiv: v1 [cs.it] 5 Sep 2008 1 arxiv:0809.1043v1 [cs.it] 5 Sep 2008 On Unique Decodability Marco Dalai, Riccardo Leonardi Abstract In this paper we propose a revisitation of the topic of unique decodability and of some fundamental

More information

An O(N) Semi-Predictive Universal Encoder via the BWT

An O(N) Semi-Predictive Universal Encoder via the BWT An O(N) Semi-Predictive Universal Encoder via the BWT Dror Baron and Yoram Bresler Abstract We provide an O(N) algorithm for a non-sequential semi-predictive encoder whose pointwise redundancy with respect

More information

Lecture 2: August 31

Lecture 2: August 31 0-704: Information Processing and Learning Fall 206 Lecturer: Aarti Singh Lecture 2: August 3 Note: These notes are based on scribed notes from Spring5 offering of this course. LaTeX template courtesy

More information

Northwestern University Department of Electrical Engineering and Computer Science

Northwestern University Department of Electrical Engineering and Computer Science Northwestern University Department of Electrical Engineering and Computer Science EECS 454: Modeling and Analysis of Communication Networks Spring 2008 Probability Review As discussed in Lecture 1, probability

More information

Induction and Recursion

Induction and Recursion . All rights reserved. Authorized only for instructor use in the classroom. No reproduction or further distribution permitted without the prior written consent of McGraw-Hill Education. Induction and Recursion

More information

Lecture 11: Polar codes construction

Lecture 11: Polar codes construction 15-859: Information Theory and Applications in TCS CMU: Spring 2013 Lecturer: Venkatesan Guruswami Lecture 11: Polar codes construction February 26, 2013 Scribe: Dan Stahlke 1 Polar codes: recap of last

More information

Lecture 7 September 24

Lecture 7 September 24 EECS 11: Coding for Digital Communication and Beyond Fall 013 Lecture 7 September 4 Lecturer: Anant Sahai Scribe: Ankush Gupta 7.1 Overview This lecture introduces affine and linear codes. Orthogonal signalling

More information

Chapter 4. Data Transmission and Channel Capacity. Po-Ning Chen, Professor. Department of Communications Engineering. National Chiao Tung University

Chapter 4. Data Transmission and Channel Capacity. Po-Ning Chen, Professor. Department of Communications Engineering. National Chiao Tung University Chapter 4 Data Transmission and Channel Capacity Po-Ning Chen, Professor Department of Communications Engineering National Chiao Tung University Hsin Chu, Taiwan 30050, R.O.C. Principle of Data Transmission

More information

CSEP 590 Data Compression Autumn Arithmetic Coding

CSEP 590 Data Compression Autumn Arithmetic Coding CSEP 590 Data Compression Autumn 2007 Arithmetic Coding Reals in Binary Any real number x in the interval [0,1) can be represented in binary as.b 1 b 2... where b i is a bit. x 0 0 1 0 1... binary representation

More information

Homework 1 (revised) Solutions

Homework 1 (revised) Solutions Homework 1 (revised) Solutions 1. Textbook, 1.1.1, # 1.1.2 (p. 24) Let S be an ordered set. Let A be a non-empty finite subset. Then A is bounded and sup A, inf A A Solution. The hint was: Use induction,

More information

Kolmogorov complexity

Kolmogorov complexity Kolmogorov complexity In this section we study how we can define the amount of information in a bitstring. Consider the following strings: 00000000000000000000000000000000000 0000000000000000000000000000000000000000

More information