Lecture 5 Channel Coding over Continuous Channels
|
|
- Amie Watkins
- 5 years ago
- Views:
Transcription
1 Lecture 5 Channel Coding over Continuous Channels I-Hsiang Wang Department of Electrical Engineering National Taiwan University ihwang@ntu.edu.tw November 14, / 34 I-Hsiang Wang NIT Lecture 5
2 From Discrete-Valued to Continuous-Valued (1) So far we have focused on the discrete-valued (& finite-alphabet) r.v s: Entropy and mutual information for discrete-valued r.v s. Lossless source coding for discrete stationary sources. Channel coding over discrete memoryless channels. In this lecture we extend the basic principles and fundamental theorems to continuous-valued sources and channels. In particular: Mutual information for continuous-valued r.v. s. Lossy source coding for continuous stationary sources. Channel coding with input cost over continuous memoryless channels. (Main example: Gaussian channel capacity). We skip lossy source coding (rate distortion theory) in this course. 2 / 34 I-Hsiang Wang NIT Lecture 5
3 From Discrete-Valued to Continuous-Valued (2) Main technique for extending coding theorems from the discrete-valued, finite-alphabet world to the continuous-valued world: Advantages: Discretization. No need for new tools (eg., typicality) for continuous-valued r.v. s. Extends naturally to multi-terminal settings can focus on discrete memoryless networks. Outline: 1 Differential entropy 2 Channel coding with input cost over DMC 3 Gaussian channel capacity 3 / 34 I-Hsiang Wang NIT Lecture 5
4 Disclaimer: due to time constraint, we will not be 100% rigorous in deriving the results in this lecture. Instead, you can find rigorous treatment in the references. Reading: 1 Mutual information and differential entropy: Chapter 8, Cover&Thomas [2] Chapter 15, Moser [5] Chapter 2.2, El Gamal&Kim [1] 2 Gaussian channel capacity: Chapter 9, Cover&Thomas [2] Chapter 16, Moser [5] Chapter 3.3,3.4, El Gamal&Kim [1] Remark: Using discretization to derive the achievability of Gaussian channel capacity follows [1]. [2] uses weak typicality for continuous random variables; [5] uses threshold decoder, similar to weak typicality in spirit. 4 / 34 I-Hsiang Wang NIT Lecture 5
5 1 Mutual Information and Differential Entropy / 34 I-Hsiang Wang NIT Lecture 5
6 Entropy of a Continuous Random Variable Question: What is the entropy of a continuous real-valued random variable X? Suppose X has the probability density function (p.d.f.) f (x). Let us discretize X to answer this question, as follows: Partition R into length- intervals: R = k= [k, (k + 1) ). Suppose that f (x) is continuous, then by the mean-value theorem, k Z, x k [k, (k + 1) ) such that f (x k ) = 1 (k+1) f (x) dx. k Set [X] := x k if X [k, (k + 1) ), with p.m.f. p (x k ) = f (x k ). Observation: lim 0 H ([X] ) = H (X) (intuitively), while H ( ) [X] = (f (x k ) ) log (f (x k ) ) = f (x k ) log f (x k ) log k= k= f (x) log f (x) dx + = as 0 We conclude that H (X) = if [ ] f (x) log f (x) = E log 1 f(x) exists. 6 / 34 I-Hsiang Wang NIT Lecture 5
7 Differential Entropy It is quite intuitive that the entropy of a continuous random variable can be arbitrarily large, because it can take infinitely many possible values. Hence, in general it is impossible to losslessly compress a continuous source with finite rate. Instead, lossy source coding is done. Yet, for continuous r.v. s, it turns out to be useful to define the counterparts of entropy and conditional entropy, as follows: Definition 1 (Differential entropy and conditional differential entropy) The differential [ entropy] of a continuous r.v. X with p.d.f. f (x) is defined as h (X) := E log 1 f(x) if the (improper) integral exists. The conditional differential entropy of a continuous r.v. X given Y, where (X, Y) has joint p.d.f. [ f (x, y) ] and conditional p.d.f. f (x y), is 1 defined as h (X Y) := E log f(x Y) if the (improper) integral exists. 7 / 34 I-Hsiang Wang NIT Lecture 5
8 Mutual Information between Continuous Random Variables How about mutual information between two continuous real-valued random variables X and Y, with joint p.d.f. f X,Y (x, y) and marginal p.d.f. s f X (x) and f Y (y)? Again, we use discretization: Partition R 2 plane into squares: R 2 = k,j= I k I j, where Ik := [k, (k + 1) ). Suppose that f X,Y (x, y) is continuous, then by the mean-value theorem (MVT), k, j Z, (x k, y j ) Ik I j such that f X,Y (x k, y j ) = 1 f 2 Ik X,Y (x, y) dx dy. I j Set ([X], [Y] ) := k,j (x k, y j ) I { (X, Y) Ik } I j, with p.m.f. p (x k, y j ) = f X,Y (x k, y j ) 2. By MVT, k, j Z, x k Ik and ỹ j Ij such that p (x k ) := f Ik X (x) dx = f X ( x k ), p (y j ) := f Ij Y (y) dy = f Y (ỹ j ). 8 / 34 I-Hsiang Wang NIT Lecture 5
9 Mutual Information between Continuous Random Variables Observation: lim 0 I ([X] ; [Y] ) = I (X; Y) (intuitively), while I ( [X] ; [Y] ) = = k,j= k,j= = 2 Hence, I (X; Y) = E k,j= p (x k, y j ) log p (x k, y j ) p (x k ) p (y j ) ( fx,y (x k, y j ) 2) log fx,y (x k, y j) 2 [ ] log f(x,y) f(x)f(y) f X ( x k ) f Y (ỹ j ) 2 f X,Y (x k, y j ) log f X,Y (x k, y j ) f X ( x k ) f Y (ỹ j ) f X,Y (x, y) log fx,y (x, y) dx dy as 0 f X (x) f Y (y) if the improper integral exists. 9 / 34 I-Hsiang Wang NIT Lecture 5
10 Mutual Information Unlike entropy that is only well-defined for discrete random variables, in general we can define the mutual information between two real-valued random variables (no necessarily continuous or discrete) as follows. Definition 2 (Mutual information) The mutual information between two random variables X and Y is defined as I (X; Y) = sup I ( ) [X] P ; [Y] Q, P,Q where the supremum is taken over all pairs of partitions P and Q of R. We have the following theorem immediately from the previous discussion: Theorem 1 (Mutual information between two continuous r.v. s) [ ] I (X; Y) := E log f(x,y) f(x)f(y) = h (X) h (X Y). 10 / 34 I-Hsiang Wang NIT Lecture 5
11 Properties that Extend to Continuous R.V. s Proposition 1 (Chain rule) n h (X, Y) = h (X) + h (Y X), h (X n ) = h ( X i X i 1). i=1 Proposition 2 (Conditioning reduces differential entropy) h (X Y) h (X), h (X Y, Z) h (X Z). Proposition 3 (Non-negativity of mutual information) I (X; Y) 0, I (X; Y Z) / 34 I-Hsiang Wang NIT Lecture 5
12 New Properties of Differential Entropy Example 1 (Differential entropy of a uniform r.v.) For a r.v. X Unif [a, b], that is, its p.d.f. f X (x) = 1 b ai {a x b}, its differential entropy h (X) = log (b a). Differential entropy can be negative. Since b a can be made arbitrarily small, h (X) = log (b a) can be negative. Hence, the non-negative property of entropy cannot be extended to differential entropy. Scaling will change the differential entropy. Consider X Unif [0, 1]. 2X Unif [0, 2]. Hence, h (X) = log 1 = 0, h (2X) = log 2 = 1 = h (X) h (2X). This is in sharp contrast to entropy: H (X) = H (g (X)) as long as g ( ) is an invertible function. 12 / 34 I-Hsiang Wang NIT Lecture 5
13 Scaling and Translation Proposition 4 Let X be a continuous random variable with differential entropy h (X). Translation does not change the differential entropy: For a constant c, h (X + c) = h (X). Scaling shift the differential entropy: For a constant a 0, h (ax) = h (X) + log a. Proposition 5 Let X be a continuous random vector with differential entropy h (X). For a constant vector c, h (X + c) = h (X). For an invertible matrix A R n n, h (AX) = h (X) + log det A. The proof of these propositions are left as exercises. 13 / 34 I-Hsiang Wang NIT Lecture 5
14 Differential Entropy of Gaussian Random Vectors Example 2 (Differential entropy of N (0, 1)) For a r.v. X N (0, 1), that is, its p.d.f. f X (x) = 1 2π e x2 2, its differential entropy h (X) = 1 2 log (2πe). For a n-dim random vector X N (m, K), we can rewrite X as X = AW + m, where AA T = K and W consists of i.i.d. W i N (0, 1), i = 1,..., n. Hence, by the translation and scaling properties of differential entropy: n h (X) = h (W) + log det A = h (W i ) + 1 log det K 2 i=1 = n 2 log (2πe) log det K = 1 2 log (2πe)n det K 14 / 34 I-Hsiang Wang NIT Lecture 5
15 Kullback-Leibler Divergence Definition 3 (KL divergence between densities) The Kullback-Leibler divergence between two probability [ density ] functions f (x) and g (x) is defined as D (f g) := E if the log f(x) g(x) (improper) integral exists. The expectation is taken over r.v. X f (x). By Jensen s inequality, it is straightforward to see that the non-negativity of KL divergence remains. Proposition 6 (Non-negativity of KL divergence) D (f g) 0, with equality iff f = g almost everywhere (i.e., except for some points with zero probability). Note: D (f g) is finite only if the support of f (x) is contained in the support of g (x). 15 / 34 I-Hsiang Wang NIT Lecture 5
16 Maximum Differential Entropy Theorem 2 (Maximum Differential Entropy under Covariance Constraint) Let [ X be a random vector with mean m and covariance matrix E (X m) (X m) T] = K, and X G be Gaussian with the same covariance K. Then, h (X) h ( X G) = 1 2 log (2πe)n det K. pf: First, we can assume WLOG that both X and X G are zero-mean. Let the p.d.f. of X be f (x) and the p.d.f. of X G be f G (x). Hence, 0 D (f f G ) = E [log f (X)] E [log f G (X)] = h (X) E f [log f G (X)] Note: log f G (x) is a quadratic function of x, and X, X G have the same second moment. Hence, E f [log f G (X)] = E fg [log f G (X)] = h ( X G), and 0 D (f f G ) = h (X) + h ( X G) = h (X) h ( X G). 16 / 34 I-Hsiang Wang NIT Lecture 5
17 1 Mutual Information and Differential Entropy / 34 I-Hsiang Wang NIT Lecture 5
18 Input Cost Our goal is to study channel coding over continuous memoryless channel. However, without any constraint on the channel input, it is quite easy to see that infinite amount of information can be delivered over the channel. To make the problem valid, we shall impose a constraint on the input cost of the communication. Let us begin by defining the cost function. Definition 4 (Input cost function) A non-negative input cost function b : X [0, ) is defined over the input alphabet X of a DMC ( X, p Y X, Y ). We can shift b such that there exists a symbol x o X with b (x o ) = 0. WLOG assume the existence of such zero-cost symbol x o X. Average input cost constraint for channel coding: for N channel uses, the average input cost 1 N N i=1 b (x i) B. 18 / 34 I-Hsiang Wang NIT Lecture 5
19 over DMC: Problem Setup w Channel Encoder x N Noisy Channel y N Channel Decoder bw 1 A ( 2 NR, N, B ) channel code consists of an encoding function (encoder) enc N : [1 : 2 K ] X N that maps each message w to a length N codeword x N, where K := NR. The codeword follows the input cost constraint 1 N N i=1 b (x i) B. a decoding function (decoder) dec N : Y N [1 : 2 K ] { } that maps a channel output y N to a reconstructed message ŵ or an error. { } 2 The error probability is defined as P (N) e := Pr W Ŵ. 3 A rate R is said to be achievable with input cost B if there exist a sequence of ( 2 NR, N, B ) codes such that P (N) e 0 as N. The channel capacity is defined as C(B) := sup {R R : achievable}. 19 / 34 I-Hsiang Wang NIT Lecture 5
20 Channel Coding Theorem with Average Input Cost Theorem 3 (Channel Coding Theorem for DMC with Average Input Cost) The capacity of the DMC p (y x) with input cost B is given by C (B) = max I (X; Y). (1) p(x): E[b(X)] B Compared to that without input cost constraint, the additional constraint in the extremal problem (1) is on the expected cost E [b (X)]. The capacity-cost function C (B) is non-decreasing, concave, and continuous in B (exercise). 20 / 34 I-Hsiang Wang NIT Lecture 5
21 Converse Proof pf: Following the converse proof of DMC without input cost, we arrive at R ϵ N 1 N I (X k ; Y k ), where ϵ N 0 as N. N k=1 By the definition of C (B), k [1 : N], I (X k ; Y k ) C (E [b (X k )]). Let B k := E [b (X k )]. Hence, ( ) R ϵ N 1 N C (B k ) (a) 1 N (b) C B k C (B). N N k=1 (a) is due to concavity of C (B) in B. k=1 (b) is due to B k B for all k [1 : N], and the fact that C (B) is non-decreasing in B. Hence, for all R is achievable, R C (B). 21 / 34 I-Hsiang Wang NIT Lecture 5
22 Achievability Proof (1) Achievability proof mostly follows that of DMC without input cost. Twist: in random codebook generation, how to ensure all codewords satisfy the input cost constraint? Random Codebook Generation and Encoding: Generate the random codebook C i.i.d. randomly according to p X (x) = arg max I (X; Y). p(x): E[b(X)] B 1+ϵ In other words, we step back and generate random codewords with a slightly smaller average cost B 1+ϵ. If the generate x N T (N) ϵ (X), then by Problem 6(a) of Quiz #1, we have 1 N N i=1 b (x i) (1 + ϵ)e [b (X)] = (1 + ϵ) B 1+ϵ = B. If the generate x N / T ϵ (N) (X), it may violates the input constraint, and hence we send the zero-cost codeword [ ] x o x o instead. 22 / 34 I-Hsiang Wang NIT Lecture 5
23 Achievability Proof (2) Error Probability Analysis: Following the same line in that of DMC without input cost constraint, we arrive at upper bounding Pr {E W = 1} := P 1 (E), where { (X E = A c 1 ( w 1 A w ), A w := N (w), Y N) } T ϵ (N) (X, Y). Upper bounding P 1 (A w ) for w 1 remains the same: since the actual message W = 1, whether or not the codeword X N (1) violates the cost constraint is not relevant. Upper bounding P 1 (A c 1 ) requires slight modification, regarding whether or not X N (1) T ϵ (N) (X): { (X P 1 (A c 1) P N 1 (1), Y N) } / T ϵ (N) (X, Y), X N (1) T ϵ (N) (X) { } + P 1 X N (1) / T (N) (X) 2ϵ, for N sufficiently large. ϵ 23 / 34 I-Hsiang Wang NIT Lecture 5
24 Achievability Proof (3) ( ) B Hence, we conclude that for any R < C 1+ϵ, R is achievable. ( ) Finally, since C (B) is continuous in B, we can make C arbitrarily B 1+ϵ close to C (B), and hence for any R < C (B), R is achievable. Exercise 1 Show that C (B) = max I (X; Y) p(x): E[b(X)] B is non-decreasing, concave, and continuous in B. 24 / 34 I-Hsiang Wang NIT Lecture 5
25 1 Mutual Information and Differential Entropy / 34 I-Hsiang Wang NIT Lecture 5
26 Additive White Gaussian Noise (AWGN) Channel z N w Channel Encoder x N y N Channel Decoder bw 1 Input/output alphabet X = Y = R. 2 AWGN Channel: Conditional p.d.f. f Y X is given by Y = X + Z, Z N ( 0, σ 2) X. {Z k } form an i.i.d. (white) Gaussian r.p. with Z k N ( 0, σ 2), k. Memoryless: Z k ( W, X k 1, Z k 1). Without feedback: Z N X N. 3 Average input power constraint P: 1 N N k=1 x k 2 P. 26 / 34 I-Hsiang Wang NIT Lecture 5
27 Channel Coding Theorem for Gaussian Channel Theorem 4 () The capacity of the AWGN channel with input power constraint P and noise variance σ 2 is given by C = sup I (X; Y) = 1 ( f(x): E[ X 2 ] P 2 log 1 + P ) σ 2. (2) Compared (1) in DMC with input cost constraint, in (2) the max operation is replaced by the sup operation. For the AWGN channel, the supremum is actually attainable with Gaussian input distribution f (x) = 1 2πP e x2 2P, i.e., X N (0, P). The capacity-power function C (P) is non-decreasing, concave, and left-continuous in P (exercise). 27 / 34 I-Hsiang Wang NIT Lecture 5
28 Converse Proof and Evaluation of Capacity Converse proof: Following the same line of proof, we arrive at R is achievable = R sup I (X; Y). f(x): E[ X 2 ] P Capacity evaluation: It is straightforward to see that the optimizing input X should have zero mean. Note that since Z X, I (X; Y) = h (Y) h (Y X) = h (Y) h (X + Z X) = h (Y) h (Z X) = h (Y) h (Z) = h (Y) 1 log (2πe) σ2 2 (a) 1 2 log (2πe) ( P + σ 2) 1 2 log (2πe) σ2 = 1 ( 2 log 1 + P ) σ 2 (a): h (Y) 1 2 log (2πe) E [ Y 2], E [ Y 2] = E [ X 2] + E [ Z 2] P + σ / 34 I-Hsiang Wang NIT Lecture 5
29 Achievability Proof (1) The proof of achievability for this continuous-valued channel makes use of discretization, so that we can apply the result in DMC with input cost. Discretization: here we use a different way to discretize the channel. { } m N, let Q m := l m : l = 0, ±1,..., ±m be the set of quantized points. For any r R, quantize r to the closest point [r] m Q m such that [r] m r. Let us use the above quantizer to discretize the channel: Quantize the channel input X to [X] m. Let Y (m) := [X] m + Z denote the channel output corresponding to the quantized channel input [X] m. Note: Y (m) is continuous-valued. Quantize the channel output Y (m) to [ Y (m)] n. 29 / 34 I-Hsiang Wang NIT Lecture 5
30 Achievability Proof (2) Now we have a DMC with input [X] m and output [ Y (m)] n. Note that for any p.d.f. f X (x) with E [ X 2] P, the quantized [X] m also satisfies the power constraint: E [ [X] m 2] E [ X 2] P. Hence, by the achievability result of DMC with input cost constraint, any R < I ( [X] m ; [ Y (m)] n) (evaluated under fx (x) = 1 2πP e x2 2P ) is indeed achievable for the discretized channel under power constraint P. The only thing left to be shown is that, I ( [X] m ; [ Y (m)] n) can be made arbitrarily close to I (X; Y) = 1 2 log ( 1 + P σ 2 ) as m, n. 30 / 34 I-Hsiang Wang NIT Lecture 5
31 Achievability Proof (3) First, due to data processing inequality, since [X] m Y (m) [ Y (m)] n, we have I ( [X] m ; [ Y (m)] n) I ( [X]m ; Y (m)) = h ( Y (m)) h (Z). Since E [ Y (m) 2] P + σ 2, we have h ( Y (m)) 1 2 log ( 2πe(P + σ 2 ) ), and hence ( [ I [X] m ; Y (m)] ) 1 (1 n 2 log + P ) σ 2. Second, for the lower bound, Lemma 3.2. in El Gamal&Kim [1] gives the following: lim inf m ( lim I n [ [X] m ; Y (m)] n Combining the above, proof is complete. ) 1 2 log (1 + P σ 2 ). 31 / 34 I-Hsiang Wang NIT Lecture 5
32 Lesson Learned Discretization may not be the simplest way to obtain achievability results. Instead, such discretization shows how to extend the coding theorem for a DMC to a Gaussian or any other well-behaved continuous-valued channel. Similar procedures can be used to extend coding theorems for finite-alphabet multiuser channels to their Gaussian counterparts. Hence, later we may skip formal proofs of such extensions. 32 / 34 I-Hsiang Wang NIT Lecture 5
33 1 Mutual Information and Differential Entropy / 34 I-Hsiang Wang NIT Lecture 5
34 Mutual information between two[ continuous r.v. s ] X and Y with joint density f X,Y : I (X; Y) = E. log f X,Y(X,Y) f X (X)f Y (Y) Differential[ entropy and ] conditional differential [ entropy: ] 1 1 h (X) := E log f X (X), h (X Y) := E log f X Y (X Y). I (X; Y) = h (X) h (X Y) = h (Y) h (Y X). KL divergence between densities f and g: D (f g) := E f [ log f(x) g(x) Chain rule, conditioning reduces differential entropy, non-negativity of mutual information and KL divergence: remain to hold. Differential entropy can be negative; h (X) h (X, Y). Channel capacity with input cost constraint: DMC: C (B) = max p(x): E[b(X)] B I (X; Y). Continuous-valued: C (B) = sup f(x): E[b(X)] B I (X; Y). Gaussian channel capacity: C (P) = 1 2 log ( 1 + P σ 2 ). ]. 34 / 34 I-Hsiang Wang NIT Lecture 5
Lecture 4 Channel Coding
Capacity and the Weak Converse Lecture 4 Coding I-Hsiang Wang Department of Electrical Engineering National Taiwan University ihwang@ntu.edu.tw October 15, 2014 1 / 16 I-Hsiang Wang NIT Lecture 4 Capacity
More informationLecture 6 Channel Coding over Continuous Channels
Lecture 6 Channel Coding over Continuous Channels I-Hsiang Wang Department of Electrical Engineering National Taiwan University ihwang@ntu.edu.tw November 9, 015 1 / 59 I-Hsiang Wang IT Lecture 6 We have
More informationLecture 4 Noisy Channel Coding
Lecture 4 Noisy Channel Coding I-Hsiang Wang Department of Electrical Engineering National Taiwan University ihwang@ntu.edu.tw October 9, 2015 1 / 56 I-Hsiang Wang IT Lecture 4 The Channel Coding Problem
More informationECE 4400:693 - Information Theory
ECE 4400:693 - Information Theory Dr. Nghi Tran Lecture 8: Differential Entropy Dr. Nghi Tran (ECE-University of Akron) ECE 4400:693 Lecture 1 / 43 Outline 1 Review: Entropy of discrete RVs 2 Differential
More informationMidterm Exam Information Theory Fall Midterm Exam. Time: 09:10 12:10 11/23, 2016
Midterm Exam Time: 09:10 12:10 11/23, 2016 Name: Student ID: Policy: (Read before You Start to Work) The exam is closed book. However, you are allowed to bring TWO A4-size cheat sheet (single-sheet, two-sided).
More informationEE 4TM4: Digital Communications II. Channel Capacity
EE 4TM4: Digital Communications II 1 Channel Capacity I. CHANNEL CODING THEOREM Definition 1: A rater is said to be achievable if there exists a sequence of(2 nr,n) codes such thatlim n P (n) e (C) = 0.
More informationLecture 6: Gaussian Channels. Copyright G. Caire (Sample Lectures) 157
Lecture 6: Gaussian Channels Copyright G. Caire (Sample Lectures) 157 Differential entropy (1) Definition 18. The (joint) differential entropy of a continuous random vector X n p X n(x) over R is: Z h(x
More information18.2 Continuous Alphabet (discrete-time, memoryless) Channel
0-704: Information Processing and Learning Spring 0 Lecture 8: Gaussian channel, Parallel channels and Rate-distortion theory Lecturer: Aarti Singh Scribe: Danai Koutra Disclaimer: These notes have not
More informationChapter 9 Fundamental Limits in Information Theory
Chapter 9 Fundamental Limits in Information Theory Information Theory is the fundamental theory behind information manipulation, including data compression and data transmission. 9.1 Introduction o For
More informationLecture 14 February 28
EE/Stats 376A: Information Theory Winter 07 Lecture 4 February 8 Lecturer: David Tse Scribe: Sagnik M, Vivek B 4 Outline Gaussian channel and capacity Information measures for continuous random variables
More informationLecture 2. Capacity of the Gaussian channel
Spring, 207 5237S, Wireless Communications II 2. Lecture 2 Capacity of the Gaussian channel Review on basic concepts in inf. theory ( Cover&Thomas: Elements of Inf. Theory, Tse&Viswanath: Appendix B) AWGN
More informationLecture 6 I. CHANNEL CODING. X n (m) P Y X
6- Introduction to Information Theory Lecture 6 Lecturer: Haim Permuter Scribe: Yoav Eisenberg and Yakov Miron I. CHANNEL CODING We consider the following channel coding problem: m = {,2,..,2 nr} Encoder
More informationAppendix B Information theory from first principles
Appendix B Information theory from first principles This appendix discusses the information theory behind the capacity expressions used in the book. Section 8.3.4 is the only part of the book that supposes
More informationLecture 5: Channel Capacity. Copyright G. Caire (Sample Lectures) 122
Lecture 5: Channel Capacity Copyright G. Caire (Sample Lectures) 122 M Definitions and Problem Setup 2 X n Y n Encoder p(y x) Decoder ˆM Message Channel Estimate Definition 11. Discrete Memoryless Channel
More informationLecture 7 Introduction to Statistical Decision Theory
Lecture 7 Introduction to Statistical Decision Theory I-Hsiang Wang Department of Electrical Engineering National Taiwan University ihwang@ntu.edu.tw December 20, 2016 1 / 55 I-Hsiang Wang IT Lecture 7
More informationLecture 8: Channel Capacity, Continuous Random Variables
EE376A/STATS376A Information Theory Lecture 8-02/0/208 Lecture 8: Channel Capacity, Continuous Random Variables Lecturer: Tsachy Weissman Scribe: Augustine Chemparathy, Adithya Ganesh, Philip Hwang Channel
More informationExercises with solutions (Set D)
Exercises with solutions Set D. A fair die is rolled at the same time as a fair coin is tossed. Let A be the number on the upper surface of the die and let B describe the outcome of the coin toss, where
More informationChapter 4. Data Transmission and Channel Capacity. Po-Ning Chen, Professor. Department of Communications Engineering. National Chiao Tung University
Chapter 4 Data Transmission and Channel Capacity Po-Ning Chen, Professor Department of Communications Engineering National Chiao Tung University Hsin Chu, Taiwan 30050, R.O.C. Principle of Data Transmission
More informationLecture 17: Differential Entropy
Lecture 17: Differential Entropy Differential entropy AEP for differential entropy Quantization Maximum differential entropy Estimation counterpart of Fano s inequality Dr. Yao Xie, ECE587, Information
More informationLecture 2: August 31
0-704: Information Processing and Learning Fall 206 Lecturer: Aarti Singh Lecture 2: August 3 Note: These notes are based on scribed notes from Spring5 offering of this course. LaTeX template courtesy
More informationInformation Theory. Lecture 10. Network Information Theory (CT15); a focus on channel capacity results
Information Theory Lecture 10 Network Information Theory (CT15); a focus on channel capacity results The (two-user) multiple access channel (15.3) The (two-user) broadcast channel (15.6) The relay channel
More informationThe Method of Types and Its Application to Information Hiding
The Method of Types and Its Application to Information Hiding Pierre Moulin University of Illinois at Urbana-Champaign www.ifp.uiuc.edu/ moulin/talks/eusipco05-slides.pdf EUSIPCO Antalya, September 7,
More informationLecture 11: Continuous-valued signals and differential entropy
Lecture 11: Continuous-valued signals and differential entropy Biology 429 Carl Bergstrom September 20, 2008 Sources: Parts of today s lecture follow Chapter 8 from Cover and Thomas (2007). Some components
More informationLecture 22: Final Review
Lecture 22: Final Review Nuts and bolts Fundamental questions and limits Tools Practical algorithms Future topics Dr Yao Xie, ECE587, Information Theory, Duke University Basics Dr Yao Xie, ECE587, Information
More informationCapacity of AWGN channels
Chapter 3 Capacity of AWGN channels In this chapter we prove that the capacity of an AWGN channel with bandwidth W and signal-tonoise ratio SNR is W log 2 (1+SNR) bits per second (b/s). The proof that
More informationInformation Theory. Coding and Information Theory. Information Theory Textbooks. Entropy
Coding and Information Theory Chris Williams, School of Informatics, University of Edinburgh Overview What is information theory? Entropy Coding Information Theory Shannon (1948): Information theory is
More informationChapter 8: Differential entropy. University of Illinois at Chicago ECE 534, Natasha Devroye
Chapter 8: Differential entropy Chapter 8 outline Motivation Definitions Relation to discrete entropy Joint and conditional differential entropy Relative entropy and mutual information Properties AEP for
More informationLECTURE 13. Last time: Lecture outline
LECTURE 13 Last time: Strong coding theorem Revisiting channel and codes Bound on probability of error Error exponent Lecture outline Fano s Lemma revisited Fano s inequality for codewords Converse to
More informationEE/Stats 376A: Homework 7 Solutions Due on Friday March 17, 5 pm
EE/Stats 376A: Homework 7 Solutions Due on Friday March 17, 5 pm 1. Feedback does not increase the capacity. Consider a channel with feedback. We assume that all the recieved outputs are sent back immediately
More informationNational University of Singapore Department of Electrical & Computer Engineering. Examination for
National University of Singapore Department of Electrical & Computer Engineering Examination for EE5139R Information Theory for Communication Systems (Semester I, 2014/15) November/December 2014 Time Allowed:
More informationLecture 15: Conditional and Joint Typicaility
EE376A Information Theory Lecture 1-02/26/2015 Lecture 15: Conditional and Joint Typicaility Lecturer: Kartik Venkat Scribe: Max Zimet, Brian Wai, Sepehr Nezami 1 Notation We always write a sequence of
More informationPROBABILITY AND INFORMATION THEORY. Dr. Gjergji Kasneci Introduction to Information Retrieval WS
PROBABILITY AND INFORMATION THEORY Dr. Gjergji Kasneci Introduction to Information Retrieval WS 2012-13 1 Outline Intro Basics of probability and information theory Probability space Rules of probability
More informationLecture Notes 1 Probability and Random Variables. Conditional Probability and Independence. Functions of a Random Variable
Lecture Notes 1 Probability and Random Variables Probability Spaces Conditional Probability and Independence Random Variables Functions of a Random Variable Generation of a Random Variable Jointly Distributed
More informationSolutions to Homework Set #4 Differential Entropy and Gaussian Channel
Solutions to Homework Set #4 Differential Entropy and Gaussian Channel 1. Differential entropy. Evaluate the differential entropy h(x = f lnf for the following: (a Find the entropy of the exponential density
More informationSHARED INFORMATION. Prakash Narayan with. Imre Csiszár, Sirin Nitinawarat, Himanshu Tyagi, Shun Watanabe
SHARED INFORMATION Prakash Narayan with Imre Csiszár, Sirin Nitinawarat, Himanshu Tyagi, Shun Watanabe 2/40 Acknowledgement Praneeth Boda Himanshu Tyagi Shun Watanabe 3/40 Outline Two-terminal model: Mutual
More informationBlock 2: Introduction to Information Theory
Block 2: Introduction to Information Theory Francisco J. Escribano April 26, 2015 Francisco J. Escribano Block 2: Introduction to Information Theory April 26, 2015 1 / 51 Table of contents 1 Motivation
More informationELEC546 Review of Information Theory
ELEC546 Review of Information Theory Vincent Lau 1/1/004 1 Review of Information Theory Entropy: Measure of uncertainty of a random variable X. The entropy of X, H(X), is given by: If X is a discrete random
More information4F5: Advanced Communications and Coding Handout 2: The Typical Set, Compression, Mutual Information
4F5: Advanced Communications and Coding Handout 2: The Typical Set, Compression, Mutual Information Ramji Venkataramanan Signal Processing and Communications Lab Department of Engineering ramji.v@eng.cam.ac.uk
More informationECE598: Information-theoretic methods in high-dimensional statistics Spring 2016
ECE598: Information-theoretic methods in high-dimensional statistics Spring 06 Lecture : Mutual Information Method Lecturer: Yihong Wu Scribe: Jaeho Lee, Mar, 06 Ed. Mar 9 Quick review: Assouad s lemma
More informationLecture Notes 1 Probability and Random Variables. Conditional Probability and Independence. Functions of a Random Variable
Lecture Notes 1 Probability and Random Variables Probability Spaces Conditional Probability and Independence Random Variables Functions of a Random Variable Generation of a Random Variable Jointly Distributed
More information5 Mutual Information and Channel Capacity
5 Mutual Information and Channel Capacity In Section 2, we have seen the use of a quantity called entropy to measure the amount of randomness in a random variable. In this section, we introduce several
More informationECE Information theory Final
ECE 776 - Information theory Final Q1 (1 point) We would like to compress a Gaussian source with zero mean and variance 1 We consider two strategies In the first, we quantize with a step size so that the
More information( 1 k "information" I(X;Y) given by Y about X)
SUMMARY OF SHANNON DISTORTION-RATE THEORY Consider a stationary source X with f (x) as its th-order pdf. Recall the following OPTA function definitions: δ(,r) = least dist'n of -dim'l fixed-rate VQ's w.
More informationUCSD ECE 255C Handout #14 Prof. Young-Han Kim Thursday, March 9, Solutions to Homework Set #5
UCSD ECE 255C Handout #14 Prof. Young-Han Kim Thursday, March 9, 2017 Solutions to Homework Set #5 3.18 Bounds on the quadratic rate distortion function. Recall that R(D) = inf F(ˆx x):e(x ˆX)2 DI(X; ˆX).
More informationEntropies & Information Theory
Entropies & Information Theory LECTURE I Nilanjana Datta University of Cambridge,U.K. See lecture notes on: http://www.qi.damtp.cam.ac.uk/node/223 Quantum Information Theory Born out of Classical Information
More informationLecture 20: Quantization and Rate-Distortion
Lecture 20: Quantization and Rate-Distortion Quantization Introduction to rate-distortion theorem Dr. Yao Xie, ECE587, Information Theory, Duke University Approimating continuous signals... Dr. Yao Xie,
More informationInformation Theory. M1 Informatique (parcours recherche et innovation) Aline Roumy. January INRIA Rennes 1/ 73
1/ 73 Information Theory M1 Informatique (parcours recherche et innovation) Aline Roumy INRIA Rennes January 2018 Outline 2/ 73 1 Non mathematical introduction 2 Mathematical introduction: definitions
More informationLecture 10: Broadcast Channel and Superposition Coding
Lecture 10: Broadcast Channel and Superposition Coding Scribed by: Zhe Yao 1 Broadcast channel M 0M 1M P{y 1 y x} M M 01 1 M M 0 The capacity of the broadcast channel depends only on the marginal conditional
More informationComputing and Communications 2. Information Theory -Entropy
1896 1920 1987 2006 Computing and Communications 2. Information Theory -Entropy Ying Cui Department of Electronic Engineering Shanghai Jiao Tong University, China 2017, Autumn 1 Outline Entropy Joint entropy
More informationEE5139R: Problem Set 7 Assigned: 30/09/15, Due: 07/10/15
EE5139R: Problem Set 7 Assigned: 30/09/15, Due: 07/10/15 1. Cascade of Binary Symmetric Channels The conditional probability distribution py x for each of the BSCs may be expressed by the transition probability
More informationEE/Stat 376B Handout #5 Network Information Theory October, 14, Homework Set #2 Solutions
EE/Stat 376B Handout #5 Network Information Theory October, 14, 014 1. Problem.4 parts (b) and (c). Homework Set # Solutions (b) Consider h(x + Y ) h(x + Y Y ) = h(x Y ) = h(x). (c) Let ay = Y 1 + Y, where
More informationSHARED INFORMATION. Prakash Narayan with. Imre Csiszár, Sirin Nitinawarat, Himanshu Tyagi, Shun Watanabe
SHARED INFORMATION Prakash Narayan with Imre Csiszár, Sirin Nitinawarat, Himanshu Tyagi, Shun Watanabe 2/41 Outline Two-terminal model: Mutual information Operational meaning in: Channel coding: channel
More informationLecture 3: Channel Capacity
Lecture 3: Channel Capacity 1 Definitions Channel capacity is a measure of maximum information per channel usage one can get through a channel. This one of the fundamental concepts in information theory.
More informationCapacity of a channel Shannon s second theorem. Information Theory 1/33
Capacity of a channel Shannon s second theorem Information Theory 1/33 Outline 1. Memoryless channels, examples ; 2. Capacity ; 3. Symmetric channels ; 4. Channel Coding ; 5. Shannon s second theorem,
More informationEE376A - Information Theory Final, Monday March 14th 2016 Solutions. Please start answering each question on a new page of the answer booklet.
EE376A - Information Theory Final, Monday March 14th 216 Solutions Instructions: You have three hours, 3.3PM - 6.3PM The exam has 4 questions, totaling 12 points. Please start answering each question on
More informationEE 376A: Information Theory Lecture Notes. Prof. Tsachy Weissman TA: Idoia Ochoa, Kedar Tatwawadi
EE 376A: Information Theory Lecture Notes Prof. Tsachy Weissman TA: Idoia Ochoa, Kedar Tatwawadi January 6, 206 Contents Introduction. Lossless Compression.....................................2 Channel
More informationOn the Duality between Multiple-Access Codes and Computation Codes
On the Duality between Multiple-Access Codes and Computation Codes Jingge Zhu University of California, Berkeley jingge.zhu@berkeley.edu Sung Hoon Lim KIOST shlim@kiost.ac.kr Michael Gastpar EPFL michael.gastpar@epfl.ch
More informationShannon s noisy-channel theorem
Shannon s noisy-channel theorem Information theory Amon Elders Korteweg de Vries Institute for Mathematics University of Amsterdam. Tuesday, 26th of Januari Amon Elders (Korteweg de Vries Institute for
More informationELEMENT OF INFORMATION THEORY
History Table of Content ELEMENT OF INFORMATION THEORY O. Le Meur olemeur@irisa.fr Univ. of Rennes 1 http://www.irisa.fr/temics/staff/lemeur/ October 2010 1 History Table of Content VERSION: 2009-2010:
More informationNotes 3: Stochastic channels and noisy coding theorem bound. 1 Model of information communication and noisy channel
Introduction to Coding Theory CMU: Spring 2010 Notes 3: Stochastic channels and noisy coding theorem bound January 2010 Lecturer: Venkatesan Guruswami Scribe: Venkatesan Guruswami We now turn to the basic
More informationx log x, which is strictly convex, and use Jensen s Inequality:
2. Information measures: mutual information 2.1 Divergence: main inequality Theorem 2.1 (Information Inequality). D(P Q) 0 ; D(P Q) = 0 iff P = Q Proof. Let ϕ(x) x log x, which is strictly convex, and
More informationLecture 18: Gaussian Channel
Lecture 18: Gaussian Channel Gaussian channel Gaussian channel capacity Dr. Yao Xie, ECE587, Information Theory, Duke University Mona Lisa in AWGN Mona Lisa Noisy Mona Lisa 100 100 200 200 300 300 400
More informationRevision of Lecture 5
Revision of Lecture 5 Information transferring across channels Channel characteristics and binary symmetric channel Average mutual information Average mutual information tells us what happens to information
More information(Classical) Information Theory III: Noisy channel coding
(Classical) Information Theory III: Noisy channel coding Sibasish Ghosh The Institute of Mathematical Sciences CIT Campus, Taramani, Chennai 600 113, India. p. 1 Abstract What is the best possible way
More informationThe Poisson Channel with Side Information
The Poisson Channel with Side Information Shraga Bross School of Enginerring Bar-Ilan University, Israel brosss@macs.biu.ac.il Amos Lapidoth Ligong Wang Signal and Information Processing Laboratory ETH
More informationLecture 9 Polar Coding
Lecture 9 Polar Coding I-Hsiang ang Department of Electrical Engineering National Taiwan University ihwang@ntu.edu.tw December 29, 2015 1 / 25 I-Hsiang ang IT Lecture 9 In Pursuit of Shannon s Limit Since
More informationChapter I: Fundamental Information Theory
ECE-S622/T62 Notes Chapter I: Fundamental Information Theory Ruifeng Zhang Dept. of Electrical & Computer Eng. Drexel University. Information Source Information is the outcome of some physical processes.
More informationLecture 1: The Multiple Access Channel. Copyright G. Caire 12
Lecture 1: The Multiple Access Channel Copyright G. Caire 12 Outline Two-user MAC. The Gaussian case. The K-user case. Polymatroid structure and resource allocation problems. Copyright G. Caire 13 Two-user
More informationInformation Theory Primer:
Information Theory Primer: Entropy, KL Divergence, Mutual Information, Jensen s inequality Seungjin Choi Department of Computer Science and Engineering Pohang University of Science and Technology 77 Cheongam-ro,
More informationShannon s A Mathematical Theory of Communication
Shannon s A Mathematical Theory of Communication Emre Telatar EPFL Kanpur October 19, 2016 First published in two parts in the July and October 1948 issues of BSTJ. First published in two parts in the
More informationEE5585 Data Compression May 2, Lecture 27
EE5585 Data Compression May 2, 2013 Lecture 27 Instructor: Arya Mazumdar Scribe: Fangying Zhang Distributed Data Compression/Source Coding In the previous class we used a H-W table as a simple example,
More informationA Novel Asynchronous Communication Paradigm: Detection, Isolation, and Coding
A Novel Asynchronous Communication Paradigm: Detection, Isolation, and Coding The MIT Faculty has made this article openly available. Please share how this access benefits you. Your story matters. Citation
More informationInformation Dimension
Information Dimension Mina Karzand Massachusetts Institute of Technology November 16, 2011 1 / 26 2 / 26 Let X would be a real-valued random variable. For m N, the m point uniform quantized version of
More informationDistributed Lossless Compression. Distributed lossless compression system
Lecture #3 Distributed Lossless Compression (Reading: NIT 10.1 10.5, 4.4) Distributed lossless source coding Lossless source coding via random binning Time Sharing Achievability proof of the Slepian Wolf
More informationRecitation 2: Probability
Recitation 2: Probability Colin White, Kenny Marino January 23, 2018 Outline Facts about sets Definitions and facts about probability Random Variables and Joint Distributions Characteristics of distributions
More informationPROOF OF ZADOR-GERSHO THEOREM
ZADOR-GERSHO THEOREM FOR VARIABLE-RATE VQ For a stationary source and large R, the least distortion of k-dim'l VQ with nth-order entropy coding and rate R or less is δ(k,n,r) m k * σ 2 η kn 2-2R = Z(k,n,R)
More informationReliable Computation over Multiple-Access Channels
Reliable Computation over Multiple-Access Channels Bobak Nazer and Michael Gastpar Dept. of Electrical Engineering and Computer Sciences University of California, Berkeley Berkeley, CA, 94720-1770 {bobak,
More informationLECTURE 3. Last time:
LECTURE 3 Last time: Mutual Information. Convexity and concavity Jensen s inequality Information Inequality Data processing theorem Fano s Inequality Lecture outline Stochastic processes, Entropy rate
More informationFeedback Capacity of a Class of Symmetric Finite-State Markov Channels
Feedback Capacity of a Class of Symmetric Finite-State Markov Channels Nevroz Şen, Fady Alajaji and Serdar Yüksel Department of Mathematics and Statistics Queen s University Kingston, ON K7L 3N6, Canada
More information10-704: Information Processing and Learning Spring Lecture 8: Feb 5
10-704: Information Processing and Learning Spring 2015 Lecture 8: Feb 5 Lecturer: Aarti Singh Scribe: Siheng Chen Disclaimer: These notes have not been subjected to the usual scrutiny reserved for formal
More informationEE376A: Homework #3 Due by 11:59pm Saturday, February 10th, 2018
Please submit the solutions on Gradescope. EE376A: Homework #3 Due by 11:59pm Saturday, February 10th, 2018 1. Optimal codeword lengths. Although the codeword lengths of an optimal variable length code
More informationTwo Applications of the Gaussian Poincaré Inequality in the Shannon Theory
Two Applications of the Gaussian Poincaré Inequality in the Shannon Theory Vincent Y. F. Tan (Joint work with Silas L. Fong) National University of Singapore (NUS) 2016 International Zurich Seminar on
More informationLecture 8: Channel and source-channel coding theorems; BEC & linear codes. 1 Intuitive justification for upper bound on channel capacity
5-859: Information Theory and Applications in TCS CMU: Spring 23 Lecture 8: Channel and source-channel coding theorems; BEC & linear codes February 7, 23 Lecturer: Venkatesan Guruswami Scribe: Dan Stahlke
More informationShannon s Noisy-Channel Coding Theorem
Shannon s Noisy-Channel Coding Theorem Lucas Slot Sebastian Zur February 2015 Abstract In information theory, Shannon s Noisy-Channel Coding Theorem states that it is possible to communicate over a noisy
More informationLecture 8: Information Theory and Statistics
Lecture 8: Information Theory and Statistics Part II: Hypothesis Testing and I-Hsiang Wang Department of Electrical Engineering National Taiwan University ihwang@ntu.edu.tw December 23, 2015 1 / 50 I-Hsiang
More informationCOMM901 Source Coding and Compression. Quiz 1
German University in Cairo - GUC Faculty of Information Engineering & Technology - IET Department of Communication Engineering Winter Semester 2013/2014 Students Name: Students ID: COMM901 Source Coding
More informationDEEP LEARNING CHAPTER 3 PROBABILITY & INFORMATION THEORY
DEEP LEARNING CHAPTER 3 PROBABILITY & INFORMATION THEORY OUTLINE 3.1 Why Probability? 3.2 Random Variables 3.3 Probability Distributions 3.4 Marginal Probability 3.5 Conditional Probability 3.6 The Chain
More informationLECTURE 15. Last time: Feedback channel: setting up the problem. Lecture outline. Joint source and channel coding theorem
LECTURE 15 Last time: Feedback channel: setting up the problem Perfect feedback Feedback capacity Data compression Lecture outline Joint source and channel coding theorem Converse Robustness Brain teaser
More informationat Some sort of quantization is necessary to represent continuous signals in digital form
Quantization at Some sort of quantization is necessary to represent continuous signals in digital form x(n 1,n ) x(t 1,tt ) D Sampler Quantizer x q (n 1,nn ) Digitizer (A/D) Quantization is also used for
More informationLecture 11: Quantum Information III - Source Coding
CSCI5370 Quantum Computing November 25, 203 Lecture : Quantum Information III - Source Coding Lecturer: Shengyu Zhang Scribe: Hing Yin Tsang. Holevo s bound Suppose Alice has an information source X that
More informationPerformance-based Security for Encoding of Information Signals. FA ( ) Paul Cuff (Princeton University)
Performance-based Security for Encoding of Information Signals FA9550-15-1-0180 (2015-2018) Paul Cuff (Princeton University) Contributors Two students finished PhD Tiance Wang (Goldman Sachs) Eva Song
More informationSecond-Order Asymptotics in Information Theory
Second-Order Asymptotics in Information Theory Vincent Y. F. Tan (vtan@nus.edu.sg) Dept. of ECE and Dept. of Mathematics National University of Singapore (NUS) National Taiwan University November 2015
More informationChapter 5,6 Multiple RandomVariables
Chapter 5,6 Multiple RandomVariables ENCS66 - Probabilityand Stochastic Processes Concordia University Vector RandomVariables A vector r.v. is a function where is the sample space of a random experiment.
More informationRelay Networks With Delays
Relay Networks With Delays Abbas El Gamal, Navid Hassanpour, and James Mammen Department of Electrical Engineering Stanford University, Stanford, CA 94305-9510 Email: {abbas, navid, jmammen}@stanford.edu
More information10-704: Information Processing and Learning Fall Lecture 9: Sept 28
10-704: Information Processing and Learning Fall 2016 Lecturer: Siheng Chen Lecture 9: Sept 28 Note: These notes are based on scribed notes from Spring15 offering of this course. LaTeX template courtesy
More informationLecture 8: Shannon s Noise Models
Error Correcting Codes: Combinatorics, Algorithms and Applications (Fall 2007) Lecture 8: Shannon s Noise Models September 14, 2007 Lecturer: Atri Rudra Scribe: Sandipan Kundu& Atri Rudra Till now we have
More informationBasics on Probability. Jingrui He 09/11/2007
Basics on Probability Jingrui He 09/11/2007 Coin Flips You flip a coin Head with probability 0.5 You flip 100 coins How many heads would you expect Coin Flips cont. You flip a coin Head with probability
More informationExercise 1. = P(y a 1)P(a 1 )
Chapter 7 Channel Capacity Exercise 1 A source produces independent, equally probable symbols from an alphabet {a 1, a 2 } at a rate of one symbol every 3 seconds. These symbols are transmitted over a
More information19. Channel coding: energy-per-bit, continuous-time channels
9. Channel coding: energy-per-bit, continuous-time channels 9. Energy per bit Consider the additive Gaussian noise channel: Y i = X i + Z i, Z i N ( 0, ). (9.) In the last lecture, we analyzed the maximum
More informationLECTURE 10. Last time: Lecture outline
LECTURE 10 Joint AEP Coding Theorem Last time: Error Exponents Lecture outline Strong Coding Theorem Reading: Gallager, Chapter 5. Review Joint AEP A ( ɛ n) (X) A ( ɛ n) (Y ) vs. A ( ɛ n) (X, Y ) 2 nh(x)
More information