Variable Length Codes for Degraded Broadcast Channels

Similar documents
Superposition Encoding and Partial Decoding Is Optimal for a Class of Z-interference Channels

Multiaccess Channels with State Known to One Encoder: A Case of Degraded Message Sets

On Multiple User Channels with State Information at the Transmitters

On Two-user Fading Gaussian Broadcast Channels. with Perfect Channel State Information at the Receivers. Daniela Tuninetti

The Gallager Converse

Information Theory. Lecture 10. Network Information Theory (CT15); a focus on channel capacity results

The Poisson Channel with Side Information

On Gaussian MIMO Broadcast Channels with Common and Private Messages

Cut-Set Bound and Dependence Balance Bound

A Summary of Multiple Access Channels

Polar codes for the m-user MAC and matroids

Capacity Region of Reversely Degraded Gaussian MIMO Broadcast Channel

A Comparison of Superposition Coding Schemes

ECE Information theory Final (Fall 2008)

Capacity of the Discrete Memoryless Energy Harvesting Channel with Side Information

On the Duality between Multiple-Access Codes and Computation Codes

Bounds and Capacity Results for the Cognitive Z-interference Channel

IET Commun., 2009, Vol. 3, Iss. 4, pp doi: /iet-com & The Institution of Engineering and Technology 2009

The Capacity Region of a Class of Discrete Degraded Interference Channels

ProblemsWeCanSolveWithaHelper

Solutions to Homework Set #2 Broadcast channel, degraded message set, Csiszar Sum Equality

Feedback Capacity of a Class of Symmetric Finite-State Markov Channels

On the Capacity Region of the Gaussian Z-channel

EE5139R: Problem Set 7 Assigned: 30/09/15, Due: 07/10/15

AN INTRODUCTION TO SECRECY CAPACITY. 1. Overview

Upper Bounds on the Capacity of Binary Intermittent Communication

Variable-length coding with feedback in the non-asymptotic regime

On the Duality of Gaussian Multiple-Access and Broadcast Channels

Chapter 4. Data Transmission and Channel Capacity. Po-Ning Chen, Professor. Department of Communications Engineering. National Chiao Tung University

On the Capacity of the Interference Channel with a Relay

An Outer Bound for the Gaussian. Interference channel with a relay.

5958 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 56, NO. 12, DECEMBER 2010

(Preprint of paper to appear in Proc Intl. Symp. on Info. Th. and its Applications, Waikiki, Hawaii, Nov , 1990.)

Lecture 6 I. CHANNEL CODING. X n (m) P Y X

Interactive Decoding of a Broadcast Message

Feedback Capacity of the Gaussian Interference Channel to Within Bits: the Symmetric Case

Mismatched Multi-letter Successive Decoding for the Multiple-Access Channel

Capacity Region of the Permutation Channel

On the Universality of Burnashev s Error Exponent

Amobile satellite communication system, like Motorola s

An Achievable Error Exponent for the Mismatched Multiple-Access Channel

Secret Key Agreement Using Asymmetry in Channel State Knowledge

Joint Source-Channel Coding for the Multiple-Access Relay Channel

Simultaneous Nonunique Decoding Is Rate-Optimal

X 1 : X Table 1: Y = X X 2

Lecture 3: Channel Capacity

Lecture 10: Broadcast Channel and Superposition Coding

UCSD ECE 255C Handout #12 Prof. Young-Han Kim Tuesday, February 28, Solutions to Take-Home Midterm (Prepared by Pinar Sen)

A Proof of the Converse for the Capacity of Gaussian MIMO Broadcast Channels

Equivalence for Networks with Adversarial State

The Capacity Region of the Cognitive Z-interference Channel with One Noiseless Component

List of Figures. Acknowledgements. Abstract 1. 1 Introduction 2. 2 Preliminaries Superposition Coding Block Markov Encoding...

Exercise 1. = P(y a 1)P(a 1 )

A Novel Asynchronous Communication Paradigm: Detection, Isolation, and Coding

Relay Networks With Delays

On Source-Channel Communication in Networks

A Half-Duplex Cooperative Scheme with Partial Decode-Forward Relaying

A Comparison of Two Achievable Rate Regions for the Interference Channel

Lecture 4 Channel Coding

Achieving Shannon Capacity Region as Secrecy Rate Region in a Multiple Access Wiretap Channel

EE 4TM4: Digital Communications II. Channel Capacity

Performance-based Security for Encoding of Information Signals. FA ( ) Paul Cuff (Princeton University)

Representation of Correlated Sources into Graphs for Transmission over Broadcast Channels

A Simple Converse of Burnashev s Reliability Function

On Dependence Balance Bounds for Two Way Channels

Two Applications of the Gaussian Poincaré Inequality in the Shannon Theory

II. THE TWO-WAY TWO-RELAY CHANNEL

Optimal Encoding Schemes for Several Classes of Discrete Degraded Broadcast Channels

Fading Wiretap Channel with No CSI Anywhere

Generalized Writing on Dirty Paper

The Capacity Region of the Gaussian Cognitive Radio Channels at High SNR

The Method of Types and Its Application to Information Hiding

ELEC546 Review of Information Theory

Optimal Natural Encoding Scheme for Discrete Multiplicative Degraded Broadcast Channels

An Achievable Rate Region for the 3-User-Pair Deterministic Interference Channel

Secret Key Agreement Using Conferencing in State- Dependent Multiple Access Channels with An Eavesdropper

Lattices for Distributed Source Coding: Jointly Gaussian Sources and Reconstruction of a Linear Function

On the Capacity Region for Secure Index Coding

Sum Capacity of General Deterministic Interference Channel with Channel Output Feedback

Lecture 4 Noisy Channel Coding

EE376A: Homework #3 Due by 11:59pm Saturday, February 10th, 2018

Distributed Functional Compression through Graph Coloring

Information Theory for Wireless Communications. Lecture 10 Discrete Memoryless Multiple Access Channel (DM-MAC): The Converse Theorem

The Capacity of the Semi-Deterministic Cognitive Interference Channel and its Application to Constant Gap Results for the Gaussian Channel

Arimoto Channel Coding Converse and Rényi Divergence

An Alternative Proof of Channel Polarization for Channels with Arbitrary Input Alphabets

Duality Between Channel Capacity and Rate Distortion With Two-Sided State Information

Source-Channel Coding Theorems for the Multiple-Access Relay Channel

Multicoding Schemes for Interference Channels

Joint Write-Once-Memory and Error-Control Codes

Towards a Theory of Information Flow in the Finitary Process Soup

Information Theoretic Limits of Randomness Generation

Compound Polar Codes

On the Capacity of Interference Channels with Degraded Message sets

Reliable Computation over Multiple-Access Channels

A Simple Converse of Burnashev's Reliability Function

The sequential decoding metric for detection in sensor networks

The Role of Directed Information in Network Capacity

Feedback Capacity of the Compound Channel

On Scalable Source Coding for Multiple Decoders with Side Information

Transcription:

Variable Length Codes for Degraded Broadcast Channels Stéphane Musy School of Computer and Communication Sciences, EPFL CH-1015 Lausanne, Switzerland Email: stephane.musy@ep.ch Abstract This paper investigates the achievable rates using variable length codes when transmitting independent information over a degraded broadcast channel. In this note, we de ne the transmission rates from the perspective of the receivers and allow the decoders to make their decision at a different instant of time. We give an outer bound to the region of achievable rates, as well as examples of code that achieve this bound in some settings. I. INTRODUCTION When a user wishes to send information to a single receiver, it is known that the use of a variable length code does not allow to achieve a better transmission rate than the one achieved with a xed length code [1]. Variable length coding is rather used to improve the error exponent of the communication scheme. In the multiple-user setting, when a user wishes to transmit information to multiple receivers, a simple argument shows that, if we require that the receivers decode at the same instant of time, the set of achievable rates is the same for variable and xed length codes. 1 In this paper, we focus on discrete memoryless broadcast channels with two receivers and consider the transmission of independent messages to each of them. As in [2], we let the codewords be in nite sequences and de ne the rates from the perspective of the receivers, and capture the trade-off between the amount of information received with the timeliness of the information. 2 Let us denote by M the number of possible messages for one receiver and by its average decoding time, then we de ne the transmission rate to that receiver by log M. The expectation operator is motivated by the following argument. Suppose that, to send information to a receiver, we use the transmission method a large number n of times. The corresponding rate experienced by this receiver is equal to n log M l 1+l2+ +l n, where l i represents the transmission duration of the i-th transmission. Hence, by the law of large numbers, the average transmission rate approaches log M with probability one, as n gets large. The same de nition is made in [3]. Notice The work presented in this paper was partially supported by the National Competence Center in Research on Mobile Information and Communication Systems (NCCR-MICS), a center supported by the Swiss National Science Foundation under grant number 5005-67322. 1 To the contrary assume that such a code exists, let be its expected length, then, by the law of large number the total length of n successive transmissions is very likely to be less than n +. Thus, a xed length code of this length will achieve almost the same rate with small probability of error. 2 We start counting time from the beginning of transmission for both receivers. that, contrary to the setting of [2], there is no schedule that controls the sequence of channel outputs seen by the receivers. It was observed by T. Cover in [4], that if we allow the receivers to decode at a different instant of time, the channel capacity of each link can be (simultaneously) achieved. This is done by sending the messages to each receiver in a separated period of time, and letting the ratio between the sizes of messages grow appropriately as they go to in nity. In our setting, this requires that the ratio between the average transmission time of each receiver to be in nite. The objective of this paper, is to characterize the region of achievable rates for bounded values of this ratio. In the next section, we give the de nition of a variable length code, along with the associated notions of reliability and capacity region. In section III we show an outer bound to the region of achievable rates. Then, in section IV, we analyze this bound and present examples of coding scheme that can achieve the outer bound. II. DEFINITIONS We consider sending independent information over a memoryless degraded broadcast channel. There are two independent sources, one producing a message W 1 2 f1; 2; ; M 1 g and the other producing a message W 2 2 f1; 2; ; M 2 g. The channel consists of an input alphabet X, two output alphabets Y and Z, and a probability transition function p(y; zjx). By the memorylessness of the channel we have, for any n, p(y n ; z n jx n ) n p(y i; z i jx i ), where x n 2 X n, y n 2 Y n and z n 2 Z n. Let N 1 be a stopping time w.r.t. (with respect to) fy i g i1, the sequence of received values at the strong receiver. And, let N 2 be a stopping time w.r.t. fz i g i1, the sequence of received values at the weak receiver. We de ne a (M 1 ; M 2 ; N 1 ; N 2 ) variable length code of rates E[N 1] and E[N 2], as a sequence of mappings fx i (W 1 ; W 2 )g i1, where each x i is a function of W 1 and W 2, and two decoding functions (two decoders) w.r.t. the decoding times N 1 and N 2, 3 g 1 : Y N1! f1; 2; ; M 1 g 3 For deterministic stopping rules, we can represent the set of all output sequences for which a decision is made, at each decoder, as the leaves of a complete jyj-ary (resp. jzj-ary) tree. The leaves have a label from the set of messages. Each decoder starts climbing the tree from the root. At each time it chooses the branch that corresponds to the received symbol. When a leaf is reached, the decoder makes a decision as indicated by the label of the leaf.

and g 2 : Z N2! f1; 2; ; M 2 g: We de ne the average probability of error as the probability that the decoded messages are not equal to the transmitted messages, i.e., P e P rfg 1 (Y N1 ) 6 W 1 or g 2 (Z N2 ) 6 W 2 g when (W 1 ; W 2 ) are assumed to be uniformly distributed over f1; 2; ; M 1 g f1; 2; ; M 2 g. A rate pair (R 1 ; R 2 ) is said to be achievable if for all > 0, there exists a (M 1 ; M 2 ; N 1 ; N 2 ) variable length code with E[N 1] R 1, E[N 2] R 2 and P e <. The capacity region of the broadcast channel is the closure of the set of achievable rates. Notice that, with this de nition, the capacity region is simply given by the rectangle [0; ] [0; ], where, max p(x) I(X; Y ) is the (usual) channel capacity of the strong link, and, max p(x) I(X; Z) is the channel capacity of the weak link. As mentioned in the introduction, the argument presented in [4], that demonstrates the achievability of [0; ] [0; ], requires making E[N 1 ]E[N 2 ] approach 0 (or in nity). Therefore, in the next section, we show a theorem that gives an outer bound on the capacity region, with a restriction on E[N 1 ] and E[N 2 ]. III. OUTER REGION In order to obtain an outer bound on the capacity region we state two lemmas. First, let us de ne N min(n 1 ; N 2 ). Observe that N is a stopping time w.r.t. f(y i ; Z i )g i1, but it is not (in general) a stopping time w.r.t. individual channel outputs. Lemma 3.1: The following inequalities hold: I(W 2 ; Z N ) I(U ; Z) + log(e) I(W 1 ; Y N jw 2 ) I(X : Y ju ) + log(e); Proof: Let i 1fN ig, 4 from the chain rule for mutual information, we have I(W 2 ; Z N ) I(W 2 ; Z 1 1 ; 1 ; ; Z n n ; n ; ) I(W 2 ; 1 ) + I(W 2 ; Z 1 1 j 1 ) + + I(W 2 ; n j(z) n 1 ; n 1 ) + I(W 2 ; Z n n j(z) n 1 ; n ) + + I(W 2 ; i j(z) i 1 ; i 1 ) : 4 Where 1fN ig is equal to 1 if N i and equal to 0 otherwise. Also, we de ne Z i i as being equal to Z i if N i and equal to 0 otherwise. The rst summation can be upper bounded as I(W 2 ; i j(z) i 1 ; i 1 ) H( i j i 1 ) H( 1 ; 2 ; ) H(N ) log(e); where the last inequality is proved in [1] and [5, x3]. For the second summation, we get H(Z i i j(z) i 1 ; i ) H(Z i i j(z) i 1 ; i ; W 2 ) H(Z i i j i ) H(Z i i j(y ) i 1 ; (Z) i 1 ; i ; W 2 ) H(Z i i j i ) H(Z i i j(y ) i 1 ; (Z) i 1 ; i ; W 2 ) Pr( i 1) H(Z i j i 1) H(Z i jy i 1 ; Z i 1 ; i 1; W 2 ) Pr(N i)i(u i ; Z i j i 1); where U i fy i 1 ; Z i 1 ; W 2 g. Observe that p(z i jx i ; i 1) p(z i jx i ), thus, with a slight abuse of notation, we can write I(U i ; Z i j i 1) I(U i ; Z i ), with p(u i ), p(u i j i 1) and p(x i ju i ), p(x i ju i ; i 1). Hence, we obtain P Pr(N i)i(u i ; Z i ) Pr(N i) I(U i ; Z i ): Now let a i Pr(Ni), note that a i 0 for all i, and i a i 1. Thus, we can de ne an integer random variable Q by setting Pr(Q i) a i, for all i 2 f1; 2; g. Using this, the preceding equation becomes I(U Q ; Z Q jq) I(U ; Z); Pr(Q i)i(u Q ; Z Q jq i) where U, fu Q ; Qg and Z, Z Q are new random variables, the distributions of U Q and Z Q depend on Q in the same way as the distributions of U i and Z i depend on i.

Then, consider I(W 1 ; Y N jw 2 ) I(W 1 ; Y 1 1 ; 1 ; ; Y n n ; n ; jw 2 ) I(W 1 ; 1 jw 2 ) + I(W 1 ; Y 1 1 j 1 ; W 2 ) + + I(W 1 ; n j(y ) n 1 ; n 1 ; W 2 ) + I(W 1 ; Y n n j(y ) n 1 ; n ; W 2 ) + + I(W 1 ; i j(y ) i 1 ; i 1 ; W 2 ) : As previously, we may upper bound the rst summation as I(W 1 ; i j(y ) i 1 ; i 1 ; W 2 ) H( i j i 1 ) H( 1 ; 2 ; ) H(N ) And, the ith term in the second summation is H(Y i i j(y ) i 1 ; i ; W 2 ) H(Y i i j(y ) i 1 ; i ; W 1 ; W 2 ) log(e): (a) H(Y i i j(y ) i 1 ; (Z) i 1 ; i ; W 2 ) H(Y i i j(y ) i 1 ; (Z) i 1 ; i ; W 1 ; W 2 ) (b) H(Y i i j(y ) i 1 ; (Z) i 1 ; i ; W 2 ) H(Y i i j(y ) i 1 ; (Z) i 1 ; X i i ; i ; W 2 ) Pr( i 1) H(Y i jy i 1 ; Z i 1 ; i 1; W 2 ) H(Y i jy i 1 ; Z i 1 ; X i ; i 1; W 2 ) Pr(N i)i(x i ; Y i ju i ; i 1); where in (a) we use the fact that the channel is degraded, and (b) follows since X i is a function of (W 1 ; W 2 ; Y i 1 ; Z i 1 ), and then given X i, Y i is independent of W 1. Here, following the steps done for the preceding inequality, we let p(u i ), p(u i j i 1) and p(x i ju i ), p(x i ju i ; i 1), then since p(y i jx i ; u i ; i 1) p(y i jx i ), we have I(X i ; Y i ju i ; i 1) I(X i ; Y i ju i ). Hence, we get Pr(N i)i(x i ; Y i ju i ) Pr(N i) I(X i ; Y i ju i ): Now, introducing the random variable Q de ned earlier, we may write I(X; Y ju ); Pr(Q i)i(x Q ; Y Q ju Q ; Q i) where U, fu Q ; Qg, and X, X Q and Y, Y Q are new random variables, whose distributions depend on Q in the same way as the distributions of X i and Y i depend on i. Notice that U! X! (Y; Z) forms a Markov chain. Therefore, we obtain I(W 2 ; Z N ) I(U ; Z) + log(e) I(W 1 ; Y N jw 2 ) I(X; Y ju ) + log(e); We state the next lemma without proof, the main ideas being presented in the previous lemma. Lemma 3.2: We have the following inequalities: I(W 1 ; Y N1 N+1 jy N ; W 2 ) E[N 1 N ] + log(ee[n 1 N ]) I(W 2 ; Z N2 N+1 jzn ) E[N 2 N ] + log(ee[n 2 N ]): The following theorem shows an outer bound on the region of achievable rates. Theorem 3.3: (Outer bound) Let us denote by C r1;r 2, the set of rates achievable by using variable length codes for which E[N 1] r 1 and E[N 2] r 2. Then, any rate pairs (R 1 ; R 2 ) 2 C r1;r 2 must satisfy R 1 r 1 I(X; Y ju ) + (1 r 1 ) R 2 r 2 I(U ; Z) + (1 r 2 ) ; for some joint distribution p(u)p(xju)p(y; zjx), with the cardinality of the auxiliary random variable bounded by juj min(jx j; jyj; jzj). Proof: Let W i be uniformly distributed over f1; 2; ; M i g, i 1; 2. Then, and I(W 2 ; Z N2 ) H(W 2 ) H(W 2 jz N2 ) E[N 2 ]R 2 H(W 2 jz N2 ); I(W 1 ; Y N1 jw 2 ) H(W 1 jw 2 ) H(W 1 jy N1 ; W 2 ) E[N 1 ]R 1 H(W 1 jy N1 ): Thus, using Fano's inequality, we have where! 0 as P e! 0. E[N 2 ](R 2 ) I(W 2 ; Z N2 ) E[N 1 ](R 1 ) I(W 1 ; Y N1 jw 2 );

From the chain rule for mutual information, we can write and I(W 2 ; Z N2 ) I(W 2 ; Z N ) + I(W 2 ; Z N2 N+1 jzn ); I(W 1 ; Y N1 jw 2 ) I(W 1 ; Y N jw 2 ) + I(W 1 ; Y N1 N+1 jy N ; W 2 ): Then, applying Lemma 3.1 and Lemma 3.2, we get I(W 2 ; Z N2 ) I(U ; Z) + E[N 2 N ] + log(e) + log(ee[n 2 N ]) I(W 1 ; Y N1 jw 2 ) I(X; Y ju ) + E[N 1 N ] + log(e) + log(ee[n 1 N ]); Hence, E[N 1 ](R 1 ) I(X; Y ju ) + E[N 1 N ] + log(e) + log(ee[n 1 N ]) E[N 2 ](R 2 ) I(U ; Z) + E[N 2 N ] + log(e) + log(ee[n 2 N ]); Dividing by E[N 1 ] and letting E[N 1 ]! 1 in the rst inequality, and dividing by E[N 2 ] and letting E[N 2 ]! 1 in the second one, we obtain 5 R 1 r 1 I(X; Y ju ) + (1 r 1 ) R 2 r 2 I(U ; Z) + (1 r 2 ) ; The cardinality bounds for the auxiliary random variable U can be derived using standard methods from convex set theory. Notice that the region C r1;r 2 is de ned for variable length coding schemes verifying the restrictions on E[N 1 ], E[N 2 ] and, given by r 1 and r 2. The next section presents some important conclusion that can be derived from the outer bound on C r1;r 2. IV. ANALYSIS AND CODING SCHEMES To get a better insight into the meaning of the outer region found in the previous section, we focus on coding schemes that have E[N 1 ]. 6 In this case the outer region is given by the set of all rate pairs (R 1 ; R 2 ) satisfying R 1 I(X; Y ju ) R 2 E[N 1] E[N 2 ] I(U ; Z) + (1 E[N 1 ] E[N 2 ] ); 5 Codes achieving low probability of error may exist without requiring E[N1]! 1 or E[N2]! 1, but this is needed for codes achieving arbitrary low probability of error. 6 This means that the strong receiver always makes a decision before the weak one. 7 Note that this region is composed of the usual (block code) capacity region for degraded broadcast channels R DBC, with a scaling factor on R 2, plus a fraction of. It turns out that any rate pairs in this region can be achieve, by choosing E[N 1 ] and E[N 2 ] large enough, and using a block code of length E[N 1 ] achieving the corresponding rate pair in R DBC, 8 followed by a capacity achieving code of length E[N 2 ] E[N 1 ], for the link to the weakest receiver. Note that, the number of messages for the second receiver behaves like log M 2 E[N 1 ]R 0 2 + E[N 2 ] E[N 1 ], where R 0 2 2 R DBC is the rate at which the codewords for the weak receiver are generated, during the transmission of the rst block code. This shows that among all coding schemes with E[N 1 ], the best one is composed of two successive block codes. Thus, the possibility to employ variable length coding gives no real improvement over block coding. The gain in the achievable rates comes from the fact that the receivers decode their message at a different instant of time. The same conclusion can be derived if E[N 2 ]. Concerning coding schemes with an arbitrary, the best outer bound is obtained by minimizing. We know that with high probability N 1 and N 2, thus we. In the special case may write min ; when, we can rewrite the outer bound as R 1 2 R 2 2 I(X;Y ju) ; I(U;Z) 9 Observe that, this outer bound is valid for codes having E[N 1 ] and E[N 2 ] verifying the condition on their ratio, implicitly given by log M C1 2. Now, consider the following random coding scheme. Generate two capacity achieving (block) codes, one for the strong link of length, and one for the weak link of length. Then, to transmit a message pair (w 1 ; w 2 ) 2 (W 1 ; W 2 ), with probability p send the codeword in corresponding to w 1, followed by the codeword in corresponding to w 2. And, with probability 1 p, p send it in the reverse order ( followed by ). For M 1 and M 2 large enough, this coding scheme achieve the following rates with p 2 [0; 1]. R 0 1 + p R 0 2 log M 2 + p log M 2 1 + p C1 log M 2 1 + p C2 ; log M 2 7 From now on, we omit to mention the cardinality bound on the auxiliary random variable U. 8 For a careful de nition and analysis of block codes and broadcast channels, the reader is referred to [6] and the references therein. 9 This region is not necessarily convex.

Assuming that C 1, we have that the region of all rate pairs (R 1 ; R 2 ), satisfying R 1 1 + p R 2 1 + p ; for some p 2 [0; 1], is achievable. This region can be related to the outer region in the special case given previously. We see that the corner points of this outer region are achieved. Furthermore, if (the statistics over each link are the same), the two regions coincide. Again, in this example, the improvement of using variable length codes is only apparent through the possibility of sending the messages at different periods of time. V. REMARKS AND CONCLUSION For variable length coding over a degraded broadcast channel, we introduced a new notion of capacity region with a receiver centric de nition of the transmission rates. We derived an outer bound on this region, that capture the variability in the receiver decoding times. Through examples of coding scheme, we motivated that the gain in using variable length codes essentially comes from the possibility for the receivers to decode at a different instant of time. The setup of this paper can be extended to allow an immediate and noiseless feedback from the receivers to the transmitter. In the case when the degradation is physical, the outer bound remains valid. 10 This gives an equivalent to the result in [7], when the encoder is able to use variable length codes. ACKNOWLEDGMENT The author wishes to thank Emre Telatar for insightful discussions and helpful comments. REFERENCES [1] I. Csiszár, On the capacity of noisy channels with arbitrary signal costs, Problems of Control and Information Theory, Vol. 2 (2-4), pp. 283-304, 1973. [2] S. Shamai, E. Telatar and S. Verdú, Fountain Capacity, in Proc. IEEE Int. Symp. Inf. Theory, pp. 1881-1884, Seattle, 2006. [3] A. Tchamkerten, Feedback Communication over Unknown Channels, Ph.D. Thesis, Information Theory Laboratory, EPFL, March 2005. [4] T. Cover, Comments on Broadcast Channels, IEEE Trans. Inf. Theory, vol. 44, no. 6, pp. 2524-2530, October 1998. [5] I. Csiszár and J. Körner, Information Theory: Coding Theorems for Discrete Memoryless Systems, Academic Press, New York, 1981. [6] T. Cover and J. Thomas, Elements of Information Theory, Wiley, New York, 2006. [7] A. El Gamal, The Feedback Capacity of Degraded Broadcast Channels, IEEE Trans. Inf. Theory, vol. IT-24, no. 3, pp. 379-381, May 1978. 10 Each x i becomes dependent of Y i 1 and Z i 1, and the proof of theorem 3.3 follows with minor changes.