Entropy and Ergodic Theory Lecture 7: Rate-distortion theory
|
|
- Baldric Barnett
- 5 years ago
- Views:
Transcription
1 Entropy and Ergodic Theory Lecture 7: Rate-distortion theory 1 Coupling source coding to channel coding Let rσ, ps be a memoryless source and let ra, θ, Bs be a DMC. Here are the two coding problems we have discussed so far: A length-n block code for rσ, ps using the alphabet A may be represented by the diagram Σ n f ÝÑ C ÝÑ g Σ n, (1) where C Ď A n is the set of codewords. The objective is to make pˆn tgpfpσqq σu ă ε for some small ε. Roughly put, this is possible if ratepc q : 1 log C ą Hppq n and n is sufficiently large. (To be more precise, the gap in this inequality must be large enough, but by an amount which is o ε p1q as n ÝÑ 8.) See Lecture 3. A length-n channel code consists of a set of codewords C Ď A n and a decoder ϕ : B n ÝÑ A n. The objective is to make θ n ptϕ xu xq ă ε for some small ε, uniformly over all x P C. Roughly put, this is possible if See Lecture 6. ratepc q ă cappθq : sup Ipθ ; pq. ppprobpaq 1
2 By combining the coding theorems from Lectures 3 and 6, it follows that if Hppq ă cappθq, (2) then we may construct source and channel codes as above with the same set C for both. Then we can use the channel ra, θ, Bs to transmit random sourcewords from rσ, ps with small probability of error: given σ P Σ n, form fpσq P C, transmit it through the channel to produce a noisy output Y θ n p fpσqq, apply ϕ to get ϕpyq fpσq with probability at least 1 ε, and now apply g to recover the original sourceword, with probability at least 1 2ε. gpϕpyqq σ, 2 Rate-distortion theory What if the reverse inequality holds in (2)? Then it turns out that reliable communication, in the sense described above, is not possible, not matter how large an n we choose. Put another way: there is no way to transmit information from rσ, ps through ra, θ, Bs reliably which gives a better rate than by applying the source and channel coding theorems separately and then combining them, as described above. This is Shannon s source-channel separation theorem. (The case of equality in (2) is more complicated, and I do not know of a simple analysis.) We do not prove the separation theorem here. Instead, we focus on a new problem: what to do if (2) is reversed, but we are willing to accept some fraction of errors in our recovery of the original sourceword? To answer this, we again separate the problem into a source-coding part and a channel-coding part. We still ask for reliable transmission of codewords through the channel, as above, but now we are willing to use a source code that enables recovery of the original sourceword only with a certain fraction of errors. A more general version of the source-channel separation theorem shows that this separation into a noisy source code and a good channel code is again optimal. 2
3 Thus, set R : cappθq, and assume that Hppq ą R. Our new task is to design a source code as in (1) such that the expected distortion pˆn pσq ti 1, 2,..., n : gpfpσqq i σ i u n σpσ n is as small as possible, given that we are restricted to codes in which ratepcq ď R. This is the basic rate-distortion problem. Some more notation makes for a better understanding of (3). First, by replacing C with its image gpc q and f with g f, we reduce the rate distortion problem to the case Σ A and g id C. Next, let us write d n pa, a 1 q : ti : a i a 1 iu n for a pa 1,..., a n q, a 1 pa 1 1,..., a 1 nq P A n. This defines a metric on A n (exercise!), although we do not use this fact in the present lecture. t is called the (normalized) Hamming metric. With these simplifications, our task it to find an encoder f : A n ÝÑ A n for which fpa n q ď 2 Rn and for which the expected distortion is as small as possible when X pˆn. Erd n px, fpxqqs (3) 3 Shannon s rate-distortion theorem It turns out to be slightly easier to fix an upper bound on the expected distortion that we are willing to accept, and find the least rate bound R for which that expected distortion bound can be satisfied. This is done by the following theorem. Theorem 1 (Shannon s rate-distortion theorem). For δ ą 0 let Rpδq : inf IpX ; Y q : X, Y are A- and B-valued RVs with p X p and PpX Y q ď δ (! ) inf Ipθ ; pq : channels ra, θ, As satisfying ppaqθpa 1 aq ď δ. a a 1 in A (4) 3
4 1. (Existence.) If R 1 ą Rpδq and δ 1 ą δ, then there is a sequence of encoders f n : A n ÝÑ A n such that f n pa n q ă 2 R1n and with expected distortions less than δ 1 for all sufficiently large n. 2. (Converse.) If f : A n ÝÑ A n is any encoder with expected distortion at most δ, then f n pa n q ě 2 Rpδqn. The function δ ÞÑ Rpδq defined above is called the rate-distortion function of the source ra, ps. Thus, as with channel coding, the answer is obtained by solving a certain optimization problem for mutual information. This time, the source is fixed, and one obtains its rate-distortion function by optimizing over kernels: this is the opposite of the channel coding theorem. However, as in channel coding, the basis for Theorem 1 is the asymptotic picture of qˆn for q P ProbpA ˆ Aq that we obtained from our study of conditional entropy. Our first step is the following lemma, an obvious cousin of Feinstein s lemma. Feinstein s lemma was about packing disjoint sets V i into B n so as to mostly cover the distributions θ n p x i q. This time we need to cover A n with sets U i, not necesssarily disjoint, which all have small diameter according to d n. It suffices to do this using balls for the Hamming metric d n. Given x P A n and r ą 0, let B r pxq : ty P A n : d n px, yq ă ru. (This notation leaves the alphabet A and dimension n to the reader s understanding.) Lemma 2. Let R 1 ą Rpδq, δ 1 ą δ and ε ą 0. Then for every sufficiently large n (depending on R 1 and δ 1 ) there exist a positive integer N ă 2 R1n and elements v 1,..., v N P A n such that pˆn`b δ 1pv 1 q Y Y B δ 1pv N q ą 1 ε. (5) Proof. Choose a kernel θ from A to A so that if px, Y q p θ then PpX Y q ă δ 1 (6) and IpX ; Y q ă R 1. Then p X p, and let us write q : p θ. This joint distribution may also be written in terms of its second marginal p Y and a different kernel, say λ: we denote this representation by λ p Y. Let κ ą 0; we will specify it later. 4
5 Step 1: Setting up. For any y P B n, let and now H y : B δ 1pyq X T n,κ pλ, yq, H : ď ypt n,κppq H y ˆ tyu. Using the inclusions from the end of Lecture 5, we have H Ě T n,κ{2 pqq X tpx, yq : d n px, yq ă δ 1 u. The LLN for types gives qˆn rt n,κ{2 pqqs ÝÑ 1 as n ÝÑ 8. On the other hand, if px, Yq qˆn, then the quantity d n px, yq 1 n 1 txi Y i u is also a sum of bounded i.i.d. RVs, so the upper bound (6) and another appeal to the LLN give that qˆn phq ÝÑ 1 as n ÝÑ 8. Henceforth assume that n is large enough to satisfy qˆn phq ą 1 ε{2. (7) Step 2: Completion. Our proof actually gives pˆn ph v1 Y Y H vn q ą 1 ε (8) instead of (5). Since H y Ď B δ 1pyq, this implies (5). We now use a recursion to find a finite list of elements v i P T n,κ pp Y q such that for each i we have λ n`h vi zph v1 Y Y H vi 1 q ˇˇ vi ą ε{2. (9) We do this so that: (i) at each step, either (5) is also satisfied, or the recursion can be continued for another step; and (ii) the recursion must stop before 2 R1n steps. Thus, suppose we have already found v 1,..., v m which satisfy (9). If (8) is also satisfied with N m, then we stop the recursion. Otherwise, let U : H v1 Y Y H vm, and observe from (7) and the negation of (8) that pˆn pyqλ n`h y zu ˇˇ y qˆn phzpu ˆ B n qq ě ε{2. ypt n,κppq 5
6 So there must be some y P T n,κ pp Y q such that λ n ph y zu yq ą ε{2. Letting v m`1 : y, it follows that (9) also holds with i m ` 1. This continue the recursion. This recursion terminates only once (8) is satisfied, so it remains to show that this happens after fewer than 2 R1n steps. To see this, suppose that at step m the recursion has not yet terminated. Then on the one hand, since v i P T n,δ pp Y q, we have T n,κ pλ, v i q Ď T n,2κ pp X q, and so H v1 Y Y H vm Ď T n,2κ pp X q ùñ H v1 Y Y H vm ď T n,2κ pp X q ď 2 HpXqn` pκqn`opnq. On the other hand, from Lecture 5 we have cov 1 ε{2 pλ n p v i qq ě 2 Hpλ pv i qn opnq ě 2 HpX Y qn pκqn opnq, where the error estimates do not depend on v i provided that v i P T n,κ pp Y q. In view of (9), this implies that H vi zph v1 Y Y H vi 1 q ě 2 HpX Y qn pκqn opnq. Comparing the above inequalities gives m 2 HpX Y qn pκqn opnq ď m H vi zph v1 Y Y H vi 1 q H v1 Y Y H vm ď 2 HpXqn` pκqn`opnq and hence m ď 2 IpX ; Y qn` pκqn`opnq, where the estimates in the two error terms depend on A and λ but nothing else. Since IpX ; Y q ă R 1, we may choose κ small enough to deduce from this that N ă 2 R1n for all sufficiently large n. Proof of existence in Theorem 1. For any sufficiently large n, let v i, i 1, 2,..., N n, be given by the previous lemma. By forming set differences, we may now find subsets U i Ď B δ 1pv i q which are disjoint and still cover more than 1 ε of pˆn. Define fpxq v P U i, i 1, 2,..., N n, and extend this arbitrarily to a function f : A n ÝÑ tv 1,..., v Nn u. The lemma tells us that N n ă 2 R1n, and if X pˆn then E d n px, fpxqq ď E d n px, fpxqq1 txp Ťi U iu ` pˆn A nh`u 1 Y Y U Nn ă δ 1 ` ε. 6
7 Since δ 1 may be chosen arbitrarily close to δ and ε may be chosen arbitrarily small, this completes the proof. 4 The converse to the rate-distortion theorem One can give a counting proof of the converse to the rate-distortion theorem, similarly to the channel-coding theorem. But this time let us instead exhibit a more formula-based proof. It begins with the following lemma. Lemma 3. Given ra, ps, the rate-distortion function Rpδq is non-increasing and convex. Proof. The property of being non-increasing is obvious: if δ 1 ă δ 2, then Rpδ 2 q is defined by an infimum over a larger set of possible values than Rpδ 1 q. So now let δ i ą 0 for i 1, 2 and let 0 ă t ă 1. We need to show that Rptδ 1 ` p1 tqδ 2 q ď trpδ 1 q ` p1 tqrpδ 1 q. If either of the values Rpδ i q is infinite, then this is trivial, so suppose they are both finite. Let θ i for i 1, 2 be kernels from A to A such that Ipθ i ; pq Rpδ i q and ppaqθ i pa 1 aq ď δ i for i 1, 2. a a 1 These exist because Ipθ ; pq is continuous as a function of θ and the space of kernels is compact, so the infimum which defines Rpδq is always achieved. Define a new kernel by θpa 1 aq : tθ 1 pa 1 aq ` p1 tqθ 2 pa 1 aq. It satisfies ppaqθpa 1 aq t ppaqθ 1 pa 1 aq ` p1 tq ppaqθ 2 pa 1 aq ď tδ 1 ` p1 tqδ 2 a a 1 a a 1 a a 1 and Ipθ ; pq a ppaqh`θp aq ď a ppaq th`θ 1 p aq ` p1 tqh`θ 2 p aq trpδ 1 q ` p1 tqrpδ 2 q, by the concavity of H. So this θ witnesses that R is also concave. 7
8 Lemma 4. Let px, Yq px 1,..., X n, Y 1,..., Y n q be a RV taking values in A n ˆ B n, and assume that X 1,..., X n are independent. Then IpX ; Yq ě IpX i, Y i q. Proof. First, the definition of mutual information and the independence of X 1,..., X n give On the other hand, IpX ; Yq HpXq HpX Yq HpX Yq HpX 1,..., X n Yq ď HpX i Yq (subadditivity) Therefore IpX ; Yq ď HpX i Y i q HpX i q HpX Yq ě HpX i q HpX Yq. (data processing: Y determines Y i for each i). HpXi q HpX i Y i q IpX i ; Y i q. Proof of converse in Theorem 1. Assume that for some n we have an encoder f : A n ÝÑ A n such that if X pˆn then E d n px, fpxqq 1 n PpfpXq i X i q ď δ. (10) Let f n pxq Y py 1,..., Y n q, and let δ i : PpX i Y i q for each i. Then the above inequality may be re-written as 1 n pδ 1 ` ` δ n q ď δ. 8
9 On the other hand, since the coordinates of X are independent and X determines Y, Lemma 4 gives log 2 fpa n q ě HpYq HpYq HpY Xq IpY ; Xq ě IpY i ; X i q. By the definition of the rate-distortion function, this satisfies IpY i ; X i q ě Rpδ i q, and by Lemma 3 this right-hand side is at least 1 nr n pδ 1 ` ` δ n q ě Rpδq. Putting the above inequalities together gives fpa n q ě 2 Rpδqn. 5 Notes and remarks Sources for this lecture: [CT06, Chapter 10]. Further reading: See [CT06, Sections 7.13 and 10.5] for the source-channel separation theorem. Rate-distortion theory is easily generalized to handle different notions of similarity between a sourceword and its attempted recovery. To do this, one now considers two possibly different alphabets A and B and a general distortion function d : A ˆ B ÝÑ r0, 8q. Then the normalized Hamming metric is replaced by the function d n px, yq : 1 dpx i, y i q of x P A n, y P B n. n This is the theory covered in [CT06, Chapter 10]. See also [McE02, Chapter 3], and especially [McE02, Section 6.3] for a discussion of several further directions of research extending these results, such as the problem of explicitly constructing low-distortion codes. 9
10 Information theory analyses many other models than just finite strings over finite alphabets. For instance, a simple extension of the basic model is to allow larger collections of sources and destinations, connected by several encoders and decoders arranged in some directed network. Such a model is often called a multi-terminal communication system. The analysis of such a network (principally, finding its possible rates of data transmission subject to various contraints) is a rich and difficult subject. Some kinds of network have known complete solutions, often involving new and surprising ideas, such as fork networks which are treated by the Slepian Wolf theorem. Others do not seem to have tractable complete solutions. See [CT06, Chapter 15] for an introduction to this large area. More recent developments in information theory have increasingly been interwoven into other branches of applied mathematics and especially theoretical computer science. MacKay s book [Mac03] can serve as a good introduction to these connections. References [CT06] Thomas M. Cover and Joy A. Thomas. Elements of information theory. Wiley-Interscience [John Wiley & Sons], Hoboken, NJ, second edition, [Mac03] David J. C. MacKay. Information theory, inference and learning algorithms. Cambridge University Press, New York, [McE02] R. J. McEliece. The theory of information and coding, volume 86 of Encyclopedia of Mathematics and its Applications. Cambridge University Press, Cambridge, second edition, TIM AUSTIN COURANT INSTITUTE OF MATHEMATICAL SCIENCES, NEW YORK UNIVERSITY, 251 MERCER ST, NEW YORK NY 10012, USA tim@cims.nyu.edu URL: cims.nyu.edu/ tim 10
Entropy and Ergodic Theory Lecture 6: Channel coding
Entropy and Ergodic Theory Lecture 6: Channel coding 1 Discrete memoryless channels In information theory, a device which transmits information is called a channel. A crucial feature of real-world channels
More informationEntropy and Ergodic Theory Notes 22: The Kolmogorov Sinai entropy of a measure-preserving system
Entropy and Ergodic Theory Notes 22: The Kolmogorov Sinai entropy of a measure-preserving system 1 Joinings and channels 1.1 Joinings Definition 1. If px, µ, T q and py, ν, Sq are MPSs, then a joining
More informationEntropy and Ergodic Theory Notes 21: The entropy rate of a stationary process
Entropy and Ergodic Theory Notes 21: The entropy rate of a stationary process 1 Sources with memory In information theory, a stationary stochastic processes pξ n q npz taking values in some finite alphabet
More informationEntropy and Ergodic Theory Lecture 27: Sinai s factor theorem
Entropy and Ergodic Theory Lecture 27: Sinai s factor theorem What is special about Bernoulli shifts? Our main result in Lecture 26 is weak containment with retention of entropy. If ra Z, µs and rb Z,
More informationEntropy and Ergodic Theory Lecture 17: Transportation and concentration
Entropy and Ergodic Theory Lecture 17: Transportation and concentration 1 Concentration in terms of metric spaces In this course, a metric probability or m.p. space is a triple px, d, µq in which px, dq
More informationEntropy and Ergodic Theory Lecture 4: Conditional entropy and mutual information
Entropy and Ergodic Theory Lecture 4: Conditional entropy and mutual information 1 Conditional entropy Let (Ω, F, P) be a probability space, let X be a RV taking values in some finite set A. In this lecture
More informationEntropy and Ergodic Theory Lecture 3: The meaning of entropy in information theory
Entropy and Ergodic Theory Lecture 3: The meaning of entropy in information theory 1 The intuitive meaning of entropy Modern information theory was born in Shannon s 1948 paper A Mathematical Theory of
More informationLecture 4 Noisy Channel Coding
Lecture 4 Noisy Channel Coding I-Hsiang Wang Department of Electrical Engineering National Taiwan University ihwang@ntu.edu.tw October 9, 2015 1 / 56 I-Hsiang Wang IT Lecture 4 The Channel Coding Problem
More informationEntropy and Ergodic Theory Lecture 28: Sinai s and Ornstein s theorems, II
Entropy and Ergodic Theory Lecture 28: Sinai s and Ornstein s theorems, II 1 Proof of the update proposition Consider again an ergodic and atomless source ra Z, µs and a Bernoulli source rb Z, pˆz s such
More information(Classical) Information Theory III: Noisy channel coding
(Classical) Information Theory III: Noisy channel coding Sibasish Ghosh The Institute of Mathematical Sciences CIT Campus, Taramani, Chennai 600 113, India. p. 1 Abstract What is the best possible way
More informationChapter 4. Data Transmission and Channel Capacity. Po-Ning Chen, Professor. Department of Communications Engineering. National Chiao Tung University
Chapter 4 Data Transmission and Channel Capacity Po-Ning Chen, Professor Department of Communications Engineering National Chiao Tung University Hsin Chu, Taiwan 30050, R.O.C. Principle of Data Transmission
More informationEntropies & Information Theory
Entropies & Information Theory LECTURE I Nilanjana Datta University of Cambridge,U.K. See lecture notes on: http://www.qi.damtp.cam.ac.uk/node/223 Quantum Information Theory Born out of Classical Information
More informationECE598: Information-theoretic methods in high-dimensional statistics Spring 2016
ECE598: Information-theoretic methods in high-dimensional statistics Spring 06 Lecture : Mutual Information Method Lecturer: Yihong Wu Scribe: Jaeho Lee, Mar, 06 Ed. Mar 9 Quick review: Assouad s lemma
More informationEntropy as a measure of surprise
Entropy as a measure of surprise Lecture 5: Sam Roweis September 26, 25 What does information do? It removes uncertainty. Information Conveyed = Uncertainty Removed = Surprise Yielded. How should we quantify
More informationReliable Computation over Multiple-Access Channels
Reliable Computation over Multiple-Access Channels Bobak Nazer and Michael Gastpar Dept. of Electrical Engineering and Computer Sciences University of California, Berkeley Berkeley, CA, 94720-1770 {bobak,
More informationEntropy and Ergodic Theory Lecture 19: The ergodic theorems
Entropy and Ergodic Theory Lecture 19: The ergodic theorems 1 Some history: the ergodic hypothesis Ergodic theory takes its name from the ergodic hypothesis. This is an old idea of Boltzmann in statistical
More informationGround States are generically a periodic orbit
Gonzalo Contreras CIMAT Guanajuato, Mexico Ergodic Optimization and related topics USP, Sao Paulo December 11, 2013 Expanding map X compact metric space. T : X Ñ X an expanding map i.e. T P C 0, Dd P Z`,
More informationLecture 11: Quantum Information III - Source Coding
CSCI5370 Quantum Computing November 25, 203 Lecture : Quantum Information III - Source Coding Lecturer: Shengyu Zhang Scribe: Hing Yin Tsang. Holevo s bound Suppose Alice has an information source X that
More informationDiscrete Memoryless Channels with Memoryless Output Sequences
Discrete Memoryless Channels with Memoryless utput Sequences Marcelo S Pinho Department of Electronic Engineering Instituto Tecnologico de Aeronautica Sao Jose dos Campos, SP 12228-900, Brazil Email: mpinho@ieeeorg
More informationLecture 5 Channel Coding over Continuous Channels
Lecture 5 Channel Coding over Continuous Channels I-Hsiang Wang Department of Electrical Engineering National Taiwan University ihwang@ntu.edu.tw November 14, 2014 1 / 34 I-Hsiang Wang NIT Lecture 5 From
More informationShannon s Noisy-Channel Coding Theorem
Shannon s Noisy-Channel Coding Theorem Lucas Slot Sebastian Zur February 13, 2015 Lucas Slot, Sebastian Zur Shannon s Noisy-Channel Coding Theorem February 13, 2015 1 / 29 Outline 1 Definitions and Terminology
More informationCapacity of a channel Shannon s second theorem. Information Theory 1/33
Capacity of a channel Shannon s second theorem Information Theory 1/33 Outline 1. Memoryless channels, examples ; 2. Capacity ; 3. Symmetric channels ; 4. Channel Coding ; 5. Shannon s second theorem,
More information(each row defines a probability distribution). Given n-strings x X n, y Y n we can use the absence of memory in the channel to compute
ENEE 739C: Advanced Topics in Signal Processing: Coding Theory Instructor: Alexander Barg Lecture 6 (draft; 9/6/03. Error exponents for Discrete Memoryless Channels http://www.enee.umd.edu/ abarg/enee739c/course.html
More informationEntropy and Ergodic Theory Lecture 15: A first look at concentration
Entropy and Ergodic Theory Lecture 15: A first look at concentration 1 Introduction to concentration Let X 1, X 2,... be i.i.d. R-valued RVs with common distribution µ, and suppose for simplicity that
More informationChapter 9 Fundamental Limits in Information Theory
Chapter 9 Fundamental Limits in Information Theory Information Theory is the fundamental theory behind information manipulation, including data compression and data transmission. 9.1 Introduction o For
More informationEE376A: Homework #3 Due by 11:59pm Saturday, February 10th, 2018
Please submit the solutions on Gradescope. EE376A: Homework #3 Due by 11:59pm Saturday, February 10th, 2018 1. Optimal codeword lengths. Although the codeword lengths of an optimal variable length code
More informationInformation Theory. Coding and Information Theory. Information Theory Textbooks. Entropy
Coding and Information Theory Chris Williams, School of Informatics, University of Edinburgh Overview What is information theory? Entropy Coding Information Theory Shannon (1948): Information theory is
More informationNotes 3: Stochastic channels and noisy coding theorem bound. 1 Model of information communication and noisy channel
Introduction to Coding Theory CMU: Spring 2010 Notes 3: Stochastic channels and noisy coding theorem bound January 2010 Lecturer: Venkatesan Guruswami Scribe: Venkatesan Guruswami We now turn to the basic
More informationEntropy and Ergodic Theory Lecture 13: Introduction to statistical mechanics, II
Entropy and Ergodic Theory Lecture 13: Introduction to statistical mechanics, II 1 Recap and an example Recap: A ta 1, a 2,... u is a set of possible states for a single particle, so a state for a system
More informationarxiv: v2 [math.ca] 13 May 2015
ON THE CLOSURE OF TRANSLATION-DILATION INVARIANT LINEAR SPACES OF POLYNOMIALS arxiv:1505.02370v2 [math.ca] 13 May 2015 J. M. ALMIRA AND L. SZÉKELYHIDI Abstract. Assume that a linear space of real polynomials
More informationLecture 15: Conditional and Joint Typicaility
EE376A Information Theory Lecture 1-02/26/2015 Lecture 15: Conditional and Joint Typicaility Lecturer: Kartik Venkat Scribe: Max Zimet, Brian Wai, Sepehr Nezami 1 Notation We always write a sequence of
More informationLecture 8: Shannon s Noise Models
Error Correcting Codes: Combinatorics, Algorithms and Applications (Fall 2007) Lecture 8: Shannon s Noise Models September 14, 2007 Lecturer: Atri Rudra Scribe: Sandipan Kundu& Atri Rudra Till now we have
More informationEE 4TM4: Digital Communications II. Channel Capacity
EE 4TM4: Digital Communications II 1 Channel Capacity I. CHANNEL CODING THEOREM Definition 1: A rater is said to be achievable if there exists a sequence of(2 nr,n) codes such thatlim n P (n) e (C) = 0.
More informationErdinç Dündar, Celal Çakan
DEMONSTRATIO MATHEMATICA Vol. XLVII No 3 2014 Erdinç Dündar, Celal Çakan ROUGH I-CONVERGENCE Abstract. In this work, using the concept of I-convergence and using the concept of rough convergence, we introduced
More informationA Single-letter Upper Bound for the Sum Rate of Multiple Access Channels with Correlated Sources
A Single-letter Upper Bound for the Sum Rate of Multiple Access Channels with Correlated Sources Wei Kang Sennur Ulukus Department of Electrical and Computer Engineering University of Maryland, College
More informationIntroduction to Information Theory. Uncertainty. Entropy. Surprisal. Joint entropy. Conditional entropy. Mutual information.
L65 Dept. of Linguistics, Indiana University Fall 205 Information theory answers two fundamental questions in communication theory: What is the ultimate data compression? What is the transmission rate
More informationDept. of Linguistics, Indiana University Fall 2015
L645 Dept. of Linguistics, Indiana University Fall 2015 1 / 28 Information theory answers two fundamental questions in communication theory: What is the ultimate data compression? What is the transmission
More information1 Ex. 1 Verify that the function H(p 1,..., p n ) = k p k log 2 p k satisfies all 8 axioms on H.
Problem sheet Ex. Verify that the function H(p,..., p n ) = k p k log p k satisfies all 8 axioms on H. Ex. (Not to be handed in). looking at the notes). List as many of the 8 axioms as you can, (without
More informationMidterm Exam Information Theory Fall Midterm Exam. Time: 09:10 12:10 11/23, 2016
Midterm Exam Time: 09:10 12:10 11/23, 2016 Name: Student ID: Policy: (Read before You Start to Work) The exam is closed book. However, you are allowed to bring TWO A4-size cheat sheet (single-sheet, two-sided).
More informationADVANCE TOPICS IN ANALYSIS - REAL. 8 September September 2011
ADVANCE TOPICS IN ANALYSIS - REAL NOTES COMPILED BY KATO LA Introductions 8 September 011 15 September 011 Nested Interval Theorem: If A 1 ra 1, b 1 s, A ra, b s,, A n ra n, b n s, and A 1 Ě A Ě Ě A n
More informationLecture 2: August 31
0-704: Information Processing and Learning Fall 206 Lecturer: Aarti Singh Lecture 2: August 3 Note: These notes are based on scribed notes from Spring5 offering of this course. LaTeX template courtesy
More informationCSCI 2570 Introduction to Nanocomputing
CSCI 2570 Introduction to Nanocomputing Information Theory John E Savage What is Information Theory Introduced by Claude Shannon. See Wikipedia Two foci: a) data compression and b) reliable communication
More informationGraph Coloring and Conditional Graph Entropy
Graph Coloring and Conditional Graph Entropy Vishal Doshi, Devavrat Shah, Muriel Médard, Sidharth Jaggi Laboratory for Information and Decision Systems Massachusetts Institute of Technology Cambridge,
More informationShannon s noisy-channel theorem
Shannon s noisy-channel theorem Information theory Amon Elders Korteweg de Vries Institute for Mathematics University of Amsterdam. Tuesday, 26th of Januari Amon Elders (Korteweg de Vries Institute for
More informationDigital Communications III (ECE 154C) Introduction to Coding and Information Theory
Digital Communications III (ECE 154C) Introduction to Coding and Information Theory Tara Javidi These lecture notes were originally developed by late Prof. J. K. Wolf. UC San Diego Spring 2014 1 / 8 I
More informationComputing and Communications 2. Information Theory -Entropy
1896 1920 1987 2006 Computing and Communications 2. Information Theory -Entropy Ying Cui Department of Electronic Engineering Shanghai Jiao Tong University, China 2017, Autumn 1 Outline Entropy Joint entropy
More informationUpper Bounds on the Capacity of Binary Intermittent Communication
Upper Bounds on the Capacity of Binary Intermittent Communication Mostafa Khoshnevisan and J. Nicholas Laneman Department of Electrical Engineering University of Notre Dame Notre Dame, Indiana 46556 Email:{mhoshne,
More informationREAL ANALYSIS II TAKE HOME EXAM. T. Tao s Lecture Notes Set 5
REAL ANALYSIS II TAKE HOME EXAM CİHAN BAHRAN T. Tao s Lecture Notes Set 5 1. Suppose that te 1, e 2, e 3,... u is a countable orthonormal system in a complex Hilbert space H, and c 1, c 2,... is a sequence
More informationECE Information theory Final (Fall 2008)
ECE 776 - Information theory Final (Fall 2008) Q.1. (1 point) Consider the following bursty transmission scheme for a Gaussian channel with noise power N and average power constraint P (i.e., 1/n X n i=1
More informationRepresentation of Correlated Sources into Graphs for Transmission over Broadcast Channels
Representation of Correlated s into Graphs for Transmission over Broadcast s Suhan Choi Department of Electrical Eng. and Computer Science University of Michigan, Ann Arbor, MI 80, USA Email: suhanc@eecs.umich.edu
More informationLecture 4 Channel Coding
Capacity and the Weak Converse Lecture 4 Coding I-Hsiang Wang Department of Electrical Engineering National Taiwan University ihwang@ntu.edu.tw October 15, 2014 1 / 16 I-Hsiang Wang NIT Lecture 4 Capacity
More informationA Graph-based Framework for Transmission of Correlated Sources over Multiple Access Channels
A Graph-based Framework for Transmission of Correlated Sources over Multiple Access Channels S. Sandeep Pradhan a, Suhan Choi a and Kannan Ramchandran b, a {pradhanv,suhanc}@eecs.umich.edu, EECS Dept.,
More informationLecture 5: Channel Capacity. Copyright G. Caire (Sample Lectures) 122
Lecture 5: Channel Capacity Copyright G. Caire (Sample Lectures) 122 M Definitions and Problem Setup 2 X n Y n Encoder p(y x) Decoder ˆM Message Channel Estimate Definition 11. Discrete Memoryless Channel
More informationLECTURE 13. Last time: Lecture outline
LECTURE 13 Last time: Strong coding theorem Revisiting channel and codes Bound on probability of error Error exponent Lecture outline Fano s Lemma revisited Fano s inequality for codewords Converse to
More informationLecture 1: September 25, A quick reminder about random variables and convexity
Information and Coding Theory Autumn 207 Lecturer: Madhur Tulsiani Lecture : September 25, 207 Administrivia This course will cover some basic concepts in information and coding theory, and their applications
More informationChaos, Complexity, and Inference (36-462)
Chaos, Complexity, and Inference (36-462) Lecture 7: Information Theory Cosma Shalizi 3 February 2009 Entropy and Information Measuring randomness and dependence in bits The connection to statistics Long-run
More informationCoding into a source: an inverse rate-distortion theorem
Coding into a source: an inverse rate-distortion theorem Anant Sahai joint work with: Mukul Agarwal Sanjoy K. Mitter Wireless Foundations Department of Electrical Engineering and Computer Sciences University
More informationSimultaneous Nonunique Decoding Is Rate-Optimal
Fiftieth Annual Allerton Conference Allerton House, UIUC, Illinois, USA October 1-5, 2012 Simultaneous Nonunique Decoding Is Rate-Optimal Bernd Bandemer University of California, San Diego La Jolla, CA
More informationData Compression. Limit of Information Compression. October, Examples of codes 1
Data Compression Limit of Information Compression Radu Trîmbiţaş October, 202 Outline Contents Eamples of codes 2 Kraft Inequality 4 2. Kraft Inequality............................ 4 2.2 Kraft inequality
More information1 Background on Information Theory
Review of the book Information Theory: Coding Theorems for Discrete Memoryless Systems by Imre Csiszár and János Körner Second Edition Cambridge University Press, 2011 ISBN:978-0-521-19681-9 Review by
More informationMultiaccess Channels with State Known to One Encoder: A Case of Degraded Message Sets
Multiaccess Channels with State Known to One Encoder: A Case of Degraded Message Sets Shivaprasad Kotagiri and J. Nicholas Laneman Department of Electrical Engineering University of Notre Dame Notre Dame,
More informationRemote Source Coding with Two-Sided Information
Remote Source Coding with Two-Sided Information Basak Guler Ebrahim MolavianJazi Aylin Yener Wireless Communications and Networking Laboratory Department of Electrical Engineering The Pennsylvania State
More informationLecture 12: November 6, 2017
Information and Coding Theory Autumn 017 Lecturer: Madhur Tulsiani Lecture 1: November 6, 017 Recall: We were looking at codes of the form C : F k p F n p, where p is prime, k is the message length, and
More informationECE Advanced Communication Theory, Spring 2009 Homework #1 (INCOMPLETE)
ECE 74 - Advanced Communication Theory, Spring 2009 Homework #1 (INCOMPLETE) 1. A Huffman code finds the optimal codeword to assign to a given block of source symbols. (a) Show that cannot be a Huffman
More informationELEC546 Review of Information Theory
ELEC546 Review of Information Theory Vincent Lau 1/1/004 1 Review of Information Theory Entropy: Measure of uncertainty of a random variable X. The entropy of X, H(X), is given by: If X is a discrete random
More informationNetwork coding for multicast relation to compression and generalization of Slepian-Wolf
Network coding for multicast relation to compression and generalization of Slepian-Wolf 1 Overview Review of Slepian-Wolf Distributed network compression Error exponents Source-channel separation issues
More informationAppendix B Information theory from first principles
Appendix B Information theory from first principles This appendix discusses the information theory behind the capacity expressions used in the book. Section 8.3.4 is the only part of the book that supposes
More informationQuantum rate distortion, reverse Shannon theorems, and source-channel separation
Quantum rate distortion, reverse Shannon theorems, and source-channel separation ilanjana Datta, Min-Hsiu Hsieh, Mark Wilde (1) University of Cambridge,U.K. (2) McGill University, Montreal, Canada Classical
More information18.2 Continuous Alphabet (discrete-time, memoryless) Channel
0-704: Information Processing and Learning Spring 0 Lecture 8: Gaussian channel, Parallel channels and Rate-distortion theory Lecturer: Aarti Singh Scribe: Danai Koutra Disclaimer: These notes have not
More informationCell-Probe Lower Bounds for Prefix Sums and Matching Brackets
Cell-Probe Lower Bounds for Prefix Sums and Matching Brackets Emanuele Viola July 6, 2009 Abstract We prove that to store strings x {0, 1} n so that each prefix sum a.k.a. rank query Sumi := k i x k can
More information(Preprint of paper to appear in Proc Intl. Symp. on Info. Th. and its Applications, Waikiki, Hawaii, Nov , 1990.)
(Preprint of paper to appear in Proc. 1990 Intl. Symp. on Info. Th. and its Applications, Waikiki, Hawaii, ov. 27-30, 1990.) CAUSALITY, FEEDBACK AD DIRECTED IFORMATIO James L. Massey Institute for Signal
More informationJoint Source-Channel Coding for the Multiple-Access Relay Channel
Joint Source-Channel Coding for the Multiple-Access Relay Channel Yonathan Murin, Ron Dabora Department of Electrical and Computer Engineering Ben-Gurion University, Israel Email: moriny@bgu.ac.il, ron@ee.bgu.ac.il
More informationVariable Length Codes for Degraded Broadcast Channels
Variable Length Codes for Degraded Broadcast Channels Stéphane Musy School of Computer and Communication Sciences, EPFL CH-1015 Lausanne, Switzerland Email: stephane.musy@ep.ch Abstract This paper investigates
More informationThe Method of Types and Its Application to Information Hiding
The Method of Types and Its Application to Information Hiding Pierre Moulin University of Illinois at Urbana-Champaign www.ifp.uiuc.edu/ moulin/talks/eusipco05-slides.pdf EUSIPCO Antalya, September 7,
More informationNOTES WEEK 13 DAY 2 SCOT ADAMS
NOTES WEEK 13 DAY 2 SCOT ADAMS Recall: Let px, dq be a metric space. Then, for all S Ď X, we have p S is sequentially compact q ñ p S is closed and bounded q. DEFINITION 0.1. Let px, dq be a metric space.
More informationOn Multiple User Channels with State Information at the Transmitters
On Multiple User Channels with State Information at the Transmitters Styrmir Sigurjónsson and Young-Han Kim* Information Systems Laboratory Stanford University Stanford, CA 94305, USA Email: {styrmir,yhk}@stanford.edu
More informationLecture 4: Proof of Shannon s theorem and an explicit code
CSE 533: Error-Correcting Codes (Autumn 006 Lecture 4: Proof of Shannon s theorem and an explicit code October 11, 006 Lecturer: Venkatesan Guruswami Scribe: Atri Rudra 1 Overview Last lecture we stated
More informationLecture 22: Final Review
Lecture 22: Final Review Nuts and bolts Fundamental questions and limits Tools Practical algorithms Future topics Dr Yao Xie, ECE587, Information Theory, Duke University Basics Dr Yao Xie, ECE587, Information
More informationSuperposition Encoding and Partial Decoding Is Optimal for a Class of Z-interference Channels
Superposition Encoding and Partial Decoding Is Optimal for a Class of Z-interference Channels Nan Liu and Andrea Goldsmith Department of Electrical Engineering Stanford University, Stanford CA 94305 Email:
More informationThe Poisson Channel with Side Information
The Poisson Channel with Side Information Shraga Bross School of Enginerring Bar-Ilan University, Israel brosss@macs.biu.ac.il Amos Lapidoth Ligong Wang Signal and Information Processing Laboratory ETH
More informationShannon s Noisy-Channel Coding Theorem
Shannon s Noisy-Channel Coding Theorem Lucas Slot Sebastian Zur February 2015 Abstract In information theory, Shannon s Noisy-Channel Coding Theorem states that it is possible to communicate over a noisy
More informationInformation Theory in Intelligent Decision Making
Information Theory in Intelligent Decision Making Adaptive Systems and Algorithms Research Groups School of Computer Science University of Hertfordshire, United Kingdom June 7, 2015 Information Theory
More informationNOTES ON SOME EXERCISES OF LECTURE 5, MODULE 2
NOTES ON SOME EXERCISES OF LECTURE 5, MODULE 2 MARCO VITTURI Contents 1. Solution to exercise 5-2 1 2. Solution to exercise 5-3 2 3. Solution to exercise 5-7 4 4. Solution to exercise 5-8 6 5. Solution
More informationLecture 8: Channel and source-channel coding theorems; BEC & linear codes. 1 Intuitive justification for upper bound on channel capacity
5-859: Information Theory and Applications in TCS CMU: Spring 23 Lecture 8: Channel and source-channel coding theorems; BEC & linear codes February 7, 23 Lecturer: Venkatesan Guruswami Scribe: Dan Stahlke
More informationNoisy channel communication
Information Theory http://www.inf.ed.ac.uk/teaching/courses/it/ Week 6 Communication channels and Information Some notes on the noisy channel setup: Iain Murray, 2012 School of Informatics, University
More informationNotes 6 : First and second moment methods
Notes 6 : First and second moment methods Math 733-734: Theory of Probability Lecturer: Sebastien Roch References: [Roc, Sections 2.1-2.3]. Recall: THM 6.1 (Markov s inequality) Let X be a non-negative
More informationAPPROXIMATE HOMOMORPHISMS BETWEEN THE BOOLEAN CUBE AND GROUPS OF PRIME ORDER
APPROXIMATE HOMOMORPHISMS BETWEEN THE BOOLEAN CUBE AND GROUPS OF PRIME ORDER TOM SANDERS The purpose of this note is to highlight a question raised by Shachar Lovett [Lov], and to offer some motivation
More informationEquivalence for Networks with Adversarial State
Equivalence for Networks with Adversarial State Oliver Kosut Department of Electrical, Computer and Energy Engineering Arizona State University Tempe, AZ 85287 Email: okosut@asu.edu Jörg Kliewer Department
More informationAn introduction to basic information theory. Hampus Wessman
An introduction to basic information theory Hampus Wessman Abstract We give a short and simple introduction to basic information theory, by stripping away all the non-essentials. Theoretical bounds on
More informationLecture 2. Capacity of the Gaussian channel
Spring, 207 5237S, Wireless Communications II 2. Lecture 2 Capacity of the Gaussian channel Review on basic concepts in inf. theory ( Cover&Thomas: Elements of Inf. Theory, Tse&Viswanath: Appendix B) AWGN
More informationLecture 6 I. CHANNEL CODING. X n (m) P Y X
6- Introduction to Information Theory Lecture 6 Lecturer: Haim Permuter Scribe: Yoav Eisenberg and Yakov Miron I. CHANNEL CODING We consider the following channel coding problem: m = {,2,..,2 nr} Encoder
More informationQuantifier elimination
Quantifier elimination Definition We say that a theory T has quantifier elimination if for any formula ϕp xq and ɛ 0 there is a quantifier-free formula ψp xq such that sup ϕp xq ψp xq ɛ x holds in all
More informationFor example, p12q p2x 1 x 2 ` 5x 2 x 2 3 q 2x 2 x 1 ` 5x 1 x 2 3. (a) Let p 12x 5 1x 7 2x 4 18x 6 2x 3 ` 11x 1 x 2 x 3 x 4,
SOLUTIONS Math A4900 Homework 5 10/4/2017 1. (DF 2.2.12(a)-(d)+) Symmetric polynomials. The group S n acts on the set tx 1, x 2,..., x n u by σ x i x σpiq. That action extends to a function S n ˆ A Ñ A,
More informationExercise 1. = P(y a 1)P(a 1 )
Chapter 7 Channel Capacity Exercise 1 A source produces independent, equally probable symbols from an alphabet {a 1, a 2 } at a rate of one symbol every 3 seconds. These symbols are transmitted over a
More informationSource and Channel Coding for Correlated Sources Over Multiuser Channels
Source and Channel Coding for Correlated Sources Over Multiuser Channels Deniz Gündüz, Elza Erkip, Andrea Goldsmith, H. Vincent Poor Abstract Source and channel coding over multiuser channels in which
More informationFeedback Capacity of a Class of Symmetric Finite-State Markov Channels
Feedback Capacity of a Class of Symmetric Finite-State Markov Channels Nevroz Şen, Fady Alajaji and Serdar Yüksel Department of Mathematics and Statistics Queen s University Kingston, ON K7L 3N6, Canada
More informationTHEORY OF PROBABILITY VLADIMIR KOBZAR
THEORY OF PROBABILITY VLADIMIR KOBZAR Lecture 20 - Conditional Expectation, Inequalities, Laws of Large Numbers, Central Limit Theorem This lecture is based on the materials from the Courant Institute
More informationSolutions to Homework Set #1 Sanov s Theorem, Rate distortion
st Semester 00/ Solutions to Homework Set # Sanov s Theorem, Rate distortion. Sanov s theorem: Prove the simple version of Sanov s theorem for the binary random variables, i.e., let X,X,...,X n be a sequence
More informationConstructing Polar Codes Using Iterative Bit-Channel Upgrading. Arash Ghayoori. B.Sc., Isfahan University of Technology, 2011
Constructing Polar Codes Using Iterative Bit-Channel Upgrading by Arash Ghayoori B.Sc., Isfahan University of Technology, 011 A Thesis Submitted in Partial Fulfillment of the Requirements for the Degree
More informationECE Information theory Final
ECE 776 - Information theory Final Q1 (1 point) We would like to compress a Gaussian source with zero mean and variance 1 We consider two strategies In the first, we quantize with a step size so that the
More information