An Achievable Error Exponent for the Mismatched Multiple-Access Channel

Similar documents
Mismatched Multi-letter Successive Decoding for the Multiple-Access Channel

THE mismatched decoding problem [1] [3] seeks to characterize

Lecture 4 Noisy Channel Coding

Error Exponent Region for Gaussian Broadcast Channels

Reliable Communication Under Mismatched Decoding

Universal A Posteriori Metrics Game

A Comparison of Superposition Coding Schemes

An Alternative Proof of Channel Polarization for Channels with Arbitrary Input Alphabets

Variable Length Codes for Degraded Broadcast Channels

(each row defines a probability distribution). Given n-strings x X n, y Y n we can use the absence of memory in the channel to compute

The Method of Types and Its Application to Information Hiding

Universal Anytime Codes: An approach to uncertain channels in control

Exact Random Coding Error Exponents of Optimal Bin Index Decoding

Dispersion of the Gilbert-Elliott Channel

Achievable Rates for Probabilistic Shaping

Finding the best mismatched detector for channel coding and hypothesis testing

LECTURE 10. Last time: Lecture outline

Arimoto Channel Coding Converse and Rényi Divergence

Bit-Interleaved Coded Modulation Revisited: A Mismatched Decoding Perspective

Distributed Lossless Compression. Distributed lossless compression system

The Poisson Channel with Side Information

Lecture 1: The Multiple Access Channel. Copyright G. Caire 12

Exercise 1. = P(y a 1)P(a 1 )

Multiaccess Channels with State Known to One Encoder: A Case of Degraded Message Sets

Approaching Blokh-Zyablov Error Exponent with Linear-Time Encodable/Decodable Codes

Chapter 4. Data Transmission and Channel Capacity. Po-Ning Chen, Professor. Department of Communications Engineering. National Chiao Tung University

On Gaussian MIMO Broadcast Channels with Common and Private Messages

Error Exponent Regions for Gaussian Broadcast and Multiple Access Channels

Optimal Power Control in Decentralized Gaussian Multiple Access Channels

On Scalable Source Coding for Multiple Decoders with Side Information

Multiuser Successive Refinement and Multiple Description Coding

Capacity Region of Reversely Degraded Gaussian MIMO Broadcast Channel

On Achievable Rates for Channels with. Mismatched Decoding

for some error exponent E( R) as a function R,

A New Achievable Region for Gaussian Multiple Descriptions Based on Subset Typicality

On the Capacity Region of the Gaussian Z-channel

Subset Typicality Lemmas and Improved Achievable Regions in Multiterminal Source Coding

Simultaneous Nonunique Decoding Is Rate-Optimal

Superposition Encoding and Partial Decoding Is Optimal for a Class of Z-interference Channels

Channel Polarization and Blackwell Measures

1 Introduction to information theory

Reliable Computation over Multiple-Access Channels

A Novel Asynchronous Communication Paradigm: Detection, Isolation, and Coding

A Summary of Multiple Access Channels

Feedback Capacity of the Compound Channel

5958 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 56, NO. 12, DECEMBER 2010

Nearest Neighbor Decoding in MIMO Block-Fading Channels With Imperfect CSIR

Bit-interleaved coded modulation revisited: a mismatched decoding perspective Martinez Vicente, A.; Guillén i Fàbregas, A.; Caire, G.; Willems, F.M.J.

An Achievable Rate Region for the 3-User-Pair Deterministic Interference Channel

Upper Bounds on the Capacity of Binary Intermittent Communication

Analytical Bounds on Maximum-Likelihood Decoded Linear Codes: An Overview

Second-Order Asymptotics for the Gaussian MAC with Degraded Message Sets

CONSIDER a joint stationary and memoryless process

Optimization in Information Theory

EE 4TM4: Digital Communications II. Channel Capacity

Tightened Upper Bounds on the ML Decoding Error Probability of Binary Linear Block Codes and Applications

Tightened Upper Bounds on the ML Decoding Error Probability of Binary Linear Block Codes and Applications

On the Capacity of the Interference Channel with a Relay

EE5139R: Problem Set 7 Assigned: 30/09/15, Due: 07/10/15

AN INTRODUCTION TO SECRECY CAPACITY. 1. Overview

Degrees of Freedom Region of the Gaussian MIMO Broadcast Channel with Common and Private Messages

Intermittent Communication

Simple Channel Coding Bounds

Bounds on Mutual Information for Simple Codes Using Information Combining

Variable-Rate Universal Slepian-Wolf Coding with Feedback

Lecture 8: Shannon s Noise Models

EE376A: Homework #3 Due by 11:59pm Saturday, February 10th, 2018

Representation of Correlated Sources into Graphs for Transmission over Broadcast Channels

Solutions to Homework Set #3 Channel and Source coding

National University of Singapore Department of Electrical & Computer Engineering. Examination for

A Proof of the Converse for the Capacity of Gaussian MIMO Broadcast Channels

Lecture 4 Channel Coding

Linear Programming Decoding of Binary Linear Codes for Symbol-Pair Read Channels

An Extended Fano s Inequality for the Finite Blocklength Coding

On Multiple User Channels with State Information at the Transmitters

Lecture 6 I. CHANNEL CODING. X n (m) P Y X

Appendix B Information theory from first principles

Information Theory. Lecture 10. Network Information Theory (CT15); a focus on channel capacity results

A Half-Duplex Cooperative Scheme with Partial Decode-Forward Relaying

SUCCESSIVE refinement of information, or scalable

Capacity of a channel Shannon s second theorem. Information Theory 1/33

The Gallager Converse

SHARED INFORMATION. Prakash Narayan with. Imre Csiszár, Sirin Nitinawarat, Himanshu Tyagi, Shun Watanabe

Lecture 5: Channel Capacity. Copyright G. Caire (Sample Lectures) 122

Lossy Distributed Source Coding

Shannon s noisy-channel theorem

Information Theory for Wireless Communications. Lecture 10 Discrete Memoryless Multiple Access Channel (DM-MAC): The Converse Theorem

A Single-letter Upper Bound for the Sum Rate of Multiple Access Channels with Correlated Sources

Equivalence for Networks with Adversarial State

Coding into a source: an inverse rate-distortion theorem

An Improved Sphere-Packing Bound for Finite-Length Codes over Symmetric Memoryless Channels

Cut-Set Bound and Dependence Balance Bound

SHARED INFORMATION. Prakash Narayan with. Imre Csiszár, Sirin Nitinawarat, Himanshu Tyagi, Shun Watanabe

Simplified Composite Coding for Index Coding

THIS paper is aimed at designing efficient decoding algorithms

Cases Where Finding the Minimum Entropy Coloring of a Characteristic Graph is a Polynomial Time Problem

An instantaneous code (prefix code, tree code) with the codeword lengths l 1,..., l N exists if and only if. 2 l i. i=1

Network coding for multicast relation to compression and generalization of Slepian-Wolf

EECS 750. Hypothesis Testing with Communication Constraints

Lecture 5 Channel Coding over Continuous Channels

Transcription:

An Achievable Error Exponent for the Mismatched Multiple-Access Channel Jonathan Scarlett University of Cambridge jms265@camacuk Albert Guillén i Fàbregas ICREA & Universitat Pompeu Fabra University of Cambridge guillen@ieeeorg Abstract This paper considers channel coding for the discrete memoryless multiple-access channel with a given possibly suboptimal decoding rule Using constant-composition random coding, an achievable error exponent is obtained which is tight with respect to the ensemble average, and positive for all rate pairs in the interior of Lapidoth s achievable rate region I INTRODUCTION The problem of channel coding with a mismatched decoding rule arises in numerous settings 1 5 For example, in practical systems the decoder may have imperfect knowledge of the channel, or implementation constraints may prohibit the use of an optimal decoder The problem of finding the mismatched capacity is an open problem in general, and most existing results are on achievable rates via random coding Of particular note is the LM rate, which can be obtained via constant-composition random coding 3, 4 or iid random coding with a cost constraint 5 The mismatched multiple-access channel MAC was first considered by Lapidoth 1, who obtained an achievable rate region and showed the surprising fact that the single-user LM rate can be improved by treating the single-user channel as a MAC As an example, Lapidoth considered the channel in Figure 1 consisting of two parallel binary symmetric channels BSCs with crossover probabilities δ 1 < 5 and δ 2 < 5 The mismatched decoder assumes that both crossover probabilities are equal to δ < 5 By treating the channel as a mismatched single-user channel from x 1, x 2 to y 1, y 2 and using random coding with a uniform distribution on the quaternary input alphabet, one can only achieve rates R satisfying δ1 + δ 2 R < 2 1 H 2 1 2 where H 2 is the binary entropy function in bits On the other hand, by treating the channel as a mismatched MAC from x 1 and x 2 to y 1, y 2 and using random coding with equiprobable input distributions on each binary input alphabet, one can achieve any sum-rate R satisfying R < 1 H 2 δ 1 + 1 H 2 δ 2 2 This is the best rate possible even under maximum-likelihood ML decoding This work has been supported in part by the European Research Council under ERC grant agreement 259663 δ 1 1 δ 1 x 1 y 1 1 1 1 δ 1 x 2 Figure 1 δ 1 δ 2 δ 2 1 δ 2 1 1 1 δ 2 y 2 Lapidoth s Parallel BSC Example A random coding converse was also given in 1, showing that the the random-coding error probability tends to one for rate pairs outside the given achievable rate region In this paper, we strengthen the results of 1 by obtaining random-coding error exponents for the constant-composition ensemble which are tight with respect to the ensemble average, and positive within the interior of the rate region given in 1 The problem of finding the best achievable error exponents for the MAC is unsolved even in the matched regime; see 6 and references therein To our knowledge, no complete results on the ensemble tightness of error exponents for the MAC have been given previously, even in the matched regime While error exponents for a given decoding rule are presented in 6, they are not tight enough to prove the achievability of Lapidoth s rate region For example, for the parallel BSC example in Figure 1 with uniform input distributions, the exponents of 6 are only positive when the sum-rate R satisfies 1, whereas the ensemble-tight exponent can be positive for all sum-rates satisfying 2 We will see that the key difference in our analysis is a refined application of the union bound see Section III-B Improving the standard use of the union bound was also a key idea in 1, but it was done differently to the present paper

A System Setup We consider a 2-user discrete memoryless MAC DM-MAC W y x 1, x 2 with input alphabets X 1 and X 2 and output alphabet Y The decoding metric is denoted by qx 1, x 2, y We write W y x 1, x 2 and qx 1, x 2, y as a shorthand for n i=1 W y i x 1,i, x 2,i and n i=1 qx 1,i, x 2,i, y i respectively, where y i is the i-th entry of y and similarly for x 1,i and x 2,i The encoders and decoder operate as follows Encoder ν ν 1, 2 selects a message m ν equiprobably from the set 1,, M ν, and transmits the corresponding codeword x mν ν from the codebook C ν = x 1 ν,, x Mν ν Upon receiving the signal y at the output of the channel, the decoder forms an estimate m 1, m 2 of the messages, given by m 1, m 2 = arg max qx i i 1,,M 1,j 1,,M 2 1, xj 2, y 3 We assume that ties are broken at random An error is said to have occurred if the estimate m 1, m 2 differs from m 1, m 2 We distinguish between the following three types of error: Type 1 m 1 m 1 and m 2 = m 2 Type 2 m 1 = m 1 and m 2 m 2 Type 12 m 1 m 1 and m 2 m 2 The probabilities of these events are denoted by p e,1, p e,2 and p e,12 respectively, and the overall error probability is denoted by p e The ensemble-average error probabilities for a given random-coding ensemble are denoted by p e,1, p e,2, p e,12 and p e respectively Clearly we have maxp e,1, p e,2, p e,12 p e p e,1 + p e,2 + p e,12 4 and similarly for p e A rate pair R 1, R 2 is said to be achievable if there exist sequences of codebooks with M 1 = expnr 1 and M 2 = expnr 2 codewords of length n for users 1 and 2 respectively such that p e We say that ER 1, R 2 is an achievable error exponent if there exist sequences of codebooks with M 1 = expnr 1 and M 2 = expnr 2 codewords of length n such that lim 1 n n log p e ER 1, R 2 5 For a given random-coding ensemble, we say that the random-coding error exponent E r R 1, R 2 exhibits ensemble tightness if B Notation lim 1 n n log p e = E r R 1, R 2 6 The set of all probability distributions on an alphabet A is denoted by PA The set of all sequences with a given type P X is denoted by T P X, and similarly for joint types We refer the reader to 7, 8 for an introduction to the method of types The main properties of types used in this paper are outlined in the Appendix The probability of an event is denoted by P The symbol means distributed as The marginals of a joint distribution P XY x, y are denoted by P X x and P Y y Similarly, P Y X y x denotes the conditional distribution induced by P XY x, y We write P X = P X to denote element-wise equality between two probability distributions on the same alphabet For a joint distribution P XY x, y, expectations are denoted by E P When the probability distribution is understood from the context, we simply write E Given a distribution Qx and a conditional distribution W y x, we write Q W to denote the joint distribution QxW y x, and similarly when there are more than two distributions For example, given Q 1 x 1, Q 2 x 2 and W y x 1, x 2 we have Q 1 Q 2 W Q 1 x 1 Q 2 x 2 W y x 1, x 2 7 Mutual information with respect to a joint distribution P XY x, y is written with a subscript, eg I P X; Y = x,y P XY x, y log P XY x, y P X xp Y y 8 For two sequences fn and gn, we write fn = gn 1 fn if lim n n log gn =, and similarly for and All logarithms have base e, and all rates are in units of nats except in the examples, where bits are used We define c + = max, c, and denote the indicator function by 1 II ERROR EXPONENT FOR CONSTANT-COMPOSITION RANDOM CODING In this section, we present the ensemble-tight randomcoding error exponent for the constant-composition ensemble, in which each codeword of a given user has the same empirical distribution Using this exponent, we prove the achievability of Lapidoth s achievable rate region 1 The derivation of the error exponent and the discussion of the analysis are postponed until Section III We let X i ν be the random variable corresponding to the i- th codeword of user ν, and let Y denote the random sequence at the output of the channel The codewords are distributed according to X i 1 M1 i=1, M 1 Xj 2 M2 i=1 i=1 M2 Q X1 x i 1 j=1 Q X2 x j 2 where Q Xν is the codeword distribution for user ν We assume without loss of generality that message 1, 1 is transmitted, and write X 1 and X 2 in place of X 1 1 and X 1 2 We write X 1 and X 2 to denote arbitrary codewords which are generated independently of X 1 and X 2 We fix Q 1 x 1 PX 1 and take Q X1 to be the uniform distribution over the type class T Q 1,n, where Q 1,n P n X 1 is the most probable type under Q 1 ; similarly for Q 2 x 2 and Q 2,n That is, Q X1 x 1 = 9 1 T Q 1,n 1 x 1 T Q 1,n 1

1 Q X2 x 2 = T Q 2,n 1 x 2 T Q 2,n 11 To ease notation, we write fq to denote a function f which depends on Q 1 and Q 2, and similarly for Q n Remark: All of the results in this paper can easily be extended to the ensemble in which the codewords are generated conditionally on a time-sharing sequence u, such that the joint type of u, x 1 is fixed for every user-1 codeword x 1, and similarly for user 2 eg see 6 However, in the mismatched setting there are some subtle differences between the performance of this ensemble and that of explicit time-sharing, and their study is beyond the scope of this paper The error exponents and achievable rates will be expressed in terms of the sets SQ = P X1X 2Y PX 1 X 2 Y : P X1 = Q 1, P X2 = Q 2 12 T 1 P X1X 2Y = P X1X 2Y PX 1 X 2 Y : P X1 = P X1, P X2Y = P X2Y, log qx 1, X 2, Y E P log qx 1, X 2, Y T 2 P X1X 2Y = P X1X 2Y PX 1 X 2 Y : P X2 = P X2, P X1Y = P X1Y, log qx 1, X 2, Y E P log qx 1, X 2, Y T 12 P X1X 2Y = P X1X 2Y PX 1 X 2, Y : 13 14 P X1 = P X1, P X2 = P X2, P Y = P Y, log qx 1, X 2, Y E P log qx 1, X 2, Y 15 The following theorem gives the random-coding error exponent for each error type Theorem 1 The random-coding error probabilities for the constant-composition ensemble in 9 11 satisfy where E r,1 Q, R 1 = p e,1 = exp ner,1 Q, R 1 16 p e,2 = exp ner,2 Q, R 2 17 p e,12 = exp ner,12 Q, R 1, R 2 18 P X1 X 2 Y SQ P X1 X 2 Y T 1P X1 X 2 Y DP X1X 2Y Q 1 Q 2 W + I P X 1 ; X 2, Y R 1 + 19 E r,2 Q, R 2 = P X1 X 2 Y SQ P X1 X 2 Y T 2P X1 X 2 Y DP X1X 2Y Q 1 Q 2 W + I P X 2 ; X 1, Y R 2 + 2 E r,12 Q, R 1, R 2 = P X1 X 2 Y SQ P X1 X 2 Y T 12P X1 X 2 Y DP X1X 2Y Q 1 Q 2 W + max ν 1,2 Ψ ν P X1X 2Y, R 1, R 2 and 21 Ψ 1 P X1X 2Y, R 1, R 2 = I P X 2 ; Y + I P X 1 ; X 2, Y R 1 + R2 + 22 Ψ 2 P X1X 2Y, R 1, R 2 = I P X 1 ; Y + I P X 2 ; X 1, Y R 2 + R1 + 23 Proof: See Section III Due to the lack of converse results in mismatched decoding, it is important to detere whether the weakness in the achievability results is due to the ensemble itself, or the bounding techniques used in the analysis Theorem 1 states that the overall error exponent E r Q, R 1, R 2 = E r,1 Q, R 1, E r,2 Q, R 2, E r,12 Q, R 1, R 2 24 is not only achievable, but it is also tight with respect to the ensemble average The following achievable rate region follows from Theorem 1 in a straightforward fashion, and coincides with the ensemble-tight achievable rate region of 1 Theorem 2 The overall error exponent E r Q, R 1, R 2 is positive for all rate pairs R 1, R 2 in the interior of R LM Q, where R LM Q is the set of all rate pairs R 1, R 2 satisfying R 1 R 2 R 1 + R 2 I P X 1 ; X 2, Y 25 P X1 X 2 Y T 1Q 1 Q 2 W I P X 2 ; X 1, Y 26 P X1 X 2 Y T 2Q 1 Q 2 W D P X1X2Y Q 1 Q 2 P Y P X1 X 2 Y T 12Q 1 Q 2 W I P X 1;Y R 1, I P X 2;Y R 2 27 Proof: The conditions in 25 27 are obtained from the error exponents in 19 21 respectively Focusing on 27, we note that the objective in 21 is always positive

when DP X1X 2Y Q 1 Q 2 W > In the case that the imizing P X1X 2Y satisfies DP X1X 2Y Q 1 Q 2 W =, we obtain that P X1X 2Y = Q 1 Q 2 W, and hence E r,12 is positive provided that either or R 2 I P X 2 ; Y + I P X 1 ; X 2, Y R 1 + 28 R 1 I P X 1 ; Y + I P X 2 ; X 1, Y R 2 + 29 under the imizing P X1X 2Y in 21 The condition in 28 corresponds to the the case that Ψ 1 achieves the maximum in 21, and 29 corresponds to the case that Ψ 2 achieves the maximum Finally, 28 and 29 can be combined to obtain 27 by noting that 28 respectively, 29 always holds when I P X 2 ; Y > R 2 respectively, I P X 1 ; Y > R 1, and using I P X 2 ; Y + I P X 1 ; X 2, Y = D P X1X 2Y Q 1 Q 2 P Y 3 I P X 1 ; Y + I P X 2 ; X 1, Y = D P X1X 2Y Q 1 Q 2 P Y 31 Using the usual time-sharing argument 1, 9, it follows from Theorem 2 that we can achieve any rate pair in the convex hull of R LM Q Q where the union is over all input distributions Q 1 and Q 2 on X 1 and X 2 respectively In the proof of Theorem 1, it will be shown that a weaker analysis yields the achievable type-12 error exponent E r,12q, R 1, R 2 = P X1 X 2 Y SQ DP X1X 2Y Q 1 Q 2 W P X1 X 2 Y T 12P X1 X 2 Y + D P X1X 2Y Q 1 Q 2 P Y R 1 + R 2 + 32 which coincides with an achievable exponent given in 6 Using a similar argument to the proof of Theorem 2, we see that 32 yields a similar achievable rate region to 25 25, but with 27 replaced by R 1 + R 2 D P X1X 2Y Q 1 Q 2 P Y P X1 X 2 Y T 12Q 1 Q 2 W 33 In the following subsection, we compare the ensembletight type-12 exponent and the corresponding achievable rate region with that of 32 33 A Numerical Example We now return to the parallel BSC example given in Figure 1, where the output is given by y = y 1, y 2 As mentioned in the introduction, the decoder assumes that both crossover probabilities are equal It is straightforward to show that the corresponding decoding rule is equivalent to imizing the sum of t 1 and t 2, where t ν is the number of bit flips from the input sequence x ν to the output sequence y ν As noted in 1, this decision rule is in fact equivalent to ML This channel could easily be analyzed by treating the two subchannels separately, but we treat it as a MAC because it serves as a good example for comparing the ensemble-tight achievability results with 32 33 We let both Q 1 and Q 2 be the uniform distribution on, 1 With this choice of input distributions, it was shown in 1 that the right-hand side of 33 is no greater than δ1 + δ 2 2 1 H 2 34 2 On the other hand, the refined condition in 27 can be used to prove the achievability of any R 1, R 2 within the rectangle defined by the corners, and C 1, C 2, where C ν = 1 H2 δ ν 1 This observation is analogous to the comparison between 1 and 2 in the introduction The main difference is that the weakness in 1 is in the randomcoding ensemble itself, whereas the weakness in 34 is in the bounding techniques used in the analysis We evaluate the error exponents using the optimization software YALMIP 1 Figure 2 plots each of the exponents as a function of α, where the rate pairs are given by R 1, R 2 = αc 1, αc 2 While the overall error exponent E r Q, R 1, R 2 in 24 is unchanged at low to moderate values of α when E r,12 is used in place of E r,12, this is not true for high values of α Furthermore, consistent with the preceding discussion, E r,12 is non-zero only for α < 865, whereas E r,12 is positive for all α < 1 It is interesting to note that the curves E r,12 and E r,12 coincide at low values of α Roughly speaking, the reason for this is that the the arguments to the + functions in 22 23 are positive when the rates are sufficiently small This is consistent with 6, Corollary 5, which states that 32 is ensemble-tight at low rates III PROOF OF THEOREM 1 While the random-coding error probabilities p e,1 and p e,2 can be handled very similarly to the single-user setting 8, p e,12 requires a more refined analysis Furthermore, equivalent error exponents to 19 2 are given in 6; we therefore focus exclusively on p e,12 We first write p e,12 = c 12 E P i qx 1, Xj 2, Y 1 X 1, X 2, Y qx 1, X 2, Y for some c 12 1 2, 1 Setting c 12 = 1 yields the average probability of error when ties are decoded as errors, and the condition c 12 1 2, 1 arises since decoding ties at random reduces the error probability by at most a factor of two 35

35 3 25 4 x 1 3 3 2 can be written as P X1 X 2 Y T 12,nP X1 X 2 Y X i 1, Xj 2, y T P X1X 2Y 39 Error Exponent 2 15 1 5 1 8 9 1 2 4 6 8 1 α Figure 2 Error exponents E r,1 dotted, E r,2 dash-dot, E r,12 solid and E r,12 dashed for the parallel channel shown in Figure 1 using δ 1 = 5, δ 2 = 25 and equiprobable input distributions We will rewrite 35 in terms of the possible joint types of X 1, X 2, Y and X i, Y To this end, we define S n Q n = T 12,n P X1X 2Y = 1, Xj 2 P X1X 2Y P n X 1 X 2 Y : P X1 = Q 1,n, P X2 = Q 2,n P X1X 2Y P n X 1 X 2 Y : 36 P X1 = P X1, P X2 = P X2, P Y = P Y, log qx 1, X 2, Y E P log qx 1, X 2, Y 37 Roughly speaking, S n is the set of possible joint types of X 1 1, X1 2, Y, and T 12,n is the set of types of X i 1, Xj 2, Y which lead to decoding errors when X 1, X 2, Y T P X1X 2Y The constraints on P Xν and P Xν arise from the fact that we are using constant-composition random coding, and the constraint log qx 1, X 2, Y E P log qx 1, X 2, Y holds if and only if qx1,x2,y qx 1 for x 1,x 2,y 1, x 2, y T P X1X 2Y and x 1, x 2, y T P X1X 2Y Fixing P X1X 2Y S n Q n and letting x 1, x 2, y be a triplet of sequences such that x 1, x 2, y T P X1X 2Y, it follows that the event Expanding the probability and expectation in 35, substituting 39, and interchanging the order of the unions, we obtain p e,12 = c 12 P where P X1 X 2 Y S nq n P X1 X 2 Y T 12,nP X1 X 2 Y X1X 2Y = P X 1, X 2, Y T P X1X 2Y X1X 2Y Y T P Y 4 X i 1, Xj 2, Y T P X1X 2Y 41 We define the conditional probability p E P X1X 2Y = P X1X 2Y Y T P Y 42 We have replaced the condition Y T P Y by Y T P Y since we have that PY = P Y for all PX1X 2Y T 12,n P X1X 2Y Applying the union bound to 4 and using the fact that the number of joint types is polynomial in n, we obtain p e,12 = max P X1 X 2 Y S nq n P X 1, X 2, Y T P X1X 2Y max p E P X1X 2Y 43 P X1 X 2 Y T 12,nP X1 X 2 Y = max P X1 X 2 Y S nq n exp ndp X1X 2Y Q 1 Q 2 W max p E P X1X 2Y 44 P X1 X 2 Y T 12,nP X1 X 2 Y where 44 follows from the property of types in 7 It remains to detere the exponential behavior of p E Lemma 3 The probability p E P X1X 2Y satisfies p E P X1X 2Y = exp n max Ψ ν P X1X 2Y, R 1, R 2 ν 1,2 45 i qx 1, Xj 2, y 1 qx 1, x 2, y 38 for any P X1X 2Y such that PX1X 2Y T 12,n P X1X 2Y for some P X1X 2Y S n Q, where Ψ 1 and Ψ 2 are defined in 22 and 23 respectively

Proof: See Section III-A for the upper bound, and Section III-C for the matching lower bound Substituting 45 into 44 and noting that the sets S n and T 12,n can be replaced by S and T 12 respectively, we recover the exponent in 21, and the proof is complete A Upper Bound on p E P X1X 2Y In this subsection, it will be convenient to write p E as p E P X1X 2Y = P X i 1, Xj i 1 j 1 2, y T P X1X 2Y 46 where y is an arbitrary sequence such that y T P Y We upper bound the probability in 46 by applying the truncated union bound to one union at a time Since there are M 2 1 identically distributed codewords for user 2, we have p E P X1X 2Y 1, M 2 1 P X i 1, X 2, y T P X1X 2Y 47 i 1 i 1 = 1, M 2 1 2 E P X i 1, X 2, y T P X1X 2Y X 48 Similarly, since there are M 1 1 identically distributed codewords for user 1, we obtain p E P X1X 2Y 1, M 2 1E 1, M 1 1P X 1, X 2, y T P X1X 2Y X 2 49 The inner probability in 49 is zero unless X 2, y T P X2Y, since any other joint marginal must give a joint type of X 1, X 2, y which differs from P X1X 2Y Hence, instead of writing the expectation in 49 as a summation over joint types of X 2, y, we can limit attention to the case that X 2, y T P X2Y, yielding p E P X1X 2Y 1, M 2 1P X 2, y T P X2Y M 1 1P X 1, x 2, y T P X1X 2Y 5 where x 2 is an arbitrary sequence such that x 2, y T P X2Y Substituting the properties of types in 68 and 69 into 5 yields p E P X1X 2Y exp nψ 1 P X1X 2Y, R 1, R 2 51 where Ψ 1 is defined in 22 By following the steps from 47 51 with the union bounds applied in the opposite order, it can similarly be shown that p E P X1X 2Y exp nψ 2 P X1X 2Y, R 1, R 2 52 where Ψ 1 is defined in 23 We therefore obtain the righthand side of 45 B Discussion The key idea used in Section III-A is to apply the union bound to 46 one union at a time If the union bound was instead applied to all M 1 1M 2 1 events at once, then the inner + functions of 22 and 23 would have been replaced by their argument, yielding the exponent E r,12 in 32 and the corresponding achievable rate condition in 33 Hence, only the refined analysis is powerful enough to yield the ensemble-tight error exponent We state without proof that under ML decoding ie qx 1, x 2, y = W y x 1, x 2, the overall error exponent E r Q, R 1, R 2 given in 24 is unchanged when E r,12 in 32 is used in place of E r,12 1 That is, while the refined analysis of Section III-A is necessary to obtain the ensemble-tight error exponent E r Q, R 1, R 2 under mismatched decoding, the analysis of 6 suffices under ML decoding C Lower Bound on p E P X1X 2Y In order to lower bound p E, we will make use of the following result due to de Caen 11 Proposition 4 11 Let A 1,, A k be a sequence of probabilistic events Then k k PA i 2 P A i k j=1 PA i A j 53 i=1 i=1 We begin by rewriting 42 as p E P X1X 2Y = P Eij P X1X 2Y Y T P Y 54 where E ij P X1X 2Y is the event that X i 1, Xj 2, Y T P X1X 2Y Using 53, we obtain from 54 that p E P X1X 2Y PE ij 2 i 1,j 1 PE ij E i j 55 where the argument to E ij and the conditioning of the probabilities on the event Y T P Y is kept implicit 1 While it is possible that E r,12 > E r,12 under ML decoding, it can be shown that this never occurs in the region where E r,12 is the doant exponent ie achieves the imum in 24

We claim that the pairwise probabilities of the events E ij P X1X 2Y satisfy PE ij E ij = e nd P X1 X 2 Y Q 1 Q 2 P Y 56 PE ij E i j = e n D P X1 X 2 Y Q 1 Q 2 P Y +I P X 1;X 2,Y PE ij E ij = e n D P X1 X 2 Y Q 1 Q 2 P Y +I P X 2;X 1,Y PE ij E i j = e n 2D P X1 X 2 Y Q 1 Q 2 P Y 57 58 59 where i i and j j The first case follows from the property of types in 71, and the final case follows from the pairwise conditional independence of E ij and E i j when i i and j j The second case follows from the property of types in 72, whose proof is outlined in the Appendix The third case is analogous to the second with the roles of users 1 and 2 reversed An inspection of the denoator in 55 reveals that there are 1, M 1 2, M 2 2 and M 1 2M 2 2 terms in the sum corresponding to the four cases in 56 59 respectively Furthermore, by symmetry, each term in the outer summation of 55 is equal Hence, substituting 56 59 into 55 and canceling a common term of e nd P X1 X 2 Y Q 1 Q 2 P Y from the numerator and denoator, we obtain p E P X1X 2Y M 1 1M 2 1e nd P X1 X 2 Y Q 1 Q 2 P Y 1 + M 1 2e ni P X 1;X 2,Y + M 2 2e ni P X 2;X 1,Y + M 1 2M 2 2e nd P X1 X 2 Y Q 1 Q 2 P Y 1 6 = M 1 M 2 e nd P X1 X 2 Y Q 1 Q 2 P Y 1 + M 1 M 2 e nd P X1 X 2 Y Q 1 Q 2 P Y 1 + max M 1 e ni P X 1;X 2,Y, M 2 e ni P X 2;X 1,Y 61 where 61 follows since the sum of two terms has the same exponential behavior as their maximum Let us first consider the case that the maximum in 61 is achieved by M 1 e ni P X 1;X 2,Y In this case, we have p E P X1X 2Y M 1 M 2 e nd P X1 X 2 Y Q 1 Q 2 P Y 1 1 +M 1 e ni P X 1;X 2,Y +M 1 M 2 e nd P X1 X 2 Y Q 1 Q 2 P Y = M 2 e ni P X 2;Y M 1 e ni P X 1;X 2,Y 1 + M 1 e ni P X 1;X 2,Y 1 + M 2 e ni P X 2;Y M 1 e ni P X 1;X 2,Y 1 63 1 + M 1 e ni P X 1;X 2,Y 62 where 63 follows by dividing the numerator and denoator by 1 + M 1 e ni P X 1;X 2,Y and making use of 3 Applying the inequality 1, a twice, we obtain a 1+a 1 2 p E P X1X 2Y 1, M 2 e ni P X 2;Y 1, M 1 e ni P X 1;X 2,Y 64 = exp nψ 1 P X1X 2Y, R 1, R 2 65 where Ψ 1 is defined in 22 Similarly, in the case that the maximum in 61 is achieved by M 2 e ni P X 2;X 1,Y, we obtain p E P X1X 2Y exp nψ 2 P X1X 2Y, R 1, R 2 66 where Ψ 2 is defined in 23 Combining 65 and 66, we obtain the right-hand side of 45 APPENDIX Here we state the properties of types used in this paper We use the notation and definitions given at the beginning of Section II The random variables X 1, X 2, Y, X 1, X 2, X 1, X 2 are distributed according to Q X1 x 1 Q X2 x 2 W y x 1, x 2 Q X1 x 1 Q X2 x 2 Q X1 x 1 Q X2 x 2 67 We then have the following 1 For ν 1, 2, if y T P Y then P X ν, y T P = XνY exp ni P X ν; Y 68 for any P XνY such that P Xν = Q ν,n and P Y = P Y 2 If x 2, y T P X2Y then P X 1, x 2, y T P X1X 2Y = exp ni P X 1 ; X 2, Y 69 for any P X1X 2Y such that PX1 = Q 1,n and P X2Y = P X2Y 3 The probability of X 1, X 2, Y having a given type satisfies P X 1, X 2, Y T P X1X 2Y = exp ndp X1X 2Y Q 1 Q 2 W 7 for any P X1X 2Y with marginals P X1 = Q 1,n and P X2 = Q 2,n 4 If y T P Y, then P X 1, X 2, y T P X1X 2Y = exp nd PX1X 2Y Q 1 Q 2 P Y 71

P X 1, X 2, y T P X1X 2Y X 1, X 2, y T P X1X 2Y = exp n D PX1X2Y Q 1 Q 2 P Y + I P X 1 ; X 2, Y 72 where each equation holds for any P X1X 2Y such that P X1 = Q 1,n, PX2 = Q 2,n and P Y = P Y The proofs of 68 71 are omitted, since each is either a known property or a straightforward extension thereof, eg see 7, 8 To prove 72, we write the left-hand side as 2 P X 2, y T P X2Y P X 1, x 2, y T P X1X 2Y 73 where x 2 is an arbitrary sequence such that x 2, y T P X2Y Substituting 68 and 69 into 73 and using the identity in 3, we obtain 72 ACKNOWLEDGEMENT The authors would like to thank Alfonso Martinez for helpful discussions REFERENCES 1 A Lapidoth, Mismatched decoding and the multiple-access channel, IEEE Trans Inf Theory, vol 42, no 5, pp 1439 1452, Sep 1996 2 N Merhav, G Kaplan, A Lapidoth, and S Shamai, On information rates for mismatched decoders, IEEE Trans Inf Theory, vol 4, no 6, pp 1953 1967, Nov 1994 3 J Hui, Fundamental issues of multiple accessing, PhD dissertation, MIT, 1983 4 I Csiszár and P Narayan, Channel capacity for a given decoding metric, IEEE Trans Inf Theory, vol 45, no 1, pp 35 43, Jan 1995 5 A Ganti, A Lapidoth, and E Telatar, Mismatched decoding revisited: general alphabets, channels with memory, and the wide-band limit, IEEE Trans Inf Theory, vol 46, no 7, pp 2315 2328, Nov 2 6 A Nazari, A Anastasopoulos, and S S Pradhan, Error exponent for multiple-access channels: Lower bounds, arxiv:11133v1 csit 7 I Csiszár and J Körner, Information Theory: Coding Theorems for Discrete Memoryless Systems, 2nd ed Cambridge University Press, 211 8 R Gallager, Fixed composition arguments and lower bounds to error probability, http://webmitedu/gallager/www/notes/notes5pdf 9 A El Gamal and Y H Kim, Network Information Theory Cambridge University Press, 211 1 J Löfberg, YALMIP : A toolbox for modeling and optimization in MATLAB, in Proc CACSD Conf, Taipei, Taiwan, 24 11 D de Caen, A lower bound on the probability of a union, Discrete Mathematics, vol 169, pp 217 22, 1997