Asymptotic Eigenvalue Moments for Linear Multiuser Detection

Similar documents
Design of MMSE Multiuser Detectors using Random Matrix Techniques

Spectral Efficiency of CDMA Cellular Networks

Random Matrices and Wireless Communications

Performance of Reduced-Rank Linear Interference Suppression

Multiuser Receivers, Random Matrices and Free Probability

ANALYSIS OF A PARTIAL DECORRELATOR IN A MULTI-CELL DS/CDMA SYSTEM

MULTICARRIER code-division multiple access (MC-

On the Optimum Asymptotic Multiuser Efficiency of Randomly Spread CDMA

DS-GA 1002 Lecture notes 0 Fall Linear Algebra. These notes provide a review of basic concepts in linear algebra.

These outputs can be written in a more convenient form: with y(i) = Hc m (i) n(i) y(i) = (y(i); ; y K (i)) T ; c m (i) = (c m (i); ; c m K(i)) T and n

Decoupling of CDMA Multiuser Detection via the Replica Method

Optimal Sequences, Power Control and User Capacity of Synchronous CDMA Systems with Linear MMSE Multiuser Receivers

LECTURE 18. Lecture outline Gaussian channels: parallel colored noise inter-symbol interference general case: multiple inputs and outputs

Random Matrix Theory Lecture 1 Introduction, Ensembles and Basic Laws. Symeon Chatzinotas February 11, 2013 Luxembourg

Mathematical Induction

CDMA Systems in Fading Channels: Admissibility, Network Capacity, and Power Control

0 Sets and Induction. Sets

Linear Algebra Notes. Lecture Notes, University of Toronto, Fall 2016

Introduction Reduced-rank ltering and estimation have been proposed for numerous signal processing applications such as array processing, radar, model

ADVANCED CALCULUS - MTH433 LECTURE 4 - FINITE AND INFINITE SETS

Determinants of Partition Matrices

NATIONAL BOARD FOR HIGHER MATHEMATICS. Research Scholarships Screening Test. Saturday, January 20, Time Allowed: 150 Minutes Maximum Marks: 40

Counting Matrices Over a Finite Field With All Eigenvalues in the Field

Performance Analysis of Spread Spectrum CDMA systems

arxiv:cs/ v1 [cs.it] 11 Sep 2006

INVERSE EIGENVALUE STATISTICS FOR RAYLEIGH AND RICIAN MIMO CHANNELS

Modeling and Stability Analysis of a Communication Network System

A matrix over a field F is a rectangular array of elements from F. The symbol

Using Laplacian Eigenvalues and Eigenvectors in the Analysis of Frequency Assignment Problems

Math 554 Qualifying Exam. You may use any theorems from the textbook. Any other claims must be proved in details.

Asymptotic Spectral Distribution of Crosscorrelation Matrix in Asynchronous CDMA

SPECTRAL THEOREM FOR COMPACT SELF-ADJOINT OPERATORS

Power Control in Multi-Carrier CDMA Systems

The Free Central Limit Theorem: A Combinatorial Approach

ELEMENTARY LINEAR ALGEBRA

Representations of Sp(6,R) and SU(3) carried by homogeneous polynomials

Lecture 5: Antenna Diversity and MIMO Capacity Theoretical Foundations of Wireless Communications 1. Overview. CommTh/EES/KTH

In particular, if A is a square matrix and λ is one of its eigenvalues, then we can find a non-zero column vector X with

Received Signal, Interference and Noise

EE6604 Personal & Mobile Communications. Week 13. Multi-antenna Techniques

Upper triangular matrices and Billiard Arrays

Characteristic polynomials of skew-adjacency matrices of oriented graphs

Non white sample covariance matrices.

On the Optimal Phase Control in MIMO Systems with Phase Quantization

CONSIDER transmit and receive antennas with

Doubly Indexed Infinite Series

Math212a1413 The Lebesgue integral.

Boolean Inner-Product Spaces and Boolean Matrices

Chapter 5: Integer Compositions and Partitions and Set Partitions

Spanning and Independence Properties of Finite Frames

Lecture Notes 1: Vector spaces

Throughout these notes we assume V, W are finite dimensional inner product spaces over C.

Notes on SU(3) and the Quark Model

APPENDIX A. Background Mathematics. A.1 Linear Algebra. Vector algebra. Let x denote the n-dimensional column vector with components x 1 x 2.

Measure Theory on Topological Spaces. Course: Prof. Tony Dorlas 2010 Typset: Cathal Ormond

Eventually reducible matrix, eventually nonnegative matrix, eventually r-cyclic

Foundations of Matrix Analysis

Approximately achieving the feedback interference channel capacity with point-to-point codes

Linear Algebra March 16, 2019

Bounds on fast decodability of space-time block codes, skew-hermitian matrices, and Azumaya algebras

Transmit Directions and Optimality of Beamforming in MIMO-MAC with Partial CSI at the Transmitters 1

Multiuser Detection. Ateet Kapur Student Member, IEEE and Mahesh K. Varanasi Senior Member, IEEE

Out-colourings of Digraphs

From Satisfiability to Linear Algebra

Metric Spaces and Topology

Reaching a Consensus in a Dynamically Changing Environment A Graphical Approach

NATIONAL BOARD FOR HIGHER MATHEMATICS. Research Scholarships Screening Test. Saturday, February 2, Time Allowed: Two Hours Maximum Marks: 40

Lecture 8: MIMO Architectures (II) Theoretical Foundations of Wireless Communications 1. Overview. Ragnar Thobaben CommTh/EES/KTH

DISTRIBUTION OF EIGENVALUES OF REAL SYMMETRIC PALINDROMIC TOEPLITZ MATRICES AND CIRCULANT MATRICES

Optimum Signature Sequence Sets for. Asynchronous CDMA Systems. Abstract

Multi-coloring and Mycielski s construction

ADDITIVE noise is most often represented by a fixed

INDISTINGUISHABILITY OF ABSOLUTELY CONTINUOUS AND SINGULAR DISTRIBUTIONS

Linear Algebra M1 - FIB. Contents: 5. Matrices, systems of linear equations and determinants 6. Vector space 7. Linear maps 8.

THE SEMIGROUP βs APPLICATIONS TO RAMSEY THEORY

Interactions of Information Theory and Estimation in Single- and Multi-user Communications

THE UNIVERSITY OF TORONTO UNDERGRADUATE MATHEMATICS COMPETITION. Sunday, March 14, 2004 Time: hours No aids or calculators permitted.

Open Issues on the Statistical Spectrum Characterization of Random Vandermonde Matrices

Lecture 7 MIMO Communica2ons

Counting k-marked Durfee Symbols

A PECULIAR COIN-TOSSING MODEL

MATH 61-02: PRACTICE PROBLEMS FOR FINAL EXAM

Ir O D = D = ( ) Section 2.6 Example 1. (Bottom of page 119) dim(v ) = dim(l(v, W )) = dim(v ) dim(f ) = dim(v )

First we introduce the sets that are going to serve as the generalizations of the scalars.

Introduction to Decision Sciences Lecture 6

Quantum Computing Lecture 2. Review of Linear Algebra

Finite-dimensional spaces. C n is the space of n-tuples x = (x 1,..., x n ) of complex numbers. It is a Hilbert space with the inner product

4184 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 51, NO. 12, DECEMBER Pranav Dayal, Member, IEEE, and Mahesh K. Varanasi, Senior Member, IEEE

Noncooperative Optimization of Space-Time Signals in Ad hoc Networks

MTH 309 Supplemental Lecture Notes Based on Robert Messer, Linear Algebra Gateway to Mathematics

Course 311: Michaelmas Term 2005 Part III: Topics in Commutative Algebra

Applications of Lattices in Telecommunications

MAT115A-21 COMPLETE LECTURE NOTES

Contents Ordered Fields... 2 Ordered sets and fields... 2 Construction of the Reals 1: Dedekind Cuts... 2 Metric Spaces... 3

8. Prime Factorization and Primary Decompositions

a 11 x 1 + a 12 x a 1n x n = b 1 a 21 x 1 + a 22 x a 2n x n = b 2.

Central Groupoids, Central Digraphs, and Zero-One Matrices A Satisfying A 2 = J

Kernels of Directed Graph Laplacians. J. S. Caughman and J.J.P. Veerman

EIGENVALUES AND EIGENVECTORS 3

Week 9-10: Recurrence Relations and Generating Functions

Transcription:

Asymptotic Eigenvalue Moments for Linear Multiuser Detection Linbo Li and Antonia M Tulino and Sergio Verdú Department of Electrical Engineering Princeton University Princeton, NJ 08544, USA flinbo, atulino, verdug@eeprincetonedu January 27, 2002 Abstract Using the theory of lattices of non-crossing partitions, an explicit expression for the moments and free cumulants of the asymptotic distribution of certain infinite random matrices is obtained and extended to several cases From the explicit expression, we give a self-contained proof of the Tse-Hanly formula for the output signal-tointerference-plus-noise ratio of MMSE multiuser detector We use the moment results to design an asymptotic reduced-rank linear multiuser detector Also we apply these results to analyze linear-conjugate MMSE receivers and multiple antenna receivers Introduction In order to gain insight into the performance of receivers in a DS-CDMA system with large processing gain and many users, much work has been devoted to the asymptotic analysis for synchronous DS-CDMA with random spreading [7, 7, 5] In [5], the asymptotic output signal-to-interference-plus-noise ratio (SINR) of the MMSE receiver is shown to satisfy a fixed point equation by using recent results on the asymptotic eigenvalue distribution of certain infinite random matrices The reduced-rank MMSE receiver is proposed and analyzed in [0] As discussed in [0] the moments of the asymptotic eigenvalue distribution of some infinite random matrix are relevant to find the large system limit of the output SINR and the optimal weights of the reduced-rank receiver Several recursive procedures to obtain the asymptotic eigenvalue moments are proposed in [20, 0] A certain combinatorial problem is shown in [20] to be equivalent to the problem of computing the moments explicitly In Section 2, using the theory of lattices of non-crossing partitions, explicit expressions for the asymptotic eigenvalue moments of the correlation matrix of the CDMA signal SDS H (S is a matrix whose columns are the normalized spreading sequences of the

users, and D is a diagonal matrix with diagonal elements the received powers of the users), with ( ) H denoting the conjugate transpose, are obtained for several classes of infinite random matrices In case (i) S consists of iid random variables, and the explicit expression for the moments is obtained by solving the combinatorial problem formulated in [20] This kind of S (iid entries) arises when we consider a single receive antenna DS- CDMA system with random spreading In case (ii) each column of S consists of repeated iid sequences with block iid phases; this kind of S arises when we consider multiple receive antenna randomly-spread DS-CDMA where the phases of the fading coefficients at different antenna elements are independent In case (iii) each column of S consists of repeated iid sequences with linearly increasing phases; this kind of S arises when we consider multiantenna receiver for DS-CDMA in line-of-sight transmission In cases (iv), (v) and (vi), each column of S is formed by stacking the matrices in (i), (ii) and (iii) with their conjugate These kinds of S arise when we consider the newly proposed linearconjugate MMSE (LCM) receiver [6] for single antenna and multiple antenna systems There are several motivations and applications for the results in Section 2, which are discussed in detail in Section 3 In Section 3, the large-system limit of the optimal weights for the reduced-rank MMSE/LCM receivers is obtained from the explicit expression of the moments Six environments are considered, corresponding to the direct applications of the results of the six classes of matrices obtained in Section 2 Receivers that employ the asymptotic limit of the optimal weights instead of the optimal weights themselves are called asymptotic reduced-rank receivers The desirable feature of these receivers is that the asymptotic limit of the optimal weights does not depend on the realizations of the spreading sequences, which is particularly useful when the CDMA system uses long sequences With the pre-calculated asymptotic weights, the reduced-rank MMSE/LCM receivers can be implemented in an efficient way as discussed in [2] In Section 32 the asymptotic performance of the receivers considered in 3 is given and the asymptotic equivalence of them is established In Section 4, based on the result on the moments we give a self-contained proof of the fixed point equation satisfied by the asymptotic output SINR of the MMSE receiver, which is originally derived in [5] by means of the Silverstein and Bai theorem [3] Section 5 contains our numerical results, which show that for reasonably large processing gain (N = 32), the output SINR of the asymptotic reduced-rank receiver lies within db from that of the reduced-rank receiver using spreading-sequence-dependent optimal weights For larger processing gain, say 80 or larger, the performance gap between the two receivers is less than 0:5 db 2 Asymptotic Eigenvalue Moments In this section we state our results on the asymptotic eigenvalue moments Corresponding situations in which these results can be applied are discussed in Section 3 Definition: Let us consider the vector space of the n n matrices M n (C) whose elements are complex random variables M n (C) with the ordinary sum, and product, and adjunction (Hermitian transpose) is a Λ algebra It said that the random matrix V n 2 M n (C) 2

has the m-th limit moment m if exists finite the following limit m, lim n! n E[TracefVm n g]: The set of all limit moments of the matrix V n defines the the limit distribution of V n If V n is a self-adjoint matrix, the limit distribution of V n is the function, F ( ), described by the collection of its moments [8, 9] as follow: Z m df ( ), lim n! n E[TracefVm n g] F ( ) is also called the limit eigenvalue distribution of V n and it is easy to see [8, 9] that: F ( ) = lim n! E[F n ( )] where F n ( ) is the empirical distribution of the eigenvalues of V n [8] Let us introduce here the necessary notation used in the following results: suppose a vector of k integers m ;:::;m k is partitioned into n equivalence classes under the equivalence relation a = b, and the cardinalities of the equivalence classes are given by f ;:::;f n, then we define the following function: f (m ;:::;m k ), f! f n!: For example, f (; ; 4; 2; ; 2) = 3! 2!! The proofs of Propositions and 4 are given in Appendices A and B respectively Propositions 2, 3, 5, 6 can be shown following similar methods used in the proof of Proposition 4 and a brief explanation of the proofs is given at the end of Appendix B 2 IID Sequences Proposition : Suppose the N K matrix S consists of iid zero-mean random variables with variance =N, D is a K K diagonal matrix with non-negative diagonal elements, and the empirical distribution of the diagonal elements of D converges almost surely (as) to a nonrandom limit distribution Then the mth moment of the limit eigenvalue distribution G( ) of SDS H as K, N go to infinity with K=N = fi is: Z m dg( ) = mx k= fi k X m + +m k =m c(m ;:::;m k ) E[Λ m ] E[Λ mk ]; () where Λ is a nonnegative random variable whose distribution is the non-random limit distribution of the diagonal elements of D, and c(m ;:::;m k )= with f (m ;:::;m k ) is as defined above Proof: In Appendix A m! (m k + )! f (m ;:::;m k ) ; (2) 3

An equivalent result on the asymptotic eigenvalue moments in the iid-sequences case can be found in [2] in a slightly different form The reason we still present our proof for this case (in Appendix A) is to introduce necessary techniques that are to be used in the generalization to other cases A few remarks on the assumption made in Proposition will be made in Section 3 22 Repeated Sequences with Block IID Phases Proposition 2: Let N = LN 0 Let t k =(U k; ;:::;U k;n 0) T (» k» K), where U k;j s (» k» K;» j» N 0 ) are an array of real- valued iid zero-mean random variables with unit variance Let S =(s ;:::;s K ) where s k = pn (tt k ejffi k; ;:::;t T k ejffi k;l ) T ; (3) and ffi k;l s (» k» K;» l» L) are iid random variables uniformly distributed on [0; 2ß), and p = N is the normalizing factor D is a K K diagonal matrix with non-negative diagonal elements, and the empirical distribution of the diagonal elements of D converges as to a nonrandom limit distribution Then the mth moment of the limit eigenvalue distribution G( ) of SDS H as K, N go to infinity with K=N = fi is the same as in Proposition, ie () Proof: Refer to Appendix B for a brief explanation of the proof 23 Conjugate Sequences Proposition 3: Let N=2N Let t k = (U k; ;:::;U k;n 0) T (» k» K), where U k;j s (» k» K;» j» N 0 ) are an array of real- valued iid zero-mean random variables with unit variance Let S =(s ;:::;s K ) where s k = pn (tt k ejffi k ; t T k e jffik ) T ; (4) p and ffi k s (» k» K) are iid random variables uniformly distributed on [0; 2ß), and = N is the normalizing factor D is a K K diagonal matrix with non-negative diagonal elements, and the empirical distribution of the diagonal elements of D converges as to a non-random limit distribution Then the mth moment of the limit eigenvalue distribution G( ) of SDS H as K, N go to infinity with K=N = fi is the same as in Proposition Proof: Refer to Appendix B for a brief explanation of the proof 24 Repeated Sequences with Conjugate Block IID Phases Proposition 4: Let N=2LN Let t k = (U k; ;:::;U k;n 0) T (» k» K), where U k;j s (» k» K;» j» N 0 ) are an array of real- valued iid zero-mean random variables with unit variance Let S =(s ;:::;s K ) where s k = p N (t T k ejffi k; ;:::;t T k ejffi k;l ; t T k e jffi k; ;:::;t T k e jffi k;l ) T ; (5) 4

and ffi k;l s (» k» K;» l» L) are iid random variables uniformly distributed on [0; 2ß), and = p N is the normalizing factor D is a K K diagonal matrix with non-negative diagonal elements, and the empirical distribution of the diagonal elements of D converges as to a nonrandom limit distribution Then the mth moment of the limit eigenvalue distribution G( ) of SDS H as K, N go to infinity with K=N = fi is the same as in Proposition Proof: In Appendix B 25 Repeated Sequences with Linearly Increasing Phases Proposition 5: Let N=LN Let t k = (U k; ;:::;U k;n 0) T (» k» K), where U k;j s (» k» K;» j» N 0 ) are an array of real- valued iid zero-mean random variables with unit variance Let S =(s ;:::;s K ) where s k = p N (t T k ejffi k ; t T k e2jffi k ;:::;t T k eljffik ) T ; (6) and ffi k s (» k» K) are iid random variables uniformly distributed on [0; 2ß), and p = N is the normalizing factor D is a K K diagonal matrix with non-negative diagonal elements, and the empirical distribution of the diagonal elements of D converges as to a non-random limit distribution Then the mth moment of the limit eigenvalue distribution G( ) of SDS H as K, N go to infinity with K=N = fi is the same as in Proposition Proof: Refer to Appendix B for a brief explanation of the proof 26 Repeated Sequences with Conjugate Linearly Increasing Phases Proposition 6: Let N=2LN Let t k = (U k; ;:::;U k;n 0) T (» k» K), where U k;j s (» k» K;» j» N 0 ) are an array of real- valued iid zero-mean random variables with unit variance Let S =(s ;:::;s K ) where s k = p N (t T k ejffi k ;:::;t T k eljffi k ; t T k e jffi k ;:::;t T k e Ljffik ) T ; (7) and ffi k s (» k» K) are iid random variables uniformly distributed on [0; 2ß), and p = N is the normalizing factor D is a K K diagonal matrix with non-negative diagonal elements, and the empirical distribution of the diagonal elements of D converges as to a non-random limit distribution Then the mth moment of the limit eigenvalue distribution G( ) of SDS H as K, N go to infinity with K=N = fi is the same as in Proposition Proof: Refer to Appendix B for a brief explanation of the proof 3 Motivations and Applications In this section, we discuss motivations and applications of the results in Section 2 5

3 Asymptotic Reduced-Rank Receiver Expressions for the asymptotic moments can be used to implement the asymptotic reduced-rank MMSE/LCM receivers With the pre-calculated asymptotic weights, the reduced-rank MMSE/LCM receivers can be implemented in an efficient way as discussed in [2] In what follows, we consider six situations: (i) reduced-rank single antenna MMSE receiver; (ii) reduced-rank multiantenna MMSE receiver; (iii) reduced-rank single antenna LCM receiver; (iv) reduced-rank multiantenna LCM receiver; (v) reduced-rank multiantenna MMSE receiver in line-of-sight transmission; (vi) reduced-rank multiantenna LCM receiver in line-of-sight transmission These situations correspond to applications of Propositions -6, respectively In these situations, we assume synchronous DS-CDMA with processing gain N and number of users K A k is the received amplitude of user k, b k 2 f±g is the bit transmitted by user k, and s k is the spreading sequence of user k For a single antenna receiver, the phase of the complex fading coefficient of user k is e jffi k For a multiantenna receiver, the phase of the complex fading coefficient of user k at the lth antenna element is e jffi k(l) The phases (ffi k s or ffi k (l) s) are modelled as iid random variables uniformly distributed on [0; 2ß) The spreading sequences are assumed known at the receiver Perfect knowledge of the fading channel gain (magnitude and phase) at the receiver is also assumed, which could be the case at a base station For a single antenna receiver, the noise vector at the output of the chip-matched filter is n For a multiantenna receiver, n l is the chip-matched filter output noise vector at the lth antenna element, and they are independent across antenna elements The distribution of the noise vector is N(0;ff 2 I) Random spreading is used to model the spreading sequences Specifically, s k = p N (V k; ;:::;V k;n ) T ; (8) where V k;j s (» k» K;» j» N) are iid random variables taking equiprobable values on f±g = p N normalizes the energy of the spreading sequences 3 Reduced-Rank Single Antenna MMSE Receiver In a single antenna DS-CDMA system, the output of the chip-matched filter is r = KX k= A k e jffi k b k s k + n: (9) where A k e jffi k is a complex gain taking into account the channel effects Put in matrix form r = SAb + n; (0) where S = (s e jffi ;:::;s K e jffi K ) is an N K matrix, A = diagfa ;:::;A K g, and b = (b ;:::;b K ) T The MMSE receiver for user is c =(S D S H + ff2 I) s e jffi ; () 6

where S = (s 2 e jffi 2 ;:::;s K e jffi K ), and D = diagfa 2 2 ;:::;A2 Kg The rank D (D» N) reduced-rank MMSE receiver for user is c D = D X m=0 w m S D S H + ff2 I m s e jffi ; (2) where the weight vector w =(w 0 ;:::;w D ) T is chosen to maximize the output SINR and is given by [, 6] w = 2 6 4 H + A 2H 0H 0 H D + A 2H D H 0 H D + A 2H D H 0 H 2D + A 2H D H D 3 7 5 2 6 4 H 0 H D 3 7 5 ; (3) where the (i; j)-entry of the matrix above is H i+j + A 2H i H j with H m = s H S D S H + I m ff2 s : (4) From the expression above, we see that the optimal weights depend on the realizations of the spreading sequences Therefore in a system using long sequences, they need to be computed and updated symbol to symbol, which hampers real-time implementation It is therefore desirable to find a set of suboptimal weights that do not depend on the spreading sequences while maintaining acceptable performance The asymptotic values of the weights in the large system limit seem to be a promising solution The large system limit is taken as N and K go to infinity with K=N = fi Using Corollary in [4] and the results in [2], it can be shown that if the empirical distribution of the diagonal elements of the matrix D converges almost surely (as) to a fixed non-random limit distribution, then lim s H N;K! S D S H m a:s: s! lim E[Tracef S D S H mg] N;K! =Z m dg( ); (5) N K N =fi K N =fi where G( ) is the limit eigenvalue distribution of S D S H and the convergence is almost surely For proof see Appendix C Let w denote the asymptotic weight vector, from (5) it is readily shown that w = 2 6 4 H + A2 H 0 H 0 H D + A2 H D H 0 3 7 5 2 6 4 H 0 3 7 5 ; (6) H D + A2 H D H 0 H 2D + A2 H D H D H D where H m = Z ( + ff 2 ) m dg( ): (7) Now we can apply Proposition to obtain H m and therefore the asymptotic weights 7

Proposition requires that the empirical distribution of the received powers of the interfering users, which are the diagonal elements of D, converges as to a fixed nonrandom limit distribution Based also on the observation given at beginning of the Section 2, we recall here that in order to verify the as convergence of empirical distribution of the diagonal elements of D to a fixed non-random limit distribution it is enough to verify that all the moments of the matrix D exist, ie the limit in the following equation exists D p, (K ) lim K! E[TracefDp g] and that 8p the normalized trace of the matrix D p converges as to a non-random limit given by the moments of D ie: (K ) lim K! TracefDp g a:s:! D p 0» p< Notice also that the received power of each user is a positive random variable with a distribution function induced by the channel fading If all the users experience independent fading, then a sufficient condition for the as convergence of the empirical distribution of the interfering received powers to a non-random limit is given in [3, Theorem 23] in terms of the diagonal elements of the matrices D p with 0» p < It easy to verify that most cases of practical interest satisfy those conditions In particular, assuming that all the users experience iid fading with uniformly bounded transmitted powers, it can be easily shown that if the common distribution of the users fading coefficients is p integrable with 0» p<, then the the empirical distribution of the received powers converges as to a non-random limit Certainly, in all cases, this limit depends on the fading coefficients distribution and, consequently, it is subject to change if the channel characteristics change considerably In practice, this means we need to update the distribution of the received powers periodically or when we detect a considerable change in it In the implementation of the asymptotic reduced-rank receiver, we can use the moments of the empirical distribution of A 2 2 ;:::;A2 K when K is finite to approximate the moments of the limit empirical distribution of the received powers used in (), ie E[Λ m ] ß K (8) is in fact what we use in the numerical results of this paper KX k=2 32 Reduced-Rank Multiantenna MMSE Receiver A 2m k : (8) In a multiantenna DS-CDMA system, the output of the chip-matched filter at the lth antenna element is r l = KX k= A k e jffi k(l) b k s k + n l l =;:::;L: (9) In the equation above, we assume that for each user, the received amplitudes at different antenna elements are the same for simplification This is valid if the antenna elements 8

are close enough to each other so that for a given user, the received powers at different antenna elements vary little However, the received phases still vary considerably among antenna elements because different propagation paths have a much greater impact on the received phases due to the usually high carrier frequency Put in matrix form r = SAb + n; (20) where r = (r T ;:::;rt L )T is an LN- vector, S = (s ;:::;s K ) is an (LN) K matrix, s k = (s T k ejffi k() ;:::;s T k ejffi k(l) ) T, A = diagfa ;:::;A K g, b =(b ;:::;b K ) T, and n =(n T ;:::;nt L )T The MMSE receiver for user is c =(S D S H + ff2 I) s ; (2) where S =(s 2 ;:::;s K ), and D = diagfa 2 2 ;:::;A2 Kg The rank D (D» LN) reduced-rank MMSE receiver for user is c D = D X m=0 w m S D S H + ff2 I m s ; (22) where the weight vector w =(w 0 ;:::;w D ) T is given by (3), and H m = s H S D S H + ff2 I m s : (23) To calculate the limit of H m as N and K go to infinity, we notice that the situation here is similar to the situation in 3, the only difference being that the energy (defined by Efk k 2 g)ofs and every column of S is now L After taking out the L factor from s and absorbing the L factor of S into D, we have that H m = L Z ( + ff 2 ) m dg( ); (24) where G( ) is the limit eigenvalue distribution of S ~ (LD ) S ~ H with ~ p S = S = L Proposition 2 can be applied to find H m and therefore the asymptotic weights (6) 33 Reduced-Rank Single Antenna LCM Receiver The single antenna LCM receiver for user is [6] c = F S a D S H a + ff2 I sa ; (25) where F =[I N 0], I N is an N N identity matrix, and»» s e jffi s2 e jffi 2 s K e jffi K s a = s e jffi ; S a = s 2 e jffi 2 s K e jffi K The rank D (D» 2N) reduced-rank LCM receiver for user is [6] : (26) 9

c D = D X m=0 with the weight vector w =(w 0 ;:::;w D ) T given by [6] where w = 2 6 4 w m Sa D S H a + ff2 I m sa ; (27) H H D H D H 2D After taking out the factor of 2 from s a and S a H m =2m+ Z ( + ff2 3 7 5 2 6 4 H H D 3 7 5 ; (28) H m = s H a Sa D S H a + ff2 I m sa : (29) 2 )m dg( ); (30) where G( ) is the asymptotic eigenvalue distribution of ~ S a D ~ S H a with ~ S a = S a = p 2 Proposition 3 can be applied to find H m and the asymptotic weight vector w 34 Reduced-Rank Multiantenna LCM Receiver As discussed in 32, the output of the chip-matched filter in a multiantenna system is r = SAb + n: (3) Similar to the single antenna case, the multiantenna LCM receiver for user is c = F S a D S H a + ff2 I s a ; (32) where F =[I LN 0], and»» s s2 s K s a = s Λ ; S a = s Λ 2 s Λ : (33) K The rank D (D» 2LN) reduced-rank LCM receiver for user is c D = D X m=0 w m S a D S H a + ff2 I m s a ; (34) where the weight vector w =(w 0 ;:::;w D ) T is given by (28), and H m = s H a S a D S H a + ff2 I m s a : (35) To calculate the limit of H m as N and K go to infinity, we notice that the energy (defined by Efk k 2 g)ofs a and every column of S a is 2L By following similar arguments in Section (32) and (33), it is easily shown that 0

Z H m = L2m+ ( + ff2 2 )m dg( ); (36) where G( ) is the asymptotic eigenvalue distribution of ~ S a (LD ) ~ S H a with ~ S a = S a = p 2L Proposition 4 can be applied to find H m and the asymptotic weight vector w 35 Reduced-Rank Multiantenna MMSE Receiver in Line-of-Sight Transmission In a line-of-sight transmission DS-CDMA system with multiple receive antenna, the output of the chip-matched filter at the lth antenna element is r l = KX k= A k e ljffi k b k s k + n l l =0;:::;L ; (37) where ffi k = 2ßd cos ff k =, d is the distance between two adjacent antenna elements, ff k is the incident angle of user k, and is the wavelength of the carrier Due to the randomness in the geographic distribution of users, it is reasonable to assume that ffi k s are iid random variables uniformly distributed on [0; 2ß) Put in matrix form r = SAb + n; (38) where r = (r T 0 ;:::;rt L )T is an LN-vector, S = (s ;:::;s K ) is an (LN) K matrix, s k = (s T k e0jffi k ;:::;s T k e(l )jffi k ) T, A = diagfa ;:::;A K g, b =(b ;:::;b K ) T, and n =(n T ;:::;nt L )T The MMSE receiver for user is c =(S D S H + ff2 I) s ; (39) where S =(s 2 ;:::;s K ), and D = diagfa 2 2 ;:::;A2 Kg The rank D (D» LN) reduced-rank MMSE receiver for user is c D = with the weights given by (3) and D X m=0 H m = s H w m S D S H + ff2 I m s (40) S D S H + ff2 I m s : (4) The asymptotic weights can be obtained by using Proposition 5 in the same way we used Proposition 2 in 32 36 Reduced-Rank Multiantenna LCM Receiver in Line-of-Sight Transmission As discussed in 35, the output of the chip-matched filter in a line-of-sight transmission DS-CDMA system is r = SAb + n: (42)

Similar to the discussion in 34, the LCM receiver for user is c = F S a D S H a + ff2 I s a ; (43) where F =[I LN 0], and»» s s2 s K s a = s Λ ; S a = s Λ 2 s Λ : (44) K The rank D (D» 2LN) reduced-rank LCM receiver for user is c D = D X m=0 with the weights given by (28) and H m = s H a w m S a D S H a + ff2 I m s a (45) S a D S H a + ff2 I m s a : (46) The asymptotic weights can be obtained by using Proposition 6 in the same way we used Proposition 4 in 34 32 Relationships among Asymptotic Eigenvalue Distributions in Various Cases In Section 2, the moments of the asymptotic eigenvalue distributions of several classes of random matrices are shown to be equal In this section, we study the relationships among these distributions through the moments And because the asymptotic output SINR is a certain integrable function averaged with respect to the asymptotic eigenvalue distribution, the relationships among them are also characterized The multiantenna LCM receiver serves as an example to illustrate the idea Similar results on the multiantenna MMSE receiver and the single antenna LCM receiver were obtained in [8] and [6], respectively, by using free probability theory Similar to the result on the single antenna receiver in [6], the output SINR of the multiantenna LCM receiver is P s H a S a D S H a + ff2 I s a ; (47) where D = diagfp 2 ;:::;P K g, and P k = A 2 k is the received power of user k at one antenna element s a and S a are given by (33) We notice that in (47), the energy of s a and each column of S a (defined as E fk k 2 g)is2l After absorbing the factor L of s a and S a into P and D, and taking the factor 2 of S a out of the matrix inversion, we have P s H a S a D S H a + ff2 I s a =(LP ) ~ s H a ~ S a (LD ) ~ S H a + ff2 2 I ~s a ; (48) 2

where tilded quantities denote the normalized untilded ones Now we consider a single antenna MMSE receiver in a DS-CDMA system with processing gain 2LN, number of users K, noise level ff 2 =2, and received power of user klp k, its output SINR is (LP )s H H ff2 S I (LD )S + s ; (49) 2 where the 2LN-vector s k is the spreading sequence of user k, S = (s 2 ;:::;s K ), D = diagfp 2 ;:::;P K g, and P k = A 2 k All the spreading sequences are normalized Suppose the asymptotic eigenvalue distribution of S (LD )S H is G( ), the asymptotic eigenvalue distribution of S ~ a (LD ) S ~ H a is G Λ ( ), then from Proposition 4 Z m dg( ) = Z m dg Λ ( ) m =0; ; 2;::: (50) Denote the distribution density functions of G and G Λ by g and g Λ, we have Z m (g( ) g Λ ( ))d =0 m =0; ; 2;::: (5) So, g g Λ is orthogonal to every polynomial, therefore g g Λ is zero almost everywhere, as a result G = G Λ almost everywhere From [5] s H H ff2 S I (LD )S + s! 2 Z + ff2 2 dg( ); (52) and from a similar argument in the proof of Lemma 43 in [5] Z ~s H ~ a S a (LD ) S ~ H a + ff2 I ~s a! dg Λ ( ): (53) 2 + ff2 2 Therefore the asymptotic output SINR of the single antenna MMSE receiver with (K; 2LN; ff 2 =2) is the same as that of the multiantenna LCM receiver with (K; N; ff 2 ) Similar results hold for the MMSE/LCM receivers in line-of-sight transmission Remark: In [8], the output SINR of the multiantenna MMSE receiver with (K; N; ff 2 ) and L antenna elements is shown to be equal to that of the single antenna MMSE receiver with (K; LN; ff 2 ) asymptotically In that proof, free probability theory was used to obtain the asymptotic equality By using the combinatorial techniques provided in the appendices of this paper, it can be shown that the expected values (with respect to the spreading sequences) of the SINR s of both systems are equal even for K and N finite For the LCM receiver, however, since there are conjugate parts in the effective spreading sequences, from the proof in Appendix B, we need to let K and N go to infinity to make some terms vanish Therefore, it is still open whether the expected values (with respect to the spreading sequences) of the SINR s of both systems (single antenna MMSE with (K; 2LN; ff 2 =2) and multiantenna LCM with (K; N; ff 2 )) are equal for K and N finite Since the output SINR s are random variables for K and N finite, they themselves are generally not equal when K and N are finite 3

4 Deriving the Tse-Hanly Formula by Means of the Moments By using the Silverstein-Bai theorem [3], it is shown in [5] that the asymptotic output SINR fl of the MMSE receiver in a DS-CDMA system with random spreading satisfies the following fixed point equation (without loss of generality, the desired user is received with unit power) fl = R (54) ff 2 fip + df (P ); +Pfl where F (P ) is the limit of the empirical distribution of P 2 ;:::;P K as K goes to infinity In this section, we give a self-contained proof of (54) using the combinatorial convolution defined on the incidence algebra of non-crossing partitions Based on these tools, we reduce (54) to a combinatorial equality involving the asymptotic moments Let G( ) be the asymptotic eigenvalue distribution of S D S H If G( ) is a compactly supported probability measure on R, its Stieltjes transform is defined as: m(z) = Z dg( ) (55) z which is an analytic function in C n supp(g) By expanding with respect to z, exchanging the summation and integration and by analytical extension, (55) can be written z 8z 2 C n supp(g) as: where m(z) = z k = Z X k=0 k z k ; (56) k dg( ) (57) is given in Proposition Notice that regardless of G( ), C n supp(g) ff z 2 C + = fz 2 C :Im(z) > 0g Notice that if z 2 C + then also m(z) 2 C + To avoid cumbersome negative powers of we denote g(z) = m(z), so g(z) = z + X k= k z k+ = X k= k z k : (58) As in Remark 333 in [9], g(z) has a unique inverse k( ), ie = g(z); z = k( ), k( ) = + X k=0 a k+ k ; (59) where a k is a polynomial in ;:::; k Thus, from (59) we have: z = k(g(z)) = g(z) + 4 X k=0 a k+ g(z) k ; (60)

or equivalently: zg(z) = X k= k z k = X k=0 a k+ g(z) k+ : (6) Finally, from (6) we get the following equality between two power series: with X k= k x k = X k= x = z ; y = X k= k x k : a k y k ; (62) Eq (62) is satisfied when the sequence (a ;a 2 ;:::) satisfies the following implicit equation [6]: The series X k=0 m = mx k= a k X m + +m k =m m mk : (63) a k k is known as the R-transform of the asymptotic eigenvalue distribution, G( ), and the series coefficients fa k g are known as the free cumulants of G( ) [9, 4] In the following we will prove an explicitly expression for the free cumulants fa k g: a k =EfΛ k gfi; (64) where Λ is a random variable with cumulative distribution F (P ) as defined in Proposition The proof of (64) requires tools from set partition theory Our treatment here is very brief; for more details please consult [] and [4] Definition : Let m be a natural number A partition ß = 4 fv ;:::;V k g of the set f;:::;mg S k is a decomposition of f;:::;mg into disjoint and non-empty sets V i such that V i= i = f;:::;mg The elements V i are called the blocks of the partition ß We will denote the set of all partitions of f;:::;mg by P(m) This set becomes a lattice if we introduce the following partial order (called refinement order): ß» ff if each block of ff is a union of blocks of ß We will denote the smallest and the biggest elements of P(m) consisting of m blocks and one block, by: 0 m = f(); (2);:::;(m)g; m = f(; 2;:::;m)g (65) Definition 2: A partition ß 2 P(m) is a crossing partition if there exist four numbers» i<k <j<l< m such that i and j are in the same block V, k and l are in the same block V 0 and V 6= V 0 The set of all non-crossing partitions in P(m) is denoted by NC(m), ie: NC(m) 4 = fß 2 P(m)j ß non crossingg (66) 5

Define interval in NC(m) as the following set [ß; ff] = ffi 2 NC(m)j ß» fi» ffg 8ß» ff 2 NC(m) Each of these intervals can be decomposed into a product of lattices of noncrossing partitions, ie for all ß; ff 2 NC(m) with ß» ff there exist canonical natural numbers k ;k 2 ;:::;such that For example we have: [ß; ff] ο = NC() k NC(2) k 2 (67) [f(); (2); (3); (4); (5); (6); (7); (8)g; f(; 2; 3); (4; 5); (6); (7; 8)g]NC(8) ο = ο= [f(); (2); (3)g; f(; 2; 3)g]NC(3) [f(4); (5)g; f(4; 5)g]NC(2) [(6); (6)]NC() [f(7); (8)g; f(7; 8)g]NC(2) ο = ο = NC() NC(2) 2 NC(3) Having this factorization property at hand it is quite natural to define a multiplicative function f (for non-crossing partitions) corresponding to a sequence (a ;a 2 ;:::) of complex numbers by requirement that: f(ß; ff) 4 = ρ a k ak 2 2 0 whenever ß ff (68) if [ß; ff] has a factorization as above We use the notation f! (a ;a 2 ;:::) to denote the dependence of f on the sequence (a ;a 2 ;:::) In particular we have that f(0 m ; m )=a m Definition 3: The (combinatorial) convolution ( Λ ) of f and g defined as in (68), ( f Λ g), is equal to: ( f Λ g) 4 = X fi 2 NC(m) ß» fi» ff f(ß; fi) g(fi; ff) for ß» ff 2 NC(m): (69) Now we have the machinery to prove (54) According to Proposition : m = = mx k= fi k X m + +m k =m»k»m X m + +m k =m c(m ;:::;m k ) EfΛ m g EfΛ m k g c(m ;:::;m k ) Q m Q mk ; (70) where Q mj = EfΛ m j gfi Define the following multiplicative functions on the lattice of non-crossing partitions: - L! ( ; 2 ;:::) as the multiplicative function on the lattice of non-crossing partitions corresponding to the moments m of the random matrix SDS H, - Q! (Q ;Q 2 ;:::) as the multiplicative function on the lattice of non-crossing partitions corresponding to the sequence Q m 6

It is easy to see that: m = L (0 m ; m ) Q (ß; ff) = Q k Qk 2 2 iff [ß; ff] ο = NC() k NC(2) k 2 (7) From Theorem 2 in Appendix A, we know that c(m ;:::;m k ) is the number of noncrossing partitions of m into k disjoint sets, such that the numbers of elements in these sets are m ;:::;m k As a consequence, from (69) and (7), (70) can be rewritten as m = X fi 2 NC(m) 0 m» fi» m where zeta is the zeta function defined as in [, 4]: Q(0 m ;fi) zeta(fi; m )= Q Λ zeta (72) ρ ; ß» ff zeta(ß; ff) = 0; otherwise (73) By taking into account the non-crossing character of the involved partition, the relation L = Q Λ zeta can be written more concretely in a recursive way as ( 0 =) [4]: m = mx k= Q k X i ;:::;i k 0 i + + i k = m i ik : (74) Comparing Eq (63) with Eq (74) it follows immediately that a k = Q k = EfΛ k gfi, aswe wanted to show As a consequence, k(g(z)) = g(z) + X k= = m(z) + = m(z) + Z EfΛ k gfi(g(z)) k X k= EfΛ k gfi( ) k (m(z)) k fip df (P ): +Pm(z) (75) For (75) we have used the fact that for z 2 C + : X k= EfΛ k gfi( ) k z k = Z fip df (P ) (76) +Pz which can be easily shown by analytic extension expanding with respect to z and +Pz exchanging the summation and the integration with respect to F (P ) Since k(g(z)) = z, so: m(z) = R (77) fip z + df (P ): +Pm(z) 7

Therefore, via the asymptotic moments we obtain the same result originally shown by Silverstein and Bai [3] Using Corollary in [4] and the results in [2], it can be shown that if the empirical distributions of the diagonal elements of the matrix D converges almost surely (as) to a fixed distribution, then as N and K go to infinity with K=N = fi, the output SINR of the MMSE receiver for user, fl, converges almost surely for N!to : Since lim N! fl = s H (S D S H + ff2 I) a:s: s! lim N! N E Trace Φ (S D S H + ff2 I) ΨΛ = lim letting z = ff 2 in (77), (54) is obtained N E Trace Φ (S D S H + ff2 I) ΨΛ : (78) z! ff 2 Z z dg( ) = lim z! ff 2 m(z) (79) 5 Numerical Results 0 20 8 6 Reduced Rank MMSE Asymptotic Reduced Rank 5 Equivalent Single Antenna Full Rank MMSE Reduced Rank MMSE Asymptotic Reduced Rank Output SINR (db) 4 2 Output SINR (db) 0 5 0 0 2 4 0 2 4 6 8 0 2 4 6 8 20 Signal to Noise Ratio (db) 5 0 2 4 6 8 0 2 4 6 8 20 Signal to Noise Ratio (db) Figure : Output SINR vs input signalto-noise ratio for two reduced-rank single antenna receivers Figure 2: Output SINR vs input signalto-noise ratio for multiantenna MMSE receivers In this section, we compare the performance of four receivers: (i) the asymptotic reduced-rank MMSE/LCM receiver that uses the asymptotic weights; (ii) the reducedrank MMSE/LCM receiver that uses the exact optimal weights; (iii) the full-rank MMSE- /LCM receiver; (iv) the asymptotically equivalent single antenna MMSE receiver with enlarged processing gain and possibly reduced noise level In all the results, N = 32, K = 25, L = 2 (number of receive antenna elements), D = 5 (number of stages of the reduced-rank receiver) are used For simplification, we simulate the case in which the received powers of the interfering users are equal and are 5 db above the desired user This corresponds to power-controlled interferers and makes the desired user to operate in a relatively challenging environment In the case of single antenna receivers, the horizontal axis represents the input signal-to-noise ratio P =ff 2 in db For multiantenna receivers, the horizontal axis is LP =ff 2, taking into account the total received power of the desired user 8

Figure shows the performance of two reduced-rank single antenna MMSE receivers The upper curve is the output SINR of the reduced-rank MMSE receiver that uses exact optimal weights, and the lower curve is for the asymptotic reduced-rank receiver that uses asymptotic weights From the figure we can see that the later lies within db from the former And we found that (not shown in the figure) if N > 80, the performance loss due to using asymptotic weights will be less than 0:5 db Figure 2 shows the performance of the multiantenna MMSE receivers From the figure we can see that we lose about db in SINR by using asymptotic weights And as predicted, the performance of the full-rank MMSE receiver is very close to that of the asymptotically equivalent single antenna MMSE receiver with processing gain LN We also found that (not shown in the figure), if LN (the effective processing gain) is raised to above 00, the penalty of using asymptotic weights will be less than 0:5 db, and if D 8, the performance gap between the reduced-rank receiver and the full-rank one will be less than 0:5 db 25 25 20 Equivalent MMSE Full Rank LCM Reduced Rank LCM Asymp Reduced Rank Reduced Rank MMSE 20 Equiv Single Antenna MMSE Full Rank LCM Reduced Rank LCM Asymptotic Reduced Rank 5 Output SINR (db) 0 Output SINR (db) 5 0 5 0 5 5 0 2 4 6 8 0 2 4 6 8 20 Signal to Noise Ratio (db) 0 0 2 4 6 8 0 2 4 6 8 20 Signal to Noise Ratio (db) Figure 3: Output SINR vs input signalto-noise ratio for single antenna LCM receivers Figure 4: Output SINR vs input signalto-noise ratio for multiantenna LCM receivers Figure 3 illustrates the performance of the single antenna LCM receivers We observe that the penalty due to using asymptotic weights is within 25 db And as predicted, performance of the full-rank LCM receiver is very close to that of the asymptotically equivalent single antenna MMSE receiver with processing 2N and noise level ff 2 =2 We found that (not shown in the figure) if N > 60, the performance loss due to using asymptotic weights will be less than 0:5 db, and if D 8, the performance gap between the reduced- rank receiver and the full-rank one will be less than 0:5 db For comparison purposes, the output SINR of the reduced-rank MMSE receiver in Figure is also shown here We can see the significant performance gain obtained by LCM processing Figure 4 illustrates the performance of the multiantenna LCM receivers From the figure we see that the performance loss due to using asymptotic weights is very small (about 0:5 db) Additionally the performance difference between the full-rank LCM receiver and the asymptotically equivalent single antenna MMSE receiver with processing 2LN and with noise level ff 2 =2 is also very small, almost indistinguishable This is because the effective processing 2LN = 28, which is large enough to guarantee sufficient convergence By comparison of Figures 2 and 4, we can see that the output SINR in Figure 9

4is4 5 db higher than those in Figure 2 This confirms the analytical result given in (53) that the LCM receiver asymptotically reduces the number of interferers and the noise level by half 20 25 5 Equivalent Single Antenna Full Rank MMSE Reduced Rank MMSE Asymptotic Reduced Rank 20 Equiv Single Antenna MMSE Full Rank LCM Reduced Rank LCM Asymptotic Reduced Rank Output SINR (db) 0 5 Output SINR (db) 5 0 0 5 5 0 2 4 6 8 0 2 4 6 8 20 Signal to Noise Ratio (db) 0 0 2 4 6 8 0 2 4 6 8 20 Signal to Noise Ratio (db) Figure 5: Output SINR vs input signal-tonoise ratio for MMSE receivers in line-ofsight transmission Figure 6: Output SINR vs input signalto-noise ratio for LCM receivers in line-ofsight transmission Figure 5 illustrates the performance of the MMSE receivers in line-of- sight transmission We see that Figure 5 quite resembles Figure 2 This agrees with our analytical result that their performances are asymptotically equal All observations in Figure 2 can be directly applied here Figure 6 illustrates the performance of the LCM receivers in line-of- sight transmission Again we see that Figure 6 resembles Figure 4 This agrees with our analytical result that their performances are asymptotically equal Observations in Figure 4 can again be applied here 6 Conclusion In this paper, we obtain explicit expressions for the asymptotic eigenvalue moments of several classes of infinite random matrices that arise in multiuser detection in DS- CDMA system with random spreading The results are used in the design of the asymptotic reduced-rank MMSE/LCM receivers that use the asymptotic values of the optimal weights Numerical results show that the penalty in the output SINR due to using the asymptotic weights in place of the exact optimal weights is about db for reasonably large processing gain, eg 32 For larger processing gain, say 80 or larger, the performance loss is less than 0:5 db The asymptotic reduced-rank receivers are desirable in real-time implementation of systems using long sequences because they do not depend on realizations of the spreading sequences The equivalence of the asymptotic moments in these cases is used to obtain the asymptotic equality of the output SINR s of these receivers to that of the single antenna MMSE receiver with enlarged processing gain and possibly reduced noise level The asymptotic equality is confirmed by numerical results An explicit expression for the free cumulants of the asymptotic eigenvalues distribution of several classes of infinite random matrices is given from which the Tse-Hanly formula 20

is proved The technique used for finding the free cumulants of our infinite random matrices, has independent interest: in fact it permits, in the more general case, to find, using the free cumulants property of some suitable random matrix [9], an explicit expression for the moments of random matrices with more complex structure Appendix A Proof of Proposition Suppose that s k = p N (V k; ;:::;V k;n ) T ; (80) where V k;j s are iid random variables that take equiprobable values on f±g Then Φ mψ Trace (SDS H ) (8) (ψ KX! ψ KX Pi si s Hi i= = Trace = Trace = Trace = N m where P i = A 2 i So, ( KX ( KX i m= P im s im s H i m!) i ;:::;i m= P i P im s i s H i s im s H i m ) i ;:::;i m= P i P im s H i m s i s H i m s im ) KX i ;:::;i m= NX (82) (83) (84) j ;:::;j m= P i P im V Λ i m;j V i ;j V Λ i m ;j m V im;j m : (85) = N E Φ mψλ Trace (SDS H ) (86) KX NX Λ P N m+ i P im E V Λ i m;j V i ;j V Λ i m ;j m V im;jm : (87) j ;:::;j m= i ;:::;i m= We then investigate all the possible ways to partition indices i ;:::;i m into k disjoint sets for» k» m Indices in the same set keep identical in the summation and indices in different sets keep distinct in the summation Suppose for a given partition of i ;:::;i m into k disjoint sets such that numbers of elements in the sets are m ;:::;m k, then this partition contributes a non-zero term EfΛ m g EfΛm kg fi k to R m dg( ) if there exists a partition of j ;:::;j m into m k +disjoint sets such that in the following figure, every ordered pair in the upper row is matched with exactly one ordered pair in the lower 2

row, and every ordered pair in the lower row is matched with exactly one ordered pair in the upper row (i ;j ) (i 2 ;j 2 ) (i m ;j m ) (i ;j 2 ) (i 2 ;j 3 ) (i m ;j ) Two ordered pairs are matched iff the first indices in both of them are in the same set, and so are the second indices The matching above ensures that no random variable appears alone in the expectation, which would make the expectation zero If such a partition of j ;:::;j m exists, we call it a good partition for the given partition of i ;:::;i m Therefore, from the discussion above, R m dg( ) is the sum of all the terms c(m ;:::;m k ) E[Λ m ] E[Λ mk ] fi k ; (88) for all m + + m k = m and all» k» m The coefficient c(m ;:::;m k ) is the solution to the following combinatorial problem: Theorem : Among all the partitions of i ;:::;i m into k disjoint sets such that the cardinalities of these sets are m ;:::;m k (totally there are m! m! m k! f (m ;:::;m k such partitions, where ) f (m ;:::;m k ) is defined in Section 2), the coefficient c(m ;:::;m k ) in (88) is the number of partitions such that there exists a good partition of j ;:::;j m into m k +disjoint sets for each of them Another combinatorial problem the solution of which is c(m ;:::;m k ) is proposed in [20], basically it is: Theorem 2: Consider m balls arranged on a circle, numbered from to m clockwise Among all the partitions of the m balls into k disjoint sets, such that the cardinalities of these sets are m ;:::;m k, the coefficient c(m ;:::;m k ) in (88) is the number of non-crossing partitions A partition is called crossing if there exist two sets A and B and for some a; b 2 A and c; d 2 B such that the segment drawn from a to b and the segment drawn from c to d cross We use Theorem 2 to obtain c(m ;:::;m k ), while the concept of mutually good partitions defined in Theorem is used in the proof of extensions to Proposition In what follows, we prove that m! c(m ;:::;m k )= (m k + )! f (m ;:::;m k ) : (89) In Theorem 2, sets that contain the same number of elements are considered indistinguishable That is to say, if for some partition two sets (denoted by A and B) both contain p balls, then we consider the partition obtained by switching balls in A and B the same as the original one Here we first consider the case in which these sets are distinguishable and prove that the number of non-crossing partitions is given by m! (m k +)! : (90) 22

To make the sets distinguishable, we number them from to k The caridinality of S i is m i (» i» k) Now we select a ball, denoted by b,fromthem balls, then we select another ball, denoted by b 2, from the remaining m balls, and so on until we select b k Itis easy to see that there are m!=(m k + )! ways to make the selection Next, we describe a procedure that generates a non-crossing partition from any such selection Then we prove that non-crossing partitions generated from different selections must also be different, and for each non-crossing partition, there must exist a selection that generates it (i) Suppose b ;:::;b k are selected as in Figure 7 (without loss of generality, we assume that b 2 is located after b on the circle, and so on) Because m + + m k = m, there must exist i such that there are at least m i balls between b i and b i+, with b i included and b i+ excluded We assign the m i balls starting from b i (b i included) to set S i Then we take away the already assigned m i balls from the circle Now we have a smaller circle of m m i balls with k 2 selected Then for the same reason, there exists j such that there are at least m j balls between b j and the next selected ball after b j, with b j included and the next selected ball after b j excluded We then assign m j balls starting from b j to set S j Then the already assigned m j balls are taken away from the circle This procedure can be repeated until the k selected balls are used up At that stage, we have m m m k = m k balls left, and we assign them to set S k Since at each stage of the procedure described above, we always assign a continuous block of balls to a set, it is readily shown by induction that a partition generated by this procedure is non-crossing b b k- b 2 b 3 b ' b Figure 7: Figure 8: (ii) Here we prove that two different selections of balls, say b ;:::;b k and b 0 ;:::;b0 k, generate different partitions We prove this by means of contradiction Suppose that b i and b 0 i are different shown as in Figure 8, and the two selections generate the same partition Now we consider the balls assigned to set S i on the original circle of m balls According to the procedure in (i), we know that on the original circle, the m i balls in S i consist of several continuous blocks of balls, and both b i and b 0 i must be the first (in clockwise direction) ball of the block they belong to From the procedure in (i), for the first set of selected balls, we go clockwise from b i when we assign balls to S i Since b 0 i 2 S i, b 0 i is reached Similarly, for the second set of selected balls, b i is reached as we go clockwise 23

from b 0 i to assign balls to S i So, at the stage that we assign balls to S i, we go clockwise from b i and returned to b i again (shown in Figure 8) That means, all the unassigned balls are traversed and assigned to S i However, this is a contradiction because after all the selected balls are used up, we still have to have m k unassigned balls left for S k Therefore we proved that different selections generate different partitions (iii) Here we prove that for every non-crossing partition, there exists a set of k selected balls that generates it First we introduce the following lemma which is proved at the end of Appendix A Lemma: For a given non-crossing partition, there exists a set S i such that all the balls in S i form a continuous block on the original circle Suppose that S i is such a set, we select the first ball (in clockwise direction) of S i as b i Then we take away S i from the circle Now we have a partition of m m i balls into k disjoint sets which is non- crossing The lemma above can be applied again The same procedure can be repeated until k balls are selected It is easy to verify that this set of selected balls indeed generates the same partition that we start with From the discussion in (i), (ii), and (iii), a one-to-one and onto mapping from the selections of k balls to the non-crossing partitions has been established Therefore, the number of non-crossing partitions is given by m! (m k + )! when sets are distinguishable If instead, sets are considered indistinguishable, the number of non-crossing partitions will be reduced This is because we are free to switch balls in sets that contain the same number of balls and still get the same partition For example, the partition does not change no matter S = fa; bg and S 2 = fc; dg or S 2 = fa; bg and S = fc; dg It is readily shown the number of non-crossing partitions is reduced by a factor f (m ;:::;m k ) and is given by m! (m k + )! f (m ;:::;m k ) : (92) The factor f (m ;:::;m k ) makes the sets of the same size indistinguishable by mixing up these sets Proof of the Lemma: We pick up any set from the partition, if it consists a single continuous block (such a set is called a continuous set), the proof ends If it consists of at least two separate continuous blocks, we pick up any two of them (denoted by Block and Block 2, shown as in Figure 9) If there is a continuous set between x and x 2 (with x and x 2 included), the proof ends If none of the sets between x and x 2 is continuous, we pick up any set between x and x 2 and consider any two blocks that belong to this set (denoted by Block 3 and Block 4, as shown in Figure 0) The same reasoning can be applied when we further consider sets between y and y 2 This procedure must stop after finite number of steps A continuous set is found when the procedure stops The lemma is therefore proved 24 (9)