arxiv: v1 [stat.co] 23 Jun 2012

Similar documents
A note on asymptotic formulae for one-dimensional network flow problems Carlos F. Daganzo and Karen R. Smilowitz

Least-Squares Regression on Sparse Spaces

Robust Forward Algorithms via PAC-Bayes and Laplace Distributions. ω Q. Pr (y(ω x) < 0) = Pr A k

Expected Value of Partial Perfect Information

Jointly continuous distributions and the multivariate Normal

arxiv: v4 [math.pr] 27 Jul 2016

Quantile function expansion using regularly varying functions

Computing Exact Confidence Coefficients of Simultaneous Confidence Intervals for Multinomial Proportions and their Functions

Some Examples. Uniform motion. Poisson processes on the real line

A simple tranformation of copulas

model considered before, but the prey obey logistic growth in the absence of predators. In

Linear First-Order Equations

Least Distortion of Fixed-Rate Vector Quantizers. High-Resolution Analysis of. Best Inertial Profile. Zador's Formula Z-1 Z-2

Introduction to the Vlasov-Poisson system

Time-of-Arrival Estimation in Non-Line-Of-Sight Environments

Logarithmic spurious regressions

1. Aufgabenblatt zur Vorlesung Probability Theory

Some vector algebra and the generalized chain rule Ross Bannister Data Assimilation Research Centre, University of Reading, UK Last updated 10/06/10

Applications of the Wronskian to ordinary linear differential equations

Modelling and simulation of dependence structures in nonlife insurance with Bernstein copulas

Bivariate distributions characterized by one family of conditionals and conditional percentile or mode functions

Euler equations for multiple integrals

Lower Bounds for the Smoothed Number of Pareto optimal Solutions

Lecture Introduction. 2 Examples of Measure Concentration. 3 The Johnson-Lindenstrauss Lemma. CS-621 Theory Gems November 28, 2012

THE VAN KAMPEN EXPANSION FOR LINKED DUFFING LINEAR OSCILLATORS EXCITED BY COLORED NOISE

Chapter 2 Lagrangian Modeling

6 General properties of an autonomous system of two first order ODE

Monotonicity for excited random walk in high dimensions

Quantum Mechanics in Three Dimensions

State-Space Model for a Multi-Machine System

LATTICE-BASED D-OPTIMUM DESIGN FOR FOURIER REGRESSION

Tutorial on Maximum Likelyhood Estimation: Parametric Density Estimation

Table of Common Derivatives By David Abraham

Lecture XII. where Φ is called the potential function. Let us introduce spherical coordinates defined through the relations

Problem Sheet 2: Eigenvalues and eigenvectors and their use in solving linear ODEs

Parameter estimation: A new approach to weighting a priori information

Implicit Differentiation

REAL ANALYSIS I HOMEWORK 5

Agmon Kolmogorov Inequalities on l 2 (Z d )

Monte Carlo Methods with Reduced Error

Gaussian processes with monotonicity information

Hyperbolic Moment Equations Using Quadrature-Based Projection Methods

Lecture 6 : Dimensionality Reduction

FLUCTUATIONS IN THE NUMBER OF POINTS ON SMOOTH PLANE CURVES OVER FINITE FIELDS. 1. Introduction

Modeling of Dependence Structures in Risk Management and Solvency

CHAPTER 1 : DIFFERENTIABLE MANIFOLDS. 1.1 The definition of a differentiable manifold

θ x = f ( x,t) could be written as

1 dx. where is a large constant, i.e., 1, (7.6) and Px is of the order of unity. Indeed, if px is given by (7.5), the inequality (7.

The Three-dimensional Schödinger Equation

Convergence of Random Walks

Introduction to Markov Processes

Math Notes on differentials, the Chain Rule, gradients, directional derivative, and normal vectors

Introduction to variational calculus: Lecture notes 1

Schrödinger s equation.

Witten s Proof of Morse Inequalities

Multi-View Clustering via Canonical Correlation Analysis

On conditional moments of high-dimensional random vectors given lower-dimensional projections

The derivative of a function f(x) is another function, defined in terms of a limiting expression: f(x + δx) f(x)

23 Implicit differentiation

arxiv: v4 [cs.ds] 7 Mar 2014

A Unified Theorem on SDP Rank Reduction

Conservation Laws. Chapter Conservation of Energy

arxiv: v2 [cond-mat.stat-mech] 11 Nov 2016

Final Exam Study Guide and Practice Problems Solutions

Robust Low Rank Kernel Embeddings of Multivariate Distributions

A new proof of the sharpness of the phase transition for Bernoulli percolation on Z d

Spurious Significance of Treatment Effects in Overfitted Fixed Effect Models Albrecht Ritschl 1 LSE and CEPR. March 2009

1 Math 285 Homework Problem List for S2016

Diagonalization of Matrices Dr. E. Jacobs

A. Exclusive KL View of the MLE

Influence of weight initialization on multilayer perceptron performance

arxiv: v1 [math-ph] 5 May 2014

Entanglement is not very useful for estimating multiple phases

A Modification of the Jarque-Bera Test. for Normality

SYSTEMS OF DIFFERENTIAL EQUATIONS, EULER S FORMULA. where L is some constant, usually called the Lipschitz constant. An example is

Survey Sampling. 1 Design-based Inference. Kosuke Imai Department of Politics, Princeton University. February 19, 2013

The Exact Form and General Integrating Factors

Construction of the Electronic Radial Wave Functions and Probability Distributions of Hydrogen-like Systems

PDE Notes, Lecture #11

A Review of Multiple Try MCMC algorithms for Signal Processing

Topic 7: Convergence of Random Variables

NOTES ON EULER-BOOLE SUMMATION (1) f (l 1) (n) f (l 1) (m) + ( 1)k 1 k! B k (y) f (k) (y) dy,

Rank, Trace, Determinant, Transpose an Inverse of a Matrix Let A be an n n square matrix: A = a11 a1 a1n a1 a an a n1 a n a nn nn where is the jth col

EIGEN-ANALYSIS OF KERNEL OPERATORS FOR NONLINEAR DIMENSION REDUCTION AND DISCRIMINATION

Assignment 1. g i (x 1,..., x n ) dx i = 0. i=1

Stable and compact finite difference schemes

Multivariate Random Variable

Energy behaviour of the Boris method for charged-particle dynamics

TAYLOR S POLYNOMIAL APPROXIMATION FOR FUNCTIONS

On the number of isolated eigenvalues of a pair of particles in a quantum wire

Introduction. A Dirichlet Form approach to MCMC Optimal Scaling. MCMC idea

Mathematical Review Problems

7.1 Support Vector Machine

SYNCHRONOUS SEQUENTIAL CIRCUITS

Function Spaces. 1 Hilbert Spaces

arxiv:math/ v1 [math.pr] 19 Apr 2001

THE EFFICIENCIES OF THE SPATIAL MEDIAN AND SPATIAL SIGN COVARIANCE MATRIX FOR ELLIPTICALLY SYMMETRIC DISTRIBUTIONS

Permanent vs. Determinant

Chapter 6: Energy-Momentum Tensors

Partial Differential Equations

Transcription:

Noname manuscript No. (will be inserte by the eitor) Moments Calculation For the Doubly Truncate Multivariate Normal Density Manjunath B G Stefan Wilhelm arxiv:1206.5387v1 [stat.co] 23 Jun 2012 This version: 23.06.2012 Abstract In the present article we erive an explicit expression for the truncate mean an variance for the multivariate normal istribution with arbitrary rectangular ouble truncation. We use the moment generating approach of Tallis (1961) an exten it to general µ, Σ an all combinations of truncation. As part of the solution we also give a formula for the bivariate marginal ensity of truncate multinormal variates. We also prove an invariance property of some elements of the inverse covariance after truncation. Computer algorithms for computing the truncate mean, variance an the bivariate marginal probabilities for oubly truncate multivariate normal variates have been written in R an are presente along with three examples. Keywors multivariate normal; ouble truncation; moment generating function; bivariate marginal ensity function; graphical moels; conitional inepenence Mathematics Subject Classification (2010) 60E05 62H05 B. G. Manjunath CEAUL an DEIO, FCUL, University of Lisbon, Portugal E-mail: bgmanjunath@gmail.com Stefan Wilhelm Department of Finance, University of Basel, Switzerlan Tel.: +49-172-3818512 E-mail: Stefan.Wilhelm@stu.unibas.ch

2 Manjunath B G, Stefan Wilhelm 1 Introuction The multivariate normal istribution arises frequently an has a wie range of applications in fiels like multivariate regression, Bayesian statistics or the analysis of Brownian motion. One motivation to eal with moments of the truncate multivariate normal istribution comes from the analysis of special financial erivatives ( auto-callables or Expresszertifikate ) in Germany. These proucts can expire early epening on some restrictions of the unerlying trajectory, if the unerlying is above or below certain call levels. In the framework of Brownian motion the finite-imensional istributions for log returns at any points in time are multivariate normal. When some of the multinormal variates X = (x 1,..., x ) N(µ, Σ) are subject to inequality constraints (e.g. a i x i b i ), this results in truncate multivariate normal istributions. Several types of truncations an their moment calculation have been escribe so far, for example the one-sie rectangular truncation x a (Tallis 1961), the rather unusual elliptical an raial truncations a x Rx b (Tallis 1963) an the plane truncation Cx p (Tallis 1965). Linear constraints like a Cx b can often be reuce to rectangular truncation by transformation of the variables (in case of a full rank matrix C : a = C 1 a x C 1 b = b ), which makes the ouble rectangular truncation a x b especially important. The existing works on moment calculations iffer in the number of variables they consier (univariate, bivariate, multivariate) an the types of rectangular truncation they allow (single vs. ouble truncation). Single or onesie truncation can be either from above (x a) or below (x a), but only on one sie for all variables, whereas ouble truncation a x b can have both lower an upper truncations points. Other istinguishing features of previous works are further limitations or restrictions they impose on the type of istribution (e.g. zero mean) an the methos they use to erive the results (e.g. irect integration or moment-generating function). Next, we will briefly outline the line of research. Rosenbaum (1961) gave an explicit formula for the moments of the bivariate case with single truncation from below in both variables by irect integration. His results for the bivariate normal istribution have been extene by Shah an Parikh (1964), Regier an Haman (1971) an Muthén (1990) to ouble truncation. For the multivariate case, Tallis (1961) erive an explicit expression for the first two moments in case of a singly truncate multivariate normal ensity with zero mean vector an the correlation matrix R using the moment generating function. Amemiya (1974) an Lee (1979) extene the Tallis (1961) erivation to a general covariance matrix Σ an also evaluate the relationship between the first two moments. Gupta an Tracy (1976) an Lee (1983) gave very simple recursive relationships between moments of any orer for the oubly truncate case. But since except for the mean there are fewer equations than parameters, these recurrent conitions o not uniquely ientify

Moments Calculation For the Doubly Truncate Multivariate Normal Density 3 Table 1 Survey of previous works on the moments for the truncate multivariate normal istribution Author #Variates Truncation Focus Rosenbaum (1961) bivariate single moments for bivariate normal variates with single truncation, b 1 < y 1 <, b 2 < y 2 < Tallis (1961) multivariate single moments for multivariate normal variates with single truncation from below Shah an Parikh (1964) bivariate ouble recurrence relations between moments Regier an Haman (1971) bivariate ouble an explicit formula only for the case of truncation from below at the same point in both variables Amemiya (1974) multivariate single relationship between first an secon moments Gupta an Tracy (1976) multivariate ouble recurrence relations between moments Lee (1979) multivariate single recurrence relations between moments Lee (1983) multivariate ouble recurrence relations between moments Leppar an Tallis (1989) multivariate single moments for multivariate normal istribution with single truncation Muthén (1990) bivariate ouble moments for bivariate normal istribution with ouble truncation, b 1 < y 1 < a 1, b 2 < y 2 < a 2 Manjunath/Wilhelm multivariate ouble moments for multivariate normal istribution with ouble truncation in all variables a x b moments of orer 2 an are therefore not sufficient for the computation of the variance an other higher orer moments. Table 1 summarizes our survey of existing publications ealing with the computation of truncate moments an their limitations. Even though the rectangular truncation a x b can be foun in many situations, no explicit moment formulas for the truncate mean an variance in the general multivariate case of ouble truncation from below an/or above have been presente so far in the literature an are reaily apparent. The contribution of this paper is to erive these formulas for the first two truncate moments an to exten an generalize existing results on moment calculations from especially Tallis (1961); Lee (1983); Leppar an Tallis (1989); Muthén (1990). The remainer of this paper is organize as follows. Section 2 presents the moment generating function (m.g.f) for the oubly truncate multivariate normal case. In Section 3 we erive the first an secon moments by ifferentiating the m.g.f. These results are complete in Section 4 by giving a formula for computing the bivariate marginal ensity. In Section 5 we present two numerical examples an compare our results with simulation results. Section 6 links our results to the theory of graphical moels an erives some properties of the inverse covariance matrix. Finally, Section 7 summarizes our results an gives an outlook for further research.

4 Manjunath B G, Stefan Wilhelm 2 Moment Generating Function The imensional normal ensity with location parameter vector µ R an non-singular covariance matrix Σ is given by { 1 ϕ µ,σ (x) = (2π) /2 exp 1 } Σ 1/2 2 (x µ) Σ 1 (x µ), x R. (1) The pertaining istribution function is enote by Φ µ,σ (x). Corresponingly, the multivariate truncate normal ensity, truncate at a an b, in R, is efine as ϕ µ,σ (x), for a x b, ϕ αµ,σ (x) = P{a X b} (2) 0, otherwise. Denote α = P{a X b} as the fraction after truncation. The moment generating function (m.g.f) of a imensional truncate ranom variable X, truncate at a an b, in R, having the ensity f(x) is efine as the fol integral of the form ( m(t) = E e t X ) b = x a et f(x)x. Therefore, the m.g.f for the ensity in (2) is m(t) = 1 b { α(2π) /2 Σ 1/2 a exp 1 [ (x µ) Σ 1 (x µ) 2t x] } x. (3) 2 In the following, the moments are first erive for the special case µ = 0. Later, the results will be generalize to all µ by applying a location transformation. Now, consier only the exponent term in (3) for the case µ = 0. Then we have 1 [ ] x Σ 1 x 2t x 2 which can also be written as where ξ = Σt. 1 2 t Σt 1 2 [ ] (x ξ) Σ 1 (x ξ), Consequently, the m.g.f of the rectangularly oubly truncate multivariate normal is e m(t) = T b { α(2π) /2 Σ 1/2 a exp 1 [ (x ξ) Σ (x ξ)] } 1 x, (4) 2

Moments Calculation For the Doubly Truncate Multivariate Normal Density 5 where T = 1 2 t Σt. The above equation can be further reuce to e m(t) = T b ξ { α(2π) /2 Σ 1/2 a ξ exp 1 } 2 x Σ 1 x For notational convenience, we write equation (5) as x. (5) m(t) = e T Φ ασ (6) where Φ ασ = 1 b ξ { α(2π) /2 Σ 1/2 a ξ exp 1 } 2 x Σ 1 x x. 3 First An Secon Moment Calculation In this section we erive the first an secon moments of the rectangularly oubly truncate multivariate normal ensity by ifferentiating the m.g.f.. Consequently, by taking the partial erivative of (6) with respect to t i we have m(t) t i = e T Φ ασ t i + Φ ασ e T t i. (7) In the above equation the only essential terms which will be simplifie are an Φ ασ t i e T t i = e T k=1 σ i,k t k = b 1 b... ϕ ασ (x)x t...x 1, (8) i a1 a where a i = a i k=1 σ i,kt k an b i = b i k=1 σ i,kt k. Subsequently, (8) is where Φ ασ = t i σ i,k (F k (a k ) F k(bk )), (9) k=1 F i (x) = b 1 a 1... b i 1 b i+1 a i 1 a i+1... b a ϕ ασ (x 1,.., x i 1, x, x i+1,..x )x...x i+1 x i 1...x 1. (10)

6 Manjunath B G, Stefan Wilhelm Note that at t k = 0, for all k = 1, 2,...,, we have a i = a i an b i = b i. Therefore, F i (x) will be the i th marginal ensity. An especially convenient way of computing these one-imensional marginals is given in Cartinhour (1990). From (7) (9) for k = 1, 2,..., all t k = 0. Hence, the first moment is E(X i ) = m(t) t t=0 = i σ i,k (F k (a k ) F k (b k )). (11) k=1 Now, by taking the partial erivative of (7) with respect to t j, we have 2 m(t) t j t i = e T 2 Φ ασ t j t i + Φ ασ t i e T t j + Φ ασ 2 e T t j t i + et t i Φ ασ t j. (12) The essential terms for simplification are 2 e T t j t i = σ i,j an clearly, the partial erivative of (9) with respect to t j gives ( 2 Φ ασ F k (a = t j t i k σ ) ) ( F k (bk i,k t k=1 j σ ) ) i,k. (13) t k=1 j In the above equation merely consier the partial erivative of the marginal ensity F k (a k ) with respect to t j. With further simplification it reuces to F k (a k ) t j = b 1 b k 1 b k+1 b...... ϕ ασ (x 1,.., x t k 1, a j a1 a a k 1 a k+1 k, x k+1,..x )x k = σ j,ka k F k(a k ) + q =k σ k,k ( σ j,q σ k,qσ j,k σ k,k ) ( ) F k,q(a k, a q) F k,q (a k, b q), (14) where F k,q (x, y) = b 1... a 1 b k 1 b k+1 a k 1 a k+1 b q 1 b q+1 b...... ϕ ασ (x, y, x k, q )x k, q, (15) a q 1 a q+1 a an the short form x k enotes the vector (x 1,.., x k 1, x k+1,..x ) in ( 1) imensions an x k, q enotes the ( 2) imensional vector (x 1,..., x k 1, x k+1,..., x q 1, x q+1,..., x ) for k = q. The above equation (14) is euce from Lee (1979), pp. 167. Note that for all t k = 0 the term F k,q (x, y) will be the bivariate marginal ensity for which we will give a formula in the next section.

Moments Calculation For the Doubly Truncate Multivariate Normal Density 7 Subsequently, F k(bk ) t can be obtaine by substituting a j k by b k. From (12) (15) at all t k = 0, k = 1, 2,...,, the secon moment is E(X i X j ) = 2 m(t) t j t i t=0 = σ i,j + + k=1 k=1 σ i,k σ j,k (a k F k (a k ) b k F k (b k )) σ i,k q =k σ k,k ( σ j,q σ k,qσ j,k σ k,k ( F k,q (b k, a q ) F k,q (b k, b q ) ) [( F k,q(a k, a q ) F k,q(a k, b q )) )]. (16) Having erive expressions for the first an secon moments for ouble truncation in case of µ = 0, we will now generalize to all µ: if Y N(µ, Σ) with a y b, then X = Y µ N(0, Σ) with a = a µ x b µ = b an E(Y) = E(X)+µ an Cov(Y) = Cov(X). Equations (11) an (16) can then be use to compute E(X) an Cov(X). Hence, for general µ, the first moment is The covariance matrix E(Y i ) = σ i,k (F k (a k ) F k (b k ))+µ i. (17) k=1 Cov(Y i, Y j ) = Cov(X i, X j ) = E(X i X j ) E(X i )E(X j ) (18) is invariant to the shift in location. The equations (17) an (18) in combination with (11) an (16) form our esire result an allow the calculation of the truncate mean an truncate variance for general ouble truncation. A formula for the term F k,q (x k, x q ), the bivariate marginal ensity, will be given in the next section. We have implemente the moment calculation for mean vectormean, covariance matrixsigma an truncation vectorslower anupper as a function mtmvnorm(mean, sigma, lower, upper) in the R package tmvtnorm (Wilhelm an Manjunath 2010a;?), where the coe is open source. In Section 5 we show a usage example for this function. 4 Bivariate Marginal Density Computation In orer to compute the bivariate marginal ensity in this section we mainly follow Tallis (1961), p. 223 an Leppar an Tallis (1989) who implicitly use the bivariate marginal ensity as part of the moments calculation for single truncation, evaluate at the integration bouns. However, we exten it to the

8 Manjunath B G, Stefan Wilhelm oubly truncate case an state the function for all points within the support region. Without loss of generality we use a z-transformation for all variates x = (x 1,..., x ) as well as for all lower an upper truncation points a = (a 1,..., a ) an b = (b 1,..., b ), resulting in a N(0, R) istribution with correlation matrix R for the stanarize untruncate variates. In this section we treat all variables as if they are z-transforme, leaving the notation unchange. For computing the bivariate marginal ensity F q,r (x q, x r ) with a q x q b q, a r x r b r, q = r, we use the fact that for truncate normal ensities the conitional ensities are truncate normal again. The following relationship hols for x s, z s R 2 if we conition on x q = c q an x r = c r (s = q = r): α 1 ϕ (x s, x q = c q, x r = c r ; R) = α 1 ϕ(c q, c r ; ρ qr )ϕ 2 (z s ; R qr ), (19) where z s = (x s β sq.r c q β sr.q c r )/ (1 ρ 2 sq)(1 ρ 2 sr.q) (20) an R qr is the matrix of secon-orer partial correlation coefficients for s = q = r. β sq.r an β sr.q are the partial regression coefficients of x s on x q an x r respectively an ρ sr.q is the partial correlation coefficient between x s an x r for fixe x q. Integrating out( 2) variables x s leas to F q,r (x q, x r ) as a prouct of a bivariate normal ensity ϕ(x q, x r ) an a( 2)-imension normal integral Φ 2 : b1 F q,r (x q = c q, x r = c r ) =... a 1 br+1... a r+1 bq 1 a q 1 b a bq+1 br 1... a q+1 a r 1 ϕ αr (x s, c q, c r )x s = α 1 ϕ(c q, c r ; ρ qr )Φ 2 (A q rs; B q rs; R qr ) (21) where Ars q an Brs q enote the lower an upper integration bouns of Φ 2 given x q = c q an x r = c r : Ars q = (a s β sq.r c q β sr.q c r )/ (1 ρ 2 sq)(1 ρ 2 sr.q) (22) Brs q = (b s β sq.r c q β sr.q c r )/ (1 ρ 2 sq)(1 ρ 2 sr.q). (23) The computation of F q,r (x q, x r ) just nees the evaluation of the normal integral Φ 2 in 2 imensions, which is reaily available in most statistics software packages, for example as the function pmvnorm() in the R package mvtnorm (?). The bivariate marginal ensity function tmvnorm(x, mean, sigma, lower, upper, margin=c(q,r)) is also part of the R package tmvtnorm (Wilhelm an Manjunath 2010a;?), where reaers can fin the source coe as well as help files an aitional examples.

Moments Calculation For the Doubly Truncate Multivariate Normal Density 9 bivariate marginal ensity (x 1,x 2 ) x 2 4 2 0 2 4 0.15 0.05 0.05 0.3 0.1 0.25 0.4 0.15 0.45 4 2 0 2 4 x 1 Fig. 1 Contour plot for the bivariate truncate ensity function Marginal ensity x 1 Marginal ensity x 2 Density 0.2 0.4 0.6 0.8 Density 0.0 0.1 0.2 0.3 0.4 0.5 1.0 0.5 0.0 0.5 3 2 1 0 1 x 1 x 2 Fig. 2 Marginal ensities F k (x) (k = 1, 2) for x 1 an x 2 obtaine from Kernel ensity estimation of ranom samples an from irect evaluation of F k (x) 5 Numerical Examples 5.1 Example 1 We will use the following bivariate example with µ = (0.5, 0.5) an covariance matrix Σ ( ) 1 1.2 Σ = 1.2 2 as well as lower an upper truncation points a = ( 1, ), b = (0.5, 1), i.e. x 1 is oubly, while x 2 is singly truncate. The bivariate marginal ensity F q,r (x, y) is the ensity function itself an is shown in figure 1, the oneimensional ensities F k (x) (k = 1, 2) in figure 2. The moment calculation for our example can be performe in R as

10 Manjunath B G, Stefan Wilhelm Monte Carlo estimator for µ^1 Monte Carlo estimator for µ^2 µ^1 0.20 0.18 0.16 0.14 0.12 0.10 theoretical value 95% MC confience interval µ^2 0.50 0.45 0.40 0.35 0.30 theoretical value 95% MC confience interval 0 2000 6000 10000 0 2000 6000 10000 sample size sample size Fig. 3 Trace plots of the Monte Carlo estimator for µ > library(tmvtnorm) > mu <- c(0.5, 0.5) > sigma <- matrix(c(1, 1.2, 1.2, 2), 2, 2) > a <- c(-1, -Inf) > b <- c(0.5, 1) > moments <- mtmvnorm(mean=mu, sigma=sigma, > lower=a, upper=b) an results in µ = ( 0.152, 0.388) an covariance matrix Σ = ( ) 0.163 0.161 0.161 0.606 The trace plots in figures 3 an 4 show the evolution of a Monte Carlo estimate for the elements of the mean vector an the covariance matrix respectively for growing sample sizes. Furthermore, the 95% confience interval obtaine from Monte Carlo using the full sample of 10000 items is shown. All confience intervals contain the true theoretical value, but Monte Carlo estimates still show substantial variation even with a sample size of 10000. Simulation from a truncate multivariate istribution an calculating the sample mean or the sample covariance respectively also leas to consistent estimates of µ an Σ. Since the rate of convergence of the MC estimator is O( n), one has to ensure sufficient Monte Carlo iterations in orer to have a goo approximation or to choose variance reuction techniques.

Moments Calculation For the Doubly Truncate Multivariate Normal Density 11 Monte Carlo estimator for Cov(x 1, x 1) Monte Carlo estimator for Cov(x 1, x 2) Monte Carlo estimator for Cov(x 2, x 2) Cov(x1, x1) 0.10 0.12 0.14 0.16 0.18 theoretical value Cov(x1, x2) 0.10 0.12 0.14 0.16 0.18 0.20 theoretical value Cov(x2, x2) 0.4 0.5 0.6 0.7 0.8 theoretical value 0 2000 6000 10000 0 2000 6000 10000 0 2000 6000 10000 sample size sample size sample size Fig. 4 Trace plots of the Monte Carlo estimator for the 3 elements of Σ (σ 11, σ 12 = σ 21 an σ 22 ) 5.2 Example 2 Let µ = (0, 0, 0),the covariance matrix 1.1 1.2 0 Σ = 1.2 2 0.8 0 0.8 3 an the lower an upper truncation points a = ( 1,, ) an b = (0.5,, ), then the only truncate variable is x 1, which is furthermore uncorrelate with x 3. Our formula results in µ = c( 0.210, 0.229, 0) an 0.174 0.190 0.0 Σ = 0.190 0.898 0.8 0 0.8 3.0 For this special case of only k < truncate variables(x 1,..., x k ), the remaining k variables(x k+1,..., x ) can be regresse on the truncate variables, an a simple formula for the mean an covariance matrix can be given (see Johnson an Kotz (1971), p. 70). Let the covariance matrix Σ of(x 1,..., x ) be partitione as Σ = ( ) V11 V 12 V 21 V 22 (24)

12 Manjunath B G, Stefan Wilhelm where V 11 enotes the k k covariance matrix of (x 1,..., x k ). The mean vector 1 an the covariance matrix Σ of all variables can be compute as an ( Σ = U 11 (ξ 1, ξ 1 V 1 11 V 12) (25) U 11 V 1 11 V 12 V 21 V 1 11 U 11 V 22 V 21 (V 1 11 V 1 11 U 11V 1 11 )V 12 where ξ 1 an U 11 are the mean an covariance of the (x 1,..., x k ) after truncation. The mean an stanar eviation for the univariate truncate normal x 1 are ξ 1 = µ 1 = σ ϕ µ1,σ 11 (a 1 ) ϕ µ1,σ 11 (b 1 ) 11 Φ µ1,σ 11 (b 1 ) Φ µ1,σ 11 (a 1 ) σ 11 = σ 11+σ 11 a 1 ϕ µ1,σ 11 (a 1 ) b 1 ϕ µ1,σ 11 (b 1 ) Φ µ1,σ 11 (b 1 ) Φ µ1,σ 11 (a 1 ) Letting U 11 = σ 11 an inserting ξ 1 an U 11 into equations (25) an (26), one can verify our formula an the results for µ an Σ. However, the crux in using the Johnson/Kotz formula is the nee to first compute the moments of the truncate variables (x 1,..., x k ) for k 2. But this has been exactly the subject of our paper. ) (26) 6 Moment Calculation an Conitional Inepenence In this section we establish a link between our moment calculation an the theory of graphical moels (Whittaker (1990), Ewars (1995) an Lauritzen (1996)). We present some properties of the inverse covariance matrix an show how the epenence structure of variables is affecte after selection. Graphical moelling uses graphical representations of variables as noes in a graph an epenencies among them as eges. A key concept in graphical moelling is the conitional inepenence property. Two variables X an Y are conitional inepenent given a variable or a set of variables Z (notation X Y Z), when X an Y are inepenent after partialling out the effect of Z. For conitional inepenent X an Y the ege between them in the graph is omitte an the joint ensity factorizes as f(x, y z) = f(x z) f(y z). Conitional inepenence is equivalent to having zero elements Ω xy in the inverse covariance matrix Ω = Σ 1 as well as having a zero partial covariance/correlation between X an Y given the remaining variables: X Y Rest Ω xy = 0 ρ xy.rest = 0 1 The formula for the truncate mean given in Johnson an Kotz (1971), p. 70 is only vali for a zero-mean vector or after emeaning all variables appropriately. For non-zero means µ = (µ 1, µ 2 ) it will be(ξ 1, µ 2 +(ξ 1 µ 1 )V 1 11 V 12).

Moments Calculation For the Doubly Truncate Multivariate Normal Density 13 Both marginal inepenence an conitional inepenence between variables simplify the computations of the truncate covariance in equation (16). In the presence of conitional inepenence of i an j given q, the terms σ ij σ iq σqq 1σ qj = 0 vanish as they reflect the partial covariance of i an j given q. As has been shown by Marchetti an Stanghellini (2008), the conitional inepenence property is preserve after selection, i.e. the inverse covariance matrices Ω an Ω before an after truncation share the same zero-elements. We prove that many elements of the precision matrix are invariant to truncation. For the case of k < truncate variables, we efine the set of truncate variables with T = {x 1,..., x k }, an the remaining k variables as S = {x k+1,..., x }. We can show that the off-iagonal elements Ω i,j are invariant after truncation for i T S an j S: Proposition 6.1. The off-iagonal elements Ω i,j an the iagonal elements Ω j,j are invariant after truncation for i T S an j S. Proof. The proof is a irect application of the Johnson/Kotz formula in equation (26) in the previous section. As a result of the formula for partitione inverse matrices (Greene (2003), section A.5.3), the corresponing inverse covariance matrix Ω of the partitione covariance matrix Σ is ( V 1 Ω = 11 (I+V 12F 2 V 21 V11 1) V 1 11 V ) 12F 2 F 2 V 21 V11 1 (27) F 2 with F 2 = (V 22 V 21 V 1 11 V 12) 1. Inverting the truncate covariance matrix Σ in equation (26) using the formula for the partitione inverse leas to the truncate precision matrix ( Ω U 1 = 11 + V 1 11 V 12F 2 V 21 V11 1 V11 1 V ) 12F 2 F 2 V 21 V11 1 (28) F 2 where the Ω12 an Ω 21 elements are the same as Ω 12 an Ω 21 respectively. The same is true for the elements in Ω22, especially the iagonal elements in Ω22. Here, we prove this invariance property only for a subset of truncate variables. Base on our experiments we conjecture that the same is true also for the case of full truncation (i.e. all off-iagonal elements in Ω11 ), but we o not give a rigorous proof here an leave it to future research. 6.1 Example 3 We illustrate the invariance of the elements of the inverse covariance matrix with the famous mathematics marks example use in Whittaker (1990) an Ewars (1995), p. 49. The inepenence graph of the five variables(w, V, X,

14 Manjunath B G, Stefan Wilhelm Y, Z) in this example takes the form of a butterfly. vectors (W) statistics (Z) algebra (X) mechanics (V) analysis (Y) Here, we have the conitional inepenencies (W, V) (Y, Z) X. A corresponing precision matrix might look like (sample ata; zero-elements marke as. ): 1 0.2 0.3.. 0.2 1 0.1.. Ω = 0.3 0.1 1 0.4 0.5.. 0.4 1 0.2.. 0.5 0.2 1 After truncation in some variables (for example(w, V, X) as 2 W 1, 1 V 1, 0 X 1), we apply equation (16) to compute the truncate secon moment an the inverse covariance matrix as: (29) 1.88 0.2 0.3.. Ω 0.2 3.45 0.1.. = 0.3 0.1 12.67 0.4 0.5.. 0.4 1 0.2.. 0.5 0.2 1 (30) The precision matrix Ω after selection iffers from Ω only in the iagonal elements of(w, V, X). From Ω we can rea how partial correlations between variables have change ue to the selection process. Each iagonal element Ω yy of the precision matrix is the inverse of the partial variance after regressing on all other variables (Whittaker (1990),p. 143). Since only those iagonal elements in the precision matrix for the k of the truncate variables will change after selection, this leas to the iea to just compute these k elements after selection rather than the full k(k + 1)/2 symmetric elements in the truncate covariance matrix an applying the Johnson/Kotz formula for the remaining k variables. However, the inverse partial variance of a scalar Y given the remaining variables X = {x 1,..., x }\ y [ ] 1 [ ] 1 Ω yy = Σ y.x = Σ yy Σ yx Σ 1 XX Σ Xy still requires the truncate covariance results erive in Section 3.

Moments Calculation For the Doubly Truncate Multivariate Normal Density 15 7 Summary In this paper we erive a formula for the first an secon moments of the oubly truncate multivariate normal istribution an for their bivariate marginal ensity. An implementation for both formulas has been mae available in the R statistics software as part of the tmvtnorm package. We linke our results to the theory of graphical moels an prove an invariance property for elements of the precision matrix. Further research can eal with other types of truncation than we consiere (e.g. elliptical). Another line of research can look at the moments of the oubly truncate multivariate Stuent-t istribution, which contains the truncate multivariate normal istribution as a special case. References Amemiya T (1974) Multivariate regression an simultaneous equations moels when the epenent variables are truncate normal. Econometrica 42:999 1012 Cartinhour J (1990) One-imensional marginal ensity functions of a truncate multivariate normal ensity function. Communications in Statistics - Theory an Methos 19:197 203 Ewars D (1995) Introuction to graphical moelling. Springer Genz A, Bretz F, Miwa T, Mi X, Leisch F, Scheipl F, Hothorn T (2010) mvtnorm: Multivariate normal an t istributions. URL http://cran.r-project.org/package=mvtnorm, R package version 0.9-95 Greene WH (2003) Econometric Analysis, 5th en. Prentice-Hall Gupta AK, Tracy DS (1976) Recurrence relations for the moments of truncate multinormal istribution. Communications in Statistics - Theory an Methos 5(9):855 865 Johnson NL, Kotz S (1971) Distributions in Statistics: Continuous Multivariate Distributions. John Wiley & Sons Lauritzen S (1996) Graphical Moels. Oxfor University Press Lee LF (1979) On the first an secon moments of the truncate multi-normal istribution an a simple estimator. Economics Letters 3:165 169 Lee LF (1983) The etermination of moments of the oubly truncate multivariate tobit moel. Economics Letters 11:245 250 Leppar P, Tallis GM (1989) Algorithm AS 249: Evaluation of the mean an covariance of the truncate multinormal istribution. Applie Statistics 38:543 553 Marchetti GM, Stanghellini E (2008) A note on istortions inuce by truncation with applications to linear regression systems. Statistics & Probability Letters 78:824829 Muthén B (1990) Moments of the censore an truncate bivariate normal istribution. British Journal of Mathematical an Statistical Psychology 43:131 143 Regier MH, Haman MA (1971) Correlation in a bivariate normal istribution with truncation in both variables. Australian Journal of Statistics 13:77 82 Rosenbaum S (1961) Moments of a truncate bivariate normal istribution. Journal of the Royal Statistical Society Series B (Methoological) 23:405 408 Shah SM, Parikh NT (1964) Moments of single an oubly truncate stanar bivariate normal istribution. Viya (Gujarat University) 7:82 91 Tallis GM (1961) The moment generating function of the truncate multinormal istribution. Journal of the Royal Statistical Society, Series B (Methoological) 23(1):223 229 Tallis GM (1963) Elliptical an raial truncation in normal populations. The Annals of Mathematical Statistics 34(3):940 944 Tallis GM (1965) Plane truncation in normal populations. Journal of the Royal Statistical Society, Series B (Methoological) 27(2):301 307 Whittaker J (1990) Graphical moels in applie multivariate statistics. John Wiley & Sons

16 Manjunath B G, Stefan Wilhelm Wilhelm S, Manjunath BG (2010a) tmvtnorm: A Package for the Truncate Multivariate Normal Distribution. The R Journal 2(1):25 29, URL http://journal.r-project.org/archive/2010-1/rjournal_2010-1_wilhelm+manjunath.pf Wilhelm S, Manjunath BG (2010b) tmvtnorm: Truncate multivariate normal istribution an Stuent t istribution. URL http://cran.r-project.org/package=tmvtnorm, R package version 1.1-5