CONDITIONAL PROBABILITY INTEGRAL TRANSFORMATIONS FOR MULTIVARIATE NORMAL DISTRIBUTIONS

Similar documents
Direction: This test is worth 250 points. You are required to complete this test within 50 minutes.

5. Likelihood Ratio Tests

First Year Quantitative Comp Exam Spring, Part I - 203A. f X (x) = 0 otherwise

Stat 319 Theory of Statistics (2) Exercises

Introducing a Novel Bivariate Generalized Skew-Symmetric Normal Distribution

Topic 9: Sampling Distributions of Estimators

Double Stage Shrinkage Estimator of Two Parameters. Generalized Exponential Distribution

Probability 2 - Notes 10. Lemma. If X is a random variable and g(x) 0 for all x in the support of f X, then P(g(X) 1) E[g(X)].

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.

Exact Conditional Tests and Approximate Bootstrap Tests for the von Mises Distribution

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Exponential Families and Bayesian Inference

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Lecture 7: Properties of Random Samples

Topic 9: Sampling Distributions of Estimators

Probability and Statistics

1.010 Uncertainty in Engineering Fall 2008

Lecture 3. Properties of Summary Statistics: Sampling Distribution

APPLIED MULTIVARIATE ANALYSIS

Bayesian Methods: Introduction to Multi-parameter Models

Berry-Esseen bounds for self-normalized martingales

Random Variables, Sampling and Estimation

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

Stat 421-SP2012 Interval Estimation Section

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Advanced Stochastic Processes.

Self-normalized deviation inequalities with application to t-statistic

Rank tests and regression rank scores tests in measurement error models

A Note on Effi cient Conditional Simulation of Gaussian Distributions. April 2010

Generating Functions for Laguerre Type Polynomials. Group Theoretic method

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors

Topic 9: Sampling Distributions of Estimators

Properties and Hypothesis Testing

Maximum likelihood estimation from record-breaking data for the generalized Pareto distribution

DERIVING GENERALIZED MEANS AS LEAST SQUARES AND MAXIMUM LIKELIHOOD ESTIMATES

A note on self-normalized Dickey-Fuller test for unit root in autoregressive time series with GARCH errors

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

Estimation for Complete Data

Confidence interval for the two-parameter exponentiated Gumbel distribution based on record values

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1

Problem Set 4 Due Oct, 12

UNIVERSITY OF NORTH CAROLINA Department of Statistics Chapel Hill, N. C. FOR MULTIVARIATE POPULATIONS. J. N. Srivastava.

Notes 19 : Martingale CLT

EDGEWORTH SIZE CORRECTED W, LR AND LM TESTS IN THE FORMATION OF THE PRELIMINARY TEST ESTIMATOR

Journal of Multivariate Analysis. Superefficient estimation of the marginals by exploiting knowledge on the copula

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Lecture 33: Bootstrap

Parameter, Statistic and Random Samples

Unbiased Estimation. February 7-12, 2008

SOME THEORY AND PRACTICE OF STATISTICS by Howard G. Tucker

17. Joint distributions of extreme order statistics Lehmann 5.1; Ferguson 15

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Goodness-Of-Fit For The Generalized Exponential Distribution. Abstract

ESTIMATION AND PREDICTION BASED ON K-RECORD VALUES FROM NORMAL DISTRIBUTION

Matrix Representation of Data in Experiment

Slide Set 13 Linear Model with Endogenous Regressors and the GMM estimator

Math 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency

Review Questions, Chapters 8, 9. f(y) = 0, elsewhere. F (y) = f Y(1) = n ( e y/θ) n 1 1 θ e y/θ = n θ e yn

Probability and statistics: basic terms

MA Advanced Econometrics: Properties of Least Squares Estimators

Precise Rates in Complete Moment Convergence for Negatively Associated Sequences

1 Introduction to reducing variance in Monte Carlo simulations

Chi-Squared Tests Math 6070, Spring 2006

Confidence Intervals

Lecture 11 and 12: Basic estimation theory

Introduction to Extreme Value Theory Laurens de Haan, ISM Japan, Erasmus University Rotterdam, NL University of Lisbon, PT

IIT JAM Mathematical Statistics (MS) 2006 SECTION A

Testing Statistical Hypotheses for Compare. Means with Vague Data

MATH/STAT 352: Lecture 15

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

SDS 321: Introduction to Probability and Statistics

Lecture 2: Monte Carlo Simulation

Efficient GMM LECTURE 12 GMM II

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

4. Partial Sums and the Central Limit Theorem

RELATIONSHIPS BETWEEN MOMENTS OF TWO RELATED SETS OF ORDER STATISTICS AND SOME EXTENSIONS

A Generalized Gamma-Weibull Distribution: Model, Properties and Applications

Kernel density estimator

A goodness-of-fit test based on the empirical characteristic function and a comparison of tests for normality

Exam II Review. CEE 3710 November 15, /16/2017. EXAM II Friday, November 17, in class. Open book and open notes.

Module 1 Fundamentals in statistics

A statistical method to determine sample size to estimate characteristic value of soil parameters

Basis for simulation techniques

Expectation and Variance of a random variable

Confidence Interval for Standard Deviation of Normal Distribution with Known Coefficients of Variation

1 General linear Model Continued..

Large Sample Theory. Convergence. Central Limit Theorems Asymptotic Distribution Delta Method. Convergence in Probability Convergence in Distribution

PREDICTION INTERVALS FOR FUTURE SAMPLE MEAN FROM INVERSE GAUSSIAN DISTRIBUTION

Nonnegative-Definite Covariance Structures for. which the Least Squares Estimator is the Best. Linear Unbiased Estimator

Using the IML Procedure to Examine the Efficacy of a New Control Charting Technique

TAMS24: Notations and Formulas

Estimation of the Mean and the ACVF

Access to the published version may require journal subscription. Published with permission from: Elsevier.


Complex Analysis Spring 2001 Homework I Solution

Lecture 18: Sampling distributions

DEEPAK SERIES DEEPAK SERIES DEEPAK SERIES FREE BOOKLET CSIR-UGC/NET MATHEMATICAL SCIENCES

Chapter 12 EM algorithms The Expectation-Maximization (EM) algorithm is a maximum likelihood method for models that have hidden variables eg. Gaussian

Chapter 3. Strong convergence. 3.1 Definition of almost sure convergence

Chi-squared tests Math 6070, Spring 2014

Transcription:

CONDITIONAL PROBABILITY INTEGRAL TRANSFORMATIONS FOR MULTIVARIATE NORMAL DISTRIBUTIONS Satiago Rico Gallardo, C. P. Queseberry, F. J. O'Reilly Istitute of Statistics Mimeograph Series No. 1148 Raleigh, November 1977

CONDITIONAL PROBABILITY mtegral TRANSFORMATIONS FOR MULTIVARIATE NORMAL DISTRIBUTIONS Satiago Ricm-Gallardo l ad C. P. Q.ueseberry2 North Carolia State Uiversity ad Federico J. O'ReillY limas, Uiversidad Nacioal Aut6oma de M~xico Let Xl' '" X be a radom sample from a full-rak multivariate ormal distributio N(~,~). The two cases (i) ~ ukow :d ~ = a2~o'~o kow, ad (ii) ~ ad ~ completely ukow are cosidered here. Trasformatios are give that trasform the observatio vectors to a (smaller) set of i.i.d. uiform rv's. These trasformatios ca be used to costruct goodess-of-fit tests for these multivariate ormal distributios. 1.. ItroductioI) ad S1JX!lmau. There is a large literature that cosiders the multivariate ormal distributio. However, there is very little available i the way of goodess-of-fit tests for multivariate ormality, ad othig whatever that is based upo exact distributio theory ad therefore is applicable for small ad moderate size samples. Recetly, Moore (1976) has commeted upo the eed for goodess-of-fit tests for multivariate ormality. I this work we give trasformatios which ca be used to costruct exact level goodess-of-fit tests for multivariate ormality. O'Reilly ad Queseberry (1973), O-Q, itroduced the coditioal probability itegral trasformatios, CPIT's. Trasformatios were give i that paper for a multivariate ormal paret N(~,~) for the case whe ~ is ukow ad 1 Supported by limas, UNAM. 2Research supported by Natioal Sciece Foudatio Grat MCS76-82652.

2 t = to is kow. Here we give trasformatios for the two cases: (i) ~ ukow, ad t = a 2t o with EO kow, ad (ii) ~ ad E ukow. I both of these cases the compoets of the observatio vectors are trasformed usig certai Studet-t distributio fuctios. The trasformatios ad model testig techiques cosidered here ca be carried out o high-speed computers. g. Notatio~ ~ Prelimiaries. Let P deote the class of k-variate full-rak ormal distributios with mea vector ~ ad variace-covariace matrix E. For X X i.i.d. (colum) vector rv's from PEP, with l', correspodig probability desity fuctio f ad distributio f~~ctio F, both defied o ~, a complete sufficiet statistic for P is T = (X, s ), where X = (li).2:_ X. ad S =.tlx.x' - X X' ~=1 ~ ~= ~ i readily verified that T is doubly trasitive (cf. O-Q), i.e., -- It is a(t, X ) =a(t l' X ), where a(w) deotes the a - algebra iduced by a - statistic W. Cosider the coditioal distributio fuctio -F of a sigle observatio give the statistic T. For > k+l ~ is absolutely cotiuous ad possesses - a desity fuctio f which is the miimum variace ubiased, MVU, estimator of the paret desity fuctio. These fuctios were obtaied by Ghurye ad Olki (1969), p. 1265, cases 3.2 ad 3.4, for the cases (i) ad (ii) above. Case (ii) will be developed i detail, but we will oly summarize the results for case (i). The ext lemma gives the desity f i a form that will be - coveiet i this work. The idicator fuctio of the set satisfyig coditio [.] is deoted by I[.] LEMMA 2.1. If Xl"'" X are i.i.d. rv's with a commo multivariate ormal distributio PEP, the MVU estimator f of the correspodig ormal

3 probability desity fuctio is (2.1).I[(x-X )' S-l(x_X ) ~ (-l)/], > k + 1 PROOF. This result is immediate from Ghurye ad Olki (1969) ad the two facts: (a) For B(k x k) osigular ad x(k.x 1), IB-xx'\ = IBI(l-x'B-lx) (b) If B is p.d. the B-xx' is p.d. iff x'bx < 1. LEMMA 2.2. Suppose Y is a TV which has for fixed T the coditioal desity fuctio f of (2.1). The for Z = A (Y-X )/[((-l)/) - (Y-X )'s-l(y_x )}t, where A'A = s-l, the coditioal desity fuctio of Z give T is 1 PROOF. If z = A (y-x )/[((-l)/) - (y-x )' s-l(y_x )}2, 1 1 the y = A- l z((_l)/)2 (1+ Z'Z)-2 + X The Jacobia is +! [((-l)/)/(l+ z'z)} 2 IA I-III - zz'/(l+ z'z)1, ad usig the relatios I '/( ')-11 (,)-1 (- )' -l( -) I - zz 1+ z z = 1+ z z ad y-x S y-x = the result follows from Lemma 2.1. ((-l)/)z'z/(l+ z'z), The desity fuctio g of (2.2) has the form of a geeralized multivariate t distributio. Dickey (1967), Theorems 3.2 ad 3.3, gives coditioal ad

4 margial distributios for geeralized multivariate t distributios from which the coditioal ad margial distributios of g of (2.2) ca be obtaied. Let G deote the distributio fuctio of a uivariate Studet-t \.l distributio with \.l from results give by Dickey. degrees of freedom. The the followig ca be obtaied LEMMA 2.3. Let Z' = (Zl' ' \) deote a vector rv with (coditioal) probability desity fuctio g(z) of equatio (2.2). The for i = 1,, k. Cosider agai the origial sample Xl' " X ' ad put 1 Z. = A.(X. - X.)/([(j-l)/j] _ (X. _ X.)'S:l(X. _ X.)}2", J JJ J J J J J J ad deote Z~ = (Zl.,.", ~.) for j = k+2,..,. The the ext theorem J ~J ~,J follows from Lemma 2.3 ad a slight extesio of Theorem 5.1 of O-Q. THEOREM: 2.1. The k(-k-l) radom variables give by (2.4) u.. = G. k. 2 (Z..[(j-k+i-2)/(1 + Z2 l. + + i 1.)J~, ~,J J- +~- ~,J,J ~-,J for j = k+2,, ad i =1,", k; are i.i.d. U(O,l) rv's. We ow summarize the results for case (i) whe IJ. is ukow ad 2 E =a Eo for EO kow. For - X 'ad S defied above put here ad

5 THEOREM 2.2. (-2)k radom variables give by For Xl' ' X LLd. from N(I-L,ciE o )' EO kow, the for j = 3,, ad i = 1,, k; are i.i.d. U(O,l) rv's. ~. Discuss~. After the multivariate sample Xl' ' X has bee trasformed by usig either (2.4) or (2.5), the a level ~ goodess-of-fit test for the correspodig composite multivariate ormal ull hypothesis class (case (i) or case (ii)) ca be made by testig the surrogate simple ull hypothesis that the trasformed values are i.i.d. U(O,l). Queseberry ad Miller (1977) ad Miller ad Queseberry (1977) have studied power properties of omibus tests for uiformity ad recommed either the Watso ~ test (Watso (1962)) or the Neyma smooth test (Neyma (1937)) for testig simple uiformity. REFERENCES Dickey, J. M. (1967). Matricvariate geeralizatios of the multivariate t distributio ad the iverted multivariate t distributio. A. Math. Statist. j, 511-18. Ghurye, S. G. ad Olki, I. (1969). Ubiased estimatio of some multivariate probability desities ad related fuctios. A. Math. Statist.~, 1261-71. Miller, F. L., Jr. ad Queseberry, C. P. (1977). Power studies of tests for uiformity, II. Commu. Statist. submitted. Moore, D. D. (1976). Recet developmets i chi-square tests for goodess-of-fit. Departmet of Statistics, Purdue Uiversity, Mimeograph Series #459.

6 Neyma, Jerzy (1937). "Smooth" test for goodess-of-fit. Skadiavisk Aktuarietidskrift 20, 149-99...;;;...;.;.;,--"~---..-...;...;;;- -- O'Reilly, F. J. ad Queseberry, C. P. (1973). The coditioal probability itegral trasformatio ad applicatios to obtai composite chi-square goodess-of-fit tests. A. Statist. 1, 74-83. Queseberry, C. P. ad Miller, F. L., Jr. (1977). Power studies of some tests for uiformity. J. Statist. Comput. Simul. 2, 169-91. Watso, G. S. (1962). Goodess-of-fit tests o a circle. II. Biometrika ~, 57-63.