The Poisson Correlation Function

Similar documents
PHYSICS 116A Homework 7 Solutions. 0 i

Asymptotics of Integrals of. Hermite Polynomials

Generating Functions of Partitions

Least Squares Regression

5.4 Bessel s Equation. Bessel Functions

The Periodogram and its Optical Analogy.

Bilinear generating relations for a family of q-polynomials and generalized basic hypergeometric functions

MATH 2030: MATRICES ,, a m1 a m2 a mn If the columns of A are the vectors a 1, a 2,...,a n ; A is represented as A 1. .

G-SPLINE INTERPOLATION FOR APPROXIMATING THE SOLUTION OF FRACTIONAL DIFFERENTIAL EQUATIONS USING LINEAR MULTI- STEP METHODS

DIFFERENCE EQUATIONS

THE NUMBER OF LOCALLY RESTRICTED DIRECTED GRAPHS1

STATISTICS SYLLABUS UNIT I

An Involution for the Gauss Identity

IOSR Journal of Mathematics (IOSR-JM) e-issn: , p-issn: x. Volume 9, Issue 1 (Nov. Dec. 2013), PP

On the Computation of a Lagrangian Interpolation.

THEODORE VORONOV DIFFERENTIABLE MANIFOLDS. Fall Last updated: November 26, (Under construction.)

SOME FINITE IDENTITIES CONNECTED WITH POISSON'S SUMMATION FORMULA

arxiv: v1 [math.nt] 22 Jan 2019

Lecture 2. The Simple Linear Regression Model: Matrix Approach

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 2 MATH00040 SEMESTER / Probability

Vectors To begin, let us describe an element of the state space as a point with numerical coordinates, that is x 1. x 2. x =

8 RESPONSE SURFACE DESIGNS

GEOMETRY OF THE HEAT EQUATION: FIRST PAPER. they forn a two-parameter family, the parameters being temperature and

Solutions for Math 225 Assignment #4 1

ALL TEXTS BELONG TO OWNERS. Candidate code: glt090 TAKEN FROM

A General Formula of Polynomial Interpolation. A. C. AITKEN. (Received 2nd March Bead 3rd May 1929.) 1. ENUNCIATION AND PEOOF.

Dr. Maddah ENMG 617 EM Statistics 10/15/12. Nonparametric Statistics (2) (Goodness of fit tests)

On Systems of Diagonal Forms II

An Overview of Complex Hadamard Cubes

Elementary maths for GMT

ON GENERALIZED WEYL FRACTIONAL q-integral OPERATOR INVOLVING GENERALIZED BASIC HYPERGEOMETRIC FUNCTIONS. Abstract

Regression. Oscar García

BENT POLYNOMIALS OVER FINITE FIELDS

ALGEBRAIC PROPERTIES OF SELF-ADJOINT SYSTEMS*

Newtonian Mechanics. Chapter Classical space-time

The Perrin Conjugate and the Laguerre Orthogonal Polynomial

OWN ZEROS LUIS DANIEL ABREU

Before you begin read these instructions carefully.

5 CORRELATION. Expectation of the Binomial Distribution I The Binomial distribution can be defined as: P(X = r) = p r q n r where p + q = 1 and 0 r n

Math 54. Selected Solutions for Week 5

Linear Regression and Its Applications

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Testing the homogeneity of variances in a two-way classification

Difference Equations for Multiple Charlier and Meixner Polynomials 1

Infinite series, improper integrals, and Taylor series

Chapter 4 Sequences and Series

Math 113 (Calculus 2) Exam 4

Math Camp. September 16, 2017 Unit 2. MSSM Program Columbia University Dr. Satyajit Bose

Math Exam 3 Solutions

Definition 5.1. A vector field v on a manifold M is map M T M such that for all x M, v(x) T x M.

A Note on Solutions of the Matrix Equation AXB = C

CONVOLUTION TREES AND PASCAL-T TRIANGLES. JOHN C. TURNER University of Waikato, Hamilton, New Zealand (Submitted December 1986) 1.

Journal of Computational and Applied Mathematics

There are six more problems on the next two pages

On the independence of quadratic forms in a noncentral. Osaka Mathematical Journal. 2(2) P.151-P.159

The solution of linear differential equations in Chebyshev series

PROBABILITY DISTRIBUTIONS: DISCRETE AND CONTINUOUS

On the Frequency-Domain Properties of Savitzky-Golay Filters

Taylor series. Chapter Introduction From geometric series to Taylor polynomials

Chapter 4. Matrices and Matrix Rings

POLYNOMIAL EQUATIONS OVER MATRICES. Robert Lee Wilson. Here are two well-known facts about polynomial equations over the complex numbers

ANALYTIC TRANSFORMATIONS OF EVERYWHERE DENSE POINT SETS*

So far we discussed random number generators that need to have the maximum length period.

Transformation formulas for the generalized hypergeometric function with integral parameter differences

MASSACHUSETTS INSTITUTE OF TECHNOLOGY Chemistry 5.76 Revised February, 1982 NOTES ON MATRIX METHODS

GENERAL SOLUTION OF A FIBONACCI-LIKE RECURSION RELATION AND APPLICATIONS

COVENANT UNIVERSITY NIGERIA TUTORIAL KIT OMEGA SEMESTER PROGRAMME: ECONOMICS

Polynomial Properties in Unitriangular Matrices 1

On Cauchy-Kowalevski's Theorem; A Necessary Condition

Large Sample Properties of Estimators in the Classical Linear Regression Model

Linear Algebra. The analysis of many models in the social sciences reduces to the study of systems of equations.

Relevant sections from AMATH 351 Course Notes (Wainwright): Relevant sections from AMATH 351 Course Notes (Poulin and Ingalls):

Fall TMA4145 Linear Methods. Exercise set Given the matrix 1 2

POLYNOMIAL SOLUTIONS OF PELL S EQUATION AND FUNDAMENTAL UNITS IN REAL QUADRATIC FIELDS

w = X ^ = ^ ^, (i*i < ix

Ultraspherical moments on a set of disjoint intervals

P -adic root separation for quadratic and cubic polynomials

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

On the N-tuple Wave Solutions of the Korteweg-de Vnes Equation

3.5 Undetermined Coefficients

Alphonse Magnus, Université Catholique de Louvain.

Chapter 1 Vector Spaces

Linear Algebra Practice Problems

Quantum Mechanics- I Prof. Dr. S. Lakshmi Bala Department of Physics Indian Institute of Technology, Madras

The Riemann Hypothesis Project summary

SMOOTHNESS PROPERTIES OF SOLUTIONS OF CAPUTO- TYPE FRACTIONAL DIFFERENTIAL EQUATIONS. Kai Diethelm. Abstract

A NOTE ON THE SINGULAR MANIFOLDS OF A DIFFERENCE POLYNOMIAL

This ODE arises in many physical systems that we shall investigate. + ( + 1)u = 0. (λ + s)x λ + s + ( + 1) a λ. (s + 1)(s + 2) a 0

Do not copy, post, or distribute

MATH Max-min Theory Fall 2016

Real Analysis Math 131AH Rudin, Chapter #1. Dominique Abdi

MS 2001: Test 1 B Solutions

ON SIMILARITIES BETWEEN EXPONENTIAL POLYNOMIALS AND HERMITE POLYNOMIALS

Physics 116A Determinants

Hankel Operators plus Orthogonal Polynomials. Yield Combinatorial Identities

arxiv:physics/ v1 [physics.class-ph] 26 Sep 2003

JUST THE MATHS UNIT NUMBER DIFFERENTIATION APPLICATIONS 5 (Maclaurin s and Taylor s series) A.J.Hobson

Formula for the inverse matrix. Cramer s rule. Review: 3 3 determinants can be computed expanding by any row or column

T h e C S E T I P r o j e c t

The Use of Large Intervals in Finite- Difference Equations

Transcription:

The Poisson Correlation Function By J. T. CAMPBELL, Edinburgh University. {Received 8th July, 9. Read Uh November, 9.). Introduction. In this study, the use of factorial moments and factorial moment generating functions as applied () to the Poisson frequency function and Charlier's Type B function is further extended towards developing a theory of these distributions for the case of two or more correlated variables. The results obtained bear a close analogy to the known ones for the correlated normal function and Type A function, though there does not appear to be a simple closed form for the multiple Poisson function of correlated variables compared with that of the normal frequency function of several linearly correlated variables. In the last section a numerical example is given to verify the theoretical formulae of double Poisson correlation. As no statistical records of correlation of two variables, each following a Poisson law, could be discovered, an artificial experiment, suggested by Darbishire's experiments in throws of dice, was devised to provide the necessary data.. The Poisson Correlation Function. Consider first the case of the " fourfold table." Let the following fourfold table display the frequency in a population of two characters A and B which are not independent: A B a b P- B c d P Pi P Whittaker and Robinson, Calculus of Observations, p. 0.

THE POISSON CORRELATION FUNCTION 9 Here p } and p are the relative frequency (probability) of A and B respectively, p x and p that of the absence of thes echaracters. Since A and B are not independent the relative frequency d of both together will be different from pip - The frequency generating function () for this population will be a + ba + cb + dab = l +p l (A-l) + p (B - ) +d (A- ) (B - ). Putting A = + a, B = -j- B, we have the factorial moment generating function (m. g. f.) of the population, +p ia +p B + dab, and so that of samples of N drawn from it, with replacement of individuals if the population is finite, is the N ih power of the above. The case of interest to us is when p lt p and d are all 0 (N* ). We then have which tends to where +p B + dab) N = [(+Pia) (-B>j8) {l+d - Plp ab + 0(N-*)}]" gnha+m./+ma/^ (.) Np t = m lt Np = m, N (d p x p ) = in. To find the corresponding frequency function <f> (x, y) we have tosolve the sum equation Now e»i.a+»«a P+m^ = SS^(a;, y) ( +a) x (l + B)». X y m x a + m B + maj8 = (m m) (a + ) + (»%»w) (P + ) + m (a + ) (j8 + ) + m - (m x + m ), and so, picking out the coefficient of ( + a)* ( + B) v on the left hand side of the m. g. f. just found, we have the frequency function. - (m, m) x ~ s (m m) v ~ 8 in'. where the upper limit of the summation is the lesser of x and y. This series is identical with that obtained by A. G. M c Kendrick () by a quite different approach, the assumptions of which were that the x and y series were distributed according to the Poisson law, so that the variances were equal to the means.

0 J. T. CAMPBELL. A Series Form in Products of Orthogonal Polynomials. The form (.) of the frequency function is not very convenient for most purposes, and so we proceed to derive another, analogous to that form of the normal correlation function which involves a power series in r, the correlation coefficient. If (.) be expanded as a power series in m, we have a e m > a.p e m *t >.m*/s\, 8=0 which () can be written S { ( - ) K, (x) $ (x) ( + a)*}{s ( - )'K. (y) «A (y) ( + ft*) -y-, Si «=0 x V where tp (x) = e~ m <- mf/a!, <p (y) = e,- m >m\ly\, and K e {x), K s (y) are the orthogonal polynomials () appropriate to the Poisson frequency function. The theorem used here is (, ) that multiplication of a factorial m. g. f. by a r is equivalent to the operation ( V) r on the frequency function, where V/(x) = / (z) f(x ). Thus finally, assuming that the order of summation may be interchanged, we have + (,v) ^ ^ * is ' which could also be written in operational form +.., (.) ' ) <f>{x, y) = e. The Factorial Moments of the Distribution. Let m (rs) be the factorial moment of order r in x, s in y, denned by m (r> s) = T,X<f>(x,y)x(x - ).... (x - r + l)y(y - )....(y - s + ). x y Since it is the coefficient of a r j s /(r\ s\) in (.), we have where the upper limit for t is the lesser of r and s. In particular m (l, 0) = m l>

THE POISSON CORRELATION FUNCTION These three moments can be computed directly from any set of data, and so the three parameters of the frequency function are completely determined. Also, further factorial moments of the fitted distribution can be obtained by summing the necessary terms of (.). It is easy to prove, by summing (.) over all values of y, that the distribution of x alone is an ordinary Poisson distribution with mean m u with a similar result for the distribution of y. Thus the totals of the x and y arrays in the correlation table are distributed according to the Poisson exponential law.. Regression Lines. To find the mean x corresponding to a fixed y = k, which we shall denote by x k> we have, by (.), and m-\-l) (m<> m) k ~ m E x* (x, fcjr-e \ v - V, " '- + (* - ) I! + (m m+k) in k ] n, x m) m\ k\ < (a;, A) = e~ m * m\\k\, IT rf^rrf so that ^ = ^;^ {x, &)/S ^ (x, i) = m x m -\ k. m Hence the locus of the means of the ct's corresponding to any y is x m = (y m ), with a similar expression for the locus of the means of the y's corresponding to any x. These straight lines may be regarded as the "regression lines" of the distribution, though they do not, as in the case of ordinary normal correlation, give the most probable value of one variable corresponding to any value of another; they give the mean value. We may further extend the analogy and define a coefficient of correlation r by r ; I A ^L\ =! I -) m (Q. i) m d,q) m ) V (n

J. T. CAMPBELL Since the totals of the x and y arrays are Poisson distributions, the variances of these are equal to their means; so that the above result may be written r = /^,0) where /* : is the ordinary product moment, /J. 0 and ^0 the variances of the x and?/ arrays, all three being calculated from the means. This is simply the ordinary Pearsonian coefficient of correlation. The definition may be justified in another way. In the case of Poisson correlation the factor expressing the correlation involves orthogonal polynomials K s (x), K s {y), which contain the parameters m and ra. In the corresponding series for normal correlation, as usually given, the variables are normalized, so that r is given absolutely, free from scale units. The true analogue of our series will therefore be the normal series, when the Hermite polynomials that occur in it are defined by non-normalized variables. In such a case we must have H. (x, a x ) = e**'/v (- and the series for the normal correlation function then takes the shape r ni + ra x a v H x (x, o x ) H x (y, a v ) + ^> H (x, o x ) H (y, <r y ) +..., with which our form for the Poisson correlation function, _ + m K x (x, mi) K (x, m ) + ~ K (x, m x ) Z (x, m ) + is now in satisfactory correspondence. From the nature of the Poisson function and its orthogonal polynomials, we are compelled to leave the parameters m x and m implicit, being unable to remove them by normalizing the variables, since not powers but factorials in. x and y are involved. But we may normalize m in terms of m x and m, thus arriving at a proper analogy and at the definition of the correlation coefficient given above. It is of course known that the Pearsonian coefficient can be applied to distributions other than the normal; what is not always known in such cases is its sampling distribution. CT

THE POISSON CORRELATION FUNCTION. The Correlation Function of Type B. The distribution given by (.) can be derived under more general conditions. We shall now consider the samples of N to be formed by taking one individual from each of N different populations constituted similarly to the population taken in, that is, consisting of two correlated characters. If the parameters of the populations are p u p\, d x ; p, p', d ;.... ; p s, p' x, d s, then the factorial m. g. f. of the samples of JV, since the selections are from independent populations, is p i a){l + p\ js) { + d t - Pip' t a..}, (.) where m 'Lpi, m = ^p'i, m = 'L{d i Here B 0 and B 0 are 0 (N~ ), later terms being of smaller order. Hence in the limit we have the same factorial m. g. f. as in, and therefore the same kind of frequency function cf> (x, y). If however we do not proceed to the limit but retain the terms of higher order, then by the operational theorem mentioned in we have, from (.), the more general correlation function, /(*, y) = >{x, y) + B. to^l>{x, y) + B Oti V*>(z, y) +..., (.) where <f> (x, y) has the form given in (.). The function f(x, y) may be called the correlation function of Type B, and may be compared with the correlation function () of Type A, where h + r ^, and <f> (x, y) is the normal correlation function for two variables. The parameters ra x, m and m and the B's can be expressed directly () in terms of the factorial moments of the distribution. Expansions similar to those for more than two normally correlated variables can be developed by the method of the present paper, but the analogies are then less marked.. Experimental Tests and Numerical Results. The data for these were obtained in the following way. A stock of 0 marbles, of which differed from the rest in colour only, was subjected to thorough mixing and then distributed at random in 0

J. T. CAMPBELL grooves capable of holding 0 marbles each exactly. (An excess of 0 over the 00 was taken in order that the mean number of coloured marbles in a groove should not be forced to be unity.) The number of coloured marbles in each groove was then noted. The 0 marbles were then mixed again and redistributed at random. This was carried out 0 times in all, and so a frequency distribution of 000 observations was obtained. The successive frequencies recorded being denoted by %, u, u,...., the sequence (u x + u ), (u +?t ), (M + tt ),... was then constructed, and the correlated pairs (u ± + u, u + %), (w + u i: w + u a ),... were taken to be the values (x it y t ) of the double distribution. The single frequencies u lt u are uncorrelated, and can be regarded as distributed, to a sufficient order of approximation j according to the Poisson law, with mean unity and so with variance unity. (Actually the distribution, obtained by sampling without replacement, is of hypergeometric type, but since the probability of success is small, p = 0-0, the Poisson law is a close approximation.) Hence the variance of the a;-series alone is a\, since the series is composed of sums of two independent terms each of variance unity; similarly a^ =. Hence the expected value of the correlation coefficient r, as given by ra x ov = {Ui + Uj) (Uj + u k )/N = Sw?/ff=l, (since u t, u jt u k are uncorrelated), is r = 0-, as might be expected. The experimental results based on these pairs of terms gave the following correlation table. x 0 y 0 8 9 9 9 u 0 9 /. 9 9 8 9 00

THE POISSON CORRELATION FUNCTION In the usual way we obtain from this the values of the required parameters: m = m x = -0, a x = -, m = m y = -00, o u =l-s, m = -0. Hence r = 0-. The next table shows the theoretical uncorrelated distribution of 00 observations having the totals of rows and columns distributed according to a Poisson law with m x, m v =, the frequency in class (x, y) being thus given by 00 e- x.e- y o x 0 8 9-8- 8- - - 0-8 0-0 8- - - - - -9-0- 0 8- - - - - -9-0- 0 - - - - 8- - 0-0 - - 8 - - 0-0- - -9-9 - - 0-0- 0 0-8 - - 0-0- 0-0 0 0 0 0 0 0 0 0 0 - - - 90- -0 8- -9-8 0-99- In order to obtain the theoretical correlation distribution, the frequencies in the cells of the above table must be multiplied by the proper value of the series in orthogonal polynomials of (.), with m =. For this purpose we require the values of these polynomials for m =, as far as the th order. Tables of the polynomials have been constructed (), depending on the recurrence relation () K r (x + ) - K r (x) = K r _ x (x), m a partial control on the computations being provided by the relation

J. T. CAMPBELL In this way we obtain the theoretical correlated frequencies of the following table, which is to be compared with that first given: y 0 8 x 0 8-9 -9 - - 0 0- -9 9- - - - - 0-0- - - - 0-9 - 0-9 0- - - - - - - - 0-0 0-0-9-90 - - 0-0 0- - - - - - - 0-0- 0-0-9 - - - 0-0- 0 0-0- 0-0- 0-0- 0 0 0 - - - 90-8- -9-0 99- As a measure of the agreement Pearson's x -test of goodness of fit may be applied, but since the test is not valid for too fine a grouping, it was decided to pool the frequencies for x and y S:. When this is done we have frequency classes with degrees of freedom, since the total frequencies of the tables have been made to agree by the process of fitting. For the correlation table of the experimental results and the theoretical correlation table just given X proves to be 0-, for which the value of P, the probability of obtaining as great or greater values of x > * s 0-80. This can be considered a reasonable value. On the other hand, applying a similar test to the theoretical uncorrelated table, we find % = 0-, for which the value of P is of extreme smallness. We conclude from the experiment that correlation of the type discussed in this paper is present, and that the mathematical representation of it by the bracket factor involving the orthogonal polynomials of Type B is an adequate one. REFERENCES.. A. C. Aitken. Proc. Boy. Soc. Edin., (9), -8.. J. T. Campbell. Proc. Edin. Math. Soc. (), (9), 99-0. Also, for tables of the orthogonal polynomials, and other details, a Thesis, deposited in the Library of the University of Edinburgh, 9.. C. V. L. Charlier. Arkiv for Mat., Astr. och Fys. 9 (9),.. A. G. M c Kendrick. Proc. Edin. Math. Soc, (9), 0.. H. E. Soper. Frequency Arrays, Cambridge, 9.