An FKG equality with applications to random environments

Similar documents
Some Correlation Inequalities for Ising Antiferromagnets

Realization of set functions as cut functions of graphs and hypergraphs

Spatial autoregression model:strong consistency

A characterization of consistency of model weights given partial information in normal linear models

Linear Algebra (part 1) : Vector Spaces (by Evan Dummit, 2017, v. 1.07) 1.1 The Formal Denition of a Vector Space

Anew index of component importance

The symmetry in the martingale inequality

Introduction to Real Analysis

Statistical Theory 1

On some properties of elementary derivations in dimension six

Note On Parikh slender context-free languages

Markov processes Course note 2. Martingale problems, recurrence properties of discrete time chains.

Consistent semiparametric Bayesian inference about a location parameter

Some inequalities for sum and product of positive semide nite matrices

A converse Gaussian Poincare-type inequality for convex functions

Vector Space Basics. 1 Abstract Vector Spaces. 1. (commutativity of vector addition) u + v = v + u. 2. (associativity of vector addition)

1 INFO Sep 05

1 Introduction This work follows a paper by P. Shields [1] concerned with a problem of a relation between the entropy rate of a nite-valued stationary

Implicational classes ofde Morgan Boolean algebras

Contents. 2.1 Vectors in R n. Linear Algebra (part 2) : Vector Spaces (by Evan Dummit, 2017, v. 2.50) 2 Vector Spaces

LOGARITHMIC MULTIFRACTAL SPECTRUM OF STABLE. Department of Mathematics, National Taiwan University. Taipei, TAIWAN. and. S.

Comments on integral variants of ISS 1

Monte Carlo Methods for Statistical Inference: Variance Reduction Techniques

A fourth-order nite-dierence method based on non-uniform mesh for a class of singular two-point boundary value problems

The Erwin Schrodinger International Pasteurgasse 6/7. Institute for Mathematical Physics A-1090 Wien, Austria

MARKOV CHAINS: STATIONARY DISTRIBUTIONS AND FUNCTIONS ON STATE SPACES. Contents

Homework 1 Solutions ECEn 670, Fall 2013

Lecture 1: Overview of percolation and foundational results from probability theory 30th July, 2nd August and 6th August 2007

WARDROP EQUILIBRIA IN AN INFINITE NETWORK

Expectations and moments

Extremal Cases of the Ahlswede-Cai Inequality. A. J. Radclie and Zs. Szaniszlo. University of Nebraska-Lincoln. Department of Mathematics

16 Chapter 3. Separation Properties, Principal Pivot Transforms, Classes... for all j 2 J is said to be a subcomplementary vector of variables for (3.

Measure-theoretic probability

POSITIVE AND NEGATIVE CORRELATIONS FOR CONDITIONAL ISING DISTRIBUTIONS

Probability Review. Yutian Li. January 18, Stanford University. Yutian Li (Stanford University) Probability Review January 18, / 27

Minimum and maximum values *

3.1 Basic properties of real numbers - continuation Inmum and supremum of a set of real numbers

Total positivity in Markov structures

On reaching head-to-tail ratios for balanced and unbalanced coins

IBM Almaden Research Center, 650 Harry Road, School of Mathematical Sciences, Tel Aviv University, TelAviv, Israel

Basic Probability. Introduction

Reimer s Inequality on a Finite Distributive Lattice

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 3 9/10/2008 CONDITIONING AND INDEPENDENCE

LECTURE 1. 1 Introduction. 1.1 Sample spaces and events

2 Interval-valued Probability Measures

A primer on basic probability and Markov chains

THE LARGEST INTERSECTION LATTICE OF A CHRISTOS A. ATHANASIADIS. Abstract. We prove a conjecture of Bayer and Brandt [J. Alg. Combin.

Measure and integration

From Fractional Brownian Motion to Multifractional Brownian Motion

Part II Probability and Measure

Analysis on Graphs. Alexander Grigoryan Lecture Notes. University of Bielefeld, WS 2011/12

Picking Winners in Rounds of Elimination 1

A shrinking lemma for random forbidding context languages

Entrance Exam, Real Analysis September 1, 2017 Solve exactly 6 out of the 8 problems


lecture notes October 22, Probability Theory

5 Birkhoff s Ergodic Theorem

Stochastic Dynamic Programming. Jesus Fernandez-Villaverde University of Pennsylvania

Richard DiSalvo. Dr. Elmer. Mathematical Foundations of Economics. Fall/Spring,

Note Watson Crick D0L systems with regular triggers

LEBESGUE INTEGRATION. Introduction

Course 212: Academic Year Section 1: Metric Spaces

On the number of generators for transeunt triangles

A lower bound on the two arms exponent for critical percolation on the lattice

Stanford Mathematics Department Math 205A Lecture Supplement #4 Borel Regular & Radon Measures

Lecture 2: Review of Prerequisites. Table of contents

Onthe reconstructionof perfect codes

G-frames in Hilbert Modules Over Pro-C*-algebras

Notes on Iterated Expectations Stephen Morris February 2002

The best expert versus the smartest algorithm

Course Notes. Part IV. Probabilistic Combinatorics. Algorithms

Linear Algebra, 4th day, Thursday 7/1/04 REU Info:

Fundamental gaps in numerical semigroups

Algorithms for Uncertainty Quantification

Advanced Probability

Optimal blocking of two-level fractional factorial designs

Probability Theory. Richard F. Bass

The average dimension of the hull of cyclic codes

Alternative theorems for nonlinear projection equations and applications to generalized complementarity problems

Random Process Lecture 1. Fundamentals of Probability

Linear Regression and Its Applications

Quick Tour of Basic Probability Theory and Linear Algebra

4.1 Eigenvalues, Eigenvectors, and The Characteristic Polynomial

When does a planar bipartite framework admit a continuous deformation?

A Refresher in Probability Calculus

If Y and Y 0 satisfy (1-2), then Y = Y 0 a.s.

The random variable 1

Classication of greedy subset-sum-distinct-sequences

A Finite Element Method for an Ill-Posed Problem. Martin-Luther-Universitat, Fachbereich Mathematik/Informatik,Postfach 8, D Halle, Abstract

Probability and Measure

FE 490 Engineering Probability and Statistics. Donald E.K. Martin Department of Statistics

Percolation and Random walks on graphs

Chapter 1: Probability Theory Lecture 1: Measure space and measurable function

Lecture Notes 1 Probability and Random Variables. Conditional Probability and Independence. Functions of a Random Variable

On a kind of restricted edge connectivity of graphs

Recitation 2: Probability

Chapter 2. Probability

Dynkin (λ-) and π-systems; monotone classes of sets, and of functions with some examples of application (mainly of a probabilistic flavor)

Finite-dimensional spaces. C n is the space of n-tuples x = (x 1,..., x n ) of complex numbers. It is a Hilbert space with the inner product

On the exact computation of the density and of the quantiles of linear combinations of t and F random variables

Transcription:

Statistics & Probability Letters 46 (2000) 203 209 An FKG equality with applications to random environments Wei-Shih Yang a, David Klein b; a Department of Mathematics, Temple University, Philadelphia, PA 19122, USA b Department of Mathematics, California State University, 18111 Nordho Street, Northridge, CA 91330-8313, USA Received October 1998; received in revised form March 1999 Abstract We consider a sequence of independent random variables, X 1;X 2;X 3;:::; taking values in {1; 2;:::;m}. We introduce a -algebra of nonpivotal events and prove the following 0 1 Law: P(A) = 0 or 1 if and only if A is nonpivotal. All tail events are nonpivotal. The proof is based on an FKG equality which provides exact error terms to the FKG inequality. We give some applications for independent random variables in a random environment in the sense that the probabilities of particular outcomes are random. c 2000 Elsevier Science B.V. All rights reserved MSC: 60F20; 60G50 Keywords: Zero one laws; Correlations; Random environment; FKG inequality The FKG inequality has resulted in great advances in probabilty theory, especially in statistical mechanics, percolation theory, and reliability theory. With appropriate restrictions on the probability measure P, the FKG inequality says that the correlation between increasing events, A and B, is always nonnegative, i.e., P(AB) P(A)P(B) 0: This inequality was rst established by Harris (1960) in the case of a product measure P and subsequently generalized by Fortuin et al. (1971), and many others. Considering the importance of the FKG inequality, it is natural to investigate the existence of a more detailed relationship between P(AB) and P(A)P(B) of the form P(AB) = P(A)P(B) + error terms; where the error terms are nonnegative in the case of increasing events A and B, and arise in a natural way related to the structure of the underlying probability space in general. In this paper we carry out this project for the case of a product measure P associated with nitely-valued random variables. We give an expansion Corresponding author. E-mail addresses: yang@euclid.math.temple.edu (W.-S. Yang), david.klein@csun.edu (D. Klein) 0167-7152/00/$ - see front matter c 2000 Elsevier Science B.V. All rights reserved PII: S0167-7152(99)00107-8

204 W.-S. Yang, D. Klein / Statistics & Probability Letters 46 (2000) 203 209 formula for P(AB) P(A)P(B) in terms of the probabilities of pivotal events determined by A and B. Precise denitions are given below. Our expansion formula can be viewed as an FKG equality. The idea for pivotal events rst appeared in Russo (1981) and it plays a prominent role in percolation theory. Our expansion connects this idea of pivotal event with the FKG inequality. Our second theorem provides an application of our FKG expansion formula to obtain a zero one law for product measures. It says that an event A is nonpivotal if and only if P(A) = 0 or 1. We then apply this new zero one law to get results for random variables in random environments. An application to Bayesian statistics of our nal corollary is also discussed in the context of a simple model for human sex ratios. Let S = {1; 2;:::;m} and = {1; 2; 3;:::}. Let X 1 ;X 2 ;X 3 ;::: be independent random variables with P(X i = )=p i () 0 so that m =1 p i() = 1 for all i. Let = S. For x =(x 1 ;x 2 ;:::) ; b and S, dene x b by { xi for i b; xi b = for i = b: Let F denote the -algebra of subsets of generated by {X i : i }, where X i (x)=x i. For A F, dene A b (; )={x :x b A and x b A}: We describe A b (; ) as the event that b is pivotal for A. Note that A b (; ) does not depend on x b, i.e., A b (; ) (X i : i b), the -algebra generated by all X i except for i = b. The indicator function for A b (; ) satises 1 Ab (; )(x)=1 A (x b )1 A c(x b ); (1) where A c denotes the complement of A. The notion of pivotal events when m=2 has appeared in percolation theory, as for example in Russo (1981) and Yang and Zhang (1992). Consider a duplicate system. Let 2 = and P 2 = P P, where P is the probability measure on generated by the random variables {X i }. Dene a projection mapping k : for each k =1;2;3;::: by k (x; y)=(y 1 ;y 2 ;:::y k 1 ;x k ;x k+1 ;:::): Let k(a)= 1 k (A)={(x; y) : (y 1 ;y 2 ;:::;y k 1 ;x k ;x k+1 ;:::) A}. Lemma 1. Let A; B F. Then lim n P 2 ( 1 (A) n (B)) = P(A)P(B). Proof. For any 0, there exists k and B k (X 1 ;:::;X k ) such that P(BB k ). This implies P(B) P(B k ). Then P 2 ( 1 (A) n (B)) P(A)P(B) 6 P 2 ( 1 (A) n (B)) P 2 ( 1 (A) n (B k )) + P 2 ( 1 (A) n (B k )) P(A)P(B k ) + P(A)P(B k ) P(A)P(B) : (2) The second term on the right-hand side of (2) equals zero if n k +1. The third term is bounded by P(A)6. The rst term equals E 2 (1 1(A)1 n(b)) E 2 (1 1(A)1 n(b k )) 6E 2 1 n(b) 1 n(b k ) =P(BB k ) because P 2 ( n (C)) =P(C) for any C F.

Theorem 1. For any A; B F; (a) P 2 ( 1 (A) k (B)) = P 2 ( 1 (A) r (B)) W.-S. Yang, D. Klein / Statistics & Probability Letters 46 (2000) 203 209 205 r 1 + p b ()p b ()[P 2 ( 1 (A b (; )) b (B b (; ))) P 2 ( 1 (A b (; )) b (B b (; )))] for all r k: (b) P 2 ( 1 (A) k (B)) = P(A)P(B)+ b=k P 2 ( 1 (A b (; )) b (B b (; )))]: p b ()p b ()[P 2 ( 1 (A b (; )) b (B b (; ))) (c) In particular, P(AB)=P(A)P(B) + b=1 p b ()p b ()[P 2 ( 1 (A b (; )) b (B b (; ))) P 2 ( 1 (A b (; )) b (B b (; )))]: Remark 1. If A or B is in the -algebra, -(X 1 ;X 2 ;:::;X n ) for some n, then the sums in parts (b) and (c) of Theorem 1 involve only nitely many terms; the upper limit may be replaced by n. Proof. Part (b) follows from Lemma 1 and part a by letting r. Part (c) follows from part (b) by choosing k = 1. To prove part (a), write r 1 P 2 ( 1 (A) k (B)) P 2 ( 1 (A) r (B)) = [P 2 ( 1 (A) b (B)) P 2 ( 1 (A) b+1 (B))] = 1 r 1 [E 2 (1 A 1 B )+E 2 (1 A 1 B ) E 2 (1 A 1 B ) E 2 (1 A 1 B )]; (3) 2 where A = 1 (A); B = b (B);A = {(x; y): (x 1 ;x 2 ;:::;x b 1 ;y b ;x b+1 ;:::) A}, and B = b+1 (B) ={(x; y): (y 1 ;y 2 ;:::;y b ;x b+1 ;x b+2 ;:::) B}. The last equality follows from the symmetry of x b and y b. Thus, E 2 (1 A 1 B )=E 2 (1 A 1 B )=P 2 ( 1 (A) b (B)) and E 2 (1 A 1 B )=E 2 (1 A 1 B )=P 2 ( 1 (A) b+1 (B)): The right-hand side of (3) equals 1 r 1 E 2 [(1 A 1 A )(1 B 1 B )]: (4) 2 To evaluate (4), we consider the following four disjoint sets: C ij = {1 A 1 A = i and 1 B 1 B = j};

206 W.-S. Yang, D. Klein / Statistics & Probability Letters 46 (2000) 203 209 where i and j = ±1. Then (4) may be rewritten as 1 r 1 [P 2 (C 11 )+P 2 (C 1 1 ) P 2 (C 1 1 ) P 2 (C 11 )]: (5) 2 Let Then, C ij C ij = = C ij {(x; y): x b = ; y b = }: C ij ; is a disjoint union. Observe that (x; y) C 11 if and only if [x A b (; ) and x b = ; y b = ] and [(y 1 ;y 2 ;:::;y b 1 ;x b ;x b+1 ;:::) B b (; ) and x b = ; y b = ]. In other words, C 11 = 1(A b (; )) b (B b (; )) {x b =; y b = }: Thus, P 2 (C 11 )= p b ()p b ()P 2 ( 1 (A b (; )) b (B b (; ))); where we have used the fact, for example, that 1(A b (; )) b (B b (; )) does not depend on x b and y b. The same argument shows that P 2 (C 1 1 )=p b()p b ()P 2 ( 1 (A b (; )) b (B b (; ))): Summing on and yields P 2 (C 1 1 )=P 2 (C 11 ). A similar argument shows that P 2 (C 11 )=P 2 (C 1 1 )= p b ()p b ()P 2 ( 1 (A b (; )) b (B b (; ))): Substituting these last expressions into (5) yields the desired result. Remark 2. Statement (c) is what we refer to as an expansion of correlations. Note that if A is an increasing event, then A b (; ) =H when Therefore (c) immediately implies the following FKG inequality for nitely-valued, independent random variables. Corollary 1. Let A; B F. If A and B are increasing events; then P(AB) P(A)P(B). Statement (c) of Theorem 1 also gives a characterization for A and B to be independent, viz., A and B F are independent if and only if p b ()p b ()[P 2 ( 1 (A b (; )) b (B b (; ))) P 2 ( 1 (A b (; )) b (B b (; )))] = 0: b=1 Denition. An event A F is said to be nonpivotal if P(A b (; )) = 0 for all b and all ; S. Let U be the class of all nonpivotal events. Lemma 2. U is a -algebra and the tail eld T U.

W.-S. Yang, D. Klein / Statistics & Probability Letters 46 (2000) 203 209 207 Proof. Since b and H b, are both empty, and H are elements of U. Let b be xed. We write x for x b and x for x b.by(1)a Uif and only if for P-a.e. x, 1 Ab (; )(x)=1 A (x )1 A c(x )=0: Now suppose A; B U. Then 1 (A B)b (; )(x)=1 A B (x )1 (A B) c(x ) (6) =[1 A (x )+1 B (x ) 1 A (x )1 B (x )]1 A c(x )1 B c(x )=0 P-a.s., by (6). Therefore A; B U implies A B U. Since (A c ) b (; )=(A) b (; ), we have A c U if and only if A U. Therefore U is an algebra. It now suces to show that U is a monotone class. Let A n U; A n A n+1. Then 1 (x )1 ( A n) b (; ) (x)=1 (x ) A n A c n n=1 = lim [1 (x )1 N N N (x )] A n A c n = lim 1 (x)=0; N ( N A n) b (; ) P-a.e., where we have used the result that U is an algebra. By the Monotone Class Theorem, U is a -algebra. Let A T. Then A does not depend on x b for any b. Therefore A b (; ) =I for all ;. Hence T U. Theorem 2. A U if and only if P(A)=0 or 1. Proof. Assume that A U. Then P 2 ( 1 (A b (; )) b (B b (; )))6P 2 ( 1 (A b (; )))=P(A b (; ))=0: By Theorem 1(c), with B = A; P(A)=P(A) 2. Conversely, assume P(A) = 0 or 1. If P(A) = 1, then P(A c ) = 0, and if A c U, then A U by Lemma 2. It therefore suces to consider only the case P(A) = 0. By denition, A b (; ) {x b =} A. Therefore, P(A b (; )) = P(A b (; ) {x b =})+P(A b (; ) {x b }) =P(A b (; ) {x b })=P(A b (; ))P({x b }): The last equality follows because A b (; ) does not depend on x b. Since P({x b = }) 0, P(A b (; ))=0: Thus, A U. We next consider a sequence of independent random variables X 1 ;X 2 ;X 3 ;:::, where {P(X i = )} = {p i ()} is also a sequence of independent random variables. Let i be a probability measure on the portion Q of the hyperplane in R m determined by m =1 p i() = 1 with each p i () 0. Dene I = Q and let (dp)= i (dp i) where p i =(p i (1);:::;p i (m)). For each xed p =(p i : i ) I, let P p be the product measure on for the independent random variables X 1 ;X 2 ;X 3 ;::: satisfying {P p (X i = )} = p i (). Let = I and let P be the probability measure on dened by P(d(x; p)) = (dp) P p (dx). Note that under P; {X i } is a sequence of independent random variables with P(X i = )= p i ()d i, and Theorem 2 applies to any event A with A (X i ;i ).

208 W.-S. Yang, D. Klein / Statistics & Probability Letters 46 (2000) 203 209 Corollary 2. Let A and A (X i ;i ). If for almost every p; P p (A)=0 or 1; then either P p (A)=0 for almost every p; or P p (A)=1 for almost every p. Proof. If for almost every p; P p (A) = 0 or 1, then for all b; P p (A b (; )) = 0, by Theorem 2. For any set B (X i ;i ), dene B = {(x; p): x B}. Let ( ) p = p 1 d 1 ; p 2 d 2 ; p 3 d 3 ;::: : Then P p (A b (; )) = P( A b (; )) = d(p)p p (A b (; ))=0: From Theorem 2, it follows that P p (A)= P( A) = 0 or 1. If P( A)= d(p)p p (A) = 0, then P p (A)=0 for almost every p. Similarly, if P( A) = 1, then P p (A)=1 for almost every p. Consider a function f which is -(X 1 ;X 2 ;:::) measurable. Let Var p (f)=e p (f 2 ) (E p (f)) 2 where E p denotes expectation with respect to P p. It is easily shown that Var(f)=E(Var p (f)) + Var(E p (f)). Therefore, 06E(Var p (f))6var(f). Corollary 3. With the notation above; E(Var p (f))=0 if and only if Var(f)=0. Proof. If E(Var p (f)) = d(p)var p (f) = 0, then Var p (f) =0 for almost every p. This implies that f(x) =c(p), a constant for almost every p. Therefore, P p {x: f(x) }= 0 or 1 for P p almost every x and all real. As in the proof of Corollary 2, it follows that P p {x: f(x) }= 0 or 1. Thus, f(x) equals a constant c = E p (f(x)) almost surely, P p. Since d(p)e p ( f(x) c 2 )=0; Var(f) = 0. The other direction is clear. Corollary 4. Let S n = X 1 + +X n. If for almost every p; lim n S n =n = c(p), a constant P p -a.e.; then lim n S n =n is constant P almost everywhere. Proof. Let f(x) = lim n S n =n. By assumption, for almost every p; f(x)=c(p), a constant depending on p, for P p almost every x. By Corollary 3, f(x) is constant P almost everywhere. Remark 3. It is possible to prove Corollary 4 using conditioning arguments and Kolmogorov s criterion for the convergence of averaged sums of independent random variables with appropriate restrictions on the variances. We give an application of Corollary 4 relating to human sex ratios at birth, and Bayesian statistics. It is fairly well established that the ratio of males to all infants at birth is approximately 0.51. This ratio appears to be stable with respect to locality, ethnicity, and time for suciently large populations (US Census Bureau, 1998; US National Center for Health Statistics, 1998). In spite of the global stability of this ratio, there is evidence that the probability p for a male infant varies among individual couples. Evidence exists (James, 1990) that p may range as widely as 0.31 0.83. How might this phenomenon be explained? Let us assume a population of n births and let X i be a random variable which takes the value 1 if the ith birth is male and 0 if it is female. We will assume that the collection {X i } is independent. As above, let {P(X i =1)}={p i } also be a collection of random variables. Assume the p i s are equal to the common random variable P, whose prior distribution we denote by (p).

W.-S. Yang, D. Klein / Statistics & Probability Letters 46 (2000) 203 209 209 Conditioning on the event {P = p}, for a xed p, and letting n go to innity gives almost surely, X 1 + +X n p n and, in general, without conditioning, X 1 + +X n P: n But assuming the validity of the demographic and medical references cited above, we are forced to conclude that this approach does not provide a good model for sex ratios, in the sense that large sample means do not approximate the limiting ratio (unless P were constant, which is not supported by the evidence). A better model results from the assumption that p 1 ;p 2 ;p 3 ;::: are i.i.d. random variables with distribution (p). In this case, intuitively, the sample mean of the collection {X i } should tend to ˆp, the mean of (p), for almost every given p 1 ;p 2 ;p 3 ;:::. By Corollary 4 this indeed follows; the sample means approximate for almost all p i s the limiting global constant ratio of males to all infants. Thus our Corollary 4 provides an explanation for the individual variability of the probabilities for gender as well as the global stability of the sex ratio. Acknowledgements The authors would like to thank Professor J.T. Gene Hwang for illuminating discussions on the applications of our Corollary 4. W.S. Yang was partially supported during the course of this research by the National Center for Theoretical Sciences, Taiwan, ROC, and by Temple University s Study Leave Program. References Fortuin, C.M., Kasteleyn, P.W., Ginibre, J., 1971. Correlation inequalities on some partially ordered sets. Commun. Math. Phys. 22, 89 103. Harris, T.E., 1960. A lower bound for the critical probability in a certain percolation process. Proceedings of the Cambridge Philosophical Society, vol. 56, pp. 12 20. James, W.H., 1990. Variations of the human sex ratio at birth. Fertil Steril. 54 (5), 956 957. Russo, L., 1981. On the critical percolation probabilities. Probab. Theory Related Fields 56, 229 237. U.S. Census Bureau, 1998. World wide statistics for 0 to 4 year olds. http:==www.census.gov=ftp=pub=ipc=www=idbsum.html. U.S. National Center for Health Statistics, 1998. Private communication. ( Per your request, NCHS reports (for the United States) that in 1995, there were 1,996,355 male live births compared with 1,903,234 female live births. These numbers yielded a sex ratio of 1049 male per 1000 female live birth, similar to the sex ratio reported in 1994 and for the last 50 years.) Yang, W.S., Zhang, Y., 1992. A note on dierentiability of the cluster density for independent percolation in high dimensions. J. Statist. Phys. 66, 1123 1138.