Bayesian Estimation of Small Proportions Using Binomial Group Test

Size: px

Start display at page:

Download "Bayesian Estimation of Small Proportions Using Binomial Group Test"

Branden Doyle
5 years ago
Views:

1 Florida International University FIU Digital Commons FIU Electronic Theses and Dissertations University Graduate School Bayesian Estimation of Small Proportions Using Binomial Group Test Shihua Luo Florida International University, DOI: /etd.FI Follow this and additional works at: Recommended Citation Luo, Shihua, "Bayesian Estimation of Small Proportions Using Binomial Group Test" (212). FIU Electronic Theses and Dissertations This work is brought to you for free and open access by the University Graduate School at FIU Digital Commons. It has been accepted for inclusion in FIU Electronic Theses and Dissertations by an authorized administrator of FIU Digital Commons. For more information, please contact

2 FLORIDA INTERNATIONAL UNIVERSITY Miami, Florida BAYESIAN ESTIMATION OF SMALL PROPORTIONS USING BINOMIAL GROUP TEST A thesis submitted in partial fulfillment of the requirements for the degree of MASTER OF SCIENCE in STATISTICS by Shihua Luo 212

3 To: Dean Kenneth Furton College of Arts and Sciences This thesis, written by Shihua Luo, and entitled Bayesian Estimation of Small Proportions Using Binomial Group Test, having been approved in respect to style and intellectual content, is referred to you for judgement. We have read this thesis and recommend that it be approved. Florence George Kai Huang, Co-Major Professor Jie Mi, Co-Major Professor Date of Defense: November 9, 212 The thesis of Shihua Luo is approved. Dean Kenneth Furton College of Arts and Sciences Dean Lakshmi N. Reddi University Graduate School Florida International University, 212 ii

4 ACKNOWLEDGMENTS First of all, I would like to express my sincere thanks to my major professor, Dr. Jie Mi for his patient guidance, enthusiasm, encouragement and friendship throughout this whole study. I couldn t finish my research without his great support. I would like to express deepest appreciation to my co-major professor, Dr. Kai Huang, for his technical support and patient guidance in Latex and Matlab. I would also like to thank the member of my committee, Dr. Florence George for her time, valuable advice and great encouragement. In addition, I would like to thank the Department of Mathematics and Statistics, all the professors who supported and encouraged me throughout my life in FIU. I wish to thank aunt Miao, Dr. Jie Mi s wife. She gave me substantial big help in my life. I also thank my classmates, friends. I am so happy I have them around me. Finally, I wish to thank my family for their strong support and their absolute confidence in me. iii

5 ABSTRACT OF THE THESIS BAYSIAN ESTIMATION OF SMALL PROPORTIONS USING BINOMIAL GROUP TEST by Shihua Luo Florida International University, 212 Miami, Florida Professor Jie Mi, Co-Major Professor Professor Kai Huang, Co-Major Professor Group testing has long been considered as a safe and sensible relative to oneat-a-time testing in applications where the prevalence rate p is small. In this thesis, we applied Bayes approach to estimate p using Beta-type prior distribution. First, we showed two Bayes estimators of p from prior on p derived from two different loss functions. Second, we presented two more Bayes estimators of p from prior on π according to two loss functions. We also displayed credible and HPD interval for p. In addition, we did intensive numerical studies. All results showed that the Bayes estimator was preferred over the usual maximum likelihood estimator (MLE) for small p. We also presented the optimal β for different p,m, and k. Keywords: Group Test, Bayes Estimator, Beta Distribution, MLE, MSE. iv

6 TABLE OF CONTENTS CHAPTER PAGE 1 Introduction Bayes Inferences from Prior on p Bayes Point Estimators of p Approximation to Bayes Estimators of p Comparison in the Special Case of Y= Credible and HPD Interval for p Bayes Estimators of p from Prior on π Numerical Studies Conclusion REFERENCES APPENDIX v

7 LIST OF TABLES TABLE PAGE 1 Optimal β For k = 5, α = 1, N = Optimal β For k = 1, α = 1, N = Optimal β For k = 15, α = 1, N = MSE For p =.5, k = 5, m = 5, α = 1, N = MSE For p =.1, k = 5, m = 5, α = 1, N = MSE For p =.5, k = 5, m = 15, α = 1, N = MSE For p =.1, k = 5, m = 15, α = 1, N = MSE For p =.5,.6,.7,.8, k = 5, α = 1, N = MSE For p =.9,.1,.2,.3, k = 5, α = 1, N = MSE For p =.4,.5,.6,.7, k = 5, α = 1, N = MSE For p =.8,.9,.1, k = 5, α = 1, N = MSE For p =.5,.6,.7,.8, k = 1, α = 1, N = MSE For p =.9,.1,.2,.3, k = 1, α = 1, N = MSE For p =.4,.5,.6,.7, k = 1, α = 1, N = MSE For p =.8,.9,.1, k = 1, α = 1, N = MSE For p =.5,.6,.7,.8, k = 15, α = 1, N = MSE For p =.9,.1,.2,.3, k = 15, α = 1, N = MSE For p =.4,.5,.6,.7, k = 15, α = 1, N = MSE For p =.8,.9,.1, k = 15, α = 1, N = vi

8 LIST OF FIGURES FIGURE PAGE 1 Optimal β for k = 5, m = Optimal β for k = 5, m = Optimal β for k = 5, m = 1 or Optimal β for k = 1, m = Optimal β for k = 5 or 1, m = vii

9 1 Introduction Almost daily, the news media report the results of some poll. Pollsters often give some data, for example, the percentage of people in favor of the current president, the rate of people who smoke cigarettes, the death rate of a certain disease, the percent of people who have hepatitis C virus or a rare disease, the rate of contaminated fish in a river, the proportion of defective products, et cetra. All of these issues have a common property the ratio of a part to a whole. How can we estimate this proportion p? In estimating the proportion of subjects that have a certain characteristic in a population in which each individual can be classified into two categories, say with or without certain characteristic, one logical method of estimating p for the population is to use the proportion of successes meaning with that characteristic in the sample. For example, how would we estimate the true fraction of all U.S. citizens who trust the president? We can estimate the p by calculating ˆp=x/n: where x is the number of people in the sample who trust the president and n is the number of people sampled. Experimenters used this method which is called the maximum likelihood estimator (MLE) for many years. But this method has some disadvantages when p is very small. For instance, if researchers want to estimate the prevalence of human immunodeficiency virus (HIV), the ratio ˆp is extremely small and so the estimation is not stable in using the MLE method. Then we need to test every sample unit to get the estimate of ratio p. In addition, the examination of the individual members of a large sample is an expensive and tedious work. To overcome these shortcomings, Dorfman (1943) first proposed the method of group testing and applied it to testing 1

10 blood samples for the syphilis antigen. The idea of group testing is as follows. Assuming that a random sample of size n is drawn from a target population in which some units have certain characteristic that we are interested in, then we divide this sample into m groups with k units in each group. It is assumed that if at least one unit in a group has that characteristic (positive), then this group will be positive; on the other hand, if none of the units in a group is positive, then this group will be negative. The key assumption is valid for many situations in biological, environmental and medical studies. Let p denote the probability of a unit being positive in a certain population, then the probability of a group being positive, denoted as π, can be expressed in terms of p, π = 1 (1 p) k. Instead of directly estimating p from examining each unit in a sample, group testing will estimate π and then p. Instead of conducting N chemical analyses are required to check out all members of a population of size N, Dorfman (1943) applied group testing for checking the syphilis antigen. Suppose that after the blood serum is drawn for each individual, all the blood samples are pooled in groups of five, and the group rather than the individual serum are subjected to chemical analysis. If none of the five sera contains syphilis antigen, the result of this group will be negative. If the result of a group shows positive, it means that this group includes at least one individual of syphilis antigen. Then the individuals making up the pool must be retested to decide which of the members are infected. He also discussed related costs for selected prevalence rates and optimum group sizes. He suggested that the prevalence rate be sufficiently 2

11 small to make worthwhile economics possible. Afterwards, many scientists used group testing in their study fields. Gibbs and Gower (196) applied this method to estimate the frequency of success of attempts to transmit a virus disease from one plant to another. Chiang and Reeves (1962) used this approach to estimate the infection rates of mosquitoes. Bhattacharyya et al.(1979) employed group testing to give point estimates and confidence intervals for infection rates. Worlund and Taylor (1983) applied the procedure to evaluate the disease incidence in a large population of fish. Swallow (1985) used this method to estimate infection rates and probabilities of pathogen transmission by a single vector. To evaluate human immunodeficiency virus (HIV) seroprevalence in population surveys, pooled sera was used by Kline et al. (1989). Rodoni et al. (1994) applied a sequential batch testing procedure combined with Enzyme-Linked Immunosorbent Assay (ELISA) to estimate levels of virus incidence in Victoria cut-flower sim carnation. Hepworth (1996) constructed confidence intervals for proportion of infected unites in a population involving unequaled sized group. Gastwirth (2) proposed that mutations in individual patients could be tested more fruitful by being checked in pools. Xie et al. (21) used group testing to develops models and estimation procedures in order to obtain quantitative information from data in the process to discovery and development of a new drug. Katholi and Unnasch (26) discussed the suitable sampling protocol about important experimental parameters for deciding infection rates. Because of its wide applications, group testing is also known as pooled testing, or composite sampling, in different fields. In recent years, more and more scholars recognized the advantage of the group testing 3

12 experimental design. It is well known that this method is economical, time saving, and efficient particularly when it is applied to estimating small proportion of a disease or some other rare attributes. The traditional way to estimate π or p is using the method of maximum likelihood (MLE). The definition of MLE is as follows. If x 1, x 2,..., x n are independent and identically distributed observation from a population with pdf or pmf f(x; θ), where x = (x 1, x 2,..., x n ) is the fixed sample point, and θ is the unknown parameter of interest, the likelihood function is defined as the product of all the pdfs or pmfs f(x i ; θ). For the sample point x, let θ(x) be a parameter value at which likelihood function attains its maximum as a function of θ, with x held fixed. Then θ(x) is the MLE of θ. An alternative way to estimate π or p is applying the Bayes method. The Bayesian approach is totally different from the approach used in MLE analyse. In classical methods the parameter to be estimated is thought to be an unknown, but fixed quantity. The Bayesian approach treats the unknown parameter as a random variable. In many problems the investigator has some prior information about the unknown parameter. For instance, in sampling inspection, the quality engineer has some idea about the true defective rate, the unknown parameter, of a production process from past experience. The Bayesian approach assumes that this prior knowledge can be summarized in the form of a probability distribution on unknown parameter, called the prior distribution. This is a subjective distribution, based on the experimenter s belief, and is formulated before the data are seen. A sample is then taken 4

13 from a population indexed by the unknown parameter θ and the prior distribution is updated with this sample information. The updated prior distribution is named the posterior distribution. The posterior distribution is now used to make inference about θ. To use the Bayes method, we also need a loss function. The most common loss function is the square loss function. If we use this square loss function, a natural estimator for θ is the mean of the posterior distribution. If necessary, of course, we can use alternative loss function, but the square loss function is the most popular one. Usually, the posterior distribution will become very complex with each added measurement. Whereas, if we choose a conjugate prior, then the posterior distribution will be easier to obtain. A conjugate prior is defined as a prior distribution belonging to some parametric family, for which the resulted posterior distribution also belongs to the same family. This is an important property. Such prior distributions have been identified for some standard distributions f(x θ). When we want Bayes estimator for proportion θ = p, we assume the prior distribution on parameter θ is Beta (α, β). The posterior distribution is also a Beta distribution function which makes the estimation of parameter p much more convenient. Some researchers prefer to use classical Bayesian approaches because the Bayesian method gave a better point estimator of p, especially when p is very small. Gastwirth et al. (1991) suggested classical Bayesian approaches in group testing. Chaubey and Li (1995) researched the difference between MLE and Bayes method for estimation of binomial probability with group testing. They observed that the Bayes methodology 5

14 gave an alternative choices to the experimenter with possible reduction in cost as well as error. Chick (1996) has used this method to do his studies. In the current study, we want to do further study on estimating p by using Bayes methods. We will confine our inference about p to the range (,.1). We will try two Bayesian estimators corresponding to two different loss functions with assumption that p has Beta prior distribution. In addition, we will try to obtain a credible and HPD interval for p. Assuming Beta prior on the probability π of a group being positive, we will investigate two Bayes estimators corresponding to two loss functions. The criterion for comparing the accurate of estimators is its M SE (mean squared error). The performance of the proposed estimators will be studied based on heavy Monte Carlo simulation. The proposed Bayes estimators will also be compared with other estimator existed in the literature. 6

15 2 Bayes Inferences from Prior on p In this section, we will consider Bayes inferences about p. It is assumed that p has Beta prior distribution Beta(α, β) with density function f P (p) = 1 B(α, β) pα 1 (1 p) β 1, < p < 1, α >, β >. 2.1 Bayes Point Estimators of p Suppose that a random sample of size n = mk is obtained from a population of screened subjects, where m, k are positive integers. We assume that p is the probability that a randomly selected individual s test is positive. In this research our concern is the inference about small p, say < p.1. The n subjects are classified into m groups each of which includes k subjects. Throughout this research it is assumed that any one of the m groups is tested as positive if and only if it includes at least one positive subject. Let Y be the number of positive groups. It is known that Y B(m, π) where π = 1 (1 p) k, i.e., P (Y = y) = ( m y ) π y (1 π) m y. Clearly, the joint distribution of (Y, p) is f Y,P (y, p) = = ( ) 1 m B(α, β) pα 1 (1 p) β 1 π y (1 π) m y y ( m ) y B(α, β) pα 1 (1 p) β 1( 1 (1 p) k) y (1 p) k(m y) (2.1) Thus the marginal distribution of Y is given as f Y (y) = f Y,P (y, p)dp 7

16 ( m y ) = p α 1 (1 p) k(m y)+β 1( 1 (1 p) k) y dp B(α, β) ( m ) y ( ) y y = p α 1 (1 p) k(m y)+β 1 ( 1) j (1 p) kj dp B(α, β) j ( ) m (B(α, ) 1 y ( ) y 1 = β) ( 1) j p α 1 (1 p) k(m+j y)+β 1 dp y j ( ) m (B(α, ) 1 y ( ) y = β) ( 1) j B ( α, k(m + j y) + β ) (2.2) y j Therefore, the posterior distribution of p is f P Y (p y) = f Y,P (y, p) f Y (y) ( m )( ) 1p y B(α, β) α 1 (1 p) k(m y)+β 1( 1 (1 p) k) y = ( m )( ) 1 y y B(α, β) ( 1) j( ) ( ) y B α, k(m + j y) + β = pα 1 (1 p) k(m y)+β 1( 1 (1 p) k) y y ( 1) j( ) ( ) y B α, k(m + j y) + β j j (2.3) The Bayes estimator of p for the case of square loss function L 1 (p, a) = (p a) 2 is derived by Chaubey and Li ( 1995) as y ( y ) j ( 1) j B(α + 1, km + kj ky + β) ˆp B1 = y ( y ) ( 1)j B(α, km + kj ky + β) j. (2.4) Specifically, if y =, then ˆp B1 = α n + α + β. Now consider loss function L 2 (p, a) = p 1 (p a) 2 and will derive the corresponding Bayes estimator ˆp B2. 8

17 Note that if loss function has the form L(p, a) = w(p)(p a) 2, then the corresponding Bayes estimator is given by ˆP BL = w(p)pf P Y (p y)dp w(p)f P Y (p y)dp = E P Y (w(p)p y) E P Y (w(p) y) (2.5) According to Equation (2.5) we have to evaluate w(p)pf P Y (p y)dp and w(p)f P Y (p y)dp. For w(p) = p 1 in loss function L 2 (p, a) we have and w(p)pf P Y (p y)dp = p 1 pf P Y (p y)dp = 1 w(p)f P Y (p y)dp = = p 1 pα 1 (1 p) k(m y)+β 1( 1 (1 p) k) y y ( 1) j( ) ( ) dp y B α, k(m + j y) + β j pα 2 (1 p) k(m y)+β 1( 1 (1 p) k) y dp y ( 1) j( ) ( ) y B α, k(m + j y) + β j (2.6) The value of the integral of the numerator in (2.6) is = k 1 = i= p α 2 (1 p) k(m y)+β 1( 1 (1 p) k) y dp (2.7) (1 u) α 2 u k(m y)+β 1 (1 u k ) y du (y > ) u k(m y)+i+β 1 (1 u k ) y 1 (1 u) α 1 du 9

18 k 1 = i= i= ( y 1 ( ) y 1 u )( 1) k(m y)+i+β 1 j u kj (1 u) α 1 du j k 1 y 1 ( ) y 1 1 = ( 1) j j i= u k(m y)+i+β 1+kj (1 u) α 1 du k 1 y 1 ( ) y 1 = ( 1) j B(km + kj + i + β ky, α) j Therefore, if y > we have ˆp B2 = k 1 y ( 1) j( y j) B(α, km + kj + β ky) y 1 i= j=i ( 1) j( y 1 j ) B(km + kj + i + β ky, α) (2.8) In the case y =, we let α > 1 and notice that equation (2.7) becomes p α 2 (1 p) km+β 1 dp = B(α 1, km + β), (α > 1). Summarizing the above, we obtain Theorem 2.1. Let p have prior distribution B(α, β). For the loss function L 2 (p, a) = p 1 (p a) 2 the Bayes estimator of p, denoted as ˆp B2, is given as ˆp B2 = y ( 1) j ( y j)b(α,km+kj+β ky) k 1 y 1 ( 1) j ( y 1 j )B(km+kj+β+i ky,α) i= B(α,km+β) B(α 1,km+β) if y = and α > 1. if y > 1

19 2.2 Approximation to Bayes Estimators of p It is easy to show that for sufficiently large N it holds that Γ(N + a) Γ(N + b) N a b (2.9) As it was mentioned in the previous section the group test is applied for the case of large sample size n = mk. Thus we could simplify the expression of ˆp B1 and ˆp B2 First we examine the expression (2.4) for ˆp B1. We have B(α, km + kj ky + β) = Γ(α)Γ(km + kj ky + β) Γ(km + kj ky + β + α) Γ(α)(km + kj ky + β) α since y is usually a small number and n = km is large. Similarly we have B(α + 1, km + kj ky + β) Γ(α + 1)(km + kj ky + β) α 1 Hence we obtain ˆp B1 = y ( y ) j ( 1) j B(α + 1, km + kj ky + β) y ( y ) ( 1)j B(α, km + kj ky + β) j Γ(α + 1) y Γ(α) y ( y j ( y ) j ( 1) j (km + kj ky + β) α 1 ) ( 1)j (km + kj ky + β) α 11

20 = α y ( 1) j( ) y (km + kj ky + β) α 1 j y ( 1) j( ) y (km + kj ky + β) α j where ˆp B 1 is an approximation to the Bayes estimator ˆp B1. As for ˆp B2 notice that if y 1, then ˆp B 1 (2.1) Γ(α)Γ(km + kj + i + β ky) B(km + kj + i + β ky, α) = Γ(km + kj + i + β ky + α) Γ(α)(km + kj + i + β ky) α Therefore, if y 1 it holds from Theorem 2.1 that ˆp B2 = k 1 y ( 1) j( ) y Γ(α)(km + kj ky + β) α i= k 1 j y 1 ( 1) j( ) y 1 Γ(α)(km + kj + i + β ky) α j y ( 1) j( ) y (km + kj ky + β) α i= j ˆp y 1 ( 1) j( ) B 2 (2.11) y 1 (km + kj + i + β ky) α On the other hand if y = and α > 1, then we have j B(α, km + β) ˆp B2 = B(α 1, km + β) (α 1)Γ(km + α + β 1) = Γ(km + α + β) Γ(α)Γ(km + β) Γ(km + α + β 1) = Γ(km + α + β) Γ(α 1)Γ(km + β) = α 1 km + α + β 1 = α 1 n + α + β 1 if α > 1. (2.12) From (2.11) and (2.12) we see that the Bayes estimator ˆp B2 can be aproximated by 12

21 ˆp B 2 = y ( 1) j ( y j)(km+kj ky+β) α k 1 y 1 ( 1) j ( y 1 i= j )(km+kj+i+β ky) α, if y 1; α 1, if y = and α > 1. n+α+β 1 (2.13) 2.3 Comparison in the special case of Y= Note the fact that the event (Y = ) is the same as event (X = ) where X is the number of positive subjects when subjects are tested individually. So it is interesting to compare the Bayes estimators based on X = and Y =. Clearly X B(n, p) = B(mk, p). Assuming that the proportion p has Beta(α, β) as its prior distribution, it is well known that the posterior distribution of p given X = x is Beta(x + α, n x + β). Suppose that square loss function L 1 (p, a) = (p a) 2 is used, then the Bayes estimator ˆp B1 based on X = is ˆp B1 = E P X (p x = ) = α n + α + β In the mean time the Bayes estimator ˆp B1 based on Y = is exactly the same as ˆp B1.(bottom of p8) Now consider the loss function L 2 (p, a) = p 1 (p a) 2. For this loss function the Bayes estimator ˆp B2 based on X can be obtained as follows. We have w(p)pf P X (p x)dp = f P X (p x)dp = 1 and 13

22 w(p)f P X (p x)dp = p 1( B(x + α, n x + β) ) 1 p x+α 1 (1 p) n x+β 1 dp = ( B(x + α, n x + β) ) 1 p x+α 2 (1 p) n x+β 1 dp Consider two cases. If x 1 which is equivalent to y 1. In this case ( B(x + α, n x + β) w(p)f P X (p x)dp = B(x + α 1, n x + β) ( Γ(x + α)γ(n x + β) = Γ(n + α + β) ( ) 1 x + α 1 = n + α + β 1 ) 1 Γ(n + α + β 1) Γ(x + α 1)Γ(n x + β) ) 1 Thus ˆp B2 = x + α 1 n + α + β 1 However, if x =, then it must be assumed that α > 1 and so w(p)f P X (p )dp = ( B(α, n + β) ) 1 B(α 1, n + β) = B(α, n + β) p α 2 (1 p) n+β 1 dp Γ(α 1)Γ(n + β) Γ(n + α + β) = Γ(n + α + β 1) Γ(α)Γ(n + β) = n + α + β 1 α 1 Consequently, it follows that 14

23 ˆp B2 = α 1, if x = and α > 1. n + α + β 1 Therefore ˆp B2 = x+α 1 if x 1; n+α+β 1 From this and equation (2.13), i.e., α 1 if x = and α > 1. n+α+β 1 ˆp B 2 = α 1 n + α + β 1 if y = and α > 1. we see that in this case ˆp B 2 and ˆp B2 are exactly the same when there is no positive subject in the entire sample. 2.4 Credible and HPD Interval for p Suppose that p has prior distribution Beta(α, β). From equation (2.3) it is known that the posterior distribution of p given Y = y is f P Y (p y) = pα 1 (1 p) k(m y)+β 1( 1 (1 p) k) y y ) ( 1)j B(α, k(m + j y) + β) Then for any set A (, 1), the credibal probability of A is ( y j P (p A) = f P Y (p y)dp and A is a credible set for p. By numerical computation it is not difficult to obtain a 1 γ credible interval for p. A 15

24 In a special case when α = 1, i.e., p has Beta(1, β) as its prior distribution, by equation (2.2) the marginal distribution of Y has density f Y (y) = β(1 p) β 1 ( m y ) (1 (1 p) k ) y (1 p) k(m y) dp = β ( ) m B (m y + βk ) k y, y + 1 (2.14) and consequently the posterior distribution of p given Y = y has density f P Y (p y) = β( ) ( m y (1 p) k(m y)+β 1 1 (1 p) k) y ) ( B m y + β, y + 1) k βk 1( m y = k(1 p)k(m y)+β 1( 1 (1 p) k) y B ( (2.15) m y + β, y + 1) k We further let β 1. The assumption is reasonable because we are dealing with small p and the majority of the probability distribution Beta(1, β) is close to zero if β is relatively large. With the assumption it holds that m y + (β 1)/k. It can be shown easily that the posterior f P Y (p y) is unimodal. Therefore, the HPD interval for p can be obtained by {p : f P Y (p y) a} where {p:f P Y (p y) a} f P Y (p y)dp = 1 γ for any given γ (, 1). For the case < β < 1, the posterior density f P Y (p y) strictly decreases in p (, 1), so the HPD interval for p can be obtained as 16

25 {p : f P Y (p y) a} where a f P Y (p y)dp = 1 γ. 3 Bayes Estimators of p from Prior on π In the present section we will assume π 1 (1 p) k has a Beta(α, β) prior distribution. On the basis of the observation Y = y we first calculate the Bayes estimator ˆπ B, then derive the Bayes estimator of p. Following Chaubey and Li (1995) this procedure is called the indirect Bayes procedure, the produced estimators is called the indirect Bayes estimators and denoted as ˆp IB1 or ˆp IB2 depending on the loss function L 1 (p, a) = (p a) 2 and L 2 (p, a) = p 1 (p a) 2. Obviously the posterior distribution of π given Y = y is Beta(y+α, m y+β), i.e., f π Y (π y) = ( B(y + α, m y + β) ) 1 π y+α 1 (1 π) m y+β 1 From π = 1 (1 p) k = π(p) we see that p = p(π) = 1 (1 π) 1/k. Consider loss function L(p, a) = w(p)(p a) 2. It is easy to see that the corresponding indirect Bayes estimator ˆp IB is given as ˆp IB = E( pw(p) Y = y ) E ( w(p) Y = y ) For the case L 1 (p, a) = (p a) 2 the indirect Bayes estimator ˆp IB1 is given in Chaubey and Li (1995) as 17

26 Γ(m + α + β)γ(m y + β + 1/k) ˆp IB1 = 1 Γ(m y + β)γ(m + α + β + 1/k) (3.1) which can be further approximated as ˆp IB 1 = 1 ( ) 1/k m + β y (3.2) m + α + β Now let us consider loss function L 2 (p, a) = p 1 (p a) 2. Clearly E(pw(p) Y = y) = pp 1 f π Y (π y)dπ = f π Y (π y)dπ = 1 and E(w(p) Y = y) = = = p 1 f π (π y)dπ p 1( B(y + α, m y + β) ) 1 π y+α 1 (1 π) m y+β 1 dπ ( 1 (1 π) 1/k ) 1 π y+α 1 (1 π) m y+β 1 dπ B(y + α, m y + β) (3.3) To evaluate the integral on the right side of (3.3) we let u = (1 π) 1/k and have = ( 1 (1 π) 1/k ) 1 π y+α 1 (1 π) m y+β 1 dπ (1 u) 1 (1 u k ) y+α 1 u k(m y+β 1) ku k 1 du 18

27 k 1 =k i= k 1 = i= k 1 = i= k 1 = B i= u k(m y+β)+i 1 (1 u k ) y+α 2 du u k(m y+β)+i k (1 u k ) y+α 2 ku k 1 du v m y+β+i/k 1 (1 v) y+α 2 dv (m y + β + ik, y + α 1 ), here it is required that α > 1 if y =. Therefore, the indirect Bayes estimator ˆp IB2 based on observation Y = y is ˆp IB2 = B(y + α, m y + β) k 1 i= B ( (3.4) m y + β + i, y + α 1) k where α > 1 if y =. We can rewrite ˆp IB2 as ˆp IB2 = k 1 i= Γ(y + α)γ(m y + β)/γ(m + α + β) Γ ( m y + β + i k) Γ(y + α 1)/Γ ( m + α + β + i k 1) =(y + α 1) k 1 i= i= Γ(m y + β)/γ(m + α + β) Γ ( m y + β + i k) /Γ ( m + α + β + i k 1) k 1 Γ(m y + β) =(y + α 1) Γ ( ) Γ ( m + α + β + i 1) k m y + β + i Γ(m + α + β) k (3.5) From (3.5) we see that ˆp IB2 can be approximated by y + α 1 ˆp IB 2 = k 1 i= (m y + β)i/k (m + α + β) i/k+1 19

28 here α > 1 if y =. = y + α 1 m + α + β k 1 i= ( 1 m y+β m+α+β ) i/k (3.6) 2

29 4 Numerical Studies A MATLAB simulation is performed in order to analyze the behaviors of different estimators under several combinations. I chose m = 5, 1, 15, 2, 25 and k = 5, 1, 15. Fifteen cases of p between.5 and.1 are considered. First, I simulate a sample that is from an independent, identically Bernoulli distribution and divided the observations into m groups with k units in each group. I use this new grouped sample to calculate the value of M LE. Second, I use the same sample to calculate the values of Bayes estimator and approximate Bayes estimator under different βs. The process is repeated 1 times. Then I will obtain the MSE of these estimators. Using the MSE as a criterion. Table 1-3 display the optimal βs which yield the smallest M SE of Bayes estimator associated with the square loss function. From these table, we can see that the optimal β decreases with the increase of p; the optimal β increases with the increase of m and k. Figures 1-5 also indicate these trends. Tables 4-7 show how I got the optimal βs. Tables 8-19 compare the MSE of Bayes estimator under the optimal βs with the approximate Bayes estimator as well as MLE. From these tables, we can draw a conclusion that the Bayes estimator is the best one as far as MSE is concerned. MLE is not a good estimator when p is small. Moreover, the approximate Bayes estimator is also much better than the M LE. 21

30 Table 1: k = 5, α = 1, N = 1 p k = 5 m This is the optimal β based on the smallest MSE of Bayes estimator by using k = 5 with combinations of different m and p. 22

31 Table 2: k = 1, α = 1, N = 1 p k = 1 m This is the optimal β based on the smallest MSE of Bayes estimator by using k = 1 with combinations of different m and p. 23

32 Table 3: k = 15, α = 1, N = 1 p k = 15 m This is the optimal β based on the smallest MSE of Bayes estimator by using k = 15 with combinations of different m and p. 24

33 Table 4: k = 5, m = 5, p =.5, α = 1, N = 1 β MSE(ˆp B1 ) E E E E E-6 MSE(ˆp B 1) E E E E E-6 β MSE(ˆp B1 ) E E E E E-6 MSE(ˆp B 1) E E E E E-6 β MSE(ˆp B1 ) E E E E E-6 MSE(ˆp B 1) E E E E E-6 Table 5: k = 5, m = 5, p =.1, α = 1, N = 1 β MSE(ˆp B1 ) E E E E E-5 MSE(ˆp B 1) E E E E E-5 β MSE(ˆp B1 ) E E E E E-5 MSE(ˆp B 1) E E E E E-5 β MSE(ˆp B1 ) E E E E E-5 MSE(ˆp B 1) E E E E E-5 The comparison of M SE of Bayes estimator and approximation to the Bayes estimator under different βs. 25

34 Table 6: k = 5, m = 15, p =.5, α = 1, N = 1 β MSE(ˆp B1 ) 4.451E E E E E-6 MSE(ˆp B 1) 4.718E E E E E-6 β MSE(ˆp B1 ) 4.366E E E E E-6 MSE(ˆp B 1) 4.594E E E E E-6 β MSE(ˆp B1 ) 4.383E E E E E-6 MSE(ˆp B 1) 4.575E E E E E-6 Table 7: k = 5, m = 15, p =.1, α = 1, N = 1 β MSE(ˆp B1 ) E E E E E-5 MSE(ˆp B 1) E E E E E-5 β MSE(ˆp B1 ) E E E E E-5 MSE(ˆp B 1) E E E E E-5 β MSE(ˆp B1 ) E E E E E-5 MSE(ˆp B 1) E E E E E-5 The comparison of M SE of Bayes estimator and approximation to the Bayes estimator under different βs. 26

35 Table 8: k = 5, p =.5,.6,.7,.8, α = 1, N = 1 p =.5 MSE(ˆp B1 ) E E E E E-6 MSE(ˆp B 1) E E E E E-6 MSE(ˆp MLE ) 2.414E E E E E-5 p =.6 MSE(p B1 ) E E E E E-6 MSE(ˆp B 1) E E E E E-6 MSE(ˆp MLE ) 2.972E E E E E-5 p =.7 MSE(ˆp B1 ) E E E E E-6 MSE(ˆp B 1) E E E E E-6 MSE(ˆp MLE ) E E E E E-5 p =.8 MSE(ˆp B1 ) E E E E E-5 MSE(ˆp B 1) 7.632E E E E E-5 MSE(ˆp MLE ) E E E E E-5 Table-8 compares the M SE of Bayes estimator and approximation to the Bayes estimator with the MSE of the MLE by using k = 5 with different combinations of p =.5,.6,.7,.8 and m = 5, 1, 15, 2,

36 Table 9: k = 5, p =.9,.1,.2,.3, α = 1, N = 1 p =.9 MSE(ˆp B1 ) 1.42E E E E E-5 MSE(ˆp B 1) 1.544E E E E E-5 MSE(ˆp MLE ) 4.357E E E E E-5 p =.1 MSE(ˆp B1 ) E E E E E-5 MSE(ˆp B 1) E E E E E-5 MSE(ˆp MLE ) E E E E E-5 p =.2 MSE(ˆp B1 ) E E E E E-5 MSE(ˆp B 1) 7.272E E E E E-5 MSE(ˆp MLE ) E E E E E-4 p =.3 MSE(ˆp B1 ) E E E E E-4 MSE(ˆp B 1) E E E E E-4 MSE(ˆp MLE ) E E E E E-4 Table-9 compares the M SE of Bayes estimator and approximation to the Bayes estimator with the MSE of the MLE by using k = 5 with different combinations of p =.9,.1,.2,.3 and m = 5, 1, 15, 2,

37 Table 1: k = 5, p =.4,.5,.6,.7, α = 1, N = 1 p =.4 MSE(ˆp B1 ) E E E E E-4 MSE(ˆp B 1) E E E E E-4 MSE(ˆp MLE ) E E E E E-4 p =.5 MSE(ˆp B1 ) 4.785E E E E E-4 MSE(ˆp B 1) E E E E E-4 MSE(ˆp MLE ) E E E E E-4 p =.6 MSE(ˆp B1 ) 6.619E E E E E-4 MSE(ˆp B 1) 6.974E E E E E-4 MSE(ˆp MLE ) 4.449E E E E E-4 p =.7 MSE(ˆp B1 ) E E E E E-4 MSE(ˆp B 1) E E E E E-4 MSE(ˆp MLE ) E E E E E-4 Table-1 compares the M SE of Bayes estimator and approximation to the Bayes estimator with the MSE of the MLE by using k = 5 with different combinations of p =.4,.5,.6,.7 and m = 5, 1, 15, 2,

38 Table 11: k = 5, p =.8,.9,.1, α = 1, N = 1 p =.8 MSE(ˆp B1 ) 1.716E E E E E-4 MSE(ˆp B 1) E E E E E-4 MSE(ˆp MLE ) 7.835E E E E E-4 p =.9 MSE(ˆp B1 ) E E E E E-4 MSE(ˆp B 1) E E E E E-4 MSE(ˆp MLE ) E E E E E-4 p =.1 MSE(ˆp B1 ) E E E E E-4 MSE(ˆp B 1) E E E E E-4 MSE(ˆp MLE ) E E E E E-4 Table-11 compares the M SE of Bayes estimator and approximation to the Bayes estimator with the MSE of the MLE by using k = 5 with different combinations of p =.8,.9,.1 and m = 5, 1, 15, 2, 25. 3

39 Table 12: k = 1, p =.5,.6,.7,.8, α = 1, N = 1 p =.5 MSE(ˆp B1 ) 3.336E E E E E-6 MSE(ˆp B 1) 3.358E E E E E-6 MSE(ˆp MLE ) E E E E E-5 p =.6 MSE(ˆp B1 ) 5.462E E E E E-6 MSE(ˆp B 1) 5.498E E E E E-6 MSE(ˆp MLE ) E E E E E-5 p =.7 MSE(ˆp B1 ) E E E E E-6 MSE(ˆp B 1) 8.65E E E E E-6 MSE(ˆp MLE ) 1.877E E-5 5.3E E E-5 p =.8 MSE(ˆp B1 ) 1.768E E E E E-5 MSE(ˆp B 1) 1.852E E E E E-5 MSE(ˆp MLE ) 2.946E E E E E-5 Table-12 compares the M SE of Bayes estimator and approximation to the Bayes estimator with the MSE of the MLE by using k = 1 with different combinations of p =.5,.6,.7,.8 and m = 5, 1, 15, 2,

40 Table 13: k = 1, p =.9,.1,.2,.3, α = 1, N = 1 p =.9 MSE(ˆp B1 ) 1.495E E E E E-5 MSE(ˆp B 1) E E E E E-5 MSE(ˆp MLE ) E E E E E-5 p =.1 MSE(ˆp B1 ) 1.821E E E E E-5 MSE(ˆp B 1) E E E E E-5 MSE(ˆp MLE ) E E E E E-5 p =.2 MSE(ˆp B1 ) E E E E E-5 MSE(ˆp B 1) 8.445E E E E E-5 MSE(ˆp MLE ) 7.779E E E E E-5 p =.3 MSE(ˆp B1 ) E E E E E-5 MSE(ˆp B 1) E E E E E-5 MSE(ˆp MLE ) 2.751E E E E E-4 Table-13 compares the M SE of Bayes estimator and approximation to the Bayes estimator with the MSE of the MLE by using k = 1 with different combinations of p =.9,.1,.2,.3 and m = 5, 1, 15, 2,

41 Table 14: k = 1, p =.4,.5,.6,.7, α = 1, N = 1 p =.4 MSE(ˆp B1 ) E E E E E-4 MSE(ˆp B 1) E E E E E-4 MSE(ˆp MLE ) 4.797E E E E E-4 p =.5 MSE(ˆp B1 ) E E E E E-4 MSE(ˆp B 1) E E E E E-4 MSE(ˆp MLE ) E E E E E-4 p =.6 MSE(ˆp B1 ) 5.555E E E E E-4 MSE(ˆp B 1) E E E E E-4 MSE(ˆp MLE ) 2.122E E E E E-4 p =.7 MSE(ˆp B1 ) E E E E E-4 MSE(ˆp B 1) E E E E E-4 MSE(ˆp MLE ) E E E E E-4 Table-14 compares the M SE of Bayes estimator and approximation to the Bayes estimator with the MSE of the MLE by using k = 1 with different combinations of p =.4,.5,.6,.7 and m = 5, 1, 15, 2,

42 Table 15: k = 1, p =.8,.9,.1, α = 1, N = 1 p =.8 MSE(ˆp B1 ) E E E E E-4 MSE(ˆp B 1) E E E E E-4 MSE(ˆp MLE ) E E E E E-4 p =.9 MSE(ˆp B1 ) 1.383E E E E E-4 MSE(ˆp B 1) E E E E E-4 MSE(ˆp MLE ) E E E E E-4 p =.1 MSE(ˆp B1 ) E E E E E-4 MSE(ˆp B 1) E E E E E-4 MSE(ˆp MLE ) E E E E E-4 Table-15 compares the M SE of Bayes estimator and approximation to the Bayes estimator with the MSE of the MLE by using k = 1 with different combinations of p =.8,.9,.1 and m = 5, 1, 15, 2,

43 Table 16: k = 15, p =.5,.6,.7,.8, α = 1, N = 1 p =.5 MSE(ˆp B1 ) E E E E E-6 MSE(ˆp B 1) 4.189E E E E E-6 MSE(ˆp MLE ) E E E E E-5 p =.6 MSE(ˆp B1 ) E E E E E-6 MSE(ˆp B 1) 6.431E E E E E-6 MSE(ˆp MLE ) 1.678E E E E E-5 p =.7 MSE(ˆp B1 ) E E E E E-6 MSE(ˆp B 1) E E E E E-6 MSE(ˆp MLE ) E E E E E-5 p =.8 MSE(ˆp B1 ) 1.217E E E E E-5 MSE(ˆp B 1) 1.299E E E E E-5 MSE(ˆp MLE ) E E E E E-5 Table-16 compares the M SE of Bayes estimator and approximation to the Bayes estimator with the MSE of the MLE by using k = 15 with different combinations of p =.5,.6,.7,.8 and m = 5, 1, 15, 2,

44 Table 17: k = 15, p =.9,.1,.2,.3, α = 1, N = 1 p =.9 MSE(ˆp B1 ) 1.586E E E E E-5 MSE(ˆp B 1) E E E E E-5 MSE(ˆp MLE ) E E E E E-5 p =.1 MSE(ˆp B1 ) 1.929E E E E E-5 MSE(ˆp B 1) E E E E E-5 MSE(ˆp MLE ) 1.99E E E E E-5 p =.2 MSE(ˆp B1 ) E E E E E-5 MSE(ˆp B 1) 7.822E E E E E-5 MSE(ˆp MLE ) E E E E E-5 p =.3 MSE(ˆp B1 ) E E E-5 8.8E E-5 MSE(ˆp B 1) 1.683E E E E E-5 MSE(ˆp MLE ) E E E E E-4 Table-17 compares the M SE of Bayes estimator and approximation to the Bayes estimator with the MSE of the MLE by using k = 15 with different combinations of p =.9,.1,.2,.3 and m = 5, 1, 15, 2,

45 Table 18: k = 15, p =.4,.5,.6,.7, α = 1, N = 1 p =.4 MSE(ˆp B1 ) 2.481E E E E E-4 MSE(ˆp B 1) E E E E E-4 MSE(ˆp MLE ) E E E E E-4 p =.5 MSE(ˆp B1 ) E E E E E-4 MSE(ˆp B 1) E E E E E-4 MSE(ˆp MLE ) E E E E E-4 p =.6 MSE(ˆp B1 ) E E E E E-4 MSE(ˆp B 1) 5.21E E E E E-4 MSE(ˆp MLE ) E E E E E-4 p =.7 MSE(ˆp B1 ) 6.63E E E E E-4 MSE(ˆp B 1) E E E E E-4 MSE(ˆp MLE ) E E E E E-4 Table-18 compares the M SE of Bayes estimator and approximation to the Bayes estimator with the MSE of the MLE by using k = 15 with different combinations of p =.4,.5,.6,.7 and m = 5, 1, 15, 2,

46 Table 19: k = 15, p =.8,.9,.1, α = 1, N = 1 p =.8 MSE(ˆp B1 ) E E E E E-4 MSE(ˆp B 1) 7.728E E E E E-4 MSE(ˆp MLE ) E E E E E-3 p =.9 MSE(ˆp B1 ) E E E E E-4 MSE(ˆp B 1) 9.28E E E E E-4 MSE(ˆp MLE ) 2.59E E E E E-3 p =.1 MSE(ˆp B1 ) E E E E E-4 MSE(ˆp B 1) 1.679E E E E E-4 MSE(ˆp MLE ) E E E E E-3 Table-19 compares the M SE of Bayes estimator and approximation to the Bayes estimator with the MSE of the MLE by using k = 15 with different combinations of p =.8,.9,.1 and m = 5, 1, 15, 2,

47 Figure 1: K = 5, m = 1 Figure 2: K = 5, m = 2 39

48 Figure 3: K = 5, m = 1 or 2 Figure 4: K = 1, m = 1 4

49 Figure 5: K = 5 or 1, m = 1 41

50 5 Conclusion According to the analytical and numerical results obtained above, We know that the M SE of Bayes estimators and approximation to Bayes estimators are both smaller than that of MLE for small p. Indirect Bayes estimators performs very well too. We could draw a conclusion that Bayes estimators are much better than MLE when p is small. 42

51 References [1] Bhattacharyya, F. K., Karandinos, M. G., Defoliart, G. R., Point estimates and confidence interval for infection rates using pooled organisms in epidemiologic studies. American Journal of Epidemiology. 19, [2] Chaubey, Y., Li, W., Comparison between maximum likelihood and Bayes methods for estimation of binomial probability with sample compositing. Journal of Official Statistics. 11, [3] Chiang, C. L., Reeves, W. C., Statistical estimation of virus infection rates in mosquito vector populations. American Journal of Hygiene. 75, [4] Chick, S., Bayesian models for limiting dilution assay and group test data. Biometrics. 52, , [5] Dorfman, R., The detection of defective members of large populations. Annals of Mathematical Statistics. 14, [6] Gastwirth, J., Johnson, W., Reneau, D., Bayesian analysis of screening data: Application to AIDS in blood donors. The Canadian Journal of Statistics. 19, [7] Gastwirth, J., 2. The efficiency of pooling in the detection of rare mutations. Americal Journal of Human Genetics. 67, [8] Gibbs, A. J., Gower, J. C., 196. The use of a multiple-transfer method in plant virus transission studies-some statistical points arising in the analysis of results. Annals of Applied Biology. 48, [9] Hepworth, G., Exact confidence intervals for proportions estimated by group testing. Biometrics. 52,

52 [1] Katholi, C. R., Unnasch, T. R., 26. Important experimental parameters for determining infection rates in arthropod vectors using pool screening approaches. The American Society of Tropical Medicine and Hygiene. 74, [11] Kline, R. L., Brothers, T. A., Brookmeyer, R., Zeger, S., Quinn, T. C., Evaluation of human immunodficiency virus seroprevalence in population surveys using pooled sera. Journal of Clinical Microbiology. 27, [12] Radoni, B. C., Hepworth, G., Richardson, C., Moran, J. R., The use of a sequential batch testing procedure and ELISA to determine the incidence of five viruses in Victorian cut-flower sim Carnations. Aust. J. Agric. Res.. 45, [13] Swallow, W. H., Group testing for estimating infection rates and probabilities of disease transmission. Phytopathology. 75, [14] Worlund, D. D., Taylor, G., Estimation of disease incidence in fish populations. Canadian Journal of Fisheries and Aquatic Science. 4, [15] Xie, M., Tatsuoka, K., Sacks, J., Young, S., 21. Group testing with blockers and synergism. J. Am. Statist. Assoc.. 96,

53 APPENDIX I list the equations in this study here for the convenience of finding them. 45

54 1 Bayes Inferences from Prior on p. Loss function L 1 (p, a) = (p a) 2. y ( y ) j ( 1) j B(α + 1, km + kj ky + β) ˆp B1 = y ( y ) ( 1)j B(α, km + kj ky + β) j. y 1 Equation 2.4 page 8 Approximation to the Bayes estimator ˆp B1. ˆp B 1 = α y ( 1) j( ) y (km + kj ky + β) α 1 j y ( 1) j( ) y (km + kj ky + β) α j. y 1 Equation 2.1 page 12 Loss function L 2 (p, a) = p 1 (p a) 2. ˆp B2 = y ( 1) j ( y j)b(α,km+kj+β ky) k 1 y 1 ( 1) j ( y 1 j )B(km+kj+β+i k,α) i= y ( 1) j ( y j)b(α,km+β+kj ky) if y 1 B(α 1,km+β) if y = and α > 1. Approximation to the Bayes estimator ˆp B2. ˆp B 2 = y ( 1) j ( y j)(km+kj ky+β) α k 1 y 1 ( 1) j ( y 1 i= j )(km+kj+i+β ky) α, if y 1; α 1, if y = and α > 1. n+α+β 1 Theorem 1 page 1 Equation 2.13 page 13 46

55 2 Bayes Estimators of p from Prior on π. Loss function L 1 (p, a) = (p a) 2. ˆp IB1 = 1 Γ(m + α + β)γ(m y + β + 1/k) Γ(m y + β)γ(m + α + β + 1/k) Equation 3.1 page 18 Approximation to the Bayes estimator ˆp IB1. ˆp IB 1 = 1 ( ) 1/k m + β y Equation 3.2 page 18 m + α + β Loss function L 2 (p, a) = p 1 (p a) 2. k 1 Γ(m y + β)γ ( m + α + β + i k ˆp IB2 = (y + α 1) 1) Γ ( ) Equation 3.5 page 19 m y + β + i k Γ(m + α + β) i= Approximation to the Bayes estimator ˆp IB2. ˆp IB 2 = y + α 1 m + α + β k 1 i= ( 1 m y+β m+α+β ) i/k Equation 3.6 page 2 47

Inferences about Parameters of Trivariate Normal Distribution with Missing Data

Florida International University FIU Digital Commons FIU Electronic Theses and Dissertations University Graduate School 7-5-3 Inferences about Parameters of Trivariate Normal Distribution with Missing