Empirical phi-divergence test-statistics for the equality of means of two populations

Size: px
Start display at page:

Download "Empirical phi-divergence test-statistics for the equality of means of two populations"

Transcription

1 Epirical phi-divergence test-statistics for the equality of eans of two populations N. Balakrishnan, N. Martín and L. Pardo 3 Departent of Matheatics and Statistics, McMaster University, Hailton, Canada arxiv: v [stat.me] 4 Jan 06 Departent of Statistics and O.R. II Decision Methods), Coplutense University of Madrid, 8003 Madrid, Spain 3 Departent of Statistics and O.R., Coplutense University of Madrid, 8040 Madrid, Spain June 3, 08 Abstract Epirical phi-divergence test-statistics have deostrated to be a useful technique for the siple null hypothesis to iprove the finite saple behaviour of the classical likelihood ratio test-statistic, as well as for odel isspecification probles, in both cases for the one population proble. This paper introduces this ethodology for two saple probles. A siulation study illustrates situations in which the new teststatistics becoe a copetitive tool with respect to the classical z-test and the likelihood ratio test-statistic. AMS 00 Subject Classification: 6F03, 6F5. Keywords and phrases: Epirical likelihood, Epirical phi-divergence test statistics, Phi-divergence easures, Power function. Introduction The ethod of likelihood introduced by Fisher is certainly one of the ost coonly used techniques for paraetric odels. The likelihood has been also shown to be very useful in non-paraetric context. More concretely Owen 988, 990, 99) introduced the epirical likelihood ratio statistics for non-paraetric probles. Two saple probles are frequently encountered in any areas of statistics, generally perfored under the assuption of norality. The ost coonly used test in this connection is the two saple t- test for the equality of eans, perfored under the assuption of equality of variances. If the variances are unknown, we have the so-called Behrens-Fisher proble. It is well-known that the two saple t-test has cone ajor drawback; it is highly sensitive to deviations fro the ideal conditions, and ay perfor iserably under odel isspecification and the presence of outliers. Recently Basu et al. 04) presented a new faily of test statistics to overcoe the proble of non-robustness of the t-statistic.

2 Epirical likelihood ethods for two-saple probles have been studied by different researchers since Owen 988) introduced the epirical likelihood as a non-paraetric likelihood-based alternative approach to inference on the ean of a single population. The onograph of Owen 00) is an excellent overview of developents on epirical likelihood and considers a ulti-saple epirical likelihood theore, which includes the two-saple proble as a special case. Soe iportant contributions for the two-saple proble are given in Owen 99), Adiiri 995), Jin 995), Qin 994, 998), Qin and Zhao 000), Zhang 000), Liu et al. 008), Baklizi and Kibria 009), Wu and Yan 0) and references therein. Consider two independent unidiensional rando variables X with unknown ean µ and variance σ and Y with unknown ean µ and variance σ. Let X,...,X be a rando saple of size fro the population denoted by X, with distribution function F, and Y,...,Y n be a rando saple of size n fro the population denoted by Y, with distribution function G. We shall assue that F and G are unknown, therefore we are interested in a non-paraetric approach, ore concretely we shall use epirical likelihood ethods. If we denote µ = µ and µ = µ+δ, our interest will be in testing H 0 : δ = δ 0 vs. H : δ δ 0, ) being δ 0 a known real nuber. Since δ = µ µ becoes the paraeter of interest, apart fro testing ), we ight also be interested in constructing the confidence interval for δ. In this paper we are going to introduce a new faily of epirical test statistics for the two-saple proble introduced in ): Epirical phi-divergence test statistics. This faily of test statistics is based on phidivergence easures and it contains the epirical log-likelihood ratio test statistic as a particular case. In this sense, we can think that the faily of epirical phi-divergence test statistics presented and studied in this paper is a generalization of the epirical log-likelihood ratio statistic. Let N = +n, assue that and x,...,x, y,...,y n a realization of X,...,X, Y,...,Y n. We denote and Lδ) = n p i q j s.t. Lp,q) = N ν 0,), ),n p i x i µ) = 0, p i =, n p i q j s.t. n q j y j µ δ) = 0, p i = n q j =, p i,q j 0, n q j =, with p i = p i µ) = F x i ) F x ) i, qj = q j µ,δ) = Gy j ) Gyj ) and p = p,...,p ) T, q = q,...,q n ) T. The epirical log-likelihood ratio statistic for testing ) is given by lδ 0 ) = log sup p,qlδ 0 ) sup p,q Lp,q). 3) Using the standard Lagrange ultiplier ethod we ight obtain sup p,q Lδ 0 ), as well as sup p,q Lp,q). For sup p,q Lδ 0 ), taking derivatives on ) ) L = logp i + n logq j +s p i +s n q j λ p i x i µ) λ n n q j y j µ δ 0 ),

3 we obtain and L = 0 p i =, i =,...,, p i +λ x i µ) 4) L = 0 q j =, j =,...,n, q j n+λ y j µ δ 0 ) 5) L µ = 0 λ +nλ = 0. Therefore, the epirical axiu likelihood estiates λ, λ and µ of λ, λ and µ, under H 0, are obtained as the solution of the equations and +λ = x i µ) n n +λ = y j µ δ 0) λ +nλ = 0, 6) log suplδ 0 ) = log log + λ x i µ) ) nlogn n ) log + λ y j µ δ 0 ). 7) p,q In relation sup p,q Lp,q), taking derivatives on we have and ) ) L = logp i + n logq j +s p i +s n q j, 8) L = 0 p i = p i, i =,..., and L = 0 q j =, j =,...,n, 9) q j n log suplp,q) = log nlogn. 0) p,q Therefore, the epirical log-likelihood ratio statistic 3), for testing ), can be written as lδ 0 ) = log ) log + λ X i µ) nlogn n { = ) log + λ Y j µ δ 0 ) Under soe regularity conditions, Jing 995) established that +log+nlogn log + λ X i µ) )+ n log + λ Y j µ δ 0 )) }. ) ) Pr lδ 0 ) > χ,α) = α+on ), where χ,α is the 00 α)-th percentile of the χ distribution. Our interest in this paper is to study the proble of testing given in ) and at the sae tie to construct confidence intervals for δ on the basis of the epirical phi-divergence test statistics. Epirical phi-divergence test statistics in the context of the epirical likelihood have studied by Baggerly 998), Broniatowski and Keizou 3

4 0), Balakhrishnan et al. 03), Felipe et al. 05) and references therein. The faily of epirical phidivergence test statistics, considered in this paper, contains the classical epirical log-likelihood ratio statistic as a particular case. In Section, the epirical phi-divergence test statistics are introduced and the corresponding asyptotic distributions are obtained. A siulation study is carried out in Section 3. Section 4 is devoted to develop a nuerical exaple. In Section 5 the previous results, devoted to univariate populations, are extended to k-diensional populations. Epirical phi-divergence test statistics For the hypothesis testing considered in ), in this section the faily of epirical phi-divergence test statistics are introduced as a natural extension of the epirical log-likelihood ratio statistic given in 3). We consider the N-diensional probability vectors U = N,..., N N )T ) and P = p ν,...,p ν,q ν),...,q n ν)) T 3) where p i, i =,...,, q j, j =,...,n were defined in 4) and 5), respectively, and ν in ). Let P be the N-diensional vector obtained fro P with p i, q j replaced by the corresponding epirical axiu likelihood estiators p i, q j and ν by N. The Kullback-Leibler divergence between the probability vectors U and P is given by D Kullback U, P) = N log N + n p i N N log N ) q j N { } = log p+ n logn q j N { = log + λ x i µ) )+ n log + λ y j µ δ 0 )) }, N where p i =, i =,...,, + λ x i µ) 4) q j =, j =,...,n. n+ λ y j µ δ 0 ) 5) Therefore, the relationship between lδ 0 ) and D Kullback U, P) is lδ 0 ) = ND Kullback U, P). 6) Based on 6), in this paper the epirical phi-divergence test statistics for ) are introduced for the first tie. This faily of epirical phi-divergence test statistics is obtained replacing the Kullback-Leibler divergence by a phi-divergence easure in 6), i.e., T φ δ 0 ) = N φ ) D φu, P), 7) 4

5 where D φ U, P) = ) ) p i N φ N + n ) q j φ N p i N N, ) qj with φ : R + R being anyconvexfunction such that at x =, φ) = 0, φ ) > 0and at x = 0, 0φ0/0) = 0 φu) and 0φp/0) = pli u u. For ore details see Cressie and Pardo 00) and Pardo 006). Therefore, 7) can be rewritten as { T φ δ 0 ) = ) φ p i φ + n ) p i { = φ ) + λ x i µ) φ n q j φ n q j + λ x i µ) )+ n N ) } 8) + λ y j µ δ 0 ) φ + λ y j µ+δ 0 ))) }. If φx) = xlogx x+ is chosen in D φ U, P), we get the Kullback-Leibler divergence and T φ δ 0 ) coincides with the epirical log-likelihood ratio statistic lδ 0 ) given in 6). Let µσ,σ ) be the optial estiator of µ under the assuption of having the known values of σ, σ, i.e. it is given by the shape πx + π)y and has iniu variance. It is well-known that ) X Y δ0 µσ,σ ) = σ +n σ. 9) σ + n σ Siilarly, an asyptotically optial estiator of µ having unknown values of σ, σ, is given by ) X Y δ0 µs,s ) = S +n S, S + n S where S = X i X), S = n n Y j Y) are consistent estiators of σ, σ respectively. In the following lea an iportant relationship is established, useful to get the asyptotic distribution of T φ δ 0 ). Lea Let µ the epirical likelihood estiator of µ. Then, we have µ = µσ,σ )+O p) = µs,s )+O p). Proof. See Appendix 5.3. Theore Suppose that σ <, σ < and ). Then, Proof. See Appendix 5.4. T φ δ 0 ) L n, χ. Reark 3 A α)-level confidence interval on δ can be constructed as CI α δ) = { δ : T φ δ) χ,α}. 5

6 The lower and upper bounds of the interval CI α δ) require a bisection search algorith. This is a coputationally challenging task, because for every selected grid point on δ, one needs to axiize the epirical phi-divergence T φ δ) over the nuisance paraeter, µ, and there is no closed-for solution to the axiu point µ for any given δ. The coputational difficulties under the standard two-saple epirical likelihood forulation are due to the fact that the involved Lagrange ultipliers, which are deterined through the set of equations 6), have to be coputed based on two separate saples with an added nuisance paraeter µ. Such difficulties can be avoided through an alternative forulation of the epirical likelihood function, for which coputation procedures are virtually identical to those for one-saple of size N = + n epirical likelihood probles. Through the transforations v i = w j = ω, x i ω, y j ω δ ω = ω =, 4) and 5) can be alternatively obtained as ) T δ, i =,...,, ω ) T, j =,...,n, p i = + λ T, i =,...,, v i 0) q j = n + λ T, j =,...,n, w j ) where the estiates of the Lagrange ultipliers λ = λ,, λ, ) T are the solution in λ of v i +λ T + n v i w j +λ T w j = 0. Reark 4 In the particular case that = n, the two saples ight be understood as a rando saple of size n fro a unique bidiensional population. In this setting the two saple proble can be considered to be a particular case of Balakrishnan et al. 05). Reark 5 Fu et al. 009), Yan 00) and Wu and Yan 0) pointed out that epirical log-likelihood ratio statistic, lδ 0 ), given in ) for testing ), does not perfor well when the distribution associated to the saples are quite skewed or saples sizes are not large or saple sizes fro each population are quite different. To overcoe this proble Fu et al. 009) considered the weighted epirical log-likelihood function defined by l w p,q) = ω logp i + ω n n logq j, ) with ω = ω =, and obtained the weighted epirical likelihood WEL) estiator as well as the weighted epirical log-likelihood ratio statistic. In order to get the WEL estiator, it is necessary to axiize ) subject to p i = n q j =, 3) p i x i n y j q j = δ 0. 4) 6

7 They obtained that the WEL estiates of p i and q j are given by w p i = + w λ T, i =,...,, v i w q j = n + w λ T, j =,...,n, w j where v i and w j are the sae transforations given in Reark 3 with δ = δ 0 and the estiates of the Lagrange ultipliers w λ = w λ,, w λ, ) T are the solution in w λ of ω v i + w λ T v i + ω n n w j + w λ T w j = 0. Now, if we define the probability vectors w U = ω,..., ),ω n,, n...,, n ) ) T, w P = ω p T,ω q T) T = ω p,...,p ),ω q,...,q n )) T, the weighted epirical log-likelihood ratio test l w δ 0 ) presented in Wu and Yan 0) can be written as l w δ 0 ) = D Kullback w U, w P). 5) The weighted epirical log-likelihood ratio test can be extended by defining the faily of weighted epirical phi-divergence test statistics as S φ δ 0 ) = D φ w U, w P) φ, ) where D φ w U, w P) is the phi-divergence easure between the probability vectors w U and w P, i.e., ) ) n D φ w U, w P) = ω p i φ +ω q j φ p i n q j = ω φ + w λ T ) v i + w λ T + ω n φ + w λ T ) w j n v i + w λ T. w j Taking into account D φ w U, w P) = w λt D w λ +o p N), where D = ω v i v T i + ω n w j w T j n, and based on Theore.. in Wu and Yan 0), we have that S φ δ 0 ) c where c is the second diagonal eleent of the atrix D. L n, χ, 7

8 3 Siulation Study The square of the classical z-test statistic for two saple probles, ) X Y +δ0 tδ 0 ) =, S + n S hasasyptoticallyχ distribution,thesaeastheepiricalphi-divergenceteststatistics,accordingtotheore. In order to copare the finite saple perforance of the confidence interval CI) of δ based on T φ δ) with respect to the ones based on tδ) as well as the epirical log-likelihood ratio test-statistic lδ) given in 3), we count on a subfaily of phi-divergence easures, the so-called power divergence easures φ γ x) = x +γ x γx ) γ+γ), dependent of tuning paraeter γ R, i.e. n p i ) γ + n q j ) γ N γγ +) T γ δ) = log+nlogn+ log p i +n, γ R {0, }, n log q j, γ = 0, n log+nlogn+ p i log p i +n q j log q j, γ =, where p i and q j can be obtained fro 0)-). We analyzed five new test-statistics, the epirical powerdivergence test statistics taking γ {, 0.5, 3,,}. The case of γ = 0 is not new, since the epirical log-likelihood ratio test-statistic lδ) is a eber of the epirical power-divergence test statistics, i.e. lδ) = T γ=0 δ). The CI of δ based on tδ 0 ) with 00 α)% confidence level is essentially the CI of z-test statistic, δ L, δ U ) = x y z α s + n s,x y+z α s + n s ). For T {T γ δ)} γ Λ, Λ = {, 0.5,0, 3,,}, as entioned in Reark 3, since there is no explicit expression for δ L, δ U ) the bisection ethod should be followed. The siulated coverage probabilities of the CI of δ based on T {tδ)} {T γ δ)} γ Λ were obtained with R = 5,000 replications by 00 R R IT r) χ,α), r= with I ) being the indicator function. The siulated expected width of the CI of δ based on T {tδ)} {T γ δ)} γ Λ were obtained with R = 3,000 replications by 00 R R δ r) U r= δ r) L ). The reason why two different values of R were followed is twofold. On one hand calculating δ r) U δ r) L is uch ore tie consuing than IT r) χ,α) and on the other hand for the designed siulation experient the replications needed to obtain a good precision is less for the expected width than for the coverage probability. The siulation experient is designed in a siilar anner as in Wu and Yan 0). The true distributions, unknown in practice, are generated fro: i) X Nµ,σ ), Y Nµ+δ 0,σ ), with µ =, σ = σ =.5, δ 0 = 0; 8

9 ii) X lognoralϑ,θ ), Y lognoralϑ,θ ), with ϑ =., θ = 0.4, ϑ =., θ = 0.. Notice that in case ii) δ 0 = 0 since E[X] = E[Y]. Depending on the saple sizes, six scenarios were considered,,n) {5,30),30,5),30,30),30,60),60,30),60,60)}. Table suarizes the results of the described siulation experient with α = In all the cases and scenarios the narrower width is obtained with T γ= δ), but the coverage probabilities closest to 95% depends on the case or scenario. For the case of the lognoral distribution the CI based on tδ) test-statistic has the closest coverage probability to 95%, but for the case of the noral distribution T γ=/3 δ) and T γ= δ) power divergence based tend to have the closest coverage probability to 95%. In order to copleent this study, the power functions have been drawn through R = 5, 000 replications and taking δ as abscissa. For case i) the power functions exhibit a syetric shape with respect to the center and also a parallel shape, in such a way that the test statistics with better approxiation of the size have worse power. For case ii), fixing the values of the two paraeters of X and changing the two paraeter of Y as ϑ = kϑ, θ = kθ, k = logδ +exp{ϑ + θ }) ϑ + θ, δ is displaced fro δ 0 = 0 to the right when k > and fro δ 0 = 0 to the left when 0 < k < δ > exp{ϑ + θ }). Unlike case i), the power function of case ii) exhibits a different shape on both sides fro the center of abscissa, and the ost proinent differences are on the left hand size. Clearly in case ii), even though the approxiated size for tδ 0 ) is the best one, it has the worst approxiated power function, in particular there is an area of the approxiated power function on the left hand side of δ 0 = 0 with saller value than the approxiated size. Hence, in case ii) the power functions of T {T γ δ)} γ Λ are ore acceptable than the power function of tδ 0 ). Taking into account the strong and weak point of tδ 0 ) in case ii), T γ=/3 δ 0 ) could be a good choice for oderate saple sizes and lδ 0 ) = T γ=0 δ 0 ) for sall saple sizes. 9

10 case i): noral populations n CI coverage width 5 30 T γ= δ) T γ= 0.5 δ) T γ=0 δ) T γ=/3 δ) T γ= δ) T γ= δ) tδ) T γ= δ) T γ= 0.5 δ) T γ=0 δ) T γ=/3 δ) T γ= δ) T γ= δ) tδ) T γ= δ) T γ= 0.5 δ) T γ=0 δ) T γ=/3 δ) T γ= δ) T γ= δ) tδ) T γ= δ) T γ= 0.5 δ) T γ=0 δ) T γ=/3 δ) T γ= δ) T γ= δ) tδ) T γ= δ) T γ= 0.5 δ) T γ=0 δ) T γ=/3 δ) T γ= δ) T γ= δ) tδ) T γ= δ) T γ= 0.5 δ) T γ=0 δ) T γ=/3 δ) T γ= δ) T γ= δ) tδ) case ii): lognoral populations n CI coverage width 5 30 T γ= δ) T γ= 0.5 δ) T γ=0 δ) T γ=/3 δ) T γ= δ) T γ= δ) tδ) T γ= δ) T γ= 0.5 δ) T γ=0 δ) T γ=/3 δ) T γ= δ) T γ= δ) tδ) T γ= δ) T γ= 0.5 δ) T γ=0 δ) T γ=/3 δ) T γ= δ) T γ= δ) tδ) T γ= δ) T γ= 0.5 δ) T γ=0 δ) T γ=/3 δ) T γ= δ) T γ= δ) tδ) T γ= δ) T γ= 0.5 δ) T γ=0 δ) T γ=/3 δ) T γ= δ) T γ= δ) tδ) T γ= δ) T γ= 0.5 δ) T γ=0 δ) T γ=/3 δ) T γ= δ) T γ= δ) tδ) Table : Siulated coverage probability and expected width of 0.95 level CIs of δ for two pupulations. 0

11 0.4 Power function: case noral, =5, n= β d λ=-0.5 λ=0 λ=/3 λ= z-test s.l. 0.4 Power function: case noral, =60, n= β d λ=-0.5 λ=0 λ=/3 λ= z-test s.l. Figure : Power functions for two noral pupulations.

12 0.4 Power function: case log-noral, =5, n= β d λ=-0.5 λ=0 λ=/3 λ= z-test s.l. 0.4 Power function: case log-noral, =60, n= β d λ=-0.5 λ=0 λ=/3 λ= z-test s.l. Figure : Power functions for two log-noral pupulations.

13 4 Nuerical Exaple Yu et al. 00) presented a data set on evaluating gasoline quality based on what is known as Reid vapor pressure, collected by the Environental Protection Agency of the United States. Two types of Reid vapor pressure easureents X and Y are included in the data set. Values of X are obtained by an Agency inspector who visits gas pups in a city, takes saples of gasoline of a particular brand, and easures the Reid vapor pressure right on the spot; values of Y, on the other hand, are produced by shipping gasoline saples to the laboratory for easureents of presuably higher precision at a high cost. The original data set has a double sapling structure, with a subset of the saple units having easureents on both X and Y. Table contains two independent saples of a new reforulated gasoline, one related to X with saple size 30 and the other, to Y with saple size 5. X Field) Y Lab) Table : Field and lab data on Reid vapor pressure for newly reforulated gasoline. One of the assuptions of Yu et al. 00) is that the field easureent X and the lab easureent Y have coon ean µ. The two types of easureents differ, however, in ters of precision. Yu et al. 00) also assued that X, Y) was bivariate noral, which would not be required under our proposed epirical likelihood approach. In Tsao and Wu 006) this exaple was studied on the basis of the epirical log-likelihood ratio test. The 95% CIs of δ based on T {tδ)} {T γ δ)} γ Λ are suarized in Table 3. As in the siulation study, the narrowest CI width is obtained with T γ= δ). In all the test-statistics used to construct the CIs δ 0 = 0 is not contained, so the null hypothesis of equal eans is rejected with 0.05 significance level. CI lower bound upper bound width T γ= δ) T γ= 0.5 δ) T γ=0 δ) T γ=/3 δ) T γ= δ) T γ= δ) tδ) Table 3: Power divergence and z-test based 0.95 level CIs for field and lab data on Reid vapor pressure for newly reforulated gasoline. 3

14 5 Further extensions 5. Extension of the diension for the rando variable Let X,...,X and Y,...,Y n be two utually independent rando saples with coon distribution function F and G respectively. Assuing that X i and Y j take values in R k and E[X i ] = µ, Cov[X i ] = Σ, i =,...,, E[Y j ] = µ, Cov[Y j ] = Σ, j =,...,n, with µ = µ and µ = µ+δ, our interest is in testing H 0 : δ = δ 0 vs. H : δ δ 0, 6) where δ 0 R k and known. The epirical likelihood under H 0 is Lδ) = n p i q j s.t. p i x i µ) = 0 k, p i =, n q j y j µ δ) = 0 k, n q j =, and in the whole paraeter space, Lp,q) = n p i q j : p i = n q j =, p i,q j 0, with p i = F x i ) F x ) i, qj = Gy j ) Gy j ) and p = p,...,p ) T, q = q,...,q n ) T. The epirical log-likelihood ratio statistic for testing 6) is given by lδ 0 ) = log sup p,qlδ 0 ) sup p,q Lp,q). Based on Lagrange ultiplier ethods, sup p,q Lδ 0 ) is obtained for p i = +λ T x, i =,...,, 7) i µ) q j = n+λ T, j =,...,n, 8) y j µ δ 0 ) where λ T +nλt = 0. The epirical axiu likelihood estiates λ, λ and µ of λ, λ and µ, under H 0, can be obtained as the solution of x i µ) +λ T xi µ) = 0 k n y j µ δ) +λ T y j µ δ) = 0 k n On the other hand sup p,q Lp,q) is obtained for λ T +nλ T = 0 k. p i =, i =,..., and q j =, j =,...,n. 9) n 4

15 After soe algebra, we obtain lδ 0 ) = log suplδ 0) suplp,q) = n log + λ T ) Y j µ δ 0 ) = Under soe regularity conditions, it follows that log +log+nlogn log + λ T ) X i µ) nlogn log + λ T X i µ) )+ n log + λ T Y j µ δ)) ). 30) Pr lδ 0 ) < χ k,α) = α+on ), where χ k,α is the α-th order quantile of the χ k distribution. Let P = be the estiate the probability vector p N,..., p ν, q N ),..., q n N ) ) T P = p ν,...,p ν,q ν),...,q n ν)) T, where p i and q j are obtained fro 7) and 8) replacing λ, λ and µ by λ, λ and µ, respectively. In this k-diensional case, the Kullback-Leibler divergence between the probability vectors U and P is given by D Kullback U, P) = N log N p i N { = log N + n N log N n q j N + λ T X i µ) )+ n Therefore, the relationship between lδ 0 ) and the Kullback-Leibler divergence is ) log + λ T Y j µ δ)) }. lδ 0 ) = ND Kullback U, P). 3) Based on 3) the faily of epirical phi-divergence test statistics are defined as with Therefore the expression of T φ δ 0 ) is T φ δ 0 ) = N φ ) D φu, P), ) D φ U, P) = p i +n φ N + n p +n q n jφ i +n T φ δ 0 ) = φ ) = φ ) { { p i φ p i ) + n + λ T X i µ) n q j φ n q j N n +n q j ) } φ + λ T ) X i µ) + n + λ T φ + λ T Y j µ δ 0 )) }. 3) Y j µ δ) ). 5

16 A result siilar to the one given in Lea for the k-diensional case is µ = Σ +nσ ) Σ X +nσ Y δ )) +Op k ), where X = X i and Y = n n Y i. Finally, based in this result it is possible to establish T φ δ 0 ) L n, χ k. 5. Extension of the test-statistic using the Rényi s divergence Rényi 96) introduced the Rényi s divergence easure as an extension of the Kullback-Leibler divergence. Unfortunately this divergence easure is not a eber of the faily of phi-divergence easures considered in this paper. Menéndez et al. 995, 997) introduced and studied theh,phi)-divergence easures in order to have a faily of divergence easures in which the phi-divergence easures as well as the Rényi divergence easure are included. But not only the Rényi divergence easure is included in this new faily but another iportant divergence easures not include in the faily of phi-divergence easures are included. For ore details about the different divergence easures included in the h,phi)-divergence see for instance, Pardo 006). Based on the h,phi)-divergence easures between the probability vectors U and P, defined in ) and 3) respectively, we can consider the following faily of epirical h,phi)-divergence test statistics for the two-saple proble considered in ) T h φ δ 0 ) = ht φ δ 0 )), 33) where h is a differentiable increasing function fro [0, ) onto [0, ) with h0) = 0 and h 0) > 0. If we consider in 33), and we get hx) = logaa )x+), a 0, aa ) φx) = xa ax ), a 0,, aa ) T Rényi δ 0 ) = Tφ h δ 0) = aa ) logaa )T φδ 0 )+) { = aa ) log ) a n + λ x i µ) + } ) a + λ y j µ+δ 0 )), i.e., the epirical Rényi s divergence test statistics for testing ). For a = 0 and a =, we get li T Rényiδ 0 ) = ND Kullback U,P) a 0 and It is clear that li T Rényiδ 0 ) = ND Kullback P,U). a ht φ δ 0 )) = h0)+h 0)T φ δ 0 )+ot φ δ 0 )). 6

17 Therefore T h φ δ 0) h 0) L n, χ. In the sae way can be established for the proble considered in 6) that where with T φ δ 0 ) defined in 3). T h φ δ 0) h 0) L n, χ k, T h φ δ 0) = ht φ δ 0 )) Acknowledgeent. This research is partially supported by Grants MTM fro Ministerio de Econoia y Copetitividad Spain). References [] Adiari, G. 995). Epirical likelihood confidence intervals for the difference between eans Statistica, 55, [] Baggerly, K. A. 998). Epirical likelihood as a goodness-of-fit easure. Bioetrika, 85, [3] Baklizi, A. and Kibria, B.M. G. 009). One and two saple confidence intervals for estiating the ean of skewed populations: an epirical coparative study. Journal of Applied Statistics, 36, 6, [4] Balakrishnan, N, Martin, N. and Pardo, L. 05). Epirical phi-divergence test statistics for testing siple and coposite null hypotheses. Statistics, 49, [5] Basu, A., Mandal, A. Martin, N. and Pardo, L. 05). Robust tests for the equality of two noral eans based on the density power divergence. Metrika, 78, [6] Bhattacharyya, A. 943). On a easure of divergence between two statistical populations defined by their probability distributions. Bulletin of the Calcutta Matheatical Society, 35, [7] Broniatowski, M. and Keziou, A. 0). Divergences and duality for estiating and test under oent condition odels. Journal of Statistical Planning and Inference, 4, [8] Cressie, N. and Pardo, L. 00). Phi-divergence statisitcs. In: Elshaarawi, A.H., Plegorich, W.W. editors. Encyclopedia of environetrics, vol. 3. pp: , John Wiley and sons, New York. [9] Cressie, N. and Read, T. R. C. 984). Multinoial goodness-of-fit tests. Journal of the Royal Statistical Society, Series B, 46, [0] Felipe, A. Martín, N., Miranda, P. and Pardo, L. 05). Epirical phi-divergence test statistics for testing siple null hypotheses based on exponentially tilted epirical likelihood estiators. arxiv preprint arxiv:

18 [] Fu, Y., Wang, X. and Wu, C. 008). Weighted epirical likelihood inference for ultiple saples..journal of Statistical Planning and Inference, 39, [] Jing, B. Y. 995). Two-saple epirical likelihood ethod. Statistics and Probability Letters, 4, [3] Liu, Y., C. Zou, and Zhang, R. 008). Statistics and Probability Letters, 78, [4] Menéndez, M. L., Morales, D., Pardo, L. and Salicrú, M. 995). Asyptotic behavior and statistical applications of divergence easures in ultinoial populations: A unified study. Statistical Papers, 36, 9. [5] Menéndez, M. L., Pardo, J. A., Pardo, L. and Pardo, M. C. 997). Asyptotic approxiations for the distributions of theh, φ)-divergence goodness-of-fit statistics: Applications to Rényi s statistic. Kybernetes, 6, [6] Owen, A. B. 988). Epirical likelihood ratio confidence interval for a single functional. Bioetrika, 75, [7] Owen, A. B. 990). Epirical likelihood confidence regions. The Annals of Statistics, 8, [8] Owen, A. B. 99). Epirical Likelihood for linear odels. The Annals of Statistics, 9, [9] Owen, A. B. 00). Epirical Likelihood, Chapan and Hall/CRC. [0] Pardo, L. 006). Statistical Inference Based on Divergence Measures. Chapan & Hall/ CRC Press, Boca Raton, Florida. [] Qin, J. 994). Sei-paraetric likelihood ratio confidence intervals for the difference of two saple eans. Annals of the Institute of Statitical Matheatics, 46, 7 6. [] Qin, J.998). Inferences for case-control and seiparaetric two-saple densisty ratio odels. Bioetrika, 85, [3] Qin, J. and Lawless, J. 994). Epirical likelihood and general estiating equations. The Annals of Statistics,, [4] Qin, Y. and Zhao, L. 000). Epirical likelihood ratio intervals for various differences of two populations. Chinese Systes Sci. Math. Sci., 3, [5] Rényi, A. 96). On easures of entropy and inforation. Proceedings of the Fourth Berkeley Syposiu on Matheatical Statistics and Probability,, [6] Shara, B. D. and Mittal, D. P. 997). New non-additive easures of relative inforation. Journal of Cobinatorics, Inforation & Systes Science,, 33. [7] Tsao, M. and Wu, C. 006). Epirical likelihood inference for a coon ean in the presence of heteroscedasticity. The Canadian Journal of Statistics, 34,,

19 [8] Yan, Y. 00). Epirical likelihood inference for two-saple probles. Unpublished aster s thesis, Departent of Statisticsd and Actuarial Science, University of Waterloo, Canada. [9] Yu, P.L.H., Su, Y. and Sinha, B. 00). Estiation of the coon ean of a bivariate noral population. Annals of the Institute of Statistical Matheatics, 54, [30] Wu, C. and Yan, Y. 0) Epirical likelihood inference for two-saple problles. Statistics and Its Inference, 5, [3] Zhang, B. 000). Estiating the treatent effect in the two-saple proble with auxiliary inforation. Nonparaetric Statistics,, Appendix 5.3 Proof of Lea In a siilar way as in Hall and Scala 990), we can establish λ = λ µ) = σ ) X µ +Op ) and λ = λ µ) = σ ) Y µ δ0 +Op n ). Now applying that we have σ λ µ)+nλ µ) = 0, ) X µ +Op )+nσ ) Y µ δ0 +Op ) = 0. 34) Solving the equation for µ we have the enunciated result. 5.4 Proof of Theore First we are going to establish ) ) X µ φ p i φ = +o p ) 35) ) p i σ ) ) n Y µ δ0 φ nq j φ = n +o p ). 36) ) nq j σ ) If we denote W i = λ µ)x i µ) we have φ p i = φ+w i ). A Taylor expansion gives φ+w i ) = φ)+φ )W i + φ )Wi +o Wi ). On the other hand p i = +λ µ)x i µ) = = W i +Wi +W +o W ) i. i 9

20 Then But φ ) ) φ p i φ = Wi ) p i φ +Wi ) +o W )) i φ )Wi +o Wi ) ) = φ ) φ ) Wi + o W ) i φ ) Wi 3 o W ) i Wi + φ ) Wi 4 + o W ) i W i + φ ) o Wi ) W i + o Wi ) ) ) o W i o ) W = i φ ) o see page 0 in Owen 00)), and = Wi + φ o W ) i W 3 i o W ) i Wi + Wi 4 ) + φ o W ) i + o W ) i W ) i + φ o W ) ) i o W ) i. λ µ)x i µ) ) = φ ) = o p )o O p )) = o p ), because λ µ) = O p / ) X i µ) = o p ) X i µ) o λ µ) ) applying the strong law of large nubers. Wi 3 λ µ) 3 X i µ 3 = O p 3/ )o /) = o p ), because X i µ 3 = o /), by Lea.3 in page 8 in Owen 00). o ) Wi Wi o λ µ)) λ µ) X i µ 3 = o O p ) O p / )o 3/ ) = o p ). Wi 4 λ µ) 4 X i µ 4 O ) p Z X i µ 3 = O ) p o / )O / ) = o p ), because Z = ax i X i µ = O / ) applying Lea. in page 8 in Owen 00). φ ) o ) ) Wi o W i φ ) λ µ) 4 X i µ 4 = o p ). Therefore ) φ p i φ = Wi ) p +o p) i = λ µ)x i µ) +o p ) = σ 4 ) X µ X i µ) +o p ) ) X µ = +o p ). σ 0

21 In a siilar way we can get Therefore, n φ nq j φ ) nq j ) ) Y µ δ0 = n +o p ). { T φ δ 0 ) = ) φ p i φ + n n q j φ ) p i n q j σ ) } Applying 34), and nσ ) ) X µ Y µ δ0 = +n +o p ). σ σ ) ) Y µ δ0 = σ X µ +Op ) ) X µ T φ δ 0 ) = σ ) ) X µ Y µ δ0 +op ) σ ) X µ X ) = µ Y + µ+δ0 +op ) σ ) X µ X ) = Y +δ0 +op ) = nσ σ Y µ δ0 ) X Y +δ0 ) +op ). Fro 9) we have Y µ δ 0 = Y X σ +n Y δ0 ) σ σ + n σ δ 0 Therefore, Now we have, = σ ) T φ δ 0 ) = nσ Y X δ0 σ + σ n X +Y δ0 ) σ + σ n. = n )) σ X Y δ0 +n σ = +n σ + n +n σ n +n ) ) X Y +δ0 σ σ +nσ +n X Y δ0 )) +op ). ) +o p ) X µ ) L N0,σ ), n Y µ δ0 ) ) L n N0,σ ). and n +n n +n X µ ) L n, N0, ν)σ ). Y µ+δ0 ) ) L n, N0,νσ ),

22 where is such that ). Hence n ) L X Y +δ0 +n n, N0, ν)σ +νσ), fro which is obtained +n σ + n +n σ n ) L X Y +δ0 +n N0,) n, and now the result follows.

Keywords: Estimator, Bias, Mean-squared error, normality, generalized Pareto distribution

Keywords: Estimator, Bias, Mean-squared error, normality, generalized Pareto distribution Testing approxiate norality of an estiator using the estiated MSE and bias with an application to the shape paraeter of the generalized Pareto distribution J. Martin van Zyl Abstract In this work the norality

More information

TEST OF HOMOGENEITY OF PARALLEL SAMPLES FROM LOGNORMAL POPULATIONS WITH UNEQUAL VARIANCES

TEST OF HOMOGENEITY OF PARALLEL SAMPLES FROM LOGNORMAL POPULATIONS WITH UNEQUAL VARIANCES TEST OF HOMOGENEITY OF PARALLEL SAMPLES FROM LOGNORMAL POPULATIONS WITH UNEQUAL VARIANCES S. E. Ahed, R. J. Tokins and A. I. Volodin Departent of Matheatics and Statistics University of Regina Regina,

More information

Non-Parametric Non-Line-of-Sight Identification 1

Non-Parametric Non-Line-of-Sight Identification 1 Non-Paraetric Non-Line-of-Sight Identification Sinan Gezici, Hisashi Kobayashi and H. Vincent Poor Departent of Electrical Engineering School of Engineering and Applied Science Princeton University, Princeton,

More information

Probability Distributions

Probability Distributions Probability Distributions In Chapter, we ephasized the central role played by probability theory in the solution of pattern recognition probles. We turn now to an exploration of soe particular exaples

More information

are equal to zero, where, q = p 1. For each gene j, the pairwise null and alternative hypotheses are,

are equal to zero, where, q = p 1. For each gene j, the pairwise null and alternative hypotheses are, Page of 8 Suppleentary Materials: A ultiple testing procedure for ulti-diensional pairwise coparisons with application to gene expression studies Anjana Grandhi, Wenge Guo, Shyaal D. Peddada S Notations

More information

Testing equality of variances for multiple univariate normal populations

Testing equality of variances for multiple univariate normal populations University of Wollongong Research Online Centre for Statistical & Survey Methodology Working Paper Series Faculty of Engineering and Inforation Sciences 0 esting equality of variances for ultiple univariate

More information

Block designs and statistics

Block designs and statistics Bloc designs and statistics Notes for Math 447 May 3, 2011 The ain paraeters of a bloc design are nuber of varieties v, bloc size, nuber of blocs b. A design is built on a set of v eleents. Each eleent

More information

ESTIMATING AND FORMING CONFIDENCE INTERVALS FOR EXTREMA OF RANDOM POLYNOMIALS. A Thesis. Presented to. The Faculty of the Department of Mathematics

ESTIMATING AND FORMING CONFIDENCE INTERVALS FOR EXTREMA OF RANDOM POLYNOMIALS. A Thesis. Presented to. The Faculty of the Department of Mathematics ESTIMATING AND FORMING CONFIDENCE INTERVALS FOR EXTREMA OF RANDOM POLYNOMIALS A Thesis Presented to The Faculty of the Departent of Matheatics San Jose State University In Partial Fulfillent of the Requireents

More information

DEPARTMENT OF ECONOMETRICS AND BUSINESS STATISTICS

DEPARTMENT OF ECONOMETRICS AND BUSINESS STATISTICS ISSN 1440-771X AUSTRALIA DEPARTMENT OF ECONOMETRICS AND BUSINESS STATISTICS An Iproved Method for Bandwidth Selection When Estiating ROC Curves Peter G Hall and Rob J Hyndan Working Paper 11/00 An iproved

More information

Model Fitting. CURM Background Material, Fall 2014 Dr. Doreen De Leon

Model Fitting. CURM Background Material, Fall 2014 Dr. Doreen De Leon Model Fitting CURM Background Material, Fall 014 Dr. Doreen De Leon 1 Introduction Given a set of data points, we often want to fit a selected odel or type to the data (e.g., we suspect an exponential

More information

Estimation of the Mean of the Exponential Distribution Using Maximum Ranked Set Sampling with Unequal Samples

Estimation of the Mean of the Exponential Distribution Using Maximum Ranked Set Sampling with Unequal Samples Open Journal of Statistics, 4, 4, 64-649 Published Online Septeber 4 in SciRes http//wwwscirporg/ournal/os http//ddoiorg/436/os4486 Estiation of the Mean of the Eponential Distribution Using Maiu Ranked

More information

Compression and Predictive Distributions for Large Alphabet i.i.d and Markov models

Compression and Predictive Distributions for Large Alphabet i.i.d and Markov models 2014 IEEE International Syposiu on Inforation Theory Copression and Predictive Distributions for Large Alphabet i.i.d and Markov odels Xiao Yang Departent of Statistics Yale University New Haven, CT, 06511

More information

Extension of CSRSM for the Parametric Study of the Face Stability of Pressurized Tunnels

Extension of CSRSM for the Parametric Study of the Face Stability of Pressurized Tunnels Extension of CSRSM for the Paraetric Study of the Face Stability of Pressurized Tunnels Guilhe Mollon 1, Daniel Dias 2, and Abdul-Haid Soubra 3, M.ASCE 1 LGCIE, INSA Lyon, Université de Lyon, Doaine scientifique

More information

The proofs of Theorem 1-3 are along the lines of Wied and Galeano (2013).

The proofs of Theorem 1-3 are along the lines of Wied and Galeano (2013). A Appendix: Proofs The proofs of Theore 1-3 are along the lines of Wied and Galeano (2013) Proof of Theore 1 Let D[d 1, d 2 ] be the space of càdlàg functions on the interval [d 1, d 2 ] equipped with

More information

1 Bounding the Margin

1 Bounding the Margin COS 511: Theoretical Machine Learning Lecturer: Rob Schapire Lecture #12 Scribe: Jian Min Si March 14, 2013 1 Bounding the Margin We are continuing the proof of a bound on the generalization error of AdaBoost

More information

CS Lecture 13. More Maximum Likelihood

CS Lecture 13. More Maximum Likelihood CS 6347 Lecture 13 More Maxiu Likelihood Recap Last tie: Introduction to axiu likelihood estiation MLE for Bayesian networks Optial CPTs correspond to epirical counts Today: MLE for CRFs 2 Maxiu Likelihood

More information

Biostatistics Department Technical Report

Biostatistics Department Technical Report Biostatistics Departent Technical Report BST006-00 Estiation of Prevalence by Pool Screening With Equal Sized Pools and a egative Binoial Sapling Model Charles R. Katholi, Ph.D. Eeritus Professor Departent

More information

AN EFFICIENT CLASS OF CHAIN ESTIMATORS OF POPULATION VARIANCE UNDER SUB-SAMPLING SCHEME

AN EFFICIENT CLASS OF CHAIN ESTIMATORS OF POPULATION VARIANCE UNDER SUB-SAMPLING SCHEME J. Japan Statist. Soc. Vol. 35 No. 005 73 86 AN EFFICIENT CLASS OF CHAIN ESTIMATORS OF POPULATION VARIANCE UNDER SUB-SAMPLING SCHEME H. S. Jhajj*, M. K. Shara* and Lovleen Kuar Grover** For estiating the

More information

Ensemble Based on Data Envelopment Analysis

Ensemble Based on Data Envelopment Analysis Enseble Based on Data Envelopent Analysis So Young Sohn & Hong Choi Departent of Coputer Science & Industrial Systes Engineering, Yonsei University, Seoul, Korea Tel) 82-2-223-404, Fax) 82-2- 364-7807

More information

Kernel-Based Nonparametric Anomaly Detection

Kernel-Based Nonparametric Anomaly Detection Kernel-Based Nonparaetric Anoaly Detection Shaofeng Zou Dept of EECS Syracuse University Eail: szou@syr.edu Yingbin Liang Dept of EECS Syracuse University Eail: yliang6@syr.edu H. Vincent Poor Dept of

More information

Feature Extraction Techniques

Feature Extraction Techniques Feature Extraction Techniques Unsupervised Learning II Feature Extraction Unsupervised ethods can also be used to find features which can be useful for categorization. There are unsupervised ethods that

More information

E0 370 Statistical Learning Theory Lecture 6 (Aug 30, 2011) Margin Analysis

E0 370 Statistical Learning Theory Lecture 6 (Aug 30, 2011) Margin Analysis E0 370 tatistical Learning Theory Lecture 6 (Aug 30, 20) Margin Analysis Lecturer: hivani Agarwal cribe: Narasihan R Introduction In the last few lectures we have seen how to obtain high confidence bounds

More information

AN OPTIMAL SHRINKAGE FACTOR IN PREDICTION OF ORDERED RANDOM EFFECTS

AN OPTIMAL SHRINKAGE FACTOR IN PREDICTION OF ORDERED RANDOM EFFECTS Statistica Sinica 6 016, 1709-178 doi:http://dx.doi.org/10.5705/ss.0014.0034 AN OPTIMAL SHRINKAGE FACTOR IN PREDICTION OF ORDERED RANDOM EFFECTS Nilabja Guha 1, Anindya Roy, Yaakov Malinovsky and Gauri

More information

Polygonal Designs: Existence and Construction

Polygonal Designs: Existence and Construction Polygonal Designs: Existence and Construction John Hegean Departent of Matheatics, Stanford University, Stanford, CA 9405 Jeff Langford Departent of Matheatics, Drake University, Des Moines, IA 5011 G

More information

Machine Learning Basics: Estimators, Bias and Variance

Machine Learning Basics: Estimators, Bias and Variance Machine Learning Basics: Estiators, Bias and Variance Sargur N. srihari@cedar.buffalo.edu This is part of lecture slides on Deep Learning: http://www.cedar.buffalo.edu/~srihari/cse676 1 Topics in Basics

More information

Fairness via priority scheduling

Fairness via priority scheduling Fairness via priority scheduling Veeraruna Kavitha, N Heachandra and Debayan Das IEOR, IIT Bobay, Mubai, 400076, India vavitha,nh,debayan}@iitbacin Abstract In the context of ulti-agent resource allocation

More information

The Distribution of the Covariance Matrix for a Subset of Elliptical Distributions with Extension to Two Kurtosis Parameters

The Distribution of the Covariance Matrix for a Subset of Elliptical Distributions with Extension to Two Kurtosis Parameters journal of ultivariate analysis 58, 96106 (1996) article no. 0041 The Distribution of the Covariance Matrix for a Subset of Elliptical Distributions with Extension to Two Kurtosis Paraeters H. S. Steyn

More information

Soft Computing Techniques Help Assign Weights to Different Factors in Vulnerability Analysis

Soft Computing Techniques Help Assign Weights to Different Factors in Vulnerability Analysis Soft Coputing Techniques Help Assign Weights to Different Factors in Vulnerability Analysis Beverly Rivera 1,2, Irbis Gallegos 1, and Vladik Kreinovich 2 1 Regional Cyber and Energy Security Center RCES

More information

arxiv: v1 [cs.ds] 3 Feb 2014

arxiv: v1 [cs.ds] 3 Feb 2014 arxiv:40.043v [cs.ds] 3 Feb 04 A Bound on the Expected Optiality of Rando Feasible Solutions to Cobinatorial Optiization Probles Evan A. Sultani The Johns Hopins University APL evan@sultani.co http://www.sultani.co/

More information

A Simple Regression Problem

A Simple Regression Problem A Siple Regression Proble R. M. Castro March 23, 2 In this brief note a siple regression proble will be introduced, illustrating clearly the bias-variance tradeoff. Let Y i f(x i ) + W i, i,..., n, where

More information

OBJECTIVES INTRODUCTION

OBJECTIVES INTRODUCTION M7 Chapter 3 Section 1 OBJECTIVES Suarize data using easures of central tendency, such as the ean, edian, ode, and idrange. Describe data using the easures of variation, such as the range, variance, and

More information

Proceedings of the 2016 Winter Simulation Conference T. M. K. Roeder, P. I. Frazier, R. Szechtman, E. Zhou, T. Huschka, and S. E. Chick, eds.

Proceedings of the 2016 Winter Simulation Conference T. M. K. Roeder, P. I. Frazier, R. Szechtman, E. Zhou, T. Huschka, and S. E. Chick, eds. Proceedings of the 2016 Winter Siulation Conference T. M. K. Roeder, P. I. Frazier, R. Szechtan, E. Zhou, T. Huschka, and S. E. Chick, eds. THE EMPIRICAL LIKELIHOOD APPROACH TO SIMULATION INPUT UNCERTAINTY

More information

IN modern society that various systems have become more

IN modern society that various systems have become more Developent of Reliability Function in -Coponent Standby Redundant Syste with Priority Based on Maxiu Entropy Principle Ryosuke Hirata, Ikuo Arizono, Ryosuke Toohiro, Satoshi Oigawa, and Yasuhiko Takeoto

More information

Introduction to Machine Learning. Recitation 11

Introduction to Machine Learning. Recitation 11 Introduction to Machine Learning Lecturer: Regev Schweiger Recitation Fall Seester Scribe: Regev Schweiger. Kernel Ridge Regression We now take on the task of kernel-izing ridge regression. Let x,...,

More information

Bayesian Approach for Fatigue Life Prediction from Field Inspection

Bayesian Approach for Fatigue Life Prediction from Field Inspection Bayesian Approach for Fatigue Life Prediction fro Field Inspection Dawn An and Jooho Choi School of Aerospace & Mechanical Engineering, Korea Aerospace University, Goyang, Seoul, Korea Srira Pattabhiraan

More information

Course Notes for EE227C (Spring 2018): Convex Optimization and Approximation

Course Notes for EE227C (Spring 2018): Convex Optimization and Approximation Course Notes for EE227C (Spring 2018): Convex Optiization and Approxiation Instructor: Moritz Hardt Eail: hardt+ee227c@berkeley.edu Graduate Instructor: Max Sichowitz Eail: sichow+ee227c@berkeley.edu October

More information

A Note on the Applied Use of MDL Approximations

A Note on the Applied Use of MDL Approximations A Note on the Applied Use of MDL Approxiations Daniel J. Navarro Departent of Psychology Ohio State University Abstract An applied proble is discussed in which two nested psychological odels of retention

More information

Intelligent Systems: Reasoning and Recognition. Perceptrons and Support Vector Machines

Intelligent Systems: Reasoning and Recognition. Perceptrons and Support Vector Machines Intelligent Systes: Reasoning and Recognition Jaes L. Crowley osig 1 Winter Seester 2018 Lesson 6 27 February 2018 Outline Perceptrons and Support Vector achines Notation...2 Linear odels...3 Lines, Planes

More information

Moments of the product and ratio of two correlated chi-square variables

Moments of the product and ratio of two correlated chi-square variables Stat Papers 009 50:581 59 DOI 10.1007/s0036-007-0105-0 REGULAR ARTICLE Moents of the product and ratio of two correlated chi-square variables Anwar H. Joarder Received: June 006 / Revised: 8 October 007

More information

Estimation of the Population Mean Based on Extremes Ranked Set Sampling

Estimation of the Population Mean Based on Extremes Ranked Set Sampling Aerican Journal of Matheatics Statistics 05, 5(: 3-3 DOI: 0.593/j.ajs.05050.05 Estiation of the Population Mean Based on Extrees Ranked Set Sapling B. S. Biradar,*, Santosha C. D. Departent of Studies

More information

Support Vector Machine Classification of Uncertain and Imbalanced data using Robust Optimization

Support Vector Machine Classification of Uncertain and Imbalanced data using Robust Optimization Recent Researches in Coputer Science Support Vector Machine Classification of Uncertain and Ibalanced data using Robust Optiization RAGHAV PAT, THEODORE B. TRAFALIS, KASH BARKER School of Industrial Engineering

More information

TABLE FOR UPPER PERCENTAGE POINTS OF THE LARGEST ROOT OF A DETERMINANTAL EQUATION WITH FIVE ROOTS. By William W. Chen

TABLE FOR UPPER PERCENTAGE POINTS OF THE LARGEST ROOT OF A DETERMINANTAL EQUATION WITH FIVE ROOTS. By William W. Chen TABLE FOR UPPER PERCENTAGE POINTS OF THE LARGEST ROOT OF A DETERMINANTAL EQUATION WITH FIVE ROOTS By Willia W. Chen The distribution of the non-null characteristic roots of a atri derived fro saple observations

More information

A note on the multiplication of sparse matrices

A note on the multiplication of sparse matrices Cent. Eur. J. Cop. Sci. 41) 2014 1-11 DOI: 10.2478/s13537-014-0201-x Central European Journal of Coputer Science A note on the ultiplication of sparse atrices Research Article Keivan Borna 12, Sohrab Aboozarkhani

More information

Kernel Methods and Support Vector Machines

Kernel Methods and Support Vector Machines Intelligent Systes: Reasoning and Recognition Jaes L. Crowley ENSIAG 2 / osig 1 Second Seester 2012/2013 Lesson 20 2 ay 2013 Kernel ethods and Support Vector achines Contents Kernel Functions...2 Quadratic

More information

Qualitative Modelling of Time Series Using Self-Organizing Maps: Application to Animal Science

Qualitative Modelling of Time Series Using Self-Organizing Maps: Application to Animal Science Proceedings of the 6th WSEAS International Conference on Applied Coputer Science, Tenerife, Canary Islands, Spain, Deceber 16-18, 2006 183 Qualitative Modelling of Tie Series Using Self-Organizing Maps:

More information

Experimental Design For Model Discrimination And Precise Parameter Estimation In WDS Analysis

Experimental Design For Model Discrimination And Precise Parameter Estimation In WDS Analysis City University of New York (CUNY) CUNY Acadeic Works International Conference on Hydroinforatics 8-1-2014 Experiental Design For Model Discriination And Precise Paraeter Estiation In WDS Analysis Giovanna

More information

A Small-Sample Estimator for the Sample-Selection Model

A Small-Sample Estimator for the Sample-Selection Model A Sall-Saple Estiator for the Saple-Selection Model by Aos Golan, Enrico Moretti, and Jeffrey M. Perloff October 2000 ABSTRACT A seiparaetric estiator for evaluating the paraeters of data generated under

More information

In this chapter, we consider several graph-theoretic and probabilistic models

In this chapter, we consider several graph-theoretic and probabilistic models THREE ONE GRAPH-THEORETIC AND STATISTICAL MODELS 3.1 INTRODUCTION In this chapter, we consider several graph-theoretic and probabilistic odels for a social network, which we do under different assuptions

More information

Combining Classifiers

Combining Classifiers Cobining Classifiers Generic ethods of generating and cobining ultiple classifiers Bagging Boosting References: Duda, Hart & Stork, pg 475-480. Hastie, Tibsharini, Friedan, pg 246-256 and Chapter 10. http://www.boosting.org/

More information

ON THE TWO-LEVEL PRECONDITIONING IN LEAST SQUARES METHOD

ON THE TWO-LEVEL PRECONDITIONING IN LEAST SQUARES METHOD PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY Physical and Matheatical Sciences 04,, p. 7 5 ON THE TWO-LEVEL PRECONDITIONING IN LEAST SQUARES METHOD M a t h e a t i c s Yu. A. HAKOPIAN, R. Z. HOVHANNISYAN

More information

Estimating Parameters for a Gaussian pdf

Estimating Parameters for a Gaussian pdf Pattern Recognition and achine Learning Jaes L. Crowley ENSIAG 3 IS First Seester 00/0 Lesson 5 7 Noveber 00 Contents Estiating Paraeters for a Gaussian pdf Notation... The Pattern Recognition Proble...3

More information

PORTMANTEAU TESTS FOR ARMA MODELS WITH INFINITE VARIANCE

PORTMANTEAU TESTS FOR ARMA MODELS WITH INFINITE VARIANCE doi:10.1111/j.1467-9892.2007.00572.x PORTMANTEAU TESTS FOR ARMA MODELS WITH INFINITE VARIANCE By J.-W. Lin and A. I. McLeod The University of Western Ontario First Version received Septeber 2006 Abstract.

More information

ASSUME a source over an alphabet size m, from which a sequence of n independent samples are drawn. The classical

ASSUME a source over an alphabet size m, from which a sequence of n independent samples are drawn. The classical IEEE TRANSACTIONS ON INFORMATION THEORY Large Alphabet Source Coding using Independent Coponent Analysis Aichai Painsky, Meber, IEEE, Saharon Rosset and Meir Feder, Fellow, IEEE arxiv:67.7v [cs.it] Jul

More information

A remark on a success rate model for DPA and CPA

A remark on a success rate model for DPA and CPA A reark on a success rate odel for DPA and CPA A. Wieers, BSI Version 0.5 andreas.wieers@bsi.bund.de Septeber 5, 2018 Abstract The success rate is the ost coon evaluation etric for easuring the perforance

More information

Lower Bounds for Quantized Matrix Completion

Lower Bounds for Quantized Matrix Completion Lower Bounds for Quantized Matrix Copletion Mary Wootters and Yaniv Plan Departent of Matheatics University of Michigan Ann Arbor, MI Eail: wootters, yplan}@uich.edu Mark A. Davenport School of Elec. &

More information

Principal Components Analysis

Principal Components Analysis Principal Coponents Analysis Cheng Li, Bingyu Wang Noveber 3, 204 What s PCA Principal coponent analysis (PCA) is a statistical procedure that uses an orthogonal transforation to convert a set of observations

More information

Chapter 6 1-D Continuous Groups

Chapter 6 1-D Continuous Groups Chapter 6 1-D Continuous Groups Continuous groups consist of group eleents labelled by one or ore continuous variables, say a 1, a 2,, a r, where each variable has a well- defined range. This chapter explores:

More information

MSEC MODELING OF DEGRADATION PROCESSES TO OBTAIN AN OPTIMAL SOLUTION FOR MAINTENANCE AND PERFORMANCE

MSEC MODELING OF DEGRADATION PROCESSES TO OBTAIN AN OPTIMAL SOLUTION FOR MAINTENANCE AND PERFORMANCE Proceeding of the ASME 9 International Manufacturing Science and Engineering Conference MSEC9 October 4-7, 9, West Lafayette, Indiana, USA MSEC9-8466 MODELING OF DEGRADATION PROCESSES TO OBTAIN AN OPTIMAL

More information

Iterative Decoding of LDPC Codes over the q-ary Partial Erasure Channel

Iterative Decoding of LDPC Codes over the q-ary Partial Erasure Channel 1 Iterative Decoding of LDPC Codes over the q-ary Partial Erasure Channel Rai Cohen, Graduate Student eber, IEEE, and Yuval Cassuto, Senior eber, IEEE arxiv:1510.05311v2 [cs.it] 24 ay 2016 Abstract In

More information

A MESHSIZE BOOSTING ALGORITHM IN KERNEL DENSITY ESTIMATION

A MESHSIZE BOOSTING ALGORITHM IN KERNEL DENSITY ESTIMATION A eshsize boosting algorith in kernel density estiation A MESHSIZE BOOSTING ALGORITHM IN KERNEL DENSITY ESTIMATION C.C. Ishiekwene, S.M. Ogbonwan and J.E. Osewenkhae Departent of Matheatics, University

More information

Fundamental Limits of Database Alignment

Fundamental Limits of Database Alignment Fundaental Liits of Database Alignent Daniel Cullina Dept of Electrical Engineering Princeton University dcullina@princetonedu Prateek Mittal Dept of Electrical Engineering Princeton University pittal@princetonedu

More information

Meta-Analytic Interval Estimation for Bivariate Correlations

Meta-Analytic Interval Estimation for Bivariate Correlations Psychological Methods 2008, Vol. 13, No. 3, 173 181 Copyright 2008 by the Aerican Psychological Association 1082-989X/08/$12.00 DOI: 10.1037/a0012868 Meta-Analytic Interval Estiation for Bivariate Correlations

More information

3.3 Variational Characterization of Singular Values

3.3 Variational Characterization of Singular Values 3.3. Variational Characterization of Singular Values 61 3.3 Variational Characterization of Singular Values Since the singular values are square roots of the eigenvalues of the Heritian atrices A A and

More information

Generalized Augmentation for Control of the k-familywise Error Rate

Generalized Augmentation for Control of the k-familywise Error Rate International Journal of Statistics in Medical Research, 2012, 1, 113-119 113 Generalized Augentation for Control of the k-failywise Error Rate Alessio Farcoeni* Departent of Public Health and Infectious

More information

GEE ESTIMATORS IN MIXTURE MODEL WITH VARYING CONCENTRATIONS

GEE ESTIMATORS IN MIXTURE MODEL WITH VARYING CONCENTRATIONS ACTA UIVERSITATIS LODZIESIS FOLIA OECOOMICA 3(3142015 http://dx.doi.org/10.18778/0208-6018.314.03 Olesii Doronin *, Rostislav Maiboroda ** GEE ESTIMATORS I MIXTURE MODEL WITH VARYIG COCETRATIOS Abstract.

More information

Chaotic Coupled Map Lattices

Chaotic Coupled Map Lattices Chaotic Coupled Map Lattices Author: Dustin Keys Advisors: Dr. Robert Indik, Dr. Kevin Lin 1 Introduction When a syste of chaotic aps is coupled in a way that allows the to share inforation about each

More information

Tail Estimation of the Spectral Density under Fixed-Domain Asymptotics

Tail Estimation of the Spectral Density under Fixed-Domain Asymptotics Tail Estiation of the Spectral Density under Fixed-Doain Asyptotics Wei-Ying Wu, Chae Young Li and Yiin Xiao Wei-Ying Wu, Departent of Statistics & Probability Michigan State University, East Lansing,

More information

Scale Invariant Conditional Dependence Measures

Scale Invariant Conditional Dependence Measures Sashank J. Reddi sjakkar@cs.cu.edu Machine Learning Departent, School of Coputer Science, Carnegie Mellon University Barnabás Póczos bapoczos@cs.cu.edu Machine Learning Departent, School of Coputer Science,

More information

SPECTRUM sensing is a core concept of cognitive radio

SPECTRUM sensing is a core concept of cognitive radio World Acadey of Science, Engineering and Technology International Journal of Electronics and Counication Engineering Vol:6, o:2, 202 Efficient Detection Using Sequential Probability Ratio Test in Mobile

More information

Inference in the Presence of Likelihood Monotonicity for Polytomous and Logistic Regression

Inference in the Presence of Likelihood Monotonicity for Polytomous and Logistic Regression Advances in Pure Matheatics, 206, 6, 33-34 Published Online April 206 in SciRes. http://www.scirp.org/journal/ap http://dx.doi.org/0.4236/ap.206.65024 Inference in the Presence of Likelihood Monotonicity

More information

Best Arm Identification: A Unified Approach to Fixed Budget and Fixed Confidence

Best Arm Identification: A Unified Approach to Fixed Budget and Fixed Confidence Best Ar Identification: A Unified Approach to Fixed Budget and Fixed Confidence Victor Gabillon Mohaad Ghavazadeh Alessandro Lazaric INRIA Lille - Nord Europe, Tea SequeL {victor.gabillon,ohaad.ghavazadeh,alessandro.lazaric}@inria.fr

More information

arxiv: v1 [stat.ot] 7 Jul 2010

arxiv: v1 [stat.ot] 7 Jul 2010 Hotelling s test for highly correlated data P. Bubeliny e-ail: bubeliny@karlin.ff.cuni.cz Charles University, Faculty of Matheatics and Physics, KPMS, Sokolovska 83, Prague, Czech Republic, 8675. arxiv:007.094v

More information

Warning System of Dangerous Chemical Gas in Factory Based on Wireless Sensor Network

Warning System of Dangerous Chemical Gas in Factory Based on Wireless Sensor Network 565 A publication of CHEMICAL ENGINEERING TRANSACTIONS VOL. 59, 07 Guest Editors: Zhuo Yang, Junie Ba, Jing Pan Copyright 07, AIDIC Servizi S.r.l. ISBN 978-88-95608-49-5; ISSN 83-96 The Italian Association

More information

Constructing Locally Best Invariant Tests of the Linear Regression Model Using the Density Function of a Maximal Invariant

Constructing Locally Best Invariant Tests of the Linear Regression Model Using the Density Function of a Maximal Invariant Aerican Journal of Matheatics and Statistics 03, 3(): 45-5 DOI: 0.593/j.ajs.03030.07 Constructing Locally Best Invariant Tests of the Linear Regression Model Using the Density Function of a Maxial Invariant

More information

A general forulation of the cross-nested logit odel Michel Bierlaire, Dpt of Matheatics, EPFL, Lausanne Phone: Fax:

A general forulation of the cross-nested logit odel Michel Bierlaire, Dpt of Matheatics, EPFL, Lausanne Phone: Fax: A general forulation of the cross-nested logit odel Michel Bierlaire, EPFL Conference paper STRC 2001 Session: Choices A general forulation of the cross-nested logit odel Michel Bierlaire, Dpt of Matheatics,

More information

Boosting with log-loss

Boosting with log-loss Boosting with log-loss Marco Cusuano-Towner Septeber 2, 202 The proble Suppose we have data exaples {x i, y i ) i =... } for a two-class proble with y i {, }. Let F x) be the predictor function with the

More information

Best Linear Unbiased and Invariant Reconstructors for the Past Records

Best Linear Unbiased and Invariant Reconstructors for the Past Records BULLETIN of the MALAYSIAN MATHEMATICAL SCIENCES SOCIETY http:/athusy/bulletin Bull Malays Math Sci Soc (2) 37(4) (2014), 1017 1028 Best Linear Unbiased and Invariant Reconstructors for the Past Records

More information

COS 424: Interacting with Data. Written Exercises

COS 424: Interacting with Data. Written Exercises COS 424: Interacting with Data Hoework #4 Spring 2007 Regression Due: Wednesday, April 18 Written Exercises See the course website for iportant inforation about collaboration and late policies, as well

More information

Statistics and Probability Letters

Statistics and Probability Letters Statistics and Probability Letters 79 2009 223 233 Contents lists available at ScienceDirect Statistics and Probability Letters journal hoepage: www.elsevier.co/locate/stapro A CLT for a one-diensional

More information

Sharp Time Data Tradeoffs for Linear Inverse Problems

Sharp Time Data Tradeoffs for Linear Inverse Problems Sharp Tie Data Tradeoffs for Linear Inverse Probles Saet Oyak Benjain Recht Mahdi Soltanolkotabi January 016 Abstract In this paper we characterize sharp tie-data tradeoffs for optiization probles used

More information

Research Article On the Isolated Vertices and Connectivity in Random Intersection Graphs

Research Article On the Isolated Vertices and Connectivity in Random Intersection Graphs International Cobinatorics Volue 2011, Article ID 872703, 9 pages doi:10.1155/2011/872703 Research Article On the Isolated Vertices and Connectivity in Rando Intersection Graphs Yilun Shang Institute for

More information

Using EM To Estimate A Probablity Density With A Mixture Of Gaussians

Using EM To Estimate A Probablity Density With A Mixture Of Gaussians Using EM To Estiate A Probablity Density With A Mixture Of Gaussians Aaron A. D Souza adsouza@usc.edu Introduction The proble we are trying to address in this note is siple. Given a set of data points

More information

Randomized Recovery for Boolean Compressed Sensing

Randomized Recovery for Boolean Compressed Sensing Randoized Recovery for Boolean Copressed Sensing Mitra Fatei and Martin Vetterli Laboratory of Audiovisual Counication École Polytechnique Fédéral de Lausanne (EPFL) Eail: {itra.fatei, artin.vetterli}@epfl.ch

More information

Course Notes for EE227C (Spring 2018): Convex Optimization and Approximation

Course Notes for EE227C (Spring 2018): Convex Optimization and Approximation Course Notes for EE7C (Spring 018: Convex Optiization and Approxiation Instructor: Moritz Hardt Eail: hardt+ee7c@berkeley.edu Graduate Instructor: Max Sichowitz Eail: sichow+ee7c@berkeley.edu October 15,

More information

Inspection; structural health monitoring; reliability; Bayesian analysis; updating; decision analysis; value of information

Inspection; structural health monitoring; reliability; Bayesian analysis; updating; decision analysis; value of information Cite as: Straub D. (2014). Value of inforation analysis with structural reliability ethods. Structural Safety, 49: 75-86. Value of Inforation Analysis with Structural Reliability Methods Daniel Straub

More information

An improved self-adaptive harmony search algorithm for joint replenishment problems

An improved self-adaptive harmony search algorithm for joint replenishment problems An iproved self-adaptive harony search algorith for joint replenishent probles Lin Wang School of Manageent, Huazhong University of Science & Technology zhoulearner@gail.co Xiaojian Zhou School of Manageent,

More information

Identical Maximum Likelihood State Estimation Based on Incremental Finite Mixture Model in PHD Filter

Identical Maximum Likelihood State Estimation Based on Incremental Finite Mixture Model in PHD Filter Identical Maxiu Lielihood State Estiation Based on Increental Finite Mixture Model in PHD Filter Gang Wu Eail: xjtuwugang@gail.co Jing Liu Eail: elelj20080730@ail.xjtu.edu.cn Chongzhao Han Eail: czhan@ail.xjtu.edu.cn

More information

Best Procedures For Sample-Free Item Analysis

Best Procedures For Sample-Free Item Analysis Best Procedures For Saple-Free Ite Analysis Benjain D. Wright University of Chicago Graha A. Douglas University of Western Australia Wright s (1969) widely used "unconditional" procedure for Rasch saple-free

More information

Least Squares Fitting of Data

Least Squares Fitting of Data Least Squares Fitting of Data David Eberly, Geoetric Tools, Redond WA 98052 https://www.geoetrictools.co/ This work is licensed under the Creative Coons Attribution 4.0 International License. To view a

More information

Computable Shell Decomposition Bounds

Computable Shell Decomposition Bounds Coputable Shell Decoposition Bounds John Langford TTI-Chicago jcl@cs.cu.edu David McAllester TTI-Chicago dac@autoreason.co Editor: Leslie Pack Kaelbling and David Cohn Abstract Haussler, Kearns, Seung

More information

On Constant Power Water-filling

On Constant Power Water-filling On Constant Power Water-filling Wei Yu and John M. Cioffi Electrical Engineering Departent Stanford University, Stanford, CA94305, U.S.A. eails: {weiyu,cioffi}@stanford.edu Abstract This paper derives

More information

CONTROL SYSTEMS, ROBOTICS, AND AUTOMATION Vol. IX Uncertainty Models For Robustness Analysis - A. Garulli, A. Tesi and A. Vicino

CONTROL SYSTEMS, ROBOTICS, AND AUTOMATION Vol. IX Uncertainty Models For Robustness Analysis - A. Garulli, A. Tesi and A. Vicino UNCERTAINTY MODELS FOR ROBUSTNESS ANALYSIS A. Garulli Dipartiento di Ingegneria dell Inforazione, Università di Siena, Italy A. Tesi Dipartiento di Sistei e Inforatica, Università di Firenze, Italy A.

More information

Support recovery in compressed sensing: An estimation theoretic approach

Support recovery in compressed sensing: An estimation theoretic approach Support recovery in copressed sensing: An estiation theoretic approach Ain Karbasi, Ali Horati, Soheil Mohajer, Martin Vetterli School of Coputer and Counication Sciences École Polytechnique Fédérale de

More information

Support Vector Machines. Maximizing the Margin

Support Vector Machines. Maximizing the Margin Support Vector Machines Support vector achines (SVMs) learn a hypothesis: h(x) = b + Σ i= y i α i k(x, x i ) (x, y ),..., (x, y ) are the training exs., y i {, } b is the bias weight. α,..., α are the

More information

Compressive Distilled Sensing: Sparse Recovery Using Adaptivity in Compressive Measurements

Compressive Distilled Sensing: Sparse Recovery Using Adaptivity in Compressive Measurements 1 Copressive Distilled Sensing: Sparse Recovery Using Adaptivity in Copressive Measureents Jarvis D. Haupt 1 Richard G. Baraniuk 1 Rui M. Castro 2 and Robert D. Nowak 3 1 Dept. of Electrical and Coputer

More information

Using a De-Convolution Window for Operating Modal Analysis

Using a De-Convolution Window for Operating Modal Analysis Using a De-Convolution Window for Operating Modal Analysis Brian Schwarz Vibrant Technology, Inc. Scotts Valley, CA Mark Richardson Vibrant Technology, Inc. Scotts Valley, CA Abstract Operating Modal Analysis

More information

Hybrid System Identification: An SDP Approach

Hybrid System Identification: An SDP Approach 49th IEEE Conference on Decision and Control Deceber 15-17, 2010 Hilton Atlanta Hotel, Atlanta, GA, USA Hybrid Syste Identification: An SDP Approach C Feng, C M Lagoa, N Ozay and M Sznaier Abstract The

More information

Algorithms for parallel processor scheduling with distinct due windows and unit-time jobs

Algorithms for parallel processor scheduling with distinct due windows and unit-time jobs BULLETIN OF THE POLISH ACADEMY OF SCIENCES TECHNICAL SCIENCES Vol. 57, No. 3, 2009 Algoriths for parallel processor scheduling with distinct due windows and unit-tie obs A. JANIAK 1, W.A. JANIAK 2, and

More information

What is Probability? (again)

What is Probability? (again) INRODUCTION TO ROBBILITY Basic Concepts and Definitions n experient is any process that generates well-defined outcoes. Experient: Record an age Experient: Toss a die Experient: Record an opinion yes,

More information

An Introduction to Meta-Analysis

An Introduction to Meta-Analysis An Introduction to Meta-Analysis Douglas G. Bonett University of California, Santa Cruz How to cite this work: Bonett, D.G. (2016) An Introduction to Meta-analysis. Retrieved fro http://people.ucsc.edu/~dgbonett/eta.htl

More information