Meta-Analytic Interval Estimation for Bivariate Correlations

Size: px
Start display at page:

Download "Meta-Analytic Interval Estimation for Bivariate Correlations"

Transcription

1 Psychological Methods 2008, Vol. 13, No. 3, Copyright 2008 by the Aerican Psychological Association X/08/$12.00 DOI: /a Meta-Analytic Interval Estiation for Bivariate Correlations Douglas G. Bonett Iowa State University The currently available eta-analytic ethods for correlations have restrictive assuptions. The fixed-effects ethods assue equal population correlations and exhibit poor perforance under correlation heterogeneity. The rando-effects ethods do not assue correlation hoogeneity but are based on an equally unrealistic assuption that the selected studies are a rando saple fro a well-defined superpopulation of study populations. The randoeffects ethods can accoodate correlation heterogeneity, but these ethods do not perfor properly in typical applications where the studies are nonrandoly selected. A new fixed-effects eta-analytic confidence interval for bivariate correlations is proposed that is easy to copute and perfors well under correlation heterogeneity and nonrandoly selected studies. Keywords: confidence interval, fixed effects, rando effects, research synthesis The purpose of this article is to describe an interval estiation procedure for a bivariate correlation using inforation obtained fro different studies with no participant belonging to ore than one study. In each of the studies, a rando saple is obtained fro the study population of interest. In each study, a correlation is coputed fro a rando saple of the study population. The saple correlation estiates the unknown value of the correlation ( i )in the specified study population of study i. The correlation estiate fro study i ay be used to ake a stateent regarding the possible values of i with soe specified level of confidence. Stateents of this type are called confidence intervals. For a given level of confidence, the width of the confidence interval depends on the saple size, and narrow confidence intervals often require large saple sizes. If easureents are costly or participants are difficult to obtain, a single researcher ay not have the resources that are needed to obtain a sufficiently large saple. One solution to this proble was recognized over 100 years ago by Karl Pearson, who cobined correlation estiates fro five different studies to obtain a ore precise estiate of the correlation between inoculation for typhoid fever and ortality (Pearson, 1904). The practice of cobining estiates fro ultiple studies, also called eta-analysis (Glass, 1976), is now a standard statistical tool in psychology, edicine, and the social sciences. Several different eta-analytic ethods for cobining saple correlations fro different studies have been Correspondence concerning this article should be addressed to Douglas G. Bonett, Departent of Statistics, Iowa State University, Aes, IA E ail: dgbonett@iastate.edu proposed. These ethods ay be classified into two general categories: rando-effects (RE) ethods and fixed-effects (FE) ethods. The RE ethods assue that study populations have been randoly sapled fro a specific and clearly defined superpopulation that consists of a large nuber of study populations. A Gaussian distribution is assued for the set of all i values in the superpopulation. The researcher s goal is to estiate the ean and standard deviation of the Gaussian distribution. Two RE ethods for correlations are described by Hedges and Vevea (1998) and Hunter and Schidt (2004). These ethods are referred to as HV-R and HS-R, respectively. Coputational forulas for the HV-R and HS-R ethods are given in Appendix A. The HV-R and HS-R ethods have been recoended because they do not ake the unrealistic assuption that all i values are identical. However, unless the study populations are randoly sapled fro a specific superpopulation, the researcher cannot ake a statistical inference fro the study populations to the superpopulation. A rando saple of studies has the property that every possible subset of study populations in the superpopulation has exactly the sae probability of being included in the eta-analysis. Additionally, to obtain a rando saple of studies, the selection of any particular study into the eta-analysis saple ust not differentially alter the probabilities of any other study fro the superpopulation being selected into the eta-analysis saple. In a typical eta-analysis, the studies selected for analysis are likely to have been published sequentially over tie, with each study intentionally designed to be siilar or dissiilar to one or ore previous studies, and this would be inconsistent with a rando selection process. The rando selection assuption of the RE ethods is a critical assuption that, as noted by Hedges 173

2 174 BONETT and Vevea (1998), will virtually never be justified in practice. The FE ethods do not ake the unrealistic assuption that the study populations have been randoly selected fro a superpopulation. With FE ethods, the researcher s goal is to estiate the average correlation ( )/. The ost frequently used FE ethod is described by Hedges and Olkin (1985, p. 231). A second ethod is described by Shadish and Haddock (1994, p. 268), who explained how the general FE approach ight be applied to correlations. These ethods are referred to as HO-F and SH-F, respectively. Coputational forulas for the HO-F and SH-F ethods are given in Appendix A. The HO-F and SH-F ethods assue Hunter and Schidt (2000) argued convincingly that this assuption will alost never be satisfied in practice. The HV-R, HS-R, HO-F, and SH-F ethods ay be used to test a hypothesis or construct a confidence interval for the unknown average correlation. Confidence intervals are alost always preferred to null hypothesis tests in scientific studies (Rozeboo, 1960), and a new confidence interval for an average correlation is proposed here. The proposed confidence interval is based on an FE approach. The proposed ethod does not assue that the studies have been randoly selected fro a superpopulation and does not assue Proposed Confidence Interval Let ˆ i denote a saple Pearson correlation fro study i based on a rando saple of size n i. Each saple correlation is an estiate of an unknown population correlation i, and the average value of these population correlations, denoted as, ay provide iportant practical or scientific inforation. The following point estiate of 1 i is proposed: 1 ˆi, (1) which is a type of analog estiate (Goldberger, 1991, p. 117). Note that Equation 1 is an unweighted average of the saple correlations. The HV-R, HS-R, HO-F, and SH-F ethods all use a weighted average of saple correlations (see Appendix A) to estiate the unweighted average of study population correlations. A weighted average will have a saller ean square error (MSE) than an unweighted average will under certain conditions. In a typical etaanalysis where the saple sizes vary across studies and correlation hoogeneity cannot be assued, a weighted average of saple correlations will not necessarily have a saller MSE than the MSE of, as explained in Appendix B. Fisher (1921) and Hotelling (1953) showed that the sall-saple variance of tanh 1 ( ˆ i) ln [(1 ˆ i)/(1 ˆ i)]/2 is closely approxiated by 1/(n i 3). Applying the delta ethod (see, e.g., Stuart & Ord, 1996, p. 350) gives (1 ˆ i2 ) 2 /(n i 3) as an approxiation to the variance of ˆ i tanh{ln [(1 ˆ i)/(1 ˆ i)]/2}. Preliinary coputer siulation results suggest that using (1 ˆ i2 ) 2 /(n i 3) to estiate var( ˆ i) in the proposed confidence interval for presented shortly yields slightly better results than the ore coonly used variance estiate (1 ˆ i2 ) 2 /(n i 1). Assuing independence aong the saple correlations, an estiate of the approxiate variance of is var 2 var( ˆi), (2) where var( ˆ i) (1 ˆ i2 ) 2 /(n i 3). The sapling distribution of is nonnoral partly because it is bounded on the interval 1 to 1. The sapling distribution of tanh 1 ( ) ln [(1 )/(1 )]/2 is unbounded and should ore closely approxiate a noral distribution than the sapling distribution of. Applying the delta ethod (see, e.g., Stuart & Ord, 1996, p. 350), an estiate of the approxiate variance of tanh 1 ( ) is var[tanh 1 ( )] var( )/ ˆi2 2 / n i 3 ]/(1 2 ) 2. (3) Assuing approxiate norality of tanh 1 ( ) and using the approxiate variance of tanh 1 ( ) given in Equation 3, the following approxiate 100(1 )% confidence interval for is proposed: tanh{tanh 1 z /2 var[tanh 1 ( )] 1/2 } (4) where tanh(x) [exp(2x) 1]/[exp(2x) 1] and z /2 is a two-sided critical value of the standard noral distribution. When 1, Equation 4 reduces to the confidence interval for a single Pearson correlation coefficient (see, e.g., Glass & Hopkins, 1996, pp ). Equation 4 also ay be used with partial Pearson correlations by replacing n i 3 with n i 3 q, where q is the nuber of control variables. The bivariate norality assuption ay be difficult to justify for certain quantitative variables, and the Spearan rank correlation ay be used in these situations. The Spearan correlation is siply a Pearson correlation applied to rank-transfored scores for each of the two quantitative variables. Spearan correlations ay be used in Equation 4 but var( ˆ i) should then be replaced with (1 ˆ i2 /2)(1 ˆ i2 ) 2 /(n i 3) on the basis of the results of Bonett and Wright (2000). If the Pearson or Spearan correlations have been

3 META-ANALYTIC INTERVAL ESTIMATION 175 coputed fro a contingency table, it is necessary to replace var( ˆ i) with the special variance estiates derived by Brown and Benedetti (1977). As explained by Brown and Benedetti (1977), the coonly used variance estiates for Pearson and Spearan correlations assue continuous bivariate distributions, and these variance estiates are inappropriate when the correlations have been estiated fro contingency tables. Siulation Study In a preliinary investigation for 5 and 10, it was found that Equation 4 had a coverage probability very close to 1 for every condition exained. However, the perforance of the other eta-analytic confidence intervals (HS-R, HV-R, HO-F, SH-F) had coverage probabilities that varied considerably depending on the pattern of saple sizes and the pattern of population correlations. To provide an accurate description of how the HS-R, HV-R, HO-F, SH-F ethods perfor under realistic conditions, it is necessary to exaine any different patterns of saple sizes and population correlations. Saple sizes fro 20 to 400 and population correlations fro 0 to 0.9 were exained. A coputer siulation study was perfored in which the coverage probabilities of the five eta-analytic confidence intervals were estiated under 1,500 different patterns of saple sizes and population correlations for 5 and 10. For each of the 1,500 conditions, the coverage probabilities and confidence interval widths for Equation 4, HS-R, HV-R, HO-F, and SH-F were estiated fro 50,000 Monte Carlo trials. Each trial involved the coputer generation of bivariate noral saples of size n i within each of the 5or 10 study populations. The siulation progra was written in GAUSS and executed on a Pentiu 4 coputer. The results of the siulation study are suarized in Tables 1 and 2. Table 1 describes the perforance of the five 95% etaanalytic confidence intervals broken down by six subdivisions of the 1,500 conditions, with 250 different patterns of saple sizes and population correlations within each subdivision. For instance, within the first subdivision, the saple sizes could range fro 20 to 100 and the population correlations could range fro 0 to 0.3. Within a given subdivision, the population correlations ranged fro 0 to 0.3, 0.3 to 0.6, or 0.6 to 0.9. These correlation ranges were selected to reflect the degree of heterogeneity that has been observed in published eta-analyses (see Field, 2005). The saple sizes and population correlations within a single condition were randoly selected with equal probability fro the specified ranges. The coputer siulations attepted to iic a typical eta-analysis where the saple sizes are alost always unequal, the population correlations vary across the study populations, and the studies have not been randoly selected fro a specific superpopulation with statistical inference applying only to the studies. Under conditions of unequal correlations, unequal saple sizes, and nonrandoly selected studies, none of the currently available ethods can be expected to perfor properly. Each nuber in Table 1 is a suary of 250 coverage probabilities for a 95% level of confidence. Table 2 extends the results to 90% and 99% confidence levels for saple sizes 20 to 100. Tables 1 and 2 present the average coverage probabilities across 250 conditions, the sallest coverage Table 1 Perforance Coparison of Five 95% Meta-Analytic Confidence Intervals Saple Population Average coverage Miniu coverage Average width sizes correlations Eq. 4 HS-R HV-R HO-F SH-F Eq. 4 HS-R HV-R HO-F SH-F Eq. 4 HS-R HV-R HO-F SH-F Note: Eq 4 new fixed-effects ethod; HS-R Hunter and Schidt (2004) rando-effects ethod; HV-R Hedges and Vevea (1998) rando-effects ethod; HO-F Hedges and Olkin (1985) fixed-effects ethod; SH-F Shadish and Haddock (1994) fixed-effects ethod; nuber of study populations.

4 176 BONETT Table 2 Perforance Coparison of Five 90% and 99% Meta-Analytic Confidence Intervals (n i ) Population Average coverage Miniu coverage Average width 1 correlations Eq. 4 HS-R HV-R HO-F SH-F Eq. 4 HS-R HV-R HO-F SH-F Eq. 4 HS-R HV-R HO-F SH-F Note: Eq 4 new fixed-effects ethod; HS-R Hunter and Schidt (2004) rando-effects ethod; HV-R Hedges and Vevea (1998) rando-effects ethod; HO-F Hedges and Olkin (1985) fixed-effects ethod; SH-F Shadish and Haddock (1994) fixed-effects ethod; nuber of study populations. probability across 250 conditions (iniu coverage probability), and the average interval width across 250 conditions. The best confidence interval ethod will have an average coverage probability close to 1 and a iniu coverage probability that is not too far below 1. Iftwo ethods have siilar average and iniu coverage probabilities, then the ethod with the sallest average interval width is preferred. Fro Tables 1 and 2, it is clear that Equation 4 is superior to all other ethods. The iniu coverage probability is perhaps the ost iportant perforance characteristic because it reveals how poorly a ethod ay perfor for certain patterns of saple sizes and population correlation. The pattern of population correlations will not be known to the researcher, and it is essential that a given ethod perfors well for any pattern of population correlations and saple sizes. In Tables 1 and 2, it can be seen that the HO-F and SH-F ethods have saller average interval widths than the other ethods. However, interval widths aong copeting ethods are not coparable unless the copeting ethods have siilar coverage probabilities. The HO-F and SH-F ethods can have true coverage probabilities that are far below the specified 1 value. The RE ethods have average widths that are wider than that of Equation 4 and also ay have coverage probabilities that are below the specified confidence level. Assessing Correlation Heterogeneity The results of the siulation study indicate that the proposed eta-analytic confidence interval (Equation 4) perfors rearkably well under the wide range of conditions exained. Additional siulation studies suggest that Equation 4 also perfors well under far ore extree degrees of correlation heterogeneity. For instance, using the ethodology that produced the results in Table 1, with 5, saple sizes that vary fro 20 to 100, and population correlations that vary fro 0.9 to 0.9, the average coverage probability for Equation 4 is.950 and the iniu coverage probability across the 250 conditions is.940. However, if the population correlations are highly heterogeneous, it could be argued that, the average of the population correlations, is not a eaningful paraeter to estiate. In applications where the population correlations are believed to be highly disparate, the researcher ay want to estiate the average of two or ore subsets of population correlations where the correlations within subsets are believed to be ore hoogeneous. The foration of subsets could be otivated by known deographic or design differences across the study populations that are believed to oderate the agnitude of the correlation under investigation. The traditional test of correlation hoogeneity (see, e.g., Hedges & Olkin, 1985, p. 153) has been isused in etaanalytic applications. Contrary to popular belief, failure to reject the null hypothesis of equal population correlations does not iply that the population correlations are equal, and rejection of the null hypothesis does not iply that there are eaningfully large differences aong the population correlations. To assess the agnitude of the difference between any two population correlations or subset averages of population correlations, the researcher should use a confidence interval approach. Let A denote a single population correlation or the av-

5 META-ANALYTIC INTERVAL ESTIMATION 177 erage of two or ore population correlations and let B denote a single population correlation or the average of two or ore population correlations. The correlations that define A ust be distinct fro those that define B. For instance, for 10, the researcher ight define A ( )/3 and B ( )/7. Equation 4 ay be used to obtain confidence intervals for A and B (recall that Equation 4 also works for a single correlation). The lower and upper 100(1 )% interval estiates fro Equation 4 for A are denoted as L A and U A ; the lower and upper 100(1 )% interval estiates fro Equation 4 for B are denoted as L B and U B. Following Zou (2007), lower and upper 100(1 )% interval estiates for A B ay be expressed as L ˆA ˆB ˆA L A 2 U B ˆB 2 1/2 and (5) U ˆA ˆB U A ˆA 2 ˆB L B 2 1/2, (6) where ˆ A and ˆ B represent saple correlations fro a single study or unweighted averages of saple correlations fro two or ore studies. Equations 5 and 6 also ay be applied with partial or Spearan correlations. Coputational Exaple The coputation of Equation 4 is illustrated using a siple hypothetical exaple where the Pearson correlation between students course grades and their course evaluations in four different studies are cobined. The four saple correlations are 0.40, 0.65, 0.60, and 0.45 fro saples of size 55, 190, 65, and 35, respectively. The point estiate of is ( )/ and tanh 1 (0.525) ln [( ) ln ( )]/ The estiated variance of is [( ) 2 /52 ( ) 2 /187 ( ) 2 /62 ( ) 2 /32]/ , and the estiated variance of tanh 1 ( ) is /( ) Inserting these values into Equation 4 and setting.05 gives tanh[ ( ) 1/2 ] [tanh(0.445), tanh(0.722)] (0.418, 0.618). Thus, the researcher can be 95% confident that the average correlation between students course grades and course evaluations in the four study populations is in the range of to The 95% confidence intervals for the HS-R, HV-R, HO-F, and SH-F ethods in this exaple are (0.484, 0.676), (0.425, 0.666), (0.515, 0.655), and (0.539, 0.673), respectively. It is not appropriate to copare the interval widths of these five ethods because they do not have the sae true levels of confidence. A sall siulation study with 4, n 1 55, n 2 190, n 3 65, and n 4 35 was conducted to get an idea of the approxiate levels of confidence for these five ethods. The population correlations for the siulation were set equal to the saple correlation values in this exaple. The estiated coverage probabilities for Equation 4, HS-R, HV-R, HO-F, and SH-F were found to be.950,.828,.924,.571, and.296, respectively. Although the HO-F and SH-F confidence intervals are narrower than the confidence interval of Equation 4 in this exaple, the interval (0.418, 0.619) fro Equation 4 is a 95.0% interval, whereas the interval (0.515, 0.655) fro the HO-F ethod is an approxiate 57.1% confidence interval and the interval (0.539, 0.673) fro the SH-F ethod is an approxiate 29.6% confidence interval. It was stated in the introduction that a priary goal of eta-analysis is to obtain a ore accurate estiate of the paraeter of interest by cobining saple inforation fro ultiple studies. If the saple sizes in the studies are siilar, Equation 4 will yield a confidence interval that is siilar to a confidence interval that would be obtained fro a single saple of size n 1 n 2... n. For instance, if a single saple of size produced a saple correlation of (the sae as above), the 95% confidence interval for the population correlation would be (0.444, 0.598), which is only slightly narrower than the eta-analytic confidence interval reported above using Equation 4. Suppose that the instructors in Studies 1 and 4 have little teaching experience whereas the instructors in Studies 2 and 3 have extensive teaching experience and the researcher believes that instructor experience ay oderate the correlation between students course grades and course evaluations. The point and 95% interval estiates for A ( 1 4 )/2 are and (0.231, 0.587), respectively; the point and 95% interval estiates for B ( 2 3 )/2 are and (0.527, 0.707), respectively. Applying Equations 5 and 6 gives a 95% confidence interval for B A equal to ( 0.011, 0.389). The results are inconclusive at the 95% level because the 95% confidence interval for B A includes zero. The confidence interval is wide priarily because of the sall saple sizes in Studies 1 and 4. Future studies that exaine the relation between students course grades and course evaluations could be cobined with the correlations in this eta-analysis to obtain a ore accurate estiate of B A. Concluding Rearks The classic FE eta-analytic confidence intervals (the HO-F and SH-F ethods) for an average Pearson correlation have unacceptable perforance characteristics under correlation heterogeneity and unequal saple sizes. Hunter and Schidt (2000) discussed the serious liitations of the classic FE ethods and warned of potentially isleading results in the any published studies that have used these ethods. The results in Tables 1 and 2 clearly support and reinforce the Hunter Schidt recoendation to discontinue use of the classic FE ethods. The National Research

6 178 BONETT Council (1992) also concluded that the classic FE ethods should rarely, if ever, be used because of their unacceptable perforance characteristics under effect-size heterogeneity. If researchers coply with the recoendations of Hunter and Schidt (2000) and the National Research Council (1992), researchers ust then choose between HS-R, HV-R, and Equation 4 when conducting a etaanalysis of bivariate correlations. None of the three ethods require the unrealistic assuption that The following four criteria ay then be used to choose between HS-R, HV-R, and Equation 4: (a) coverage probability bias, (b) confidence interval width, (c) perforance of heterogeneity assessent ethods, and (d) ability to generalize beyond the study populations. For the first criterion, Equation 4 is the clear winner because it has a true coverage probability very close to the specified 1 confidence level. The RE ethods can have true coverage probabilities that are far before the specified 1 confidence level. This fact alone is enough to recoend Equation 4 over the RE ethods. Even if the RE ethods did have true coverage probabilities close to 1, Equation 4 would be preferred to the RE ethods with respect to the second criterion of confidence interval width. One consequence of the wider RE confidence intervals is that the total saple size would typically need to be considerably larger if an RE ethod is used instead of Equation 4. Heterogeneity assessent, the third criterion, is fundaentally different in FE and RE ethods. With FE ethods, the researcher deliberately selects studies to cobine and each i is interesting in its own right. Equations 5 and 6 ay be used to copare any pair of individual i values or any pair of subset average values. Equations 5 and 6 ay be used for values as sall as 2 and n i values as sall as 15. Equations 5 and 6 yield a true coverage probability that is very close to 1 when the distributional assuptions within each study are satisfied. Heterogeneity assessent in RE ethods is fraught with serious probles. Unlike FE ethods, RE ethods assue that the studies have been randoly selected fro a superpopulation of studies and that the set of i values in the superpopulation follows a Gaussian distribution with unknown ean and unknown standard deviation. Heterogeneity assessent in an RE odel involves the estiation of, which describes the variability of the set of all i values in the superpopulation. Note that ˆ2 in Equation A3 is a point estiate of 2. A priary purpose of an RE etaanalysis is to obtain a confidence interval for (Equations A3 and A4) and also a confidence interval for (although soe eta-analysts inappropriately report only point estiates of and ). Even if the rando sapling and Gaussian assuptions of the RE odel could be satisfied, Field (2005) found that the HV-R and HS-R confidence intervals for have poor perforance even for very large values of. Confidence intervals for also have serious probles. Viechtbauer (2007) found that the currently available confidence interval ethods for exhibit poor perforance and their use cannot be recoended. Viechtbauer (2007) proposed a new ethod that works well if the effect-size variances within each study are treated as known constants. This assuption is not a proble in the HV-R ethod where the variance of tanh 1 ( ˆ i) is a function of known saple sizes. However, in the HV-R ethod, describes the standard deviation of tanh 1 ( i ) values and not the standard deviation of i values. The currently available confidence interval ethods for are not robust to violations of the Gaussian assuption, and ethods to detect a non-gaussian distribution are not effective. Viechtbauer (2007) also exained the perforance of a nonparaetric bootstrap confidence interval that does not assue a Gaussian distribution but found that this ethod did not perfor well. Furtherore, in any applications, ust be prohibitively large to obtain an acceptably narrow confidence interval for. Generalization beyond the study populations is the fourth criterion by which to copare Equation 4 with the RE ethods. The RE ethods assue that the studies are a rando saple fro a superpopulation of studies. If this assuption could be satisfied, then the SH-R and HV-R confidence intervals would provide stateents about the ean of all i values in the superpopulation. In contrast, Equation 4 only provides a stateent about the value of ( )/. The rando sapling assuption is critical: It cannot be taken lightly or ignored as is coonly done in RE eta-analyses. No researcher who understands eleentary statistics would take a nonrando saple of en, woan, and children fro a nearby neighborhood, apply soe inferential statistical ethod, and then clai that the results generalize to all en, woen, and children in the world. Fro eleentary statistics, we know that the researcher ust specify a study population fro which the rando saple has been obtained and that the results of inferential statistical ethods generalize fro the rando saple to the study population. In a single study, the study population is often a sall subset of soe target population that the researcher wants to investigate. For instance, the study population ay consist only of students enrolled in a large introductory psychology course. Stateents about the study population paraeter ay be generalized to a larger target population using logical arguents rather than statistical inference. If the researcher can argue convincingly that the study population paraeter should not be oderated by the deographic or design characteristics of the study population, then others ay be willing to accept the researcher s clai that the study population paraeter should be siilar to a corresponding paraeter in a particular target population. This sae process of logical arguentation ay be used with

7 META-ANALYTIC INTERVAL ESTIMATION 179 Equation 4 to extend the results beyond the study populations to soe clearly defined target population. If Equation 4 is used and generalization to a larger target population is desired, it is the responsibility of the etaanalyst to provide a detailed description of each study population and convincing reasons why ( )/ should be siilar to the average correlation in soe larger target population. This inforation should be ade public so that others ay decide for theselves if such a generalization is justified. Likewise, if RE ethods are used to describe a superpopulation, it is the responsibility of the eta-analyst to provide convincing evidence that the studies represent a legitiate rando saple fro a clearly defined superpopulation. If eta-analysts are held to this requireent by journal editors and reviewers, RE etaanalysis ethods will be used only on rare occasions. In applications where study population correlations are not expected to be equal and where the selected studies are not a rando saple fro a clearly defined superpopulation, Equation 4, which is siple to copute and perfors rearkably well, should be used instead of the HO-F, SH-F, HV-R and HS-R ethods. Equations 5 and 6 are recoended for assessing heterogeneity aong the study population correlations. References Aiken, A. C. (1935). On least squares and linear cobinations of observations. Proceedings of the Royal Statistical Society, 55, Bonett, D. G., & Wright, T. A. (2000). Saple size requireents for estiating Pearson, Kendall and Spearan correlations. Psychoetrika, 65, Brown, M. B., & Benedetti, J. K. (1977). Sapling behavior and tests for correlations in two-way contingency tables. Journal of the Aerican Statistical Association, 72, Field, A. P. (2005). Is eta-analysis of correlation coefficients accurate when population correlations vary? Psychological Methods, 10, Fisher, R. A. (1921). On the probable error of the coefficient of correlation deduced fro a sall saple. Metron, 1, Glass, G. V. (1976). Priary, secondary, and eta-analysis of research. Educational Researcher, 5, 3 8. Glass, G. V., & Hopkins, K. D. (1996). Statistical ethods in psychology and education (3rd ed.). Boston: Allyn & Bacon. Goldberger, A. S. (1991). A course in econoetrics. Cabridge, MA: Harvard University Press. Hedges, L. V., & Olkin, I. (1985). Statistical ethods in etaanalysis. Orlando, FL: Acadeic Press. Hedges, L. V., & Vevea, J. L. (1998). Fixed- and rando-effects odels in eta-analysis. Psychological Methods, 3, Hotelling, H. (1953). New light on the correlation coefficient and its transfors (with discussion). Journal of the Royal Statistical Society, Series B: Statistical Methodology, 15, Hunter, J. E., & Schidt, F. L. (2000). Fixed effects vs. rando effects eta-analysis odels: Iplications for cuulative knowledge in psychology. International Journal of Selection and Assessent, 8, Hunter, J. E., & Schidt, F. L. (2004). Methods of eta-analysis: Correcting errors and bias in research findings (2nd ed.). Newbury Park, CA: Sage. Judge, G. G., Griffiths, W. E., Hill, R. C., Lütkepohl, H., & Lee, T.-C. (1985). The theory and practice of econoetrics (2nd ed.). New York: Wiley National Research Council. (1992). Cobining inforation: Statistical issues and opportunities for research. Washington, DC: National Acadey Press. Olkin, I., & Pratt, J. W. (1958). Unbiased estiation of certain correlation coefficients. Annals of Matheatical Statistics, 29, Pearson, K. (1904). Report on certain enteric fever inoculation statistics. British Medical Journal, 3, Rozeboo, W. W. (1960). The fallacy of the null hypothesis significance test. Psychological Bulletin, 57, Shadish, W. R., & Haddock, C. K. (1994). Cobining estiates of effect size. In H. Cooper & L. V. Hedges (Eds.), Handbook of research synthesis (pp ). New York: Russell Sage Foundation. Stuart, A., & Ord, J. K. (1996). Kendall s advanced theory of statistics: Vol. 1. Distribution theory. London: Arnold. Viechtbauer, W. (2007). Confidence intervals for the aount of heterogeneity in eta-analysis. Statistics in Medicine, 26, Zou, G. Y. (2007). Toward using confidence intervals to copare correlations. Psychological Methods, 12, (Appendixes follow)

8 180 BONETT Appendix A Confidence Interval Forulas for the Hedges and Olkin (1985) Fixed-Effects Method (HO-F), the Shadish and Haddock (1994) Fixed-Effects Method (SH-F), the Hedges and Vevea (1998) Rando-Effects Method (HV-R), and the Hunter and Schidt (2004) Rando-Effects Method (HS-R) Method HO-F tanh w i tanh 1 ˆ i w i where w i n i 3. w i ˆ i w i Method SH-F where w i (n i 1)/(1 ˆ i2 ) 2. z /2 2 w / z /2 2 w / (A1) (A2) Method HV-R tanh i tanh 1 ˆ i i z /2 2 / (A3) where i [1/(n i 3) ˆ 2] 1, ˆ 2 (Q 1)/c, c w i w 2 i / w i, w i n i 3, and Q is the chi-square test statistic for a test of equal population correlations. where ˆ Method HS-R ˆ z /2 n i ˆ i ˆ 2 n i ˆ i/ i n n i. 1/2 (A4) Appendix B Coparison of Weighted and Unweighted Averages It is coon practice in eta-analysis to estiate an unweighted average of population effect sizes using a weighted average of saple effect sizes where the weights are inversely proportional to the squared standard errors. Let ˆi denote an estiator of effect size i with standard error SE( ˆi) obtained fro study i. The classic fixed-effects (FE) estiator of the average effect size 1 i is ˆ w i ˆi w i, (B1) where w i 1/SE( ˆ i) 2. The classic FE variance estiate of ˆ is var ˆ 1 i. (B2) The popularity of Equation B1 is based on the fact that, under certain conditions, it is a iniu variance linear unbiased (MVLU) estiator of, which iplies that ˆ has the sallest variance within a class of all possible unbiased estiators that are linear functions of ˆ i. The MVLU property of ˆ follows fro Aiken s theore (Aiken, 1935), which states that for Y X, (B3) where is an vector of unknown constants, E( ) 0, and cov( ) V with X and V containing known constants, then

9 META-ANALYTIC INTERVAL ESTIMATION 181 ˆ X V 1 X 1 X V 1 Y (B4) is an MVLU estiator of and cov( ˆ ) (X V 1 X) 1. Setting X to an 1 vector of ones, V 1 to a diagonal atrix with w i along the principal diagonal, and Y [ ˆ1 ˆ2... ˆ ], it is easy to show that, ˆ ˆ, and cov( ˆ ) var( ˆ). Thus ˆ (Equation B1) is an MVLU estiator of under the assuptions of Equation B3. Two key assuptions in Equation B3 require careful attention. The assuption that V is a atrix of known constants will not be satisfied when SE( ˆ i) contains estiated paraeters. For instance, the Shadish and Haddock (1994) FE ethod (SH-F) sets w i (n i 1)/(1 ˆ i2 ) 2 where ˆ i is a saple correlation and thus the weights are rando variables rather than known constants. When V contains paraeter estiates, Equation B4 is called an estiated generalized least squares estiator (see, e.g., Judge, Griffiths, Hill, Lütkepohl, & Lee, 1985, pp ) with sapling properties that, in general, ay be stated only asyptotically. Thus, when SE( ˆ i) contains estiated paraeters, Equation B2 is valid only asyptotically. Equation B4 is a weighted least squares estiator of. An unweighted least squares estiator of is X X 1 X Y and cov( ) (X X) 1 X VX(X X) 1. Setting X to an 1 vector of ones and Y [ ˆ1 ˆ2... ˆ ], it is easy to show that, 1 ˆ i, and var( ) 2 var( ˆ i). If V is a atrix of known constants, then var( ˆ) var( ) because cov( ) cov( ˆ ) is a positive seidefinite atrix (Judge et al., 1985, p. 170). These results show that a weighted average using inverse variance weighting with known values in the standard error will have a saller variance than an unweighted average. However, the choice between ˆ and should be based on the estiator ean squared error (MSE) rather than the variance. The MSE of an estiator is the su of its variance and squared bias. The bias of ˆ and also ust be taken into consideration. For a eta-analysis of saple Pearson correlations where ˆ i ˆ i and, it is known that E( ˆ i i ) rapidly approaches zero as the saple size increases (Olkin & Pratt, 1958). The saple Pearson correlation is nearly unbiased and we write E( ˆ i) i. The bias of the unweighted average of correlations is E 1 ˆi 1 E ˆi 0. Thus, (Equation 1) is a nearly unbiased estiator of 1 i. The bias of a weighted average of correlations, such as ˆ w i ˆ i/ w i, with known weights is E w i ˆi i w i 1 w i E ˆi w i w i i w i. If all weights are equal (and equal to w), the bias of the weighted average is w i i w i w 1 i w 0. Alternatively, if all population correlations are equal (and equal to ), the bias of the weighted average is w i i w i w i w i 0. A weighted average of correlations is nearly unbiased if the weights are equal or if the population correlations are equal. Another potential source of bias is introduced when transfored correlations are averaged. Specifically, tanh[ 1 tanh 1 ( i )] 1 i unless The bias in ay result in an MSE of that is considerably larger than the MSE of. The bias of a weighted average increases as the weights and the population correlations becoe ore heterogeneous. In Tables 1 and 2, it can be seen that the average widths of the Hedges and Olkin (1985) FE ethod (HO-F) and SH-F are saller than the average width of Equation 4 because var( ) var( ). However, the average coverage probabilities of the HO-F and SH-F ethods can be far below the specified level because of the bias of the weighted averages. Received May 16, 2007 Revision received March 18, 2008 Accepted March 18, 2008

An Introduction to Meta-Analysis

An Introduction to Meta-Analysis An Introduction to Meta-Analysis Douglas G. Bonett University of California, Santa Cruz How to cite this work: Bonett, D.G. (2016) An Introduction to Meta-analysis. Retrieved fro http://people.ucsc.edu/~dgbonett/eta.htl

More information

Keywords: Estimator, Bias, Mean-squared error, normality, generalized Pareto distribution

Keywords: Estimator, Bias, Mean-squared error, normality, generalized Pareto distribution Testing approxiate norality of an estiator using the estiated MSE and bias with an application to the shape paraeter of the generalized Pareto distribution J. Martin van Zyl Abstract In this work the norality

More information

TEST OF HOMOGENEITY OF PARALLEL SAMPLES FROM LOGNORMAL POPULATIONS WITH UNEQUAL VARIANCES

TEST OF HOMOGENEITY OF PARALLEL SAMPLES FROM LOGNORMAL POPULATIONS WITH UNEQUAL VARIANCES TEST OF HOMOGENEITY OF PARALLEL SAMPLES FROM LOGNORMAL POPULATIONS WITH UNEQUAL VARIANCES S. E. Ahed, R. J. Tokins and A. I. Volodin Departent of Matheatics and Statistics University of Regina Regina,

More information

Testing equality of variances for multiple univariate normal populations

Testing equality of variances for multiple univariate normal populations University of Wollongong Research Online Centre for Statistical & Survey Methodology Working Paper Series Faculty of Engineering and Inforation Sciences 0 esting equality of variances for ultiple univariate

More information

Biostatistics Department Technical Report

Biostatistics Department Technical Report Biostatistics Departent Technical Report BST006-00 Estiation of Prevalence by Pool Screening With Equal Sized Pools and a egative Binoial Sapling Model Charles R. Katholi, Ph.D. Eeritus Professor Departent

More information

ESTIMATING AND FORMING CONFIDENCE INTERVALS FOR EXTREMA OF RANDOM POLYNOMIALS. A Thesis. Presented to. The Faculty of the Department of Mathematics

ESTIMATING AND FORMING CONFIDENCE INTERVALS FOR EXTREMA OF RANDOM POLYNOMIALS. A Thesis. Presented to. The Faculty of the Department of Mathematics ESTIMATING AND FORMING CONFIDENCE INTERVALS FOR EXTREMA OF RANDOM POLYNOMIALS A Thesis Presented to The Faculty of the Departent of Matheatics San Jose State University In Partial Fulfillent of the Requireents

More information

Non-Parametric Non-Line-of-Sight Identification 1

Non-Parametric Non-Line-of-Sight Identification 1 Non-Paraetric Non-Line-of-Sight Identification Sinan Gezici, Hisashi Kobayashi and H. Vincent Poor Departent of Electrical Engineering School of Engineering and Applied Science Princeton University, Princeton,

More information

OBJECTIVES INTRODUCTION

OBJECTIVES INTRODUCTION M7 Chapter 3 Section 1 OBJECTIVES Suarize data using easures of central tendency, such as the ean, edian, ode, and idrange. Describe data using the easures of variation, such as the range, variance, and

More information

Model Fitting. CURM Background Material, Fall 2014 Dr. Doreen De Leon

Model Fitting. CURM Background Material, Fall 2014 Dr. Doreen De Leon Model Fitting CURM Background Material, Fall 014 Dr. Doreen De Leon 1 Introduction Given a set of data points, we often want to fit a selected odel or type to the data (e.g., we suspect an exponential

More information

Block designs and statistics

Block designs and statistics Bloc designs and statistics Notes for Math 447 May 3, 2011 The ain paraeters of a bloc design are nuber of varieties v, bloc size, nuber of blocs b. A design is built on a set of v eleents. Each eleent

More information

are equal to zero, where, q = p 1. For each gene j, the pairwise null and alternative hypotheses are,

are equal to zero, where, q = p 1. For each gene j, the pairwise null and alternative hypotheses are, Page of 8 Suppleentary Materials: A ultiple testing procedure for ulti-diensional pairwise coparisons with application to gene expression studies Anjana Grandhi, Wenge Guo, Shyaal D. Peddada S Notations

More information

Feature Extraction Techniques

Feature Extraction Techniques Feature Extraction Techniques Unsupervised Learning II Feature Extraction Unsupervised ethods can also be used to find features which can be useful for categorization. There are unsupervised ethods that

More information

3.3 Variational Characterization of Singular Values

3.3 Variational Characterization of Singular Values 3.3. Variational Characterization of Singular Values 61 3.3 Variational Characterization of Singular Values Since the singular values are square roots of the eigenvalues of the Heritian atrices A A and

More information

Lower Bounds for Quantized Matrix Completion

Lower Bounds for Quantized Matrix Completion Lower Bounds for Quantized Matrix Copletion Mary Wootters and Yaniv Plan Departent of Matheatics University of Michigan Ann Arbor, MI Eail: wootters, yplan}@uich.edu Mark A. Davenport School of Elec. &

More information

A Simple Regression Problem

A Simple Regression Problem A Siple Regression Proble R. M. Castro March 23, 2 In this brief note a siple regression proble will be introduced, illustrating clearly the bias-variance tradeoff. Let Y i f(x i ) + W i, i,..., n, where

More information

Machine Learning Basics: Estimators, Bias and Variance

Machine Learning Basics: Estimators, Bias and Variance Machine Learning Basics: Estiators, Bias and Variance Sargur N. srihari@cedar.buffalo.edu This is part of lecture slides on Deep Learning: http://www.cedar.buffalo.edu/~srihari/cse676 1 Topics in Basics

More information

Sampling How Big a Sample?

Sampling How Big a Sample? C. G. G. Aitken, 1 Ph.D. Sapling How Big a Saple? REFERENCE: Aitken CGG. Sapling how big a saple? J Forensic Sci 1999;44(4):750 760. ABSTRACT: It is thought that, in a consignent of discrete units, a certain

More information

Experimental Design For Model Discrimination And Precise Parameter Estimation In WDS Analysis

Experimental Design For Model Discrimination And Precise Parameter Estimation In WDS Analysis City University of New York (CUNY) CUNY Acadeic Works International Conference on Hydroinforatics 8-1-2014 Experiental Design For Model Discriination And Precise Paraeter Estiation In WDS Analysis Giovanna

More information

Inference in the Presence of Likelihood Monotonicity for Polytomous and Logistic Regression

Inference in the Presence of Likelihood Monotonicity for Polytomous and Logistic Regression Advances in Pure Matheatics, 206, 6, 33-34 Published Online April 206 in SciRes. http://www.scirp.org/journal/ap http://dx.doi.org/0.4236/ap.206.65024 Inference in the Presence of Likelihood Monotonicity

More information

Polygonal Designs: Existence and Construction

Polygonal Designs: Existence and Construction Polygonal Designs: Existence and Construction John Hegean Departent of Matheatics, Stanford University, Stanford, CA 9405 Jeff Langford Departent of Matheatics, Drake University, Des Moines, IA 5011 G

More information

A Note on the Applied Use of MDL Approximations

A Note on the Applied Use of MDL Approximations A Note on the Applied Use of MDL Approxiations Daniel J. Navarro Departent of Psychology Ohio State University Abstract An applied proble is discussed in which two nested psychological odels of retention

More information

RAFIA(MBA) TUTOR S UPLOADED FILE Course STA301: Statistics and Probability Lecture No 1 to 5

RAFIA(MBA) TUTOR S UPLOADED FILE Course STA301: Statistics and Probability Lecture No 1 to 5 Course STA0: Statistics and Probability Lecture No to 5 Multiple Choice Questions:. Statistics deals with: a) Observations b) Aggregates of facts*** c) Individuals d) Isolated ites. A nuber of students

More information

Estimation of the Mean of the Exponential Distribution Using Maximum Ranked Set Sampling with Unequal Samples

Estimation of the Mean of the Exponential Distribution Using Maximum Ranked Set Sampling with Unequal Samples Open Journal of Statistics, 4, 4, 64-649 Published Online Septeber 4 in SciRes http//wwwscirporg/ournal/os http//ddoiorg/436/os4486 Estiation of the Mean of the Eponential Distribution Using Maiu Ranked

More information

Online Publication Date: 19 April 2012 Publisher: Asian Economic and Social Society

Online Publication Date: 19 April 2012 Publisher: Asian Economic and Social Society Online Publication Date: April Publisher: Asian Econoic and Social Society Using Entropy Working Correlation Matrix in Generalized Estiating Equation for Stock Price Change Model Serpilılıç (Faculty of

More information

Ensemble Based on Data Envelopment Analysis

Ensemble Based on Data Envelopment Analysis Enseble Based on Data Envelopent Analysis So Young Sohn & Hong Choi Departent of Coputer Science & Industrial Systes Engineering, Yonsei University, Seoul, Korea Tel) 82-2-223-404, Fax) 82-2- 364-7807

More information

Extension of CSRSM for the Parametric Study of the Face Stability of Pressurized Tunnels

Extension of CSRSM for the Parametric Study of the Face Stability of Pressurized Tunnels Extension of CSRSM for the Paraetric Study of the Face Stability of Pressurized Tunnels Guilhe Mollon 1, Daniel Dias 2, and Abdul-Haid Soubra 3, M.ASCE 1 LGCIE, INSA Lyon, Université de Lyon, Doaine scientifique

More information

Topic 5a Introduction to Curve Fitting & Linear Regression

Topic 5a Introduction to Curve Fitting & Linear Regression /7/08 Course Instructor Dr. Rayond C. Rup Oice: A 337 Phone: (95) 747 6958 E ail: rcrup@utep.edu opic 5a Introduction to Curve Fitting & Linear Regression EE 4386/530 Coputational ethods in EE Outline

More information

Estimation of the Population Mean Based on Extremes Ranked Set Sampling

Estimation of the Population Mean Based on Extremes Ranked Set Sampling Aerican Journal of Matheatics Statistics 05, 5(: 3-3 DOI: 0.593/j.ajs.05050.05 Estiation of the Population Mean Based on Extrees Ranked Set Sapling B. S. Biradar,*, Santosha C. D. Departent of Studies

More information

Modeling the Structural Shifts in Real Exchange Rate with Cubic Spline Regression (CSR). Turkey

Modeling the Structural Shifts in Real Exchange Rate with Cubic Spline Regression (CSR). Turkey International Journal of Business and Social Science Vol. 2 No. 17 www.ijbssnet.co Modeling the Structural Shifts in Real Exchange Rate with Cubic Spline Regression (CSR). Turkey 1987-2008 Dr. Bahar BERBEROĞLU

More information

Best Procedures For Sample-Free Item Analysis

Best Procedures For Sample-Free Item Analysis Best Procedures For Saple-Free Ite Analysis Benjain D. Wright University of Chicago Graha A. Douglas University of Western Australia Wright s (1969) widely used "unconditional" procedure for Rasch saple-free

More information

Probability Distributions

Probability Distributions Probability Distributions In Chapter, we ephasized the central role played by probability theory in the solution of pattern recognition probles. We turn now to an exploration of soe particular exaples

More information

Proc. of the IEEE/OES Seventh Working Conference on Current Measurement Technology UNCERTAINTIES IN SEASONDE CURRENT VELOCITIES

Proc. of the IEEE/OES Seventh Working Conference on Current Measurement Technology UNCERTAINTIES IN SEASONDE CURRENT VELOCITIES Proc. of the IEEE/OES Seventh Working Conference on Current Measureent Technology UNCERTAINTIES IN SEASONDE CURRENT VELOCITIES Belinda Lipa Codar Ocean Sensors 15 La Sandra Way, Portola Valley, CA 98 blipa@pogo.co

More information

DEPARTMENT OF ECONOMETRICS AND BUSINESS STATISTICS

DEPARTMENT OF ECONOMETRICS AND BUSINESS STATISTICS ISSN 1440-771X AUSTRALIA DEPARTMENT OF ECONOMETRICS AND BUSINESS STATISTICS An Iproved Method for Bandwidth Selection When Estiating ROC Curves Peter G Hall and Rob J Hyndan Working Paper 11/00 An iproved

More information

13.2 Fully Polynomial Randomized Approximation Scheme for Permanent of Random 0-1 Matrices

13.2 Fully Polynomial Randomized Approximation Scheme for Permanent of Random 0-1 Matrices CS71 Randoness & Coputation Spring 018 Instructor: Alistair Sinclair Lecture 13: February 7 Disclaier: These notes have not been subjected to the usual scrutiny accorded to foral publications. They ay

More information

E0 370 Statistical Learning Theory Lecture 6 (Aug 30, 2011) Margin Analysis

E0 370 Statistical Learning Theory Lecture 6 (Aug 30, 2011) Margin Analysis E0 370 tatistical Learning Theory Lecture 6 (Aug 30, 20) Margin Analysis Lecturer: hivani Agarwal cribe: Narasihan R Introduction In the last few lectures we have seen how to obtain high confidence bounds

More information

Measures of average are called measures of central tendency and include the mean, median, mode, and midrange.

Measures of average are called measures of central tendency and include the mean, median, mode, and midrange. CHAPTER 3 Data Description Objectives Suarize data using easures of central tendency, such as the ean, edian, ode, and idrange. Describe data using the easures of variation, such as the range, variance,

More information

Bootstrapping Dependent Data

Bootstrapping Dependent Data Bootstrapping Dependent Data One of the key issues confronting bootstrap resapling approxiations is how to deal with dependent data. Consider a sequence fx t g n t= of dependent rando variables. Clearly

More information

Examining the Effects of Site Selection Criteria for Evaluating the Effectiveness of Traffic Safety Countermeasures

Examining the Effects of Site Selection Criteria for Evaluating the Effectiveness of Traffic Safety Countermeasures Exaining the Effects of Site Selection Criteria for Evaluating the Effectiveness of Traffic Safety Countereasures Doinique Lord* Associate Professor and Zachry Developent Professor I Zachry Departent of

More information

Sexually Transmitted Diseases VMED 5180 September 27, 2016

Sexually Transmitted Diseases VMED 5180 September 27, 2016 Sexually Transitted Diseases VMED 518 Septeber 27, 216 Introduction Two sexually-transitted disease (STD) odels are presented below. The irst is a susceptibleinectious-susceptible (SIS) odel (Figure 1)

More information

Interactive Markov Models of Evolutionary Algorithms

Interactive Markov Models of Evolutionary Algorithms Cleveland State University EngagedScholarship@CSU Electrical Engineering & Coputer Science Faculty Publications Electrical Engineering & Coputer Science Departent 2015 Interactive Markov Models of Evolutionary

More information

General Properties of Radiation Detectors Supplements

General Properties of Radiation Detectors Supplements Phys. 649: Nuclear Techniques Physics Departent Yarouk University Chapter 4: General Properties of Radiation Detectors Suppleents Dr. Nidal M. Ershaidat Overview Phys. 649: Nuclear Techniques Physics Departent

More information

When Short Runs Beat Long Runs

When Short Runs Beat Long Runs When Short Runs Beat Long Runs Sean Luke George Mason University http://www.cs.gu.edu/ sean/ Abstract What will yield the best results: doing one run n generations long or doing runs n/ generations long

More information

Analyzing Simulation Results

Analyzing Simulation Results Analyzing Siulation Results Dr. John Mellor-Cruey Departent of Coputer Science Rice University johnc@cs.rice.edu COMP 528 Lecture 20 31 March 2005 Topics for Today Model verification Model validation Transient

More information

Pattern Recognition and Machine Learning. Artificial Neural networks

Pattern Recognition and Machine Learning. Artificial Neural networks Pattern Recognition and Machine Learning Jaes L. Crowley ENSIMAG 3 - MMIS Fall Seester 2017 Lessons 7 20 Dec 2017 Outline Artificial Neural networks Notation...2 Introduction...3 Key Equations... 3 Artificial

More information

Correcting a Significance Test for Clustering in Designs With Two Levels of Nesting

Correcting a Significance Test for Clustering in Designs With Two Levels of Nesting Institute for Policy Research Northwestern University Working Paper Series WP-07-4 orrecting a Significance est for lustering in Designs With wo Levels of Nesting Larry V. Hedges Faculty Fellow, Institute

More information

A method to determine relative stroke detection efficiencies from multiplicity distributions

A method to determine relative stroke detection efficiencies from multiplicity distributions A ethod to deterine relative stroke detection eiciencies ro ultiplicity distributions Schulz W. and Cuins K. 2. Austrian Lightning Detection and Inoration Syste (ALDIS), Kahlenberger Str.2A, 90 Vienna,

More information

1 Proof of learning bounds

1 Proof of learning bounds COS 511: Theoretical Machine Learning Lecturer: Rob Schapire Lecture #4 Scribe: Akshay Mittal February 13, 2013 1 Proof of learning bounds For intuition of the following theore, suppose there exists a

More information

In this chapter, we consider several graph-theoretic and probabilistic models

In this chapter, we consider several graph-theoretic and probabilistic models THREE ONE GRAPH-THEORETIC AND STATISTICAL MODELS 3.1 INTRODUCTION In this chapter, we consider several graph-theoretic and probabilistic odels for a social network, which we do under different assuptions

More information

Generalized Augmentation for Control of the k-familywise Error Rate

Generalized Augmentation for Control of the k-familywise Error Rate International Journal of Statistics in Medical Research, 2012, 1, 113-119 113 Generalized Augentation for Control of the k-failywise Error Rate Alessio Farcoeni* Departent of Public Health and Infectious

More information

Soft Computing Techniques Help Assign Weights to Different Factors in Vulnerability Analysis

Soft Computing Techniques Help Assign Weights to Different Factors in Vulnerability Analysis Soft Coputing Techniques Help Assign Weights to Different Factors in Vulnerability Analysis Beverly Rivera 1,2, Irbis Gallegos 1, and Vladik Kreinovich 2 1 Regional Cyber and Energy Security Center RCES

More information

A Simplified Analytical Approach for Efficiency Evaluation of the Weaving Machines with Automatic Filling Repair

A Simplified Analytical Approach for Efficiency Evaluation of the Weaving Machines with Automatic Filling Repair Proceedings of the 6th SEAS International Conference on Siulation, Modelling and Optiization, Lisbon, Portugal, Septeber -4, 006 0 A Siplified Analytical Approach for Efficiency Evaluation of the eaving

More information

Kinetic Theory of Gases: Elementary Ideas

Kinetic Theory of Gases: Elementary Ideas Kinetic Theory of Gases: Eleentary Ideas 17th February 2010 1 Kinetic Theory: A Discussion Based on a Siplified iew of the Motion of Gases 1.1 Pressure: Consul Engel and Reid Ch. 33.1) for a discussion

More information

Support Vector Machine Classification of Uncertain and Imbalanced data using Robust Optimization

Support Vector Machine Classification of Uncertain and Imbalanced data using Robust Optimization Recent Researches in Coputer Science Support Vector Machine Classification of Uncertain and Ibalanced data using Robust Optiization RAGHAV PAT, THEODORE B. TRAFALIS, KASH BARKER School of Industrial Engineering

More information

arxiv: v1 [stat.ot] 7 Jul 2010

arxiv: v1 [stat.ot] 7 Jul 2010 Hotelling s test for highly correlated data P. Bubeliny e-ail: bubeliny@karlin.ff.cuni.cz Charles University, Faculty of Matheatics and Physics, KPMS, Sokolovska 83, Prague, Czech Republic, 8675. arxiv:007.094v

More information

Estimating Parameters for a Gaussian pdf

Estimating Parameters for a Gaussian pdf Pattern Recognition and achine Learning Jaes L. Crowley ENSIAG 3 IS First Seester 00/0 Lesson 5 7 Noveber 00 Contents Estiating Paraeters for a Gaussian pdf Notation... The Pattern Recognition Proble...3

More information

Pattern Recognition and Machine Learning. Artificial Neural networks

Pattern Recognition and Machine Learning. Artificial Neural networks Pattern Recognition and Machine Learning Jaes L. Crowley ENSIMAG 3 - MMIS Fall Seester 2016 Lessons 7 14 Dec 2016 Outline Artificial Neural networks Notation...2 1. Introduction...3... 3 The Artificial

More information

Uniform Approximation and Bernstein Polynomials with Coefficients in the Unit Interval

Uniform Approximation and Bernstein Polynomials with Coefficients in the Unit Interval Unifor Approxiation and Bernstein Polynoials with Coefficients in the Unit Interval Weiang Qian and Marc D. Riedel Electrical and Coputer Engineering, University of Minnesota 200 Union St. S.E. Minneapolis,

More information

Lost-Sales Problems with Stochastic Lead Times: Convexity Results for Base-Stock Policies

Lost-Sales Problems with Stochastic Lead Times: Convexity Results for Base-Stock Policies OPERATIONS RESEARCH Vol. 52, No. 5, Septeber October 2004, pp. 795 803 issn 0030-364X eissn 1526-5463 04 5205 0795 infors doi 10.1287/opre.1040.0130 2004 INFORMS TECHNICAL NOTE Lost-Sales Probles with

More information

Figure 1: Equivalent electric (RC) circuit of a neurons membrane

Figure 1: Equivalent electric (RC) circuit of a neurons membrane Exercise: Leaky integrate and fire odel of neural spike generation This exercise investigates a siplified odel of how neurons spike in response to current inputs, one of the ost fundaental properties of

More information

A Semi-Parametric Approach to Account for Complex. Designs in Multiple Imputation

A Semi-Parametric Approach to Account for Complex. Designs in Multiple Imputation A Sei-Paraetric Approach to Account for Coplex Designs in ultiple Iputation Hanzhi Zhou, Trivellore E. Raghunathan and ichael R. Elliott Progra in Survey ethodology, University of ichigan Departent of

More information

On Conditions for Linearity of Optimal Estimation

On Conditions for Linearity of Optimal Estimation On Conditions for Linearity of Optial Estiation Erah Akyol, Kuar Viswanatha and Kenneth Rose {eakyol, kuar, rose}@ece.ucsb.edu Departent of Electrical and Coputer Engineering University of California at

More information

Testing the lag length of vector autoregressive models: A power comparison between portmanteau and Lagrange multiplier tests

Testing the lag length of vector autoregressive models: A power comparison between portmanteau and Lagrange multiplier tests Working Papers 2017-03 Testing the lag length of vector autoregressive odels: A power coparison between portanteau and Lagrange ultiplier tests Raja Ben Hajria National Engineering School, University of

More information

Handwriting Detection Model Based on Four-Dimensional Vector Space Model

Handwriting Detection Model Based on Four-Dimensional Vector Space Model Journal of Matheatics Research; Vol. 10, No. 4; August 2018 ISSN 1916-9795 E-ISSN 1916-9809 Published by Canadian Center of Science and Education Handwriting Detection Model Based on Four-Diensional Vector

More information

Kinetic Theory of Gases: Elementary Ideas

Kinetic Theory of Gases: Elementary Ideas Kinetic Theory of Gases: Eleentary Ideas 9th February 011 1 Kinetic Theory: A Discussion Based on a Siplified iew of the Motion of Gases 1.1 Pressure: Consul Engel and Reid Ch. 33.1) for a discussion of

More information

1 Bounding the Margin

1 Bounding the Margin COS 511: Theoretical Machine Learning Lecturer: Rob Schapire Lecture #12 Scribe: Jian Min Si March 14, 2013 1 Bounding the Margin We are continuing the proof of a bound on the generalization error of AdaBoost

More information

Algorithms for parallel processor scheduling with distinct due windows and unit-time jobs

Algorithms for parallel processor scheduling with distinct due windows and unit-time jobs BULLETIN OF THE POLISH ACADEMY OF SCIENCES TECHNICAL SCIENCES Vol. 57, No. 3, 2009 Algoriths for parallel processor scheduling with distinct due windows and unit-tie obs A. JANIAK 1, W.A. JANIAK 2, and

More information

On the Maximum Likelihood Estimation of Weibull Distribution with Lifetime Data of Hard Disk Drives

On the Maximum Likelihood Estimation of Weibull Distribution with Lifetime Data of Hard Disk Drives 314 Int'l Conf. Par. and Dist. Proc. Tech. and Appl. PDPTA'17 On the Maxiu Likelihood Estiation of Weibull Distribution with Lifetie Data of Hard Disk Drives Daiki Koizui Departent of Inforation and Manageent

More information

arxiv: v1 [cs.ds] 3 Feb 2014

arxiv: v1 [cs.ds] 3 Feb 2014 arxiv:40.043v [cs.ds] 3 Feb 04 A Bound on the Expected Optiality of Rando Feasible Solutions to Cobinatorial Optiization Probles Evan A. Sultani The Johns Hopins University APL evan@sultani.co http://www.sultani.co/

More information

e-companion ONLY AVAILABLE IN ELECTRONIC FORM

e-companion ONLY AVAILABLE IN ELECTRONIC FORM OPERATIONS RESEARCH doi 10.1287/opre.1070.0427ec pp. ec1 ec5 e-copanion ONLY AVAILABLE IN ELECTRONIC FORM infors 07 INFORMS Electronic Copanion A Learning Approach for Interactive Marketing to a Custoer

More information

Comparing Probabilistic Forecasting Systems with the Brier Score

Comparing Probabilistic Forecasting Systems with the Brier Score 1076 W E A T H E R A N D F O R E C A S T I N G VOLUME 22 Coparing Probabilistic Forecasting Systes with the Brier Score CHRISTOPHER A. T. FERRO School of Engineering, Coputing and Matheatics, University

More information

Curious Bounds for Floor Function Sums

Curious Bounds for Floor Function Sums 1 47 6 11 Journal of Integer Sequences, Vol. 1 (018), Article 18.1.8 Curious Bounds for Floor Function Sus Thotsaporn Thanatipanonda and Elaine Wong 1 Science Division Mahidol University International

More information

What is Probability? (again)

What is Probability? (again) INRODUCTION TO ROBBILITY Basic Concepts and Definitions n experient is any process that generates well-defined outcoes. Experient: Record an age Experient: Toss a die Experient: Record an opinion yes,

More information

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and This article appeared in a ournal published by Elsevier. The attached copy is furnished to the author for internal non-coercial research and education use, including for instruction at the authors institution

More information

COS 424: Interacting with Data. Written Exercises

COS 424: Interacting with Data. Written Exercises COS 424: Interacting with Data Hoework #4 Spring 2007 Regression Due: Wednesday, April 18 Written Exercises See the course website for iportant inforation about collaboration and late policies, as well

More information

A MESHSIZE BOOSTING ALGORITHM IN KERNEL DENSITY ESTIMATION

A MESHSIZE BOOSTING ALGORITHM IN KERNEL DENSITY ESTIMATION A eshsize boosting algorith in kernel density estiation A MESHSIZE BOOSTING ALGORITHM IN KERNEL DENSITY ESTIMATION C.C. Ishiekwene, S.M. Ogbonwan and J.E. Osewenkhae Departent of Matheatics, University

More information

Best Arm Identification: A Unified Approach to Fixed Budget and Fixed Confidence

Best Arm Identification: A Unified Approach to Fixed Budget and Fixed Confidence Best Ar Identification: A Unified Approach to Fixed Budget and Fixed Confidence Victor Gabillon Mohaad Ghavazadeh Alessandro Lazaric INRIA Lille - Nord Europe, Tea SequeL {victor.gabillon,ohaad.ghavazadeh,alessandro.lazaric}@inria.fr

More information

Bayesian Approach for Fatigue Life Prediction from Field Inspection

Bayesian Approach for Fatigue Life Prediction from Field Inspection Bayesian Approach for Fatigue Life Prediction fro Field Inspection Dawn An and Jooho Choi School of Aerospace & Mechanical Engineering, Korea Aerospace University, Goyang, Seoul, Korea Srira Pattabhiraan

More information

TABLE FOR UPPER PERCENTAGE POINTS OF THE LARGEST ROOT OF A DETERMINANTAL EQUATION WITH FIVE ROOTS. By William W. Chen

TABLE FOR UPPER PERCENTAGE POINTS OF THE LARGEST ROOT OF A DETERMINANTAL EQUATION WITH FIVE ROOTS. By William W. Chen TABLE FOR UPPER PERCENTAGE POINTS OF THE LARGEST ROOT OF A DETERMINANTAL EQUATION WITH FIVE ROOTS By Willia W. Chen The distribution of the non-null characteristic roots of a atri derived fro saple observations

More information

THE CONSTRUCTION OF GOOD EXTENSIBLE RANK-1 LATTICES. 1. Introduction We are interested in approximating a high dimensional integral [0,1]

THE CONSTRUCTION OF GOOD EXTENSIBLE RANK-1 LATTICES. 1. Introduction We are interested in approximating a high dimensional integral [0,1] MATHEMATICS OF COMPUTATION Volue 00, Nuber 0, Pages 000 000 S 0025-578(XX)0000-0 THE CONSTRUCTION OF GOOD EXTENSIBLE RANK- LATTICES JOSEF DICK, FRIEDRICH PILLICHSHAMMER, AND BENJAMIN J. WATERHOUSE Abstract.

More information

The proofs of Theorem 1-3 are along the lines of Wied and Galeano (2013).

The proofs of Theorem 1-3 are along the lines of Wied and Galeano (2013). A Appendix: Proofs The proofs of Theore 1-3 are along the lines of Wied and Galeano (2013) Proof of Theore 1 Let D[d 1, d 2 ] be the space of càdlàg functions on the interval [d 1, d 2 ] equipped with

More information

SPECTRUM sensing is a core concept of cognitive radio

SPECTRUM sensing is a core concept of cognitive radio World Acadey of Science, Engineering and Technology International Journal of Electronics and Counication Engineering Vol:6, o:2, 202 Efficient Detection Using Sequential Probability Ratio Test in Mobile

More information

Best Linear Unbiased and Invariant Reconstructors for the Past Records

Best Linear Unbiased and Invariant Reconstructors for the Past Records BULLETIN of the MALAYSIAN MATHEMATICAL SCIENCES SOCIETY http:/athusy/bulletin Bull Malays Math Sci Soc (2) 37(4) (2014), 1017 1028 Best Linear Unbiased and Invariant Reconstructors for the Past Records

More information

Upper bound on false alarm rate for landmine detection and classification using syntactic pattern recognition

Upper bound on false alarm rate for landmine detection and classification using syntactic pattern recognition Upper bound on false alar rate for landine detection and classification using syntactic pattern recognition Ahed O. Nasif, Brian L. Mark, Kenneth J. Hintz, and Nathalia Peixoto Dept. of Electrical and

More information

Generalized eigenfunctions and a Borel Theorem on the Sierpinski Gasket.

Generalized eigenfunctions and a Borel Theorem on the Sierpinski Gasket. Generalized eigenfunctions and a Borel Theore on the Sierpinski Gasket. Kasso A. Okoudjou, Luke G. Rogers, and Robert S. Strichartz May 26, 2006 1 Introduction There is a well developed theory (see [5,

More information

Weighted- 1 minimization with multiple weighting sets

Weighted- 1 minimization with multiple weighting sets Weighted- 1 iniization with ultiple weighting sets Hassan Mansour a,b and Özgür Yılaza a Matheatics Departent, University of British Colubia, Vancouver - BC, Canada; b Coputer Science Departent, University

More information

A proposal for a First-Citation-Speed-Index Link Peer-reviewed author version

A proposal for a First-Citation-Speed-Index Link Peer-reviewed author version A proposal for a First-Citation-Speed-Index Link Peer-reviewed author version Made available by Hasselt University Library in Docuent Server@UHasselt Reference (Published version): EGGHE, Leo; Bornann,

More information

arxiv: v1 [cs.ds] 29 Jan 2012

arxiv: v1 [cs.ds] 29 Jan 2012 A parallel approxiation algorith for ixed packing covering seidefinite progras arxiv:1201.6090v1 [cs.ds] 29 Jan 2012 Rahul Jain National U. Singapore January 28, 2012 Abstract Penghui Yao National U. Singapore

More information

Intelligent Systems: Reasoning and Recognition. Artificial Neural Networks

Intelligent Systems: Reasoning and Recognition. Artificial Neural Networks Intelligent Systes: Reasoning and Recognition Jaes L. Crowley MOSIG M1 Winter Seester 2018 Lesson 7 1 March 2018 Outline Artificial Neural Networks Notation...2 Introduction...3 Key Equations... 3 Artificial

More information

Pattern Recognition and Machine Learning. Learning and Evaluation for Pattern Recognition

Pattern Recognition and Machine Learning. Learning and Evaluation for Pattern Recognition Pattern Recognition and Machine Learning Jaes L. Crowley ENSIMAG 3 - MMIS Fall Seester 2017 Lesson 1 4 October 2017 Outline Learning and Evaluation for Pattern Recognition Notation...2 1. The Pattern Recognition

More information

E. Alpaydın AERFAISS

E. Alpaydın AERFAISS E. Alpaydın AERFAISS 00 Introduction Questions: Is the error rate of y classifier less than %? Is k-nn ore accurate than MLP? Does having PCA before iprove accuracy? Which kernel leads to highest accuracy

More information

Computable Shell Decomposition Bounds

Computable Shell Decomposition Bounds Coputable Shell Decoposition Bounds John Langford TTI-Chicago jcl@cs.cu.edu David McAllester TTI-Chicago dac@autoreason.co Editor: Leslie Pack Kaelbling and David Cohn Abstract Haussler, Kearns, Seung

More information

ADVANCES ON THE BESSIS- MOUSSA-VILLANI TRACE CONJECTURE

ADVANCES ON THE BESSIS- MOUSSA-VILLANI TRACE CONJECTURE ADVANCES ON THE BESSIS- MOUSSA-VILLANI TRACE CONJECTURE CHRISTOPHER J. HILLAR Abstract. A long-standing conjecture asserts that the polynoial p(t = Tr(A + tb ] has nonnegative coefficients whenever is

More information

Moments of the product and ratio of two correlated chi-square variables

Moments of the product and ratio of two correlated chi-square variables Stat Papers 009 50:581 59 DOI 10.1007/s0036-007-0105-0 REGULAR ARTICLE Moents of the product and ratio of two correlated chi-square variables Anwar H. Joarder Received: June 006 / Revised: 8 October 007

More information

Mathematical Models to Determine Stable Behavior of Complex Systems

Mathematical Models to Determine Stable Behavior of Complex Systems Journal of Physics: Conference Series PAPER OPEN ACCESS Matheatical Models to Deterine Stable Behavior of Coplex Systes To cite this article: V I Suin et al 08 J. Phys.: Conf. Ser. 05 0336 View the article

More information

Synchronization in large directed networks of coupled phase oscillators

Synchronization in large directed networks of coupled phase oscillators CHAOS 16, 015107 2005 Synchronization in large directed networks of coupled phase oscillators Juan G. Restrepo a Institute for Research in Electronics and Applied Physics, University of Maryland, College

More information

International Journal of Pure and Applied Mathematics Volume 37 No , IMPROVED DATA DRIVEN CONTROL CHARTS

International Journal of Pure and Applied Mathematics Volume 37 No , IMPROVED DATA DRIVEN CONTROL CHARTS International Journal of Pure and Applied Matheatics Volue 37 No. 3 2007, 423-438 IMPROVED DATA DRIVEN CONTROL CHARTS Wille Albers 1, Wilbert C.M. Kallenberg 2 1,2 Departent of Applied Matheatics Faculty

More information

AN EFFICIENT CLASS OF CHAIN ESTIMATORS OF POPULATION VARIANCE UNDER SUB-SAMPLING SCHEME

AN EFFICIENT CLASS OF CHAIN ESTIMATORS OF POPULATION VARIANCE UNDER SUB-SAMPLING SCHEME J. Japan Statist. Soc. Vol. 35 No. 005 73 86 AN EFFICIENT CLASS OF CHAIN ESTIMATORS OF POPULATION VARIANCE UNDER SUB-SAMPLING SCHEME H. S. Jhajj*, M. K. Shara* and Lovleen Kuar Grover** For estiating the

More information

Is the seismic moment frequency relation universal?

Is the seismic moment frequency relation universal? Geophys. J. Int. (2) 142, 193 198 Is the seisic oent frequency relation universal? C. Godano1 and F. Pingue2 1 Dipartiento di Scienze Abientali, Seconda Università di Napoli, via V ivaldi 43, 811 Caserta,

More information

C na (1) a=l. c = CO + Clm + CZ TWO-STAGE SAMPLE DESIGN WITH SMALL CLUSTERS. 1. Introduction

C na (1) a=l. c = CO + Clm + CZ TWO-STAGE SAMPLE DESIGN WITH SMALL CLUSTERS. 1. Introduction TWO-STGE SMPLE DESIGN WITH SMLL CLUSTERS Robert G. Clark and David G. Steel School of Matheatics and pplied Statistics, University of Wollongong, NSW 5 ustralia. (robert.clark@abs.gov.au) Key Words: saple

More information

Research in Area of Longevity of Sylphon Scraies

Research in Area of Longevity of Sylphon Scraies IOP Conference Series: Earth and Environental Science PAPER OPEN ACCESS Research in Area of Longevity of Sylphon Scraies To cite this article: Natalia Y Golovina and Svetlana Y Krivosheeva 2018 IOP Conf.

More information