Bias Correction in Estimation of the Population Correlation Coefficient

Similar documents
Comparison of Dual to Ratio-Cum-Product Estimators of Population Mean

Chapter 8: Statistical Analysis of Simulated Data

A New Family of Transformations for Lifetime Data

{ }{ ( )} (, ) = ( ) ( ) ( ) Chapter 14 Exercises in Sampling Theory. Exercise 1 (Simple random sampling): Solution:

(Monte Carlo) Resampling Technique in Validity Testing and Reliability Testing

Application of Calibration Approach for Regression Coefficient Estimation under Two-stage Sampling Design

Summary of the lecture in Biostatistics

Some Applications of the Resampling Methods in Computational Physics

Econometric Methods. Review of Estimation

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

Bootstrap Method for Testing of Equality of Several Coefficients of Variation

Simple Linear Regression

VOL. 3, NO. 11, November 2013 ISSN ARPN Journal of Science and Technology All rights reserved.

Analysis of Variance with Weibull Data

Comparing Different Estimators of three Parameters for Transmuted Weibull Distribution

Bayes Interval Estimation for binomial proportion and difference of two binomial proportions with Simulation Study

LECTURE - 4 SIMPLE RANDOM SAMPLING DR. SHALABH DEPARTMENT OF MATHEMATICS AND STATISTICS INDIAN INSTITUTE OF TECHNOLOGY KANPUR

Lecture 3. Sampling, sampling distributions, and parameter estimation

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

Multiple Linear Regression Analysis

Bayes Estimator for Exponential Distribution with Extension of Jeffery Prior Information

Mean is only appropriate for interval or ratio scales, not ordinal or nominal.

( ) = ( ) ( ) Chapter 13 Asymptotic Theory and Stochastic Regressors. Stochastic regressors model

Multivariate Transformation of Variables and Maximum Likelihood Estimation

Chapter 13, Part A Analysis of Variance and Experimental Design. Introduction to Analysis of Variance. Introduction to Analysis of Variance

ENGI 3423 Simple Linear Regression Page 12-01

Comparison of Parameters of Lognormal Distribution Based On the Classical and Posterior Estimates

Estimation of Stress- Strength Reliability model using finite mixture of exponential distributions

BAYESIAN INFERENCES FOR TWO PARAMETER WEIBULL DISTRIBUTION

Bayesian Inferences for Two Parameter Weibull Distribution Kipkoech W. Cheruiyot 1, Abel Ouko 2, Emily Kirimi 3

Estimation and Testing in Type-II Generalized Half Logistic Distribution

Confidence Interval Estimations of the Parameter for One Parameter Exponential Distribution

Chapter 11 Systematic Sampling

Chapter 8. Inferences about More Than Two Population Central Values

To use adaptive cluster sampling we must first make some definitions of the sampling universe:

: At least two means differ SST

Lecture 7. Confidence Intervals and Hypothesis Tests in the Simple CLR Model

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. x, where. = y - ˆ " 1

Module 7: Probability and Statistics

Simulation Output Analysis

Class 13,14 June 17, 19, 2015

Simple Linear Regression

Special Instructions / Useful Data

Chapter 13 Student Lecture Notes 13-1

Estimation of Population Total using Local Polynomial Regression with Two Auxiliary Variables

Median as a Weighted Arithmetic Mean of All Sample Observations

Study of Correlation using Bayes Approach under bivariate Distributions

ADAPTIVE CLUSTER SAMPLING USING AUXILIARY VARIABLE

ρ < 1 be five real numbers. The

ENGI 4421 Joint Probability Distributions Page Joint Probability Distributions [Navidi sections 2.5 and 2.6; Devore sections

Ordinary Least Squares Regression. Simple Regression. Algebra and Assumptions.

ENGI 4421 Propagation of Error Page 8-01

A Combination of Adaptive and Line Intercept Sampling Applicable in Agricultural and Environmental Studies

Minimax Estimation of the Parameter of the Burr Type Xii Distribution

Goodness of Fit Test for The Skew-T Distribution

Statistics MINITAB - Lab 5

12.2 Estimating Model parameters Assumptions: ox and y are related according to the simple linear regression model

Chapter 3 Sampling For Proportions and Percentages

ON ESTIMATION OF POPULATION MEAN IN THE PRESENCE OF MEASUREMENT ERROR AND NON-RESPONSE

STA302/1001-Fall 2008 Midterm Test October 21, 2008

ECONOMETRIC THEORY. MODULE VIII Lecture - 26 Heteroskedasticity

Applied Mathematics and Computation

IFYMB002 Mathematics Business Appendix C Formula Booklet

Faculty Research Interest Seminar Department of Biostatistics, GSPH University of Pittsburgh. Gong Tang Feb. 18, 2005

Parameter Estimation in Generalized Linear Models through

Lecture Notes Types of economic variables

Module 7. Lecture 7: Statistical parameter estimation

TESTS BASED ON MAXIMUM LIKELIHOOD

Chapter Two. An Introduction to Regression ( )

A WEIGHTED LEAST SQUARES METHOD FOR ESTIMATING THE SUCCESS RATE IN CLUSTERED BINARY DATA

Chapter 11 The Analysis of Variance

Chapter -2 Simple Random Sampling

hp calculators HP 30S Statistics Averages and Standard Deviations Average and Standard Deviation Practice Finding Averages and Standard Deviations

STK4011 and STK9011 Autumn 2016

Simple Linear Regression and Correlation. Applied Statistics and Probability for Engineers. Chapter 11 Simple Linear Regression and Correlation

Sampling Theory MODULE X LECTURE - 35 TWO STAGE SAMPLING (SUB SAMPLING)

Chapter Business Statistics: A First Course Fifth Edition. Learning Objectives. Correlation vs. Regression. In this chapter, you learn:

Chapter 10 Two Stage Sampling (Subsampling)

b. There appears to be a positive relationship between X and Y; that is, as X increases, so does Y.

Permutation Tests for More Than Two Samples

Improving coverage probabilities of confidence intervals in random effects meta-analysis with publication bias

Chapter -2 Simple Random Sampling

Chapter 14 Logistic Regression Models

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

Correlation and Simple Linear Regression

CLASS NOTES. for. PBAF 528: Quantitative Methods II SPRING Instructor: Jean Swanson. Daniel J. Evans School of Public Affairs

THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA

Statistics. Correlational. Dr. Ayman Eldeib. Simple Linear Regression and Correlation. SBE 304: Linear Regression & Correlation 1/3/2018

Chapter 4 Multiple Random Variables

Estimation of the Loss and Risk Functions of Parameter of Maxwell Distribution

STA 105-M BASIC STATISTICS (This is a multiple choice paper.)

f f... f 1 n n (ii) Median : It is the value of the middle-most observation(s).

General Families of Estimators for Estimating Population Mean in Stratified Random Sampling under Non-Response

ESS Line Fitting

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

GOALS The Samples Why Sample the Population? What is a Probability Sample? Four Most Commonly Used Probability Sampling Methods

Probability and. Lecture 13: and Correlation

The Generalized Inverted Generalized Exponential Distribution with an Application to a Censored Data

ε. Therefore, the estimate

Horvitz-Thompson Variance Weights: Exact vs. Approximate

Transcription:

Kasetsart J. (Nat. Sc.) 47 : 453-459 (3) Bas Correcto Estmato of the opulato Correlato Coeffcet Juthaphor Ssomboothog ABSTRACT A estmator of the populato correlato coeffcet of two varables for a bvarate ormal dstrbuto was proposed ad evaluated usg comparsos wth the earso correlato coeffcet ad a estmator of Olk ad ratt, coducted usg a smulato study. It was foud that for a small sample sze of, the absolute bas of the proposed estmator was less tha those of the earso correlato coeffcet ad a estmator of Olk ad ratt. I addto, the mea square errors of those estmators seemed to have o dfferece each stuato for ths study. Keywords: earso correlato coeffcet, bvarate ormal dstrbuto, absolute bas, mea square error, estmator. Itroducto The earso correlato coeffcet s oe of the most frequetly used tools of researchers for correlato coeffcet vestgato as metoed by Rodgers ad Ncewader (988) ad Huso et al. (7). Ufortuately, Neter et al. (996) ad Zmmerma et al. (3) cosdered that t was a based estmator for the populato correlato coeffcet (ρ). Furthermore, the bas decreased whe the sample sze creased ad t was zero whe the populato correlato coeffcet was zero or oe. I addto, ths coformed to the research of Ssomboothog (a). A covetoal estmate of the earso correlato coeffcet s lkely to uderestmate the populato correlato coeffcet (Gorsuch ad Lehma, ; Adolph ad Hard, 7; Zmmerma et al., 3) because the dstrbuto of ths estmator s asymmetrcal (Fsher, 9). Therefore, research actvty to fd a correcto for bas the estmato of the correlato coeffcet has already bee udertake. Fsher (95) publshed a approxmately ubased estmator of the populato correlato coeffcet samples from a deftely large populato. Later, Olk ad ratt (958) developed the earso correlato coeffcet to decrease the amout of bas of ths estmator for two varables havg a bvarate ormal dstrbuto wth equal varaces. The Zmmerma et al. (3) demostrated that the bas of the earso correlato coeffcet was almost elmated by Fsher s estmator ad a related estmator proposed by Olk ad ratt (958). I the preset study, a estmator of the populato correlato coeffcet of two varables for a bvarate ormal dstrbuto was proposed ad the jackkfe method (Queoulle, 949; Tukey, 958) was appled for bas reducto Departmet of Statstcs, Faculty of Scece, Kasetsart Uversty, Bagkok 9, Thalad. E-mal: fscjps@ku.ac.th Receved date : 4//3 Accepted date : 8/5/3

454 Kasetsart J. (Nat. Sc.) 47(3) (Efro ad Tbshra, 993; Smth ad otus, 6; Ssomboothog, a, b). Furthermore, comparsos of the absolute bases ad mea square errors of three estmators the proposed estmator, the earso correlato coeffcet, ad a estmator of Olk ad ratt were performed by a smulato study. Materals ad Methods Ths study proposed a estmator of the populato correlato coeffcet ad appled the jackkfe method for bas reducto of the earso correlato coeffcet. I order to emprcally evaluate the valdty ad relablty of the proposed estmator, a smulato study was coducted for 5 stuatos. The, comparsos of the absolute bas ad mea square error of the proposed estmator ad two sample correlato coeffcets the earso correlato coeffcet ad the Olk ad ratt estmator were emprcally performed. Sample correlato coeffcet Let (x, y ),..., (x, y ) be a radom sample from a bvarate ormal dstrbuto wth meas μ, μ varaces σ, σ ad the populato correlato coeffcet ρ. It s well kow that the maxmum lkelhood estmator of ρ, deoted by ρ, s gve by Equato ρ x ad y ( x x)( y y) ( x x) ( y y) x y () (Neter et al., 996; Aderso, 3). Ths estmator s ofte called the earso correlato coeffcet. It s a based estmator of ρ (uless ρ or ), whch s usually small whe the sample sze s large (Neter et al., 996; Zmmerma et al., 3; Ssomboothog, a). Later, Olk ad ratt (958) recommeded the correcto estmator the form of O + ( 3) as a more early ubased estmator of ρ. Zmmerma et al. (3) usg smulato demostrated that the bas of ρ ˆO was less tha that of ρ for a small sample sze. roposed estmator Ths secto proposes a estmator of ρ ad apples the jackkfe method for bas reducto of ρ as follows: ) Suppose we have a radom sample from a bvarate ormal dstrbuto wth mea vector ad a varace covarace matrx Σ σ σ σ ρ σ σ ad t s σ σ gveby S ((x, y ), (x, y ),..., (x, y )). I addto, a estmator of ρ s show Equato. δ( S) x ad y ( x x)( y y) ( x x) ( y y) x y. ()

Kasetsart J. (Nat. Sc.) 47(3) 455 ) The th jackkfe sample, S (-), cossts of the dataset wth the th observato removed. ( S ( x, y ),( x, y ),..., ( x, y ),( x, y ), ( ) ( ) ( ) ( + ) ( + ) for,,,. 3) Let δ(s (-) ) be the th jackkfe replcato of δ(s) ad t s gve by Equato 3 ( ) ˆ δ S ρ x ad the form J by J ( ) ( ) j ( ) y ( ) j ( x x )( y y ) j ( ) j ( ) ( x x ) j ( ) j x j y j for,,,. ( yj y ( ) ) j j (3) 4) Calculato of the pseudo values J δ( S) ( ) δ( S( ) ) ( ). ( ) 5) The proposed estmator of ρ s gve J J ( ) ( ) ˆ ρ ( ) ( ) for ρ ad ρ (-) are gve by the format of Equatos ad 3, respectvely. RESULTS I order to emprcally evaluate the valdty ad relablty of the proposed estmator, a smulato study was coducted. I the study, two populatos of sze,, cotag ordered pars of x ad y, were each geerated accordg to a bvarate ormal dstrbuto wth μ, μ 4 wth a equal varace (σ 8,σ 8) for the frst case ad a uequal varace (σ,σ ) for the secod case. I addto, the correlato coeffcets (ρ) of the two varables, x ad y, were set at -, -.8,,,,.,,, thus creatg 38 populatos for ths smulato study. A small sample sze of ad large sample szes of 3, 5 ad 6 were take from each populato by usg smple radom samplg wth replacemet wth, repettos, thus creatg 5 stuatos for the smulato study. The, the absolute bas ad mea square error (MSE) comparsos of ρ, ρ ˆJ ad ρ ˆO were performed emprcally. The smulato results preseted Fgure ad Fgure reveal the absolute bases of ρ, ρ ˆJ ad ρ ˆO. Furthermore, they cofrm the bas reducto of ρ from the proposed estmator. For the uequal varace of two populatos, the absolute bas of ρ was less tha those of ρ ˆJ ad ρ ˆO for a small sample sze,, at all levels of the populato correlato coeffcet. I the case of a equal varace for the two populatos, the absolute bas of ρ was less tha those of ρ ˆJ ad ρ ˆO whe the populato correlato coeffcets fell betwee - ad.4. I addto, the absolute bas of ρ seemed to have o dfferece from ˆJ that of ρ ˆO whe the sample szes were ot less tha 3 at all levels of the populato correlato coeffcet for uequal ad equal varaces of the two populatos.

456 Kasetsart J. (Nat. Sc.) 47(3) Eve for the large sample szes of 3, 5 ad 6, the absolute bases of ρ ad ρ ˆJ ˆO were less tha that of ρ whe the populato correlato coeffcet dd ot approxmate zero ad there was a uequal varace for the two populatos, as there were equal varaces for the two populatos, the absolute bases of ρ ad ρ ˆJ ˆO were lkely to be lower tha that of ρ whe the populato correlato coeffcet was postve. I addto, the absolute bases of ρ, ρ ˆJ ad ρ ˆO Absolute bas Absolute bas Absolute bas Absolute bas.35.3.5..5..5..8.6.4...8.6.4..5.4.3.. Sample sze of.3 Sample sze of.5..5 - - - - - opulato correlato (ρ) opulato correlato (ρ) Sample sze of 3. Sample sze of 3.8.6 earso.4 Olk&ratt. Juthaphor - - - - - opulato correlato (ρ) opulato correlato (ρ) opulato correlato (ρ) opulato correlato (ρ) Sample sze of 6.5 Sample sze of 6.4.3 earso. Olk&ratt Juthaphor. opulato correlato (ρ) Absolute bas Absolute bas Absolute bas Absolute bas..5 Sample sze of 5.3 Sample sze of 5.5..5 - - - - - - - - - - earso Olk&ratt Juthaphor earso Olk&ratt Juthaphor..5 - - - - - - - - - - - - - - - - - - - - opulato correlato (ρ) earso Olk&ratt Juthaphor earso Olk&ratt Juthaphor earso Olk&ratt Juthaphor earso Olk&ratt Juthaphor Fgure Absolute bases of ρ ˆJ, ρ ad ρ ˆO whe σ ad σ. Fgure Absolute bases of ρ ˆJ, ρ ad ρ ˆO whe σ 8 ad σ 8.

Kasetsart J. (Nat. Sc.) 47(3) 457 seemed to decrease wheever the sample sze creased. Fgure 3 ad Fgure 4 dcate that the MSE of ρ seemed to have o dfferece from ˆJ those of ρ ad ρ ˆO each stuato for ths study. Furthermore, the MSEs of ρ, ρ ˆJ ad O seemed to decrease wheever the sample sze creased, regardless of the populato correlato coeffcet. Ths smulato study foud that the proposed estmator, ρ, almost completely ˆJ elmated the bas, ad the performace of ths estmator seemed to be better tha those of ρ ad ρ for the small sample sze of. ˆO Mea square error 4.8.6.4. Sample sze of earso Olk&ratt Juthaphor Mea square error 4.8.6.4. Sample sze of earso Olk&ratt Juthaphor - - - - - opulato correlato (ρ) - - - - - opulato correlato (ρ) Mea square error 4.8.6.4. Sample sze of 3 earso Olk&ratt Juthaphor Mea square error 4.8.6.4. Sample sze of 3 earso Olk&ratt Juthaphor - - - - - opulato correlato (ρ) - - - - - opulato correlato (ρ) Mea square error 4.8.6.4. Sample sze of 5 earso Olk&ratt Juthaphor Mea square error 4.8.6.4. Sample sze of 5 earso Olk&ratt Juthaphor - - - - - opulato correlato (ρ) - - - - - opulato correlato (ρ) Mea square error 4.8.6.4. Sample sze of 6 earso Olk&ratt Juthaphor Mea square error 4.8.6.4. Sample sze of 6 earso Olk&ratt Juthaphor - - - - - opulato correlato (ρ) - - - - - opulato correlato (ρ) Fgure 3 Mea square errors of ρ ˆJ, ρ ad ρ ˆO whe σ ad σ. Fgure 4 Mea square errors of ρ ˆJ, ρ ad ρ ˆO whe σ 8 ad σ 8.

458 Kasetsart J. (Nat. Sc.) 47(3) DISCUSSION The smulato results showed that ρ seemed to be a based estmator as metoed by Neter et al. (996) ad Zmmerma et al. (3). The proposed estmator, ρ ˆJ, was modfed from ρ ad the jackkfe method was appled for bas reducto. The results of ths smulato study showed that the bas of the proposed estmator was reduced to zero all stuatos. I addto, these results also showed that the varaces of two populatos do ot affect the bas reducto for all estmators as studed by Olk ad ratt (958). These fdgs ca be appled to research psychology, the behavoral sceces, ecology ad other felds. I addto, t s possble to use computer programmg to calculate ρ ˆJ wthout dffculty. Cocluso Ths paper proposed a estmator of the populato correlato coeffcet for a bvarate ormal dstrbuto. The proposed estmator provded a approxmately ubased estmator of the populato correlato coeffcet. The results of a smulato study dcated that the performace of ρ ˆJ seemed to be better tha those ρ ad O for a small sample sze,, regardless of the populato correlato coeffcets. I addto, the MSE of ρ ˆJ seemed to have o dfferece from those of ρ ad ρ each stuato for ths ˆO study. ACKNOWLEDGEMENTS The author would lke to thak the Departmet of Statstcs, Faculty of Scece, Kasetsart Uversty for the provso of ecessary facltes durg the research. LITERATURE CITED Adolph, S.C. ad J.S. Hard. 7. Estmatg pheotypc correlatos: Correctg for bas due to tradvdual varablty. Fuct. Ecol. (): 78 84. Aderso, T.W. 3. A Itroducto to Multvarate Statstcal Aalyss. 3rd ed. Wley. Hoboke, NJ, USA. 7 pp. Efro, B. ad R.J. Tbshra. 993. A Itroducto to the Bootstrap. Chapma & Hall/CRC, part of the Taylor ad Fracs group. Lodo, UK. 45 pp. Fsher, R.A. 95. Frequecy dstrbuto of the values of the correlato coeffcet samples from a deftely large populato. Bometrka (4): 57 5.. 9. O the probable error of a coeffcet of correlato deduced from a small sample. Metro : 3 3. Gorsuch, R.L. ad C.S. Lehma.. Correlato coeffcets: Mea bas ad cofdece terval dstortos. Joural of Methods ad Measuremet the Socal Sceces (): 5 65. Huso, L.W., Bostatstcs Group ad F.H. La- Roche. 7. erformace of some correlato coeffcets whe appled to zero-clustered data. J. Mod. Appl. Stat. Methods 6: 53 536. Neter, J., M.H. Kuter, C.J. Nachtshem ad W. Wasserma. 996. Appled Lear Statstcal Models. 4th ed.. Irw. Chcago, IL, USA.,43 pp. Olk, I. ad J.W. ratt. 958. Ubased estmato of certa correlato coeffcets. A. Math. Statst. 9:. Queoulle, M.H. 949. Approxmate test of correlato tme-seres. J. R. Statst. Soc. B : 68 84. Rodgers, J.L. ad W.A. Ncewader. 988. Thrtee ways to look at the correlato coeffcet. Am. Stat. 4(): 59 66.

Kasetsart J. (Nat. Sc.) 47(3) 459 Ssomboothog, J. a. Estmato of the correlato coeffcet for a bvarate ormal dstrbuto wth mssg data. Kasetsart J. (Nat. Sc.) 45(4): 736 74. Ssomboothog, J. b. Jackkfe maxmum lkelhood estmates for a bvarate ormal dstrbuto wth mssg data. Joural of Tha Statstcal Assocato 9(): 5 69. Smth, C.D. ad J.S. otus. 6. Jackkfe estmator of speces rchess wth S LUS. J. Stat. Softw. 5:. Tukey, J.W. 958. Bas ad cofdece otqute large samples. A. Math. Statst. 9: 64 63. Zmmerma, D.W., B.D. Zumbo ad R.H. Wllams. 3. Bas estmato ad hypothess testg of correlato. scológca 4: 33 58.