Estimation of sensitive quantitative characteristics in randomized response sampling
|
|
- Timothy Price
- 5 years ago
- Views:
Transcription
1 Estimation of sensitive quantitative characteristics in randomized response sampling Kuo-Chung Huang Department of Business Administration Chungyu Institute of Technology Keelung Taiwan 20 R.O.C. Chun-Hsiung Lan Mei-Pei Kuo Department of Business Administration Nanhua University Dalin, Chiayi Taiwan 622 R.O.C. Abstract This paper considers the problem of procuring honest responses for sensitive quantitative characteristics. An alternative survey technique is proposed, which enables us to estimate the population mean unbiasedly and to gauge how sensitive a survey topic is. An asymptotically unbiased estimator of sensitivity level is proposed, and conditions for which unbiased estimation for population variance being available is also studied. In addition, an efficiency comparison is worked out to examine the performance of the proposed procedure. It is found that higher estimation efficiency results from higher variation of randomization device. Keywords : Percent relative efficiency, privacy protection, sample size allocation, simple random sampling with replacement. kchuang.tw@gmail.com Journal of Statistics & Management Systems Vol. 9 (2006), No., pp c Taru Publications
2 28 K. C. HUANG, C. H. LAN AND M. P. KUO. Introduction Accurate information on the survey topics is extremely relevant for parameter estimations. However, it is difficult to obtain valid and reliable information in the area of sensitive topics. If a direct survey research is employed to assess a sensitive characteristic, respondents often refuse to take part, or reply untruthfully, especially when they have committed sensitive behavior. To improve respondent cooperation and to procure reliable data, Warner [5] first introduced the randomized response (RR) technique for qualitative characteristics. Subsequently, Greenberg et al. [8] suggested an extension of Warner s RR technique to quantitative characteristics. An excellent exposition of modifications on RR techniques and other related works could be referred to Chaudhuri and Mukerjee [6]. Some resent developments are Bhargava and Singh [], Chua and Tsui [7], Padmawar and Vijayan [2], Singh et al. [4], Chang and Huang [2], Chaudhuri [5], Singh et al. [3], Huang [0], and Chang et al. [3, 4], etc. In particular, to quantify the sensitivity level for certain items of inquiry in practice, Gupta et al. [9] first shown how an estimator may be developed on the quantitative characteristics. Consider a finite population in which every person has a positive value for the sensitive characteristic X. The problem of interest is to estimate the mean µ x, the variance σx 2, and the sensitivity level W of X from a with-replacement simple random sample of size n. In Gupta et al. [9] procedure, each sampled respondent is instructed to utilize a randomization device and generate a positive-valued random number S from a pre-assigned distribution with known mean µ S and known variance γs 2. Then he or she chooses one of the following options: (a) The respondent can report the correct response X, or (b) The respondent can report the scrambled response SX. The optional randomized response model considered is Z S Y X, where Y or 0 according as the response is scrambled or not. Here Y is a Bernoulli variate with E(Y) W, where W is the probability that a respondent will report the scrambled response rather than the actual response X. It is shown that the usual sample mean ˆµ xg Z is unbiased with variance given by Var( ˆµ xg ) n σ 2 Z n [σ 2 x + Wγ 2 S (σ 2 x + µ 2 x)]. (.) Denote by s 2 Z (n n ) (Z j Z) 2 the sample variance, an unbiased estimator of Var( ˆµ xg ) is given by Var( ˆ ˆµ xg ) n s 2 Z j. And, they
3 RANDOMIZED RESPONSE SAMPLING 29 suggested Ŵ G n n log(z j ) log Z j E[log(S)] and ˆσ 2 xg s2 Z Ŵ G γ 2 S ˆµ2 xg + Ŵ G γ 2 S, as an estimator of W and σ 2 x, respectively. Although it creates an appropriate environment to estimate some unknown population parameters, the Gupta et al. [9] development in finding an estimator for W remains biased because of the presence of log term. They did address estimation efficiency and biasedness in the W estimate using empirical estimators, but analytical variance expression of W remains unidentified. In addition, the estimator ˆσ xg 2 is merely a biased estimator of σx 2. In these regards, we intend to suggest an survey procedure that provides an unbiased estimator of σx 2 and an asymptotically unbiased estimator of W together with its variance. The proposed estimators with principal properties are given in the following section. In section 3, an efficiency comparison is carried out to study the performance of the proposed procedure. 2. The proposed procedure In the proposed procedure, two independent samples of size n i, i, 2, are drawn from the population using simple random sampling with replacement such that n + n 2 n, the total sample size required. In the i th sample, each respondent is instructed to use a randomization device and generate a random number, say S i, from some pre-assigned distributions such as Chi-square, uniform, or Weibull etc. The RR procedures are set up in such a way that the interviewer does not know what the respondent has selected. Then the respondent is requested to report one of the following two options: (a) The correct response X, or (b) The scrambled response S i X. It is supposed that S i is a positive-valued random variate with known mean µ Si θ i and known variance σs 2 i γi 2. The optional randomized response model for the ith sample, i, 2, is given by Z i ( Y)X + YS i X, where Y is a random variate, with expectation E(Y) W, defined as {, if the response is scrambled, Y 0, otherwise.
4 30 K. C. HUANG, C. H. LAN AND M. P. KUO Under the proposed procedure, the sample response Z i for the ith sample, i, 2, has expectation E(Z i ) ( W)µ x + Wθ i µ x µ x + W(θ i )µ x. (2.) If Z and Z 2 are the observed means for the two samples, the proposed estimators of µ x and W are respectively given by ˆµ x ( θ 2) Z ( θ ) Z 2 Z and Ŵ Z 2, θ θ 2 ( θ 2 ) Z ( θ ) Z 2 whence provided that θ θ 2. Theorem. The estimator ˆµ x is unbiased with variance given by [ Var( ˆµ x ) ( θ θ 2 ) 2σ Z 2 + ( θ ) 2σ 2 ] Z 2, (2.2) θ 2 n n 2 where σ 2 Z i σ 2 x + W(γ 2 i +θ2 i )σ 2 x +W[γ 2 i +( W)( θ i) 2 ]µ 2 x, i, 2.(2.3) Proof. Taking expectation for ˆµ x, the unbiasedness follows from using (2.). From the distribution of Zi 2, we have E(Z 2 i ) σ 2 x + µ 2 x + W(γ 2 i + θ 2 i )(σ 2 x + µ 2 x). (2.4) Using (2.), (2.4) and the fact that σz 2 i E(Zi 2) [E(Z i)] 2, we get (2.3). Expression (2.2) then follows from the independence of the two samples. Hence the proof. Theorem 2. An unbiased estimator of the variance of ˆµ x is given by [ ] Var( ˆ ˆµ x ) (θ θ 2 ) 2 ( θ 2 ) 2 s2 Z + ( θ ) 2 s2 Z 2, n n 2 where s 2 Z i (n i ) n i (Z i j Z i ) 2, i, 2. j Proof. It follows from that E(s 2 Z i ) σ 2 Z i, i, 2. For deriving the properties of the estimator Ŵ, let us define d Z Z 2 and d 2 ( θ 2 ) Z ( θ ) Z 2. Since E(d ) (θ θ 2 )Wµ x and E(d 2 ) (θ θ 2 )µ x, we then have Ŵ d /d 2 and W E(d )/E(d 2 ). Additionally, if we further denote by e [d E(d )]/ E(d ) and e 2 [d 2 E(d 2 )]/E(d 2 ), assuming that e 2 < so that ( + e 2 ) can be validly expanded as power series, it follows that E(e ) E(e 2 ) 0, E(e e 2 ) ( θ 2)σ 2 Z + ( θ ) 2 σ 2 Z 2 (θ θ 2 ) 2 Wµ 2 x,
5 RANDOMIZED RESPONSE SAMPLING 3 E(e 2 ) σz 2 + σz 2 2 (θ θ 2 ) 2 W 2 µ x 2, E(e 2 2 ) ( θ 2) 2 σz 2 + ( θ ) 2 σz 2 2 (θ θ 2 ) 2 µ x 2. (2.5) And the estimation error can be written in terms of e and e 2 as Ŵ W W(e e 2 ) + o(n ). (2.6) Then we have the following theorem. Theorem 3. The estimator Ŵ is asymptotically unbiased with variance given by Var(Ŵ) (θ θ 2 ) 2 µ 2 x { which can be estimated by ˆ Var(Ŵ) (θ θ 2 ) 2 ˆµ 2 x { [ + W(θ 2 )] 2 σ 2 Z n + [ + W(θ )] 2 σ 2 Z 2 n 2 [ + Ŵ(θ 2 )] 2 s2 Z n + [ + Ŵ(θ )] 2 s2 Z 2 n 2 } }, (2.7). (2.8) Proof. The asymptotically unbiasedness follows from E(e ) E(e ) 0. Squaring expression (2.6), omitting terms with power in e i s higher than the second and then taking expectation, we have E(Ŵ W) 2 W 2 E(e 2 2e e 2 + e 2 2 ). On substituting the expected values given in (2.5) and after some algebraic simplification, we get (2.7). Replacing µ x, W and σz 2 i in (2.7) by the corresponding sample analogue, (2.8) then follows. Hence the theorem. Next, let us consider the problem of unbiased estimation for the variance σ 2 x of the sensitive characteristic X. For the sake of notational convenience, let us denote a 2(θ 2 )γ 2 + (2 θ θ 2 )γ 2 2 +( 2θ θ 2 ) (θ θ 2 ), b (θ θ 2 )( γ 2 θ2 ) and c 2[(θ 2 )γ 2 (θ )γ 2 2 +(θ )(θ 2 )(θ θ 2 )]. An unbiased estimator of σ 2 x is given in the following theorem. Theorem 4. If (θ 2 )( 2θ +θ 2 )γ 2 + (θ ) 2 γ2 2 2θ (θ )(θ 2 ) (θ θ 2 ), an unbiased estimator of σx 2 is given by [( ) ] as 2 + bs2 2 + c n n i Z 2 j Z Z 2 ˆσ x 2 j (γ2 2 + θ2 2 γ2 θ2 )(θ, θ 2 ) whence provided that γ 2 + θ2 γ2 2 + θ2 2 and θ θ 2.
6 32 K. C. HUANG, C. H. LAN AND M. P. KUO Proof. Since the two samples are independent, expression (2.) leads E( Z Z 2 ) [ + (θ + θ 2 2)W + (θ )(θ 2 )W 2 ]µ 2 x. (2.9) From (2.4), we have ( E n n i Z 2 j j ) [ + W(γ 2 + θ2 )](σ 2 x + µ 2 x). (2.0) From using (2.9), (2.0) and the fact that E(si 2) σ i 2, i, 2, the unbiasedness of ˆσ x 2 then follows. Hence the theorem. We now move on to study the appropriate sample size allocations for various objectives in sampling surveys. Consider a linear combination of the variances of ˆµ x and Ŵ, given in (2.2) and (2.7) respectively, given by Var( ˆµ x, Ŵ) α Var( ˆµ x ) + α 2 Var(Ŵ) [ (θ + θ 2 ) 2 µ x 2 {α ( θ 2 ) 2 µ x 2 + α 2 [ + W(θ 2 )] 2 } σ Z 2 n ] + {α ( θ ) 2 µ 2 x + α 2 [ + W(θ )] 2 } σ 2 Z 2 n 2 where α and α 2 are non-negative coefficients chosen by the investigator. If the total sample size n ( n + n 2 ) is fixed, then through a simple application of the Cauchy-Schwarz inequality, the sample allocation for which Var( ˆµ x, Ŵ) attains its minimum is given by [ n {α ( θ 2 ) 2 µ x 2 + α 2 [ + W(θ 2 ) 2 ]}σ 2 ] /2 Z n 2 {α ( θ ) 2 µ x 2 + α 2 [ + W(θ ) 2 ]}σz 2, 2 and the minimum value of Var( ˆµ x, Ŵ) will be ( ) [{α ( θ 2 ) 2 µ x 2 + α 2 [ + W(θ 2 ) 2 ]} 2σ Z +{α ( θ ) 2 µ x 2 + α 2 [ + W(θ ) 2 ]} 2σ Z2 ] 2 Min Var( ˆµ x, Ŵ) n(θ θ 2 ) 2 µ x 2. It is remarkable that the actual values of a σ Z, σ Z2 and W are always unknown, information regarding them can be obtained from past experience or a pilot survey, which is helpful for practical applications (Murthy [], pp ). 3. Efficiency comparison In what follows, we study the performance of the proposed procedure compared with Gupta et al. [9] procedure. The percent relative efficiency of the proposed estimator ˆµ x with respect to Gupta et al. estimator
7 RANDOMIZED RESPONSE SAMPLING 33 ˆµ xg is defined as PRE (θ θ 2 ) 2 σ 2 Z ( θ 2 σ Z + θ σ Z2 ) 2 00, where σz 2 and σ Z 2 i are respectively given in (.) and (2.3). Since the above PRE expression is infested with too many parameters, here the comparison is performed on an empirical investigation using the same probability distribution in the competing randomizing devices. To examine how the variation of randomization devices will affect the efficiency, it is assumed that scrambled variables follow the distributions with mean µ and variance γ 2 satisfying γ 2 kµ. Without loss of generality, here we simply choose (µ S, γs 2) (, k), (θ, γ 2) (k, k2 ) and (θ 2, γ2 2 ) (0.5, 0.5k), where k 2, 3,..., 0. It is remarkable that the PRE value remains unchanged if the coefficient of variation CV x σ x µ x is fixed. The values of CV x are then chosen to be 0.2, 0.4, 0.6, 0.8 and. The percent relative efficiencies thus obtained are outlined in Table for different values of W. Table Percent relative efficiency of the two competing procedures CV x W k (Contd. Table )
8 34 K. C. HUANG, C. H. LAN AND M. P. KUO CV x W k From Table, it is seen that the proposed procedure performs better than Gupta et al. [9] procedure for most of the practical situations. Even though in some cases the proposed procedure is less efficient, it can be viewed as a tradeoff for being able to get an unbiased estimator of σx 2 and an asymptotically unbiased estimator of W. Note that the PRE value increases with k and/or W, whereas decreases with CV x, if other parameters are unchanged. References [] M. Bhargava and R. Singh (2000), A modified randomization device for Warner s model, Statistica, Vol. 60, pp [2] H. J. Chang and K. C. Huang (200), Estimation of proportion and sensitivity of a qualitative character, Metrika, Vol. 53, pp [3] H. J. Chang, C. L. Wang and K. C. Huang (2004), Using randomized response to estimate the proportion and truthful reporting probability in a dichotomous finite population, Journal of Applied Statistics, Vol. 3, pp [4] H. J. Chang, C. L. Wang and K. C. Huang (2004), On estimating the proportion of a qualitative sensitive character using randomized response sampling, Quality & Quantity, Vol. 38, pp [5] A. Chaudhuri (200), Using randomized response from a complex survey to estimate a sensitive proportion in a dichotomous finite population, Journal of Statistical Planning and Inference, Vol. 94, pp
9 RANDOMIZED RESPONSE SAMPLING 35 [6] A. Chaudhuri and R. Mukerjee (988), Randomized Response: Theory and Techniques, Marcel Dekker, New York. [7] T. C. Chua and A. K. Tsui (2000), Procuring honest responses indirectly, Journal of Statistical Planning and Inference, Vol. 90, pp [8] B. G. Greenberg, R. R. Kubler, J. R. Abernathy and D. G. Horvitz (97), Applications of the RR technique in obtaining quantitative data, Journal of the American Statistical Association, Vol. 66, pp [9] S. Gupta, B. Gupta and S. Singh (2002), Estimation of sensitivity level of personal interview survey questions, Journal of Statistical Planning and Inference, Vol. 00, pp [0] K. C. Huang (2004), A survey technique for estimating the proportion and sensitivity in a dichotomous finite population, Statistica Neerlandica, Vol. 58, pp [] M. N. Murthy (967), Sampling Theory and Methods, Statistical Publishing Society, Calcutta. [2] V. R. Padmawar and K. Vijayan (2000), Randomized response revisited, Journal of Statistical Planning and Inference, Vol. 90, pp [3] S. Singh, M. Mahmood and D. S. Tracy (200), Estimation of mean and variance of stigmatized quantitative variable using distinct units in randomized response sampling, Statistical Papers, Vol. 42, pp [4] S. Singh, R. Singh and N. S. Mangeao (2000), Some alternative strategies to Moor s model in randomized response sampling, Journal of Statistical Planning and Inference, Vol. 83, pp [5] S. L. Warner (965), Randomized response: a survey technique for eliminating evasive answer bias, Journal of the American Statistical Association, Vol. 60, pp Received April, 2005
On the Relative Efficiency of the Proposed Reparametized Randomized Response Model
On the Relative Efficiency of the Proposed Reparametized Randomized Response Model Abstract 1 Adepetun, A.O. and 2 Adebola, F.B. Department of Statistics, Federal University of Technology, PMB 704, Akure,
More informationImproved Estimators of Mean of Sensitive Variables using Optional RRT Models
Improved Estimators of Mean of Sensitive Variables using Optional RRT Models Sat Gupta Department of Mathematics and Statistics University of North Carolina Greensboro sngupta@uncg.edu University of South
More informationEstimation of Sensitive Characteristic using Two-Phase Sampling for Regression Type Estimator
Journal of Informatics and Mathematical Sciences Volume 4 (01), Number 3,. 311 317 RGN Publications htt://www.rgnublications.com Estimation of Sensitive Characteristic using Two-Phase Samling for Regression
More informationA Multi-proportion Randomized Response Model Using the Inverse Sampling
A Multi-proportion Randomized Response Model Using the Inverse Sampling Marcella Polisicchio and Francesco Porro Abstract This paper describes a new procedure to unbiasedly estimate the proportions of
More informationApplied Econometrics (QEM)
Applied Econometrics (QEM) The Simple Linear Regression Model based on Prinicples of Econometrics Jakub Mućk Department of Quantitative Economics Jakub Mućk Applied Econometrics (QEM) Meeting #2 The Simple
More informationREMAINDER LINEAR SYSTEMATIC SAMPLING
Sankhyā : The Indian Journal of Statistics 2000, Volume 62, Series B, Pt. 2, pp. 249 256 REMAINDER LINEAR SYSTEMATIC SAMPLING By HORNG-JINH CHANG and KUO-CHUNG HUANG Tamkang University, Taipei SUMMARY.
More informationA CHAIN RATIO EXPONENTIAL TYPE ESTIMATOR IN TWO- PHASE SAMPLING USING AUXILIARY INFORMATION
STATISTICA, anno LXXIII, n. 2, 2013 A CHAIN RATIO EXPONENTIAL TYPE ESTIMATOR IN TWO- PHASE SAMPLING USING AUXILIARY INFORMATION Rohini Yadav Department of Applied Mathematics, Indian School of Mines, Dhanbad,
More informationElements of statistics (MATH0487-1)
Elements of statistics (MATH0487-1) Prof. Dr. Dr. K. Van Steen University of Liège, Belgium November 12, 2012 Introduction to Statistics Basic Probability Revisited Sampling Exploratory Data Analysis -
More informationA new improved estimator of population mean in partial additive randomized response models
Hacettepe Journal of Mathematics and Statistics Volume 46 () (017), 35 338 A new improved estimator of population mean in partial additive randomized response models Nilgun Ozgul and Hulya Cingi Abstract
More informationEstimation of Parameters
CHAPTER Probability, Statistics, and Reliability for Engineers and Scientists FUNDAMENTALS OF STATISTICAL ANALYSIS Second Edition A. J. Clark School of Engineering Department of Civil and Environmental
More informationIEOR 165 Lecture 7 1 Bias-Variance Tradeoff
IEOR 165 Lecture 7 Bias-Variance Tradeoff 1 Bias-Variance Tradeoff Consider the case of parametric regression with β R, and suppose we would like to analyze the error of the estimate ˆβ in comparison to
More information1 of 7 7/16/2009 6:12 AM Virtual Laboratories > 7. Point Estimation > 1 2 3 4 5 6 1. Estimators The Basic Statistical Model As usual, our starting point is a random experiment with an underlying sample
More informationTheory of Statistics.
Theory of Statistics. Homework V February 5, 00. MT 8.7.c When σ is known, ˆµ = X is an unbiased estimator for µ. If you can show that its variance attains the Cramer-Rao lower bound, then no other unbiased
More informationChapter 2 The Simple Linear Regression Model: Specification and Estimation
Chapter The Simple Linear Regression Model: Specification and Estimation Page 1 Chapter Contents.1 An Economic Model. An Econometric Model.3 Estimating the Regression Parameters.4 Assessing the Least Squares
More informationThe Influence Function of the Correlation Indexes in a Two-by-Two Table *
Applied Mathematics 014 5 3411-340 Published Online December 014 in SciRes http://wwwscirporg/journal/am http://dxdoiorg/10436/am01451318 The Influence Function of the Correlation Indexes in a Two-by-Two
More informationExercises and Answers to Chapter 1
Exercises and Answers to Chapter The continuous type of random variable X has the following density function: a x, if < x < a, f (x), otherwise. Answer the following questions. () Find a. () Obtain mean
More informationMax. Likelihood Estimation. Outline. Econometrics II. Ricardo Mora. Notes. Notes
Maximum Likelihood Estimation Econometrics II Department of Economics Universidad Carlos III de Madrid Máster Universitario en Desarrollo y Crecimiento Económico Outline 1 3 4 General Approaches to Parameter
More informationECON 4160, Autumn term Lecture 1
ECON 4160, Autumn term 2017. Lecture 1 a) Maximum Likelihood based inference. b) The bivariate normal model Ragnar Nymoen University of Oslo 24 August 2017 1 / 54 Principles of inference I Ordinary least
More informationFinite Population Sampling and Inference
Finite Population Sampling and Inference A Prediction Approach RICHARD VALLIANT ALAN H. DORFMAN RICHARD M. ROYALL A Wiley-Interscience Publication JOHN WILEY & SONS, INC. New York Chichester Weinheim Brisbane
More informationEvaluating the Performance of Estimators (Section 7.3)
Evaluating the Performance of Estimators (Section 7.3) Example: Suppose we observe X 1,..., X n iid N(θ, σ 2 0 ), with σ2 0 known, and wish to estimate θ. Two possible estimators are: ˆθ = X sample mean
More informationIEOR E4703: Monte-Carlo Simulation
IEOR E4703: Monte-Carlo Simulation Output Analysis for Monte-Carlo Martin Haugh Department of Industrial Engineering and Operations Research Columbia University Email: martin.b.haugh@gmail.com Output Analysis
More informationCOMP2610/COMP Information Theory
COMP2610/COMP6261 - Information Theory Lecture 9: Probabilistic Inequalities Mark Reid and Aditya Menon Research School of Computer Science The Australian National University August 19th, 2014 Mark Reid
More informationStatistics for Business and Economics
Statistics for Business and Economics Chapter 6 Sampling and Sampling Distributions Ch. 6-1 6.1 Tools of Business Statistics n Descriptive statistics n Collecting, presenting, and describing data n Inferential
More informationESTIMATION OF A POPULATION MEAN OF A SENSITIVE VARIABLE IN STRATIFIED TWO-PHASE SAMPLING
Pak. J. Stati. 06 Vol. 3(5), 393-404 ESTIMATION OF A POPUATION MEAN OF A SENSITIVE VARIABE IN STRATIFIED TWO-PHASE SAMPING Nadia Muaq, Muammad Noor-ul-Amin and Muammad Hanif National College of Business
More informationRegression #3: Properties of OLS Estimator
Regression #3: Properties of OLS Estimator Econ 671 Purdue University Justin L. Tobias (Purdue) Regression #3 1 / 20 Introduction In this lecture, we establish some desirable properties associated with
More informationEffective estimation of population mean, ratio and product in two-phase sampling in presence of random non-response
Effective estimation of population mean, ratio and product in two-phase sampling in presence of random non-response G. N. Singh, A. K. Sharma, A.Bandyopadhyay and C.P aul Keywords Abstract This paper presents
More informationMLE and GMM. Li Zhao, SJTU. Spring, Li Zhao MLE and GMM 1 / 22
MLE and GMM Li Zhao, SJTU Spring, 2017 Li Zhao MLE and GMM 1 / 22 Outline 1 MLE 2 GMM 3 Binary Choice Models Li Zhao MLE and GMM 2 / 22 Maximum Likelihood Estimation - Introduction For a linear model y
More informationEfficient Exponential Ratio Estimator for Estimating the Population Mean in Simple Random Sampling
Efficient Exponential Ratio Estimator for Estimating the Population Mean in Simple Random Sampling Ekpenyong, Emmanuel John and Enang, Ekaette Inyang Abstract This paper proposes, with justification, two
More informationOn Efficiency of Midzuno-Sen Strategy under Two-phase Sampling
International Journal of Statistics and Analysis. ISSN 2248-9959 Volume 7, Number 1 (2017), pp. 19-26 Research India Publications http://www.ripublication.com On Efficiency of Midzuno-Sen Strategy under
More informationFall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.
1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n
More informationESTIMATION OF FINITE POPULATION MEAN USING KNOWN CORRELATION COEFFICIENT BETWEEN AUXILIARY CHARACTERS
STATISTICA, anno LXV, n. 4, 005 ESTIMATION OF FINITE POPULATION MEAN USING KNOWN CORRELATION COEFFICIENT BETWEEN AUXILIAR CHARACTERS. INTRODUCTION Let U { U, U,..., U N } be a finite population of N units.
More informationEric Shou Stat 598B / CSE 598D METHODS FOR MICRODATA PROTECTION
Eric Shou Stat 598B / CSE 598D METHODS FOR MICRODATA PROTECTION INTRODUCTION Statistical disclosure control part of preparations for disseminating microdata. Data perturbation techniques: Methods assuring
More informationEcon 325: Introduction to Empirical Economics
Econ 325: Introduction to Empirical Economics Lecture 6 Sampling and Sampling Distributions Ch. 6-1 Populations and Samples A Population is the set of all items or individuals of interest Examples: All
More informationVariance Estimation Using Quartiles and their Functions of an Auxiliary Variable
International Journal of Statistics and Applications 01, (5): 67-7 DOI: 10.593/j.statistics.01005.04 Variance Estimation Using Quartiles and their Functions of an Auiliary Variable J. Subramani *, G. Kumarapandiyan
More informationPsychology 282 Lecture #3 Outline
Psychology 8 Lecture #3 Outline Simple Linear Regression (SLR) Given variables,. Sample of n observations. In study and use of correlation coefficients, and are interchangeable. In regression analysis,
More informationAlfredo A. Romero * College of William and Mary
A Note on the Use of in Model Selection Alfredo A. Romero * College of William and Mary College of William and Mary Department of Economics Working Paper Number 6 October 007 * Alfredo A. Romero is a Visiting
More informationMath 423/533: The Main Theoretical Topics
Math 423/533: The Main Theoretical Topics Notation sample size n, data index i number of predictors, p (p = 2 for simple linear regression) y i : response for individual i x i = (x i1,..., x ip ) (1 p)
More informationImproved ratio-type estimators using maximum and minimum values under simple random sampling scheme
Improved ratio-type estimators using maximum and minimum values under simple random sampling scheme Mursala Khan Saif Ullah Abdullah. Al-Hossain and Neelam Bashir Abstract This paper presents a class of
More informationMultiple Regression Analysis: Heteroskedasticity
Multiple Regression Analysis: Heteroskedasticity y = β 0 + β 1 x 1 + β x +... β k x k + u Read chapter 8. EE45 -Chaiyuth Punyasavatsut 1 topics 8.1 Heteroskedasticity and OLS 8. Robust estimation 8.3 Testing
More informationOn The Class of Double Sampling Ratio Estimators Using Auxiliary Information on an Attribute and an Auxiliary Variable
International Journal of Statistics and Systems ISSN 0973-675 Volume, Number (07), pp. 5-3 Research India Publications http://www.ripublication.com On The Class of Double Sampling Ratio Estimators Using
More informationTwo hours. To be supplied by the Examinations Office: Mathematical Formula Tables THE UNIVERSITY OF MANCHESTER. 21 June :45 11:45
Two hours MATH20802 To be supplied by the Examinations Office: Mathematical Formula Tables THE UNIVERSITY OF MANCHESTER STATISTICAL METHODS 21 June 2010 9:45 11:45 Answer any FOUR of the questions. University-approved
More informationWooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics
Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics A short review of the principles of mathematical statistics (or, what you should have learned in EC 151).
More informationF & B Approaches to a simple model
A6523 Signal Modeling, Statistical Inference and Data Mining in Astrophysics Spring 215 http://www.astro.cornell.edu/~cordes/a6523 Lecture 11 Applications: Model comparison Challenges in large-scale surveys
More informationwhere x and ȳ are the sample means of x 1,, x n
y y Animal Studies of Side Effects Simple Linear Regression Basic Ideas In simple linear regression there is an approximately linear relation between two variables say y = pressure in the pancreas x =
More informationVARIANCE ESTIMATION IN PRESENCE OF RANDOM NON-RESPONSE
Journal of Reliability and Statistical Studies; ISSN (Print): 0974-8024, (Online):2229-5666 Vol. 7, Issue 2 (2014): 65-70 VARIANCE ESTIMATION IN PRESENCE OF RANDOM NON-RESPONSE Sunil Kumar Alliance School
More informationGeneralized Multivariate Rank Type Test Statistics via Spatial U-Quantiles
Generalized Multivariate Rank Type Test Statistics via Spatial U-Quantiles Weihua Zhou 1 University of North Carolina at Charlotte and Robert Serfling 2 University of Texas at Dallas Final revision for
More informationSAMPLING BIOS 662. Michael G. Hudgens, Ph.D. mhudgens :55. BIOS Sampling
SAMPLIG BIOS 662 Michael G. Hudgens, Ph.D. mhudgens@bios.unc.edu http://www.bios.unc.edu/ mhudgens 2008-11-14 15:55 BIOS 662 1 Sampling Outline Preliminaries Simple random sampling Population mean Population
More informationIntroduction to Econometrics
Introduction to Econometrics Lecture 2 : Causal Inference and Random Control Trails(RCT) Zhaopeng Qu Business School,Nanjing University Sep 18th, 2017 Zhaopeng Qu (Nanjing University) Introduction to Econometrics
More informationEstimating Bounded Population Total Using Linear Regression in the Presence of Supporting Information
International Journal of Mathematics and Computational Science Vol. 4, No. 3, 2018, pp. 112-117 http://www.aiscience.org/journal/ijmcs ISSN: 2381-7011 (Print); ISSN: 2381-702X (Online) Estimating Bounded
More information10-704: Information Processing and Learning Fall Lecture 24: Dec 7
0-704: Information Processing and Learning Fall 206 Lecturer: Aarti Singh Lecture 24: Dec 7 Note: These notes are based on scribed notes from Spring5 offering of this course. LaTeX template courtesy of
More informationUnit 10: Planning Life Tests
Unit 10: Planning Life Tests Ramón V. León Notes largely based on Statistical Methods for Reliability Data by W.Q. Meeker and L. A. Escobar, Wiley, 1998 and on their class notes. 11/2/2004 Unit 10 - Stat
More informationEstimation for Mean and Standard Deviation of Normal Distribution under Type II Censoring
Communications for Statistical Applications and Methods 2014, Vol. 21, No. 6, 529 538 DOI: http://dx.doi.org/10.5351/csam.2014.21.6.529 Print ISSN 2287-7843 / Online ISSN 2383-4757 Estimation for Mean
More informationAsymptotic Normality under Two-Phase Sampling Designs
Asymptotic Normality under Two-Phase Sampling Designs Jiahua Chen and J. N. K. Rao University of Waterloo and University of Carleton Abstract Large sample properties of statistical inferences in the context
More informationThe Finite Sample Properties of the Least Squares Estimator / Basic Hypothesis Testing
1 The Finite Sample Properties of the Least Squares Estimator / Basic Hypothesis Testing Greene Ch 4, Kennedy Ch. R script mod1s3 To assess the quality and appropriateness of econometric estimators, we
More informationPackage Rsurrogate. October 20, 2016
Type Package Package Rsurrogate October 20, 2016 Title Robust Estimation of the Proportion of Treatment Effect Explained by Surrogate Marker Information Version 2.0 Date 2016-10-19 Author Layla Parast
More informationReview of Econometrics
Review of Econometrics Zheng Tian June 5th, 2017 1 The Essence of the OLS Estimation Multiple regression model involves the models as follows Y i = β 0 + β 1 X 1i + β 2 X 2i + + β k X ki + u i, i = 1,...,
More informationUniversity of Michigan School of Public Health
University of Michigan School of Public Health The University of Michigan Department of Biostatistics Working Paper Series Year 003 Paper Weighting Adustments for Unit Nonresponse with Multiple Outcome
More information557: MATHEMATICAL STATISTICS II BIAS AND VARIANCE
557: MATHEMATICAL STATISTICS II BIAS AND VARIANCE An estimator, T (X), of θ can be evaluated via its statistical properties. Typically, two aspects are considered: Expectation Variance either in terms
More informationMean estimation with calibration techniques in presence of missing data
Computational Statistics & Data Analysis 50 2006 3263 3277 www.elsevier.com/locate/csda Mean estimation with calibration techniues in presence of missing data M. Rueda a,, S. Martínez b, H. Martínez c,
More informationIMPROVED CLASSES OF ESTIMATORS FOR POPULATION MEAN IN PRESENCE OF NON-RESPONSE
Pak. J. Statist. 014 Vol. 30(1), 83-100 IMPROVED CLASSES OF ESTIMATORS FOR POPULATION MEAN IN PRESENCE OF NON-RESPONSE Saba Riaz 1, Giancarlo Diana and Javid Shabbir 1 1 Department of Statistics, Quaid-i-Azam
More informationReliability of inference (1 of 2 lectures)
Reliability of inference (1 of 2 lectures) Ragnar Nymoen University of Oslo 5 March 2013 1 / 19 This lecture (#13 and 14): I The optimality of the OLS estimators and tests depend on the assumptions of
More informationVariable inspection plans for continuous populations with unknown short tail distributions
Variable inspection plans for continuous populations with unknown short tail distributions Wolfgang Kössler Abstract The ordinary variable inspection plans are sensitive to deviations from the normality
More informationModel Assisted Survey Sampling
Carl-Erik Sarndal Jan Wretman Bengt Swensson Model Assisted Survey Sampling Springer Preface v PARTI Principles of Estimation for Finite Populations and Important Sampling Designs CHAPTER 1 Survey Sampling
More informationBusiness Statistics: A Decision-Making Approach 6 th Edition. Chapter Goals
Chapter 6 Student Lecture Notes 6-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter 6 Introduction to Sampling Distributions Chap 6-1 Chapter Goals To use information from the sample
More informationData Mining Chapter 4: Data Analysis and Uncertainty Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University
Data Mining Chapter 4: Data Analysis and Uncertainty Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University Why uncertainty? Why should data mining care about uncertainty? We
More informationSTATISTICS 4, S4 (4769) A2
(4769) A2 Objectives To provide students with the opportunity to explore ideas in more advanced statistics to a greater depth. Assessment Examination (72 marks) 1 hour 30 minutes There are four options
More informationStatistics and Econometrics I
Statistics and Econometrics I Point Estimation Shiu-Sheng Chen Department of Economics National Taiwan University September 13, 2016 Shiu-Sheng Chen (NTU Econ) Statistics and Econometrics I September 13,
More informationResearch Article Some Improved Multivariate-Ratio-Type Estimators Using Geometric and Harmonic Means in Stratified Random Sampling
International Scholarly Research Network ISRN Probability and Statistics Volume 01, Article ID 509186, 7 pages doi:10.540/01/509186 Research Article Some Improved Multivariate-Ratio-Type Estimators Using
More informationEstimation of Parameters and Variance
Estimation of Parameters and Variance Dr. A.C. Kulshreshtha U.N. Statistical Institute for Asia and the Pacific (SIAP) Second RAP Regional Workshop on Building Training Resources for Improving Agricultural
More informationIntroduction. The Linear Regression Model One popular model is the linear regression model. It writes as :
Introduction Definition From Wikipedia : The two main purposes of econometrics are to give empirical content to economic theory and to subject economic theory to potentially falsifying tests. Another popular
More informationStatistics: Learning models from data
DS-GA 1002 Lecture notes 5 October 19, 2015 Statistics: Learning models from data Learning models from data that are assumed to be generated probabilistically from a certain unknown distribution is a crucial
More informationA flexible two-step randomised response model for estimating the proportions of individuals with sensitive attributes
A flexible two-step randomised response model for estimating the proportions of individuals with sensitive attributes Anne-Françoise Donneau, Murielle Mauer Francisco Sartor and Adelin Albert Department
More informationSpring 2012 Math 541B Exam 1
Spring 2012 Math 541B Exam 1 1. A sample of size n is drawn without replacement from an urn containing N balls, m of which are red and N m are black; the balls are otherwise indistinguishable. Let X denote
More informationImproved General Class of Ratio Type Estimators
[Volume 5 issue 8 August 2017] Page No.1790-1796 ISSN :2320-7167 INTERNATIONAL JOURNAL OF MATHEMATICS AND COMPUTER RESEARCH Improved General Class of Ratio Type Estimators 1 Banti Kumar, 2 Manish Sharma,
More informationA General Overview of Parametric Estimation and Inference Techniques.
A General Overview of Parametric Estimation and Inference Techniques. Moulinath Banerjee University of Michigan September 11, 2012 The object of statistical inference is to glean information about an underlying
More informationDiscrepancy-Based Model Selection Criteria Using Cross Validation
33 Discrepancy-Based Model Selection Criteria Using Cross Validation Joseph E. Cavanaugh, Simon L. Davies, and Andrew A. Neath Department of Biostatistics, The University of Iowa Pfizer Global Research
More informationthe Presence of k Outliers
RESEARCH ARTICLE OPEN ACCESS On the Estimation of the Presence of k Outliers for Weibull Distribution in Amal S. Hassan 1, Elsayed A. Elsherpieny 2, and Rania M. Shalaby 3 1 (Department of Mathematical
More informationA Simulation Comparison Study for Estimating the Process Capability Index C pm with Asymmetric Tolerances
Available online at ijims.ms.tku.edu.tw/list.asp International Journal of Information and Management Sciences 20 (2009), 243-253 A Simulation Comparison Study for Estimating the Process Capability Index
More informationECE 275A Homework 7 Solutions
ECE 275A Homework 7 Solutions Solutions 1. For the same specification as in Homework Problem 6.11 we want to determine an estimator for θ using the Method of Moments (MOM). In general, the MOM estimator
More informationLecture 8: Information Theory and Statistics
Lecture 8: Information Theory and Statistics Part II: Hypothesis Testing and I-Hsiang Wang Department of Electrical Engineering National Taiwan University ihwang@ntu.edu.tw December 23, 2015 1 / 50 I-Hsiang
More informationThe Restricted Likelihood Ratio Test at the Boundary in Autoregressive Series
The Restricted Likelihood Ratio Test at the Boundary in Autoregressive Series Willa W. Chen Rohit S. Deo July 6, 009 Abstract. The restricted likelihood ratio test, RLRT, for the autoregressive coefficient
More informationLecture Topic 4: Chapter 7 Sampling and Sampling Distributions
Lecture Topic 4: Chapter 7 Sampling and Sampling Distributions Statistical Inference: The aim is to obtain information about a population from information contained in a sample. A population is the set
More informationRegression, Ridge Regression, Lasso
Regression, Ridge Regression, Lasso Fabio G. Cozman - fgcozman@usp.br October 2, 2018 A general definition Regression studies the relationship between a response variable Y and covariates X 1,..., X n.
More informationMachine Learning. Lecture 9: Learning Theory. Feng Li.
Machine Learning Lecture 9: Learning Theory Feng Li fli@sdu.edu.cn https://funglee.github.io School of Computer Science and Technology Shandong University Fall 2018 Why Learning Theory How can we tell
More informationSanjay Chaudhuri Department of Statistics and Applied Probability, National University of Singapore
AN EMPIRICAL LIKELIHOOD BASED ESTIMATOR FOR RESPONDENT DRIVEN SAMPLED DATA Sanjay Chaudhuri Department of Statistics and Applied Probability, National University of Singapore Mark Handcock, Department
More informationProbability Theory and Statistics. Peter Jochumzen
Probability Theory and Statistics Peter Jochumzen April 18, 2016 Contents 1 Probability Theory And Statistics 3 1.1 Experiment, Outcome and Event................................ 3 1.2 Probability............................................
More informationImproved Ratio Estimators of Variance Based on Robust Measures
Improved Ratio Estimators of Variance Based on Robust Measures Muhammad Abid 1, Shabbir Ahmed, Muhammad Tahir 3, Hafiz Zafar Nazir 4, Muhammad Riaz 5 1,3 Department of Statistics, Government College University,
More informationResearch Article A Generalized Class of Exponential Type Estimators for Population Mean under Systematic Sampling Using Two Auxiliary Variables
Probability and Statistics Volume 2016, Article ID 2374837, 6 pages http://dx.doi.org/10.1155/2016/2374837 Research Article A Generalized Class of Exponential Type Estimators for Population Mean under
More informationElementary Statistical Methods and Measurement Error
Elementary Statistical Methods and Measurement Error (To Appear in The American Statistician) Steve Vardeman, Joanne Wendelberger, Tom Burr, Mike Hamada, Marcus Jobe, Max Morris, Huaiqing Wu (Supported
More informationAlgebra of Random Variables: Optimal Average and Optimal Scaling Minimising
Review: Optimal Average/Scaling is equivalent to Minimise χ Two 1-parameter models: Estimating < > : Scaling a pattern: Two equivalent methods: Algebra of Random Variables: Optimal Average and Optimal
More informationEXPONENTIAL CHAIN DUAL TO RATIO AND REGRESSION TYPE ESTIMATORS OF POPULATION MEAN IN TWO-PHASE SAMPLING
STATISTICA, anno LXXV, n. 4, 015 EXPONENTIAL CHAIN DUAL TO RATIO AND REGRESSION TYPE ESTIMATORS OF POPULATION MEAN IN TWO-PHASE SAMPLING Garib Nath Singh Department of Applied mathematics, Indian School
More informationASYMPTOTIC NORMALITY UNDER TWO-PHASE SAMPLING DESIGNS
Statistica Sinica 17(2007), 1047-1064 ASYMPTOTIC NORMALITY UNDER TWO-PHASE SAMPLING DESIGNS Jiahua Chen and J. N. K. Rao University of British Columbia and Carleton University Abstract: Large sample properties
More informationMODIFIED SYSTEMATIC SAMPLING WITH MULTIPLE RANDOM STARTS
RESTAT Statistical Journal olume 6, Number, April 08, 87 MODIFIED SYSTEMATIC SAMPLING WITH MULTIPLE RANDOM STARTS Authors: Sat Gupta Department of Mathematics and Statistics, University of North Carolina,
More informationConservative variance estimation for sampling designs with zero pairwise inclusion probabilities
Conservative variance estimation for sampling designs with zero pairwise inclusion probabilities Peter M. Aronow and Cyrus Samii Forthcoming at Survey Methodology Abstract We consider conservative variance
More informationIntroductory Econometrics
Based on the textbook by Wooldridge: : A Modern Approach Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies Vienna November 23, 2013 Outline Introduction
More informationA Chain Regression Estimator in Two Phase Sampling Using Multi-auxiliary Information
BULLETI of the Malaysian Mathematical Sciences Society http://math.usm.my/bulletin Bull. Malays. Math. Sci. Soc. () 8(1) (005), 81 86 A Chain Regression Estimator in Two Phase Sampling Using Multi-auxiliary
More informationMLES & Multivariate Normal Theory
Merlise Clyde September 6, 2016 Outline Expectations of Quadratic Forms Distribution Linear Transformations Distribution of estimates under normality Properties of MLE s Recap Ŷ = ˆµ is an unbiased estimate
More informationEfficient and Unbiased Estimation Procedure of Population Mean in Two-Phase Sampling
Journal of Modern Applied Statistical Methods Volume 5 Issue Article --06 Efficient and Unbiased Estimation Procedure of Population Mean in Two-Phase Sampling Reba Maji Indian School of Mines, Dhanbad,
More informationSHRINKAGE ESTIMATOR AND TESTIMATORS FOR SHAPE PARAMETER OF CLASSICAL PARETO DISTRIBUTION
Journal of Scientific Research Vol. 55, 0 : 8-07 Banaras Hindu University, Varanasi ISSN : 0447-9483 SHRINKAGE ESTIMATOR AND TESTIMATORS FOR SHAPE PARAMETER OF CLASSICAL PARETO DISTRIBUTION Ganesh Singh,
More informationINTERPENETRATING SAMPLES
INTERPENETRATING SAMPLES Dr. A. C. Kulshreshtha UN Statistical Institute for Asia and Pacific Second RAP Regional Workshop on Building Training Resources for Improving Agricultural & Rural Statistics Sampling
More information