ILO-IPEC Interactive Sampling Tools No. 3. Selection of Primary Sampling Units (PSUs) by systematic PPS sampling with constraints

Similar documents
Systematic Review Round 7 Application Form

MRD273-REV METADATA. Acronyms are Used to Identify the Data Set or Information Holding: MRD273-REV

Taking into account sampling design in DAD. Population SAMPLING DESIGN AND DAD

New Implements for Crop Production in the Semi-Arid Tropics

Surveying Informal Enterprises: Applying Stratified Adaptive Cluster Sampling using CAPI with Implementation and Monitoring Tools

Lowercase letters on four lines a-z

ISO INTERNATIONAL STANDARD. Thermal bridges in building construction Linear thermal transmittance Simplified methods and default values

MEDROXYPROGESTERONE INJECTION

MRD 281. Official Name of the Data Set or Information Holding: Northeastern Ontario Rhyolite Database

AGGREGATE RESOURCES OF ONTARIO (ARO) METADATA

ISO INTERNATIONAL STANDARD

APPENDIX A SAMPLE DESIGN

Choose Carefully! An Assessment of Different Sample Designs on Estimates of Official Statistics

3/3/2015 PTS Application

MRD283-REV METADATA. Acronyms are Used to Identify the Data Set or Information Holding: MRD283-REV

XR Analog Clock - Manual Setting Model Troubleshooting Guide

This document is a preview generated by EVS

GRADUATE RECORD EXAMINATIONS. Math Review. Chapter 2: Algebra

GEOPHYSICAL DATA SET (GDS) 1084B METADATA

ISO INTERNATIONAL STANDARD. Geographic information Spatial referencing by coordinates

ISO INTERNATIONAL STANDARD. Geographic information Spatial referencing by coordinates Part 2: Extension for parametric values

STP-TS THERMOPHYSICAL PROPERTIES OF WORKING GASES USED IN WORKING GAS TURBINE APPLICATIONS

Introductory Statistics Neil A. Weiss Ninth Edition

ISO INTERNATIONAL STANDARD. Optics and photonics Spectral bands. Optique et photonique Bandes spectrales. First edition

C o f l F - 9m307-- SPREADSHEET APPLICATION TO CLASSIFY RADIOACTIVE MATERIAL FOR SHIPMENTa. A. N. Brown

Knott, M. May Future t r e n d s

ISO INTERNATIONAL STANDARD. Water quality Determination of dissolved bromate Method by liquid chromatography of ions

MRD 207 METADATA DETAIL PAGE

ISO INTERNATIONAL STANDARD

ArcGIS Pro: Essential Workflows STUDENT EDITION

ISO 385 INTERNATIONAL STANDARD. Laboratory glassware Burettes. Verrerie de laboratoire Burettes. First edition

Weekly Written Arithmetic Questions

ISO INTERNATIONAL STANDARD

ISO INTERNATIONAL STANDARD. Geographic information Metadata Part 2: Extensions for imagery and gridded data

MRD 229-Revised METADATA

ISO INTERNATIONAL STANDARD. Rolling bearings Linear motion rolling bearings Part 2: Static load ratings

Weighting of Results. Sample weights

Sales Analysis User Manual

ISO INTERNATIONAL STANDARD. Mechanical vibration and shock Coupling forces at the man-machine interface for hand-transmitted vibration

ISO INTERNATIONAL STANDARD. Plastics Determination of hardness Part 1: Ball indentation method

ISO 2575 INTERNATIONAL STANDARD. Road vehicles Symbols for controls, indicators and tell-tales

August 3,1999. Stiffness and Strength Properties for Basic Sandwich Material Core Types UCRL-JC B. Kim, R.M. Christensen.

Elementary Linear Algebra with Applications Bernard Kolman David Hill Ninth Edition

An Accuracy Assessment of the Global Employment Trends Unemployment Rate Forecasts

Conducting Fieldwork and Survey Design

Manual Railway Industry Substance List. Version: March 2011

MRD 228 METADATA. Official Name of the Data Set or Information Holding: Physiography of Southern Ontario

73/2011 Coll. ACT PART ONE LABOUR OFFICE OF THE CZECH REPUBLIC

ISO INTERNATIONAL STANDARD. Thermal insulation for building equipment and industrial installations Calculation rules

Thank you so much for your interest in The Measured Mom printables!

Tel: Fax: e.mail: Visit our Website at:

Turbines and turbine sets Measurement of emitted airborne noise Engineering/survey method

Instruction to search natural compounds on CH-NMR-NP

INTERNATIONAL STANDARD

Plasma Response Control Using Advanced Feedback Techniques

Figure Figure

Uganda - National Panel Survey

ISO INTERNATIONAL STANDARD. Sample preparation Dispersing procedures for powders in liquids

ISO INTERNATIONAL STANDARD. Particle size analysis Laser diffraction methods. Analyse granulométrique Méthodes par diffraction laser

This document is a preview generated by EVS

This is a repository copy of Aggregation of growing crystals in suspension: III. Accounting for adhesion and repulsion.

Data Comparisons Y-12 West Tower Data

Metallic materials Knoop hardness test. Part 1: Test method. Matériaux métalliques Essai de dureté Knoop Partie 1: Méthode d'essai

Mark Scheme (Results) January 2011

SA/SNZ TS ISO :2015

The Practice Book for Conceptual Physics. Paul G. Hewitt Eleventh Edition

AS Australian Standard

Worked Examples for Nominal Intercoder Reliability. by Deen G. Freelon October 30,

RESEARCH OPPORTUNITIES AT THE CNS (Y-12 AND PANTEX) NUCLEAR DETECTION AND SENSOR TESTING CENTERS (NDSTC)

Mark Scheme (Results) Summer International GCSE Mathematics (4MA0) Paper 4HR

COMBINING ENUMERATION AREA MAPS AND SATELITE IMAGES (LAND COVER) FOR THE DEVELOPMENT OF AREA FRAME (MULTIPLE FRAMES) IN AN AFRICAN COUNTRY:

ISO INTERNATIONAL STANDARD. Soil quality Determination of organochlorine pesticides and. capture detection

MiSeq System. Denature and Dilute Libraries Guide

POWER SYSTEM OPERATING PROCEDURE LOAD FORECASTING

ISO INTERNATIONAL STANDARD. Thermal performance of windows, doors and shutters Calculation of thermal transmittance Part 1: Simplified method

Nordic Council of Ministers Grant Programme for Nordic-Baltic Non-Governmental Organisations (NGO) Cooperation 2019 Estonia.

ISO/TR TECHNICAL REPORT. Nanotechnologies Methodology for the classification and categorization of nanomaterials

AC dipole based optics measurement and correction at RHIC

Part 7: Glossary Overview

INTERNATIONAL STANDARD. This is a preview of "ISO :2009". Click here to purchase the full version from the ANSI store.

Strata Scheme Size by State and Territory

POWER SYSTEM OPERATING PROCEDURE LOAD FORECASTING

ESSENTIAL SAMPLE. Mathematical Methods 1&2CAS MICHAEL EVANS KAY LIPSON DOUG WALLACE

Estimation Techniques in the German Labor Force Survey (LFS)

INTERNATIONAL STANDARD

Tutor Resources for the AMEP

Natura 2000 and spatial planning. Executive summary

Sýnishorn. Water quality Enumeration of Clostridium perfringens Method using membrane filtration

ISO INTERNATIONAL STANDARD. Reference radiation fields Simulated workplace neutron fields Part 1: Characteristics and methods of production

INTERNATIONAL STANDARD

Substance identification and how to report it in IUCLID 6

Chemical Risk Assessment Guide

Mark Scheme (Results) June Pearson Edexcel International GCSE Mathematics A (4MA0) Paper 3HR

NOTE BY THE TECHNICAL SECRETARIAT ESTABLISHMENT OF A DATABASE OF EXPERTISE ON PEACEFUL USES OF CHEMISTRY

Modelling and projecting the postponement of childbearing in low-fertility countries

Trish Price & Peter Price. 10 Minutes a Day. Level 1. Book 1: Addition Worksheets

This document is a preview generated by EVS

A Second Course in Statistics Regression Analysis William Mendenhall Terry Sincich Seventh Edition......

ISO 355 INTERNATIONAL STANDARD. Rolling bearings Tapered roller bearings Boundary dimensions and series designations

Mechanics of Materials

Transcription:

ILO-IPEC Interactive Sampling Tools No. 3 Selection of Primary Sampling Units (PSUs by systematic PPS sampling with constraints Version 1 August 2014 International Programme on the Elimination of Child Labour (IPEC Fundamental Principles and Rights at Work (FPRW Branch Governance and Tripartism Department

Copyright International Labour Organization 2014 First published 2014 Publications of the International Labour Office enjoy copyright under Protocol 2 of the Universal Copyright Convention. Nevertheless, short excerpts from them may be reproduced without authorization, on condition that the source is indicated. For rights of reproduction or translation, application should be made to ILO Publications (Rights and Permissions, International Labour Office, CH-1211 Geneva 22, Switzerland, or by email: pubdroit@ilo.org. The International Labour Office welcomes such applications. Libraries, institutions and other users registered with reproduction rights organizations may make copies in accordance with the licences issued to them for this purpose. Visit www.ifrro.org to find the reproduction rights organization in your country. ILO-IPEC ILO-IPEC Interactive Sampling Tools No. 3 - Selection of Primary Sampling Units (PSUs by systematic PPS sampling with constraints / International Labour Office, International Programme on the Elimination of Child Labour (IPEC - Geneva: ILO, 2014 ACKNOWLEDGEMENTS This publication was elaborated by Mr. Farhad Mehran, consultant, for ILO-IPEC and coordinated by Mr. Federico Blanco Allais from IPEC Geneva Office. Funding for this ILO publication was provided by the United States Department of Labor (Projects GLO/13/21/USA & GLO/10/55/USA. This publication does not necessarily reflect the views or policies of the United States Department of Labor, nor does mention of trade names, commercial products, or organizations imply endorsement by the United States Government. The designations employed in ILO publications, which are in conformity with United Nations practice, and the presentation of material therein do not imply the expression of any opinion whatsoever on the part of the International Labour Office concerning the legal status of any country, area or territory or of its authorities, or concerning the delimitation of its frontiers. The responsibility for opinions expressed in signed articles, studies and other contributions rests solely with their authors, and publication does not constitute an endorsement by the International Labour Office of the opinions expressed in them. Reference to names of firms and commercial products and processes does not imply their endorsement by the International Labour Office, and any failure to mention a particular firm, commercial product or process is not a sign of disapproval. ILO publications and electronic products can be obtained through major booksellers or ILO local offices in many countries, or direct from ILO Publications, International Labour Office, CH-1211 Geneva 22, Switzerland. Catalogues or lists of new publications are available free of charge from the above address, or by email: pubvente@ilo.org or visit our website: www.ilo.org/publns. Visit our website: www.ilo.org/ipec Available in electronic PDF format only. Photocomposed by ILO-IPEC Geneva. 2 ILO s International Programme on the Elimination of Child Labour (IPEC

1. Introduction This document describes the use of the template PSU Selection of the SIMPOC Interactive Sampling Tools. The template assists the user to select sample PSUs within a given stratum or domain by systematic sampling with probabilities proportional to size (pps with constraints. The constraints specify the treatment of very large and very small PSUs. As many PSUs can be inserted in the template as there are PSUs in the sampling frame. The template is divided into three parts: Input values, Output values and Intermediary calculations. The contents and use of each part is described in turn below. 2. Input values There are two types of input values: input parameters and input data as shown in the top and bottom panels below: INPUT VALUES Total sample size in stratum/domain (number of households 156 New random Total number of PSUs in stratum/domain in frame 134 Start Sample take per PSU (number of households 16 0.871336 Stratum or domain PSU code number Number of Measure of households PSU size h i Ni xi Stratum or domain 1 PSU 1 31 31 Stratum or domain 1 PSU 2 71 71 Stratum or domain 1 PSU 3 211 211 Stratum or domain 1 PSU 4 23 23 Stratum or domain 1 PSU 5 151 151 Stratum or domain 1 PSU 6 74 74 Stratum or domain 1 PSU 7 2651 2651 Stratum or domain 1 PSU 8 64 64 Stratum or domain 1 PSU 9 107 107 Stratum or domain 1 PSU 10 102 102 Stratum or domain 1 PSU 11 12 12 The input parameters are few and specified manually in the top rows of the template, while the input data are generally numerous and transferred from another file in the first four columns of the template. Input parameters Sample allocation is based on five input parameters: Selection of Primary Sampling Units (PSUs by Systematic PPS Sampling with Constraints 3

n h = Number of sample households to be selected for the specified domain or stratum h M h = Total number of PSU in the specified domain or stratum h in the sampling frame b = Sample take per PSU, i.e., number of households to be sampled per PSU in the specified domain or stratum h w max = Maximum weight any PSU should receive. The pps sampling scheme may result in very large weights for units of very small size. The specification of w max serves to limit the maximum weight for very small PSUs o = Random start for systematic pps sampling of PSU. Random number between 0 and 1 Input data The input data are in the form of four columns: Col A h = The domain or stratum name. In the present version, the template concerns a unique domain or stratum. The template is applied separately for each domain or stratum. In a later version of the template, it is envisaged to use the template simultaneously to all domains and strata Col B i = PSU code number. The list of all PSUs in the frame for the specified domain or stratum identified by their PSU code number Col C N i = Total number of households in the PSU i according to information in the sampling frame Col D x i = Measure of size to be used for systematic pps sampling. The measure of size may be the same as the number of households (Ni or it may be specified differently, for example, the number of children 5 to 11 years old according to the information in the sampling frame 4 ILO s International Programme on the Elimination of Child Labour (IPEC

3. Output values There are two sets of output values depending upon the method of selection for very large PSUs: Selection (1: Virtual division of very large PSUs Selection (2: Automatic selection of very large PSUs In each case, there are also two types of output values: output parameters and output data as shown in the top and bottom panels below: OUTPUTS VALUES OUTPUT VALUES Effective sample size in stratum or domain 160 160 Number of sample PSU to be selected 10 10 Selection (1: Virtual division of very large PSUs Selection proportion Random value Sample Effective sample size Selection (2: Automatic selection of very large PSUs Selection probability Random value Sample Effective sample size Pi -0.871336 si ni i -0.871336 si ni 2.3% -0.848 0 0 2.6% -0.845 0 0 5.3% -0.795 0 0 5.9% -0.786 0 0 15.7% -0.638 0 0 17.7% -0.609 0 0 1.7% -0.621 0 0 1.9% -0.590 0 0 11.3% -0.508 0 0 12.6% -0.464 0 0 5.5% -0.453 0 0 6.2% -0.402 0 0 197.8% 1.526 2 32 100.0% -0.598 1 16 4.8% 1.573 0 0 5.4% -0.652 0 0 8.0% 1.653 0 0 9.0% -0.742 0 0 7.6% 1.729 0 0 8.5% -0.827 0 0 0.9% 1.738 0 0 1.0% -0.837 0 0 The output parameters are shown in the top rows of the output values block of the template. The output data are given in the columns below the output parameters. Output parameters For each method of selection, there are two output parameters: (i The effective sample size in the specified domain or stratum h. The effective number of sample households may differ from the number of sample households n h specified as input parameter due to rounding or due to very small PSUs in which the number of households in the frame is lower than the specified sample take (ii m h = The number of sample PSU to be selected for the specified domain or stratum h Selection of Primary Sampling Units (PSUs by Systematic PPS Sampling with Constraints 5

In the case of selection method (2, an additional output parameter is calculated: (iii p min = The minimum probability of selection of any PSU. Output data For each method of selection, there are four sets of output data. In the case of method of selection (1, the output data are given in columns F to I: Col F p i = Selection proportion or more precisely the proportion of total measure of size in PSU i p i m h x i x i i p i is not a probability. Its value can be greater than one, or for that matter greater than two, three, or more depending on the size of the PSU. Col G e i = Random number for PSU i obtained systematically from the selection proportion p i and the random number of the preceding PSU i-1 e i p i e i 1 where e o is given by the random start specified in the input parameters e o randomstart(0,1 Col H s i = sample inclusion indicator specifying whether PSU i is included in the sample or not, and if included in how many virtual divisions s i Int(e i Int(e i 1 where Int(x is the integer function returning the largest integer less or equal to x Col I n i = effective sample size in PSU i. It is the minimum value between the corresponding sample take and the total number of households in the PSU n i min(b s i,n i where b is the sample take per PSU specified in the input parameters. 6 ILO s International Programme on the Elimination of Child Labour (IPEC

In the case of method of selection (2, similar output data are produced and given in columns J to M: Col J i = probability of selection of PSU i. The probability of selection is proportional to the size of the PSU, i min(1,k x i The proportionality factor k is determined as part of in the intermediary calculations. Col K e i = Random number for PSU i obtained systematically from the selection probability π i and the random number of the preceding PSU i-1 e i i e i 1 where e o is given by the same random start specified in the input parameters e o randomstart(0,1 Col L s i = sample inclusion indicator specifying whether PSU i is included in the sample or not, and if included in how many virtual divisions s i Int(e i Int(e i 1 where Int(x is the integer function returning the largest integer less or equal to x. In selection method (2, the sample selection indicator s i takes on only values 1 or 0, s i = 1 if PSU i is selected and s i = 0 if PSU i is not selected in the sample. Col M n i = effective sample size in PSU i. It is the minimum value between the corresponding sample take and the total number of households in the PSU n i min(b s i,n i where b is the sample take per PSU specified in the input parameters. Selection of Primary Sampling Units (PSUs by Systematic PPS Sampling with Constraints 7

4. Intermediary calculations The intermediary calculations are for obtaining the proportionality factor used for deriving the probabilities of selection of the PSUs. The steps are shown in the following panel. Iteration 1 k = 0.0007463 INTERMEDIARY CALCULATIONS Newton s approximation Iteration 2 k = 0.0008373 Iteration 3 k = 0.0008373 f (k f(k f (k f(k f (k f(k 31 2.3% 31 2.6% 31 2.6% 71 5.3% 71 5.9% 71 5.9% 211 15.7% 211 17.7% 211 17.7% 23 1.7% 23 1.9% 23 1.9% 151 11.3% 151 12.6% 151 12.6% 74 5.5% 74 6.2% 74 6.2% 0 100.0% 0 100.0% 0 100.0% 64 4.8% 64 5.4% 64 5.4% 107 8.0% 107 9.0% 107 9.0% 102 7.6% 102 8.5% 102 8.5% 12 0.9% 12 1.0% 12 1.0% The probability of selection of a PSU, i, is expressed as a non-linear function of the proportionality factor k, where i (k max[ p min,min(1,k x i ] p min 1 w max is a constant representing the minimum probability of selection so that the corresponding sampling weight does not exceed a maximum specified value W max, and x i is the measure of size of PSU i. The value 1 in the expression min(1,k x i is to ensure that the calculated probability does not exceed the value 1. Given that the sample design is with a fixed sample size, n h, in each stratum, the sum of the probabilities must be equal to n h : i i (k n h The determination of the proportionality factor k may thus be expressed as the solution of the non-linear function, 8 ILO s International Programme on the Elimination of Child Labour (IPEC

f (k (k n h 0 The solution may be obtained iteratively by Newton s method 1 i i k k f (k n h f '(k where f and f are the function and its derivative with respect to k, f (k max[ p min,min(1,k x i ] i f '(k ( x i 2 (1 sign([k x i p min ][1 k x i ] The calculations of f(k and f (k have been programmed in the columns O and T of the intermediary calculations of the Template for 3 iterations with starting value of k o k o n h x i i 1 http://en.wikipedia.org/wiki/newton's_method. Selection of Primary Sampling Units (PSUs by Systematic PPS Sampling with Constraints 9