Sampling approaches in animal health studies. Al Ain November 2017

Similar documents
Sampling. General introduction to sampling methods in epidemiology and some applications to food microbiology study October Hanoi

Sample size and Sampling strategy

CROSS SECTIONAL STUDY & SAMPLING METHOD

FCE 3900 EDUCATIONAL RESEARCH LECTURE 8 P O P U L A T I O N A N D S A M P L I N G T E C H N I Q U E

Part 7: Glossary Overview

Taking into account sampling design in DAD. Population SAMPLING DESIGN AND DAD

Lecturer: Dr. Adote Anum, Dept. of Psychology Contact Information:

CHOOSING THE RIGHT SAMPLING TECHNIQUE FOR YOUR RESEARCH. Awanis Ku Ishak, PhD SBM

Sampling. Sampling. Sampling. Sampling. Population. Sample. Sampling unit

Examine characteristics of a sample and make inferences about the population

Lecture 5: Sampling Methods

MN 400: Research Methods. CHAPTER 7 Sample Design

Module 4 Approaches to Sampling. Georgia Kayser, PhD The Water Institute

Role of GIS in Tracking and Controlling Spread of Disease

Formalizing the Concepts: Simple Random Sampling. Juan Muñoz Kristen Himelein March 2012

Introduction to risk assessment

Formalizing the Concepts: Simple Random Sampling. Juan Muñoz Kristen Himelein March 2013

Georgia Kayser, PhD. Module 4 Approaches to Sampling. Hello and Welcome to Monitoring Evaluation and Learning: Approaches to Sampling.

Stochastic calculus for summable processes 1

BIOSTATISTICS. Lecture 4 Sampling and Sampling Distribution. dr. Petr Nazarov

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty.

Part IV Statistics in Epidemiology

Prepared by: DR. ROZIAH MOHD RASDI Faculty of Educational Studies Universiti Putra Malaysia

TECH 646 Analysis of Research in Industry and Technology

POPULATION AND SAMPLE

Sampling in Space and Time. Natural experiment? Analytical Surveys

Information and surveillance system of vesicular diseases in the Americas. Use of grid maps for monitoring, data collection and reporting

Sampling. Module II Chapter 3

Part A: Salmonella prevalence estimates. (Question N EFSA-Q ) Adopted by The Task Force on 28 March 2007

3/9/2015. Overview. Introduction - sampling. Introduction - sampling. Before Sampling

Design of 2 nd survey in Cambodia WHO workshop on Repeat prevalence surveys in Asia: design and analysis 8-11 Feb 2012 Cambodia

Fundamentals of Applied Sampling

Introduction to Survey Data Analysis

Figure Figure

Chapter 6 ESTIMATION OF PARAMETERS

DATA COLLECTION & SAMPLING

Confounding and effect modification: Mantel-Haenszel estimation, testing effect homogeneity. Dankmar Böhning

Sampling Theory in Statistics Explained - SSC CGL Tier II Notes in PDF

SAMPLING TECHNIQUES ASSOC. PROF. DR. MOHD ROSNI SULAIMAN FACULTY OF FOOD SCIENCE AND NUTRITION UNIVERSITI MALAYSIA SABAH

Sampling. Vorasith Sornsrivichai, M.D., FETP Cert. Epidemiology Unit, Faculty of Medicine, PSU

Lecture 5: Poisson and logistic regression

CHE Chemical Engineering Operations

Effect Modification and Interaction

emerge Network: CERC Survey Survey Sampling Data Preparation

SYA 3300 Research Methods and Lab Summer A, 2000

STAC51: Categorical data Analysis

Module 16. Sampling and Sampling Distributions: Random Sampling, Non Random Sampling

Application of Statistical Analysis in Population and Sampling Population

ANALYSIS OF SURVEY DATA USING SPSS

Q.1 Define Population Ans In statistical investigation the interest usually lies in the assessment of the general magnitude and the study of

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH

Data Collection: What Is Sampling?

Draft Proof - Do not copy, post, or distribute

Lecture 2: Poisson and logistic regression

Session 2.1: Terminology, Concepts and Definitions

Lesson 6 Population & Sampling

The use of Geographic Information System (GIS) and non-gis methods to assess the external validity of samples postcollection

Sampling Methods and the Central Limit Theorem GOALS. Why Sample the Population? 9/25/17. Dr. Richard Jerz

Probability and Discrete Distributions

TECH 646 Analysis of Research in Industry and Technology

Research Methods in Environmental Science

Sampling and Estimation in Agricultural Surveys

SUPPLEMENTARY MATERIAL

Lectures of STA 231: Biostatistics

SAMPLING- Method of Psychology. By- Mrs Neelam Rathee, Dept of Psychology. PGGCG-11, Chandigarh.

Genetic Parameters for Stillbirth in the Netherlands

Vehicle Freq Rel. Freq Frequency distribution. Statistics

The World Bank Mongolia Livestock and Agricultural Marketing Project (P125964)

Day 8: Sampling. Daniel J. Mallinson. School of Public Affairs Penn State Harrisburg PADM-HADM 503

Population, Sample, and Sampling Techniques. Identify the Unit of Analysis. Unit of Analysis 4/9/2013. Dr. K. A. Korb UniJos

GIS and Health Geography. What is epidemiology?

A SAS MACRO FOR ESTIMATING SAMPLING ERRORS OF MEANS AND PROPORTIONS IN THE CONTEXT OF STRATIFIED MULTISTAGE CLUSTER SAMPLING

Now we will define some common sampling plans and discuss their strengths and limitations.

USING CLUSTERING SOFTWARE FOR EXPLORING SPATIAL AND TEMPORAL PATTERNS IN NON-COMMUNICABLE DISEASES

Animal Models. Sheep are scanned at maturity by ultrasound(us) to determine the amount of fat surrounding the muscle. A model (equation) might be

Chapter Six: Two Independent Samples Methods 1/51

Section 4. Test-Level Analyses

BIOL 51A - Biostatistics 1 1. Lecture 1: Intro to Biostatistics. Smoking: hazardous? FEV (l) Smoke

Developing continental maps of African animal trypanosomosis: the example of Ethiopia, Kenya and Uganda

Key elements An open-ended questionnaire can be used (see Quinn 2001).

An Introduction to Probability and Statistics

Predictive Thresholds for Plague in Kazakhstan

VCS MODULE VMD0018 METHODS TO DETERMINE STRATIFICATION

Notes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing

What do plants compete for? What do animals compete for? What is a gamete and what do they carry? What is a gene?

TYPES OF SAMPLING TECHNIQUES IN PHYSICAL EDUCATION AND SPORTS

ECON1310 Quantitative Economic and Business Analysis A

BOOK REVIEW Sampling: Design and Analysis. Sharon L. Lohr. 2nd Edition, International Publication,

Sampling Weights. Pierre Foy

emerge Network: CERC Survey Survey Sampling Data Preparation

Side-by-Side Comparison of the Texas Educational Knowledge and Skills (TEKS) and Louisiana Grade Level Expectations (GLEs) SCIENCE: Grade 7

Known unknowns : using multiple imputation to fill in the blanks for missing data

Auditing and Investigations Overview

Auditing and Investigations Overview

SAMPLING II BIOS 662. Michael G. Hudgens, Ph.D. mhudgens :37. BIOS Sampling II

Analysis of Longitudinal Data. Patrick J. Heagerty PhD Department of Biostatistics University of Washington

Sampling Populations limited in the scope enumerate

Introduction to SEIR Models

III Introduction to Populations III Introduction to Populations A. Definitions A population is (Krebs 2001:116) a group of organisms same species

Explain the role of sampling in the research process Distinguish between probability and nonprobability sampling Understand the factors to consider

Transcription:

Sampling approaches in animal health studies Al Ain 13-14 November 2017

Definitions Populations and Samples From a statistical point of view: Population is a set of measurements, that may be finite or infinite, really existing or conceptual Sample is a sub-set of measurements taken from the population and used to estimate population parameters

Definitions Populations and Samples From an epidemiological point of view: Population is a set of individuals (or a set of aggregates of individuals) that may be classified according to one or more homogeneous criteria (residence, production attitude, genetic structure, etc.) Sample is a sub-set taken from the population and used to gather information on the population itself

Populations and samples STATISTIC. EPIDEMIOL. EXAMPLES Population is a set of measurements... that may be finite,... Meanings and examples Population is a set of individuals that may be classified according to homogeneous criteria Antibody titers against a certain disease in a cattle herd Animals in a herd infinite,... (it exists only conceptually) Italian cattle population

Populations and samples STATISTIC. EPIDEMIOL. EXAMPLES really existing... Meanings and examples The above mentioned cattle or conceptual The results of a series (potentially infinite) of administrations of a certain drug to animals of a defined species

Populations and samples Meanings and examples STATISTIC. EPIDEMIOL. EXAMPLES Sample is a subset of measurements taken from the population and used to estimate population parameters Sample is a sub-set taken from the population and used to gather information on the population itself 32 randomly selected animals 10 females and 10 males randomly selected Every tenth animal (arriving to an abattoir, a custom, etc.) Reproductive stock (***)

Phases of sample design Objectives of sampling Target population admission criteria stratification criteria Type of sampling Sample size Time of sampling

Phases of sample design Objectives of sampling Target population admission criteria stratification criteria Type of sampling Sample size Time of sampling

Objectives They should be: Explicit Clearly defined Actually achievable

Examples of objectives UNCORRECT To know which diseases exist in UAE To know whether or not «disease X» is present in camels of UAE CORRECT To know whether or not «disease X» is present in a camel population of UAE at a prevalence rate higher than a predefined threshold

Examples of objectives UNCORRECT To measure the occurrence of «disease X» in the population of camels of UAE To measure the prevalence of «disease X» in the population of camels of UAE CORRECT Considering an acceptable level of error (standard error), and an approximate value of expected prevalence, to measure the serological prevalence of «disease X» in the population of camels of UAE

Definition of objectives The definition of objectives in the case of a data collection on sample basis should be more precise than in the case of data collection on the whole population because the planning of qualitative and quantitative aspects of sampling is strictly dependent on chosen objectives

Definition of objectives Examples Sample size is different in the case we need to detect whether a disease is not present in a certain population from the case we need to measure the prevalence of that disease in the same population (continue)

Definition of objectives Examples Target population is different in the case we need to investigate the risk for the consumer due to a certain disease/pollutant from the case we need to investigate the presence of that disease or pollutant in an animal population (continue)

Definition of objectives Examples The sizes of samples collected to estimate the prevalence of a certain disease differ, depending on the expected prevalence of that disease and on the degree of precision requested to the estimate itself

Phases of sample design Objectives of sampling Target population admission criteria stratification criteria Type of sampling Sample size Time of sampling

Phases of sample design Objectives of sampling Target population admission criteria stratification criteria Type of sampling Sample size Time of sampling

Target population Admission criteria strictly depend on chosen objectives...i.e... Admission criteria are the criteria according to which each individual may or may not be included in the sample. They are a direct consequence of the objectives of sampling

Admission criteria Example #1 Objective To estimate the risk for the consumer get brucellosis drinking raw camel milk Criterion This objective needs to be split into three sub-objectives: 1. To estimate the average amount of raw camel milk in the diet 2. To estimate the prevalence of brucellosis in camel herds 3. To estimate the level of Brucella concentration in camel milk

Example #1 Objective 2. To estimate the prevalence of brucellosis in camel herds Objective 1. To estimate the average amount of raw camel milk in the diet Criterion Criterion The whole human population of the country, subdivided by age groups, gender and other possible relevant conditions (such as rural vs urban population, etc.) All adult female camels during the milking periods Admission criteria Objective 3. To estimate the level of Brucella concentration in camel milk Criterion Milk produced in infected herds

Objective To determine the probability that RVF infection is introduced into EAU through imported camels Example #2 Admission criteria Criterion Two objectives: 1. Probability that one or more RVF infected camels are introduced into EAU in the unit of time 2. Probability that a RVF infected camel will be not detected by the import controls

Admission criteria Example #2 Objective 1. Probability that one or more RVF infected camels are introduced into EAU in the unit of time Criterion All (not vaccinated against RVF) camels imported from RVF infected countries during the vector season

Admission criteria Example #2 Objective 2. Probability that a RVF infected camel will be not detected by the import controls Criterion All imported RVF infected camels subjected to import controls

Phases of sample design Objectives of sampling Target population admission criteria stratification criteria Type of sampling Sample size Time of sampling

Phases of sample design Objectives of sampling Target population admission criteria stratification criteria Type of sampling Sample size Time of sampling

Type of sampling Simple random sampling Systematic random sampling Stratified sampling Multistage sampling

Systematic random sampling It is obtained by putting in a urn a set of tags (one for each element of the population) with the ID numbers of each element (e.g. eartag number) and taking from the urn the desired number of tags. It may also be obtained using random number tables or using a computer Veterinarian Archive of herd files Urn

Systematic random sampling We need to extract a sample of n elements from a population of size N: - a sampling step is defined: k=n/n - the first animal to be sampled is randomly selected between 1 and k - and every k th animal starting from the first selected animal is introduced in the sample

Systematic random sampling Advantages We do not need to know each element of the population, the only needed piece of information is population size It is suitable, for example, in quality control or when population units manifest themselves in a time sequence (e.g. in a laboratory, slaughterhouse, custom office, etc.)

Stratified sampling Population is subdivided into strata (sub-groups) and a simple or systematic random sample is taken from each stratum Characters chosen as strata (age, sex, breed, geographical origin, herd size, etc.) must be relevant for the investigation

Stratified sampling Advantages It allows to perform a sampling that is representative of each stratum in the population, even in the case of very small subgroups It allows (in the case of non proportional stratified sampling) to draw also inferences on single strata of the population It reduces sampling variance (increase in precision of estimates)

Stratified sampling Disadvantages Effects of stratification features need to be known in advance

Stratified sampling Two types of Stratified sampling exist: Proportional s.s. Non-proportional s.s.

Stratified sampling Examples We need to sample 60 sheep from a population of 7800, subdivided in 4 flocks. Proportional stratified sampling: from each flock a sample is taken proportionally to the weight of that flock relative to total population

Examples Stratified sampling I flock 3000 60*3000:7800= 23 II flock 800 6 III flock 2500 19 IV flock 1500 12 TOTAL 7800 60

Stratified sampling Examples We need to sample 60 sheep from a population of 7800, subdivided in 4 flocks. Non-proportional stratified sampling: from each flock an equal sample is taken (e.g. to have a pre-defined and equal precision of the estimates for each stratum of the population)

Stratified sampling Examples I flock 3000 15 II flock 800 15 III flock 2500 15 IV flock 1500 15 TOTAL 7800 60

Multi-stage sampling It is a combination of the previous methods The sample is selected in two or more phases Example: a sample of dairy herds is randomly selected and then a sample of heifers is randomly selected in each selected herd

Phases of sample design Objectives of sampling Target population admission criteria stratification criteria Type of sampling Sample size Time of sampling

Phases of sample design Objectives of sampling Target population admission criteria stratification criteria Type of sampling Sample size Time of sampling

Sample size We need to consider two different types of variables to be studied: Dichotomous variables Presence/absence of a disease Presence/absence of residues or pollutants

Sample size Continuous variables Intake of mercury through the diet Concentration of antimicrobial drugs in food NOTE: in both cases, dichotomization may be performed based on defined thresholds, but it would produce a loss of information

Dichotomous variables

Sample size to detect a disease in a population with a threshold prevalence P POPULATION SIZE 0.50% 1.00% 2.00% 5.00% 10.00% 20 20 20 20 19 15 50 50 50 48 34 22 100 100 95 77 44 25 250 227 174 112 52 27 500 349 224 128 55 28 1000 450 258 138 57 28 2000 517 277 143 58 28 10000 580 294 147 58 28 INFINITE 598 298 148 58 28 Confidence level = 95%

Sample size to detect a disease in a population with a threshold prevalence P POPULATION SIZE 0.50% 1.00% 2.00% 5.00% 10.00% 20 20 20 20 20 18 50 50 50 50 41 29 100 100 99 90 59 35 250 244 210 149 75 40 500 420 300 183 82 42 1000 601 367 204 86 43 2000 736 409 215 88 43 10000 878 448 225 89 44 INFINITE 919 458 228 90 44 Confidence level = 99%

Finite populations: n 1 1 1 D D * N 1 2 D=number of animals infected in the population N=total number of animals in the population α=confidence level Infinite populations: n log log 1 1 p α=confidence level p=prevalence of infection

Calculation of the sample size For infinite populations in may be easily calculated using a spreadsheet like Excel: given the threshold prevalence p, for example 1%=0.01 and the confidence level, for example 95%=0.95 sample size may be calculated as =log(1-0.95)/log(1-0.01) and the result is 298

Sample size needed to estimate a prevalence of infection with confidence level = 95% Infinite population DESIRED LEVEL OF PRECISION EXPECTED PREVALENCE IN THE POPULATION 1,00% 2,00% 5,00% 8,00% 10,00% 5% 1825 456 73 10% 3457 864 138 54 35 15% 4898 1225 196 77 49 20% 6147 1537 246 96 61 40% 9220 2305 369 144 92 50% 9604 2401 384 150 96

Sample size needed to estimate a prevalence of infection with confidence level = 99% Infinite population DESIRED LEVEL OF PRECISION EXPECTED PREVALENCE IN THE POPULATION 1,00% 2,00% 5,00% 8,00% 10,00% 5% 3150 787 126 10% 5968 1492 239 93 60 15% 8454 2114 338 132 85 20% 10609 2652 424 166 106 40% 15914 3978 637 249 159 50% 16577 4144 663 259 166

Calculation of the sample size From: n z * p* 1 p 2 2 E p=expected prevalence E=accepted level of error za=1.96 for 95% c.l. za=2.575 for 99% c.l.

Calculation of the sample size For finite populations, calculate the sample size using the following formula: nf=(ni*n)/(ni+n) Where: nf = sample size for finite population ni = sample size for infinite population (from the table) N = size of the total population

Continuous variables

Calculation of the sample size Sample size for estimating mean value of a continuous variable Confidence Precision of estimation level ±10% ±20% ±30% ±50% ±100% 95% 384 96 43 15 4 99% 697 174 77 28 7 n 1, 96 E 2 Confidence level = 95% n 2, 64 E 2 Confidence level = 99% E=accepted level of error σ=standard deviation of variable s distribution

Phases of sample design Objectives of sampling Target population admission criteria stratification criteria Type of sampling Sample size Time of sampling

Phases of sample design Objectives of sampling Target population admission criteria stratification criteria Type of sampling Sample size Time of sampling

Time and duration of sampling They depend on: Objective of sampling Type of study - transversal - longitudinal Seasonality of the studied phenomenon, which depends on: - host biology - parasite biology - husbandry habits and traditions Logistic constraints

Final remarks In case of rare events sample surveys are not suitable Examples of rare events are: Exotic diseases or newly introduced ones Diseases in final phases of eradication programs, when prevalence approaches zero Diseases rare for their own nature (e.g. BSE) For this reason, early warning systems cannot be based on random sampling

Final remarks In all these cases, clinical ( passive ) surveillance is more effective Even if passive surveillance has a low sensitivity, this low sensitivity is compensated by the large portion of the population that it can check Therefore, in case of rare events, the most costeffective way to deal with problems is: Passive surveillance Risk-based surveillance which is a targeted (not random) active surveillance targetting those population strata with higher probability of being exposed and infected

Thank you for your attention!