Now we will define some common sampling plans and discuss their strengths and limitations.

Similar documents
CHOOSING THE RIGHT SAMPLING TECHNIQUE FOR YOUR RESEARCH. Awanis Ku Ishak, PhD SBM

Teaching Research Methods: Resources for HE Social Sciences Practitioners. Sampling

Module 9: Sampling IPDET. Sampling. Intro Concepts Types Confidence/ Precision? How Large? Intervention or Policy. Evaluation Questions

MN 400: Research Methods. CHAPTER 7 Sample Design

Sampling. Module II Chapter 3

Data Collection: What Is Sampling?

FCE 3900 EDUCATIONAL RESEARCH LECTURE 8 P O P U L A T I O N A N D S A M P L I N G T E C H N I Q U E

Georgia Kayser, PhD. Module 4 Approaches to Sampling. Hello and Welcome to Monitoring Evaluation and Learning: Approaches to Sampling.

Lesson 6 Population & Sampling

Prepared by: DR. ROZIAH MOHD RASDI Faculty of Educational Studies Universiti Putra Malaysia

Module 4 Approaches to Sampling. Georgia Kayser, PhD The Water Institute

Examine characteristics of a sample and make inferences about the population

SAMPLING- Method of Psychology. By- Mrs Neelam Rathee, Dept of Psychology. PGGCG-11, Chandigarh.

Statistics 301: Probability and Statistics Introduction to Statistics Module

Inferential Statistics. Chapter 5

Notes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing

POPULATION AND SAMPLE

Sample size and Sampling strategy

Training and Technical Assistance Webinar Series Statistical Analysis for Criminal Justice Research

Business Statistics: A First Course

Topic 3 Populations and Samples

and the Sample Mean Random Sample

Why Sample? Selecting a sample is less time-consuming than selecting every item in the population (census).

Module 16. Sampling and Sampling Distributions: Random Sampling, Non Random Sampling

Statistics for Managers Using Microsoft Excel 5th Edition

Figure Figure

Fundamentals of Applied Sampling

Sampling. Benjamin Graham

Statistics 1L03 - Midterm #2 Review

Lecturer: Dr. Adote Anum, Dept. of Psychology Contact Information:

Lecture 5: Sampling Methods

Study on Method of Mass Communication Research 传播研究方法 (6) Dr. Yi Mou 牟怡

TYPES OF SAMPLING TECHNIQUES IN PHYSICAL EDUCATION AND SPORTS

PSY 250 8/29/2011. Sampling. Choosing your Sample. Sampling: Selecting Research Participants. Selecting a sample of participants from the population

Research Methods in Environmental Science

Chapter Goals. To introduce you to data collection

Lectures of STA 231: Biostatistics

Day 8: Sampling. Daniel J. Mallinson. School of Public Affairs Penn State Harrisburg PADM-HADM 503

Stochastic calculus for summable processes 1

Part I. Sampling design. Overview. INFOWO Lecture M6: Sampling design and Experiments. Outline. Sampling design Experiments.

Chapter 1 Statistical Inference

Vehicle Freq Rel. Freq Frequency distribution. Statistics

PSY 250. Sampling. Representative Sample. Representativeness 7/23/2015. Sampling: Selecting Research Participants

What is sampling? shortcut whole population small part Why sample? not enough; time, energy, money, labour/man power, equipment, access measure

Statistic: a that can be from a sample without making use of any unknown. In practice we will use to establish unknown parameters.

Probability and Inference. POLI 205 Doing Research in Politics. Populations and Samples. Probability. Fall 2015

TECH 646 Analysis of Research in Industry and Technology

Application of Statistical Analysis in Population and Sampling Population

What Is a Sampling Distribution? DISTINGUISH between a parameter and a statistic

Unit 3 Populations and Samples

SYA 3300 Research Methods and Lab Summer A, 2000

Taking into account sampling design in DAD. Population SAMPLING DESIGN AND DAD

3 PROBABILITY TOPICS

Sampling Populations limited in the scope enumerate

DATA COLLECTION & SAMPLING

Discrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 10

Jakarta, Indonesia,29 Sep-10 October 2014.

Sampling distributions and the Central Limit. Theorem. 17 October 2016

TECH 646 Analysis of Research in Industry and Technology

9/2/2010. Wildlife Management is a very quantitative field of study. throughout this course and throughout your career.

Answer keys for Assignment 10: Measurement of study variables (The correct answer is underlined in bold text)

Explain the role of sampling in the research process Distinguish between probability and nonprobability sampling Understand the factors to consider

3. When a researcher wants to identify particular types of cases for in-depth investigation; purpose less to generalize to larger population than to g

Sampling Techniques. Esra Akdeniz. February 9th, 2016

Weighting Missing Data Coding and Data Preparation Wrap-up Preview of Next Time. Data Management

Descriptive Statistics Methods of organizing and summarizing any data/information.

Q.1 Define Population Ans In statistical investigation the interest usually lies in the assessment of the general magnitude and the study of

Sampling. General introduction to sampling methods in epidemiology and some applications to food microbiology study October Hanoi

from

Math 31 Lesson Plan. Day 2: Sets; Binary Operations. Elizabeth Gillaspy. September 23, 2011

Sampling in Space and Time. Natural experiment? Analytical Surveys

Announcements. Lecture 5: Probability. Dangling threads from last week: Mean vs. median. Dangling threads from last week: Sampling bias

Part 3: Inferential Statistics

Conditional Probability Solutions STAT-UB.0103 Statistics for Business Control and Regression Models

Sampling. Sampling. Sampling. Sampling. Population. Sample. Sampling unit

The Central Limit Theorem

Probability and Statistics

AP Statistics Review Ch. 7

Error Analysis of Sampling Frame in Sample Survey*

Data Analysis and Statistical Methods Statistics 651

Session IV Instrumental Variables

SAMPLING. PURPOSE: The purpose of this assignment is to address questions related to designing a sampling plan.

CROSS SECTIONAL STUDY & SAMPLING METHOD

Section F Ratio and proportion

KDF2C QUANTITATIVE TECHNIQUES FOR BUSINESSDECISION. Unit : I - V

A General Overview of Parametric Estimation and Inference Techniques.

Population, Sample, and Sampling Techniques. Identify the Unit of Analysis. Unit of Analysis 4/9/2013. Dr. K. A. Korb UniJos

Survey Sample Methods

BECOMING FAMILIAR WITH SOCIAL NETWORKS

Business Statistics:

Unequal Probability Designs

Approach to Field Research Data Generation and Field Logistics Part 1. Road Map 8/26/2016

3.2 Probability Rules

SAMPLING III BIOS 662

CHAPTER 3 PROBABILITY: EVENTS AND PROBABILITIES

Independence 1 2 P(H) = 1 4. On the other hand = P(F ) =

Lecture 14: Secure Multiparty Computation

Calculus II. Calculus II tends to be a very difficult course for many students. There are many reasons for this.

Auditing and Investigations Overview

Auditing and Investigations Overview

Transcription:

Now we will define some common sampling plans and discuss their strengths and limitations. 1

For volunteer samples individuals are self selected. Participants decide to include themselves in the study. Common examples would be internet surveys, restaurant rating cards, call in surveys, etc. These types of samples are almost guaranteed to be biased and hence we cannot generalize the results to a larger group. One should realize that volunteer samples are used often in medical studies as individuals must agree to participate voluntarily and we will see that this is not necessarily problematic in an experiment designed to compare treatments. 2

Individuals in a convenience sample happen to be at the right place at the right time to suit the researcher s schedule. Depending on the questions under study, such a sample may or may not be representative of the larger population of interest. A convenience sample may be susceptible to bias because certain types of individuals are more likely to be selected than others and in extreme cases some individuals may have no chance of being selected. Suppose we use a student organization which is affiliated with a particular political party as our sample. If we wish to study heights and weights of college students, this convenience sample may still provide a relatively representative sample, even though students who are not members of this organization have no chance to be selected. However, if we wish to study opinions of college students on a social topic, you would likely not obtain a representative sample by selecting all students in a student organization that is affiliated with a particular political party. In this case, the fact that a particular party may have a particular stance on social topics could very well produce results which are not representative of the larger student population. 3

We define the sampling frame as the list of potential individuals to be sampled. Population = all individuals of interest. If the sampling frame is not the same as the population, there may be bias arising because of this discrepancy. It is always best to have the sampling frame match the population as closely as possible. However, it is often impossible to achieve a sampling frame which matches the population exactly. 4

In a systematic sample, every n th individual is selected from the sampling frame. In this method, we need to pick a random start point before starting the selection process (otherwise it isn t a random sample at all!!). For example, if we have the entire student roster in alphabetical order for a particular college, and we select every 50 th student from the list after randomizing a starting point, we have a systematic sample. Although these types of samples may not be subject to any clear bias, the fact that individuals with the same last name are not likely to be chosen together in a systematic sample whereas they would be equally like in a completely random sample may cause subtle sources of bias. 5

In a simple random sample, the process is conducted in such a way that every possible sample is equally likely to be chosen. In this case, it is also true that each individual is equally likely to be chosen. Like choosing names out of a hat. For example, if we have the entire student roster in alphabetical order for a particular college, and we select 500 students completely at random. If every student selected participates then we have a sample which is not subject to any bias and should succeed in being representative of the population of interest. On the other hand, if we send the selected students a survey which they then choose whether or not to complete and return. Then we have a simple random sample which is also subject to volunteer response which has some of the same concerns as a volunteer sample. Non response is a large concern with these types of surveys. Sometimes we might follow up with reminders to try to increase the response rate. 6

A simple random sample is the easiest way to base a selection on randomness. However, there are other, more sophisticated, sampling techniques that utilize randomness that are often preferable in real life circumstances due to the complexities of the problems under study and the logistics of carrying out the sampling process. Any plan that relies on random selection is called a probability sampling plan (or technique). Inferential methods require full understanding of the underlying probability of inclusion for each individual in the population (or more likely the sampling frame). 7

Weblink: {http://faculty.elgin.edu/dkernler/statistics/ch01/1 4.html} Now let s talk about two other common sampling plans. The question of which to use usually depends either on the goals of the study or the feasibility of the sampling procedure or both. For both of these methods, we utilize some naturally occurring grouping of the individuals in our population. The difference between these groupings is whether the individuals inside each group are naturally similar to each other or if the groups or more diverse. For stratified sampling it is either easier or is of interest to divide the population into groups called strata inside each strata, individuals are similar. For example, in a class of students, I could stratify by gender and take a simple random sample of 20 students from each gender. In that case, the population is the whole class, the strata are the two groups of students formed by female students and males students respectively. Individuals in the strata are similar in gender. In a case control study there is stratification of the population into cases and controls subjects are then sampled from these two strata. 8

Alternatively, for cluster sampling, the population is again divided up into groups now called clusters however individuals in clusters are diverse and more representative of the population as a whole. Here we randomly select which clusters to sample. In a simple cluster sample, every member of the cluster is chosen. For example, in a survey of college students, we might consider each course a cluster and randomly sample which courses to use in our study. This would be considerably easier than randomly selecting students and then needing to contact and survey each selected student individually. In a survey of people across the U.S., we might cluster people by county, zip code, or even a section of a street although as we will mention shortly, in reality, if our cluster is county or zip code, we still will need to select only a subset of the people in a cluster in most cases. 8

In this illustration, the population is divided naturally by color which represent our strata. Since there are twice as many red (in the middle) as blue or green (on top and bottom), in order to represent the population, we should sample twice as many red as blue or green. Which has been done in the illustrated sample. Even if the sampling is not proportional as long as the proportion is known the proper adjustments can be made if you know how (or know someone who knows how). Complex sampling plans are common in health surveys! One reason for stratification is the possibility that you wish to have a particular number from each strata (equal males and females or twice as many cases as controls). For example, if we take a simple random sample or cluster sample, we would not be guaranteed to get a particular number of individuals in each gender. 9

Here, we have individuals grouped into clusters where the members within each cluster are not necessarily similar although they may be either randomly or due to some particular effect of that cluster. In this case we randomly sample two clusters it is coincidental that in this particular case we still ended up with two reds and one each of blue and green. It could have ended up quite differently. 10

Often students find it difficult to distinguish between these two methods, since they both involve a natural grouping but if you think about Whether the individuals in the groups are similar on some variable of interest or if they are more diverse. Here, Strata contain similar individuals Clusters are more diverse but may be similar in location or some other variable that allows for clustering to occur And whether we randomly sample which groups to select. For strata we almost always sample from every strata For clusters we almost always randomly select only a few clusters to sample 11

Weblink {http://www.cdc.gov/nchs/tutorials/dietary/surveyorientation/surveydesign/info1.htm} Multi stage sampling plans are also common. In multi stage sampling, various sampling methods may be used at any stage resulting in complex probabilities of selecting certain individuals into the sample. Usually simple random sampling is employed repeatedly in choosing clusters to sample from at a particular stage or in choosing which individuals to sample from a particular cluster or strata. In the NHANES example simplistically first the counties are randomly selected then each selected county is broken into segments and a certain number of segments are chosen randomly then each selected segment is divided into households and a random sample of households are selected. Finally within each sampled household an individual is randomly chosen. In our course, the methods we use are traditionally applied for simple random samples. In the case of other sampling methods, adjustments to the standard methods are likely required. The example here is from the NHANES survey on the data information page linked here you can see the warnings about properly accounting for the sampling design. These concepts are definitely beyond the scope of this course but some of you may be working with data like this in your future and will need to learn more about complex sampling and 12

how to adjust for it. 12

We will discuss sample size in more detail in the unit on statistical inference. Here we simply mention a few points. First, our priority is to make sure the sample is representative of the population, by using some form of probability sampling plan Second, it should be reasonable that a larger representative sample will do a better job at estimating the quantities of interest than a smaller one. Since increasing the sample size will inevitably require greater resources, the question will be what is the smallest sample size we could use which will allow us to effectively answer our research question. In practice, we must determine the appropriate sample size very early, if not entirely before, our data collection process. However, the methods which will be used to analyze the data must be known before these calculations can be carried out. 13

In this section, we defined a few common sampling methods. Among these were the probability sampling plans which include: Simple random samples (SRS) Cluster samples Stratified samples Multi stage sampling And to a lesser extent, systematic samples Simple random samples if feasible to conduct will result in a sampling plan that is free from any bias We also discussed non probability sampling plans for which bias is likely to result including: Volunteer samples Convenience samples We defined the sampling frame which is all possible subjects with the possibility to be included in a particular study and discussed how this should match the population as closely as possible. And finally we briefly mentioned the issue of sample size. 14