Sampling Distributions

Similar documents
Probably About Probability p <.05. Probability. What Is Probability?

Unit 4 Probability. Dr Mahmoud Alhussami

Review of probabilities

Math 10 - Compilation of Sample Exam Questions + Answers

Physics 1140 Lecture 6: Gaussian Distributions

Sampling Distributions: Central Limit Theorem

Probability and Inference. POLI 205 Doing Research in Politics. Populations and Samples. Probability. Fall 2015

Edexcel past paper questions

Sample Exam #1 Probability and Statistics I

Inferential Statistics

Announcements. Lecture 5: Probability. Dangling threads from last week: Mean vs. median. Dangling threads from last week: Sampling bias

2 and F Distributions. Barrow, Statistics for Economics, Accounting and Business Studies, 4 th edition Pearson Education Limited 2006

Probability and Independence Terri Bittner, Ph.D.

Senior Math Circles November 19, 2008 Probability II

The Geometric Distribution

Probability & Statistics - FALL 2008 FINAL EXAM

MAT 271E Probability and Statistics

Chapter 23: Inferences About Means

Semester 2 Final Exam Review Guide for AMS I

14 - PROBABILITY Page 1 ( Answers at the end of all questions )

P [(E and F )] P [F ]

Probability Experiments, Trials, Outcomes, Sample Spaces Example 1 Example 2

Business Statistics: A Decision-Making Approach, 6e. Chapter Goals

A brief review of basics of probabilities

Course: ESO-209 Home Work: 1 Instructor: Debasis Kundu

Business Statistics. Lecture 3: Random Variables and the Normal Distribution

Math 105 Course Outline

4.4-Multiplication Rule: Basics

The probability of an event is viewed as a numerical measure of the chance that the event will occur.

Presentation on Theo e ry r y o f P r P o r bab a il i i l t i y

Tutorial 3: Random Processes

MAT Mathematics in Today's World

STA 247 Solutions to Assignment #1

University of California, Berkeley, Statistics 134: Concepts of Probability. Michael Lugo, Spring Exam 1

Solutionbank S1 Edexcel AS and A Level Modular Mathematics

Lecture Topic 4: Chapter 7 Sampling and Sampling Distributions

Probability and Statistics

Statistical Methods for the Social Sciences, Autumn 2012

Probability Rules. MATH 130, Elements of Statistics I. J. Robert Buchanan. Fall Department of Mathematics

MATH 3C: MIDTERM 1 REVIEW. 1. Counting

P (A B) P ((B C) A) P (B A) = P (B A) + P (C A) P (A) = P (B A) + P (C A) = Q(A) + Q(B).

Name: Exam 2 Solutions. March 13, 2017

Lecture Lecture 5

Econ 113. Lecture Module 2

Tutorial 2: Probability

13-5 Probabilities of Independent and Dependent Events

Probability, For the Enthusiastic Beginner (Exercises, Version 1, September 2016) David Morin,

Outline. Probability. Math 143. Department of Mathematics and Statistics Calvin College. Spring 2010

Mathematical Foundations of Computer Science Lecture Outline October 18, 2018

Statistics 100 Exam 2 March 8, 2017

t-test for b Copyright 2000 Tom Malloy. All rights reserved. Regression

k P (X = k)

COSC 341 Human Computer Interaction. Dr. Bowen Hui University of British Columbia Okanagan

Probability Year 9. Terminology

Introduction to Statistical Data Analysis Lecture 4: Sampling

Lecture 10: Probability distributions TUESDAY, FEBRUARY 19, 2019

Intermediate Math Circles November 8, 2017 Probability II

Statistics 1. Edexcel Notes S1. Mathematical Model. A mathematical model is a simplification of a real world problem.

2. A Basic Statistical Toolbox

Math 1313 Experiments, Events and Sample Spaces

Principles of Mathematics 12

SECOND UNIVERSITY EXAMINATION

Topic 3 Random variables, expectation, and variance, II

MATH 3510: PROBABILITY AND STATS July 1, 2011 FINAL EXAM

CS 361: Probability & Statistics

10/4/2013. Hypothesis Testing & z-test. Hypothesis Testing. Hypothesis Testing

Random processes. Lecture 17: Probability, Part 1. Probability. Law of large numbers

Probability Year 10. Terminology

CMPSCI 240: Reasoning about Uncertainty

Notes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing

Random Variable. Discrete Random Variable. Continuous Random Variable. Discrete Random Variable. Discrete Probability Distribution

SS257a Midterm Exam Monday Oct 27 th 2008, 6:30-9:30 PM Talbot College 342 and 343. You may use simple, non-programmable scientific calculators.

4.2 Probability Models

Discussion 03 Solutions

PRACTICE PROBLEMS FOR EXAM 1

PHYS 275 Experiment 2 Of Dice and Distributions

Chapter 5 : Probability. Exercise Sheet. SHilal. 1 P a g e

Statistics and Quantitative Analysis U4320. Segment 5: Sampling and inference Prof. Sharyn O Halloran

Lecture 5: Introduction to Markov Chains

Chapter 7: Section 7-1 Probability Theory and Counting Principles

Edexcel GCE Statistics S1 Advanced/Advanced Subsidiary

Probability Long-Term Memory Review Review 1

SAMPLING DISTRIBUTIONS

Lecture 3. January 7, () Lecture 3 January 7, / 35

Problems from Probability and Statistical Inference (9th ed.) by Hogg, Tanis and Zimmerman.

COVENANT UNIVERSITY NIGERIA TUTORIAL KIT OMEGA SEMESTER PROGRAMME: ECONOMICS

Example. What is the sample space for flipping a fair coin? Rolling a 6-sided die? Find the event E where E = {x x has exactly one head}

Discrete and continuous

PRACTICE PROBLEMS FOR EXAM 2

MATH Solutions to Probability Exercises

36-309/749 Math Review 2014

Arkansas Tech University MATH 3513: Applied Statistics I Dr. Marcel B. Finan

Lecture 6 Probability

Computations - Show all your work. (30 pts)

MATH 151, FINAL EXAM Winter Quarter, 21 March, 2014

STAT200 Elementary Statistics for applications

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 2 MATH00040 SEMESTER / Probability

Week 04 Discussion. a) What is the probability that of those selected for the in-depth interview 4 liked the new flavor and 1 did not?

2.2 Grouping Data. Tom Lewis. Fall Term Tom Lewis () 2.2 Grouping Data Fall Term / 6

STA Module 4 Probability Concepts. Rev.F08 1

Transcription:

Sampling Error As you may remember from the first lecture, samples provide incomplete information about the population In particular, a statistic (e.g., M, s) computed on any particular sample drawn from a population will tend to differ from the population parameter (e.g., µ, σ) it represents. This discrepancy between a sample statistic (e.g., M) and its corresponding population parameter (e.g., μ) results from sampling error Recall that sampling error is the variability of a statistic from sample to sample due to chance

The Sampling Distribution of the Mean Sampling Distributions Due to sampling error, the statistics computed from sample to sample drawn from a fixed population vary. The sampling distribution of a statistic describes this variation For example, the sampling distribution of the mean describes how the mean (M) computed for a sample of size n varies from one sample to the next when the underlying population distribution remains the same.

Example 1: Six-Sided Dice Let s consider a fair 6-sided die The population of interest is the set of all rolls. Since there is no limit to the number of times you can roll a die, the size of the population N is potentially infinite A sample is a set of n rolls There are 6 possible discrete outcomes {1,2,3,4,5,6}, each of which is equally likely, therefore the mean population outcome is 3.5 Let s consider the mean outcome for a sample of n rolls

Example 1: Six-Sided Dice Theoretical (analytical) distribution for single roll of a fair 6-sided die μ = 3.5 σ = 1.71

Example 1: Six-Sided Dice Let s try it. Roll your die once and enter the resulting score using your clicker

Example 1: Six-Sided Dice Now, let s compute the sample mean (M) for a sample of size 2 1. Roll your die twice 2. Compute the mean of the two rolls and round to the nearest whole number 3. Enter the resulting score using your clicker

Example 1: Six-Sided Dice Now, let s compute the sample mean (M) for a sample of size 4 1. Roll your die four times 2. Compute the mean of the 4 rolls and round to the nearest whole number 3. Enter the resulting score using your clicker

Example 1: Six-Sided Dice Finally, let s compute the sample mean (M) for a sample of size 8 (This is the last one, I promise) 1. Roll your die eight times 2. Compute the mean of the 8 rolls and round to the nearest whole number 3. Enter the resulting score using your clicker

Example 1: Six-Sided Dice

Example 1: Six-Sided Dice

Example 1: Six-Sided Dice

Example 1: Six-Sided Dice

Example 1: Six-Sided Dice

Central Limit Theorem Thanks to the central limit theorem we can compute the sampling distribution of the mean without having to actually draw samples and compute sample means. Central limit theorem: Given a population with mean µ and standard deviation σ, the sampling distribution of the mean (i.e., the distribution of sample means) will itself have a mean of µ and a standard deviation (standard error) of / n Furthermore, whatever the distribution of the parent population, this sampling distribution will approach the normal distribution as the sample size (n) increases.

Central Limit Theorem Conceptually, we can break up the theorem into three parts: 1. The mean (µ M ) of a population of sample means (M) is equal to the mean (µ) of the underlying population of scores 2. The standard deviation (σ M ) of a population of sample means (called the standard error) becomes smaller as the size of the sample (n, the number of scores used to compute M) increases 3. Whatever the distribution of the underlying scores, the sampling distribution of the mean will approach the normal distribution as the sample size (n) increases

The Mean & Variance as Estimators Sampling Distributions The expected value of a statistic is its long-range average over repeated sampling Notated as E(). For example the expected value of s is denoted E(s) In statistics, bias is a property of a statistic whose expected value does not equal the parameter it represents M is an unbiased estimator of μ (i.e., E(M) = μ) s 2 is an unbiased estimator of σ 2 (i.e., E(s 2 ) = σ 2 ) In statistics, minimum variance (or efficiency) is a property of a statistic whose long-range variability is as small as possible M is a minimum variance estimator of μ s 2 is not a minimum variance estimator of σ 2

Standard Error Just as the standard deviation (σ)of a population of scores provides a measure of the average distance between an individual score (x) and the population mean (µ), the standard error (σ M ) provides a measure of the average distance between the sample mean (M) and the population mean (µ). M n

Example 2: Exam Scores Student Score 1 79 2 61 3 67 4 68 5 57 266 75 267 70 268 73 269 84 270 75

Example 2: Exam Scores Sample 1 M 74.2 Student Score 155 74 120 67 66 69 216 77 188 84 (M)

Sampling Distribution of the Mean Sampling Distributions Sample 1 M 74.2 Sample 2 M 73.4 Student Score 155 74 120 67 66 69 216 77 188 84 Student Score 237 68 192 76 101 78 109 69 180 76 (M)

Example 2: Exam Scores Sample 1 M 74.2 Sample 2 M 73.4 Student Score 155 74 120 67 66 69 216 77 188 84 Student Score 237 68 192 76 101 78 109 69 180 76 Sample 3 M 67.6 Student Score 221 65 85 70 223 71 48 63 40 69 (M)

Example 2: Exam Scores Sample 1 M 74.2 Student Score 155 74 120 67 66 69 216 77 188 84 Remember: 70.0, 7.0 250 samples (with N=5) x x Sample 2 M 73.4 Sample 3 M 67.6 Student Score 237 68 192 76 101 78 109 69 180 76 Student Score 221 65 85 70 223 71 48 63 40 69 M M 70. 0 7 3.13 5 (M)

p(m) Sampling Distributions Sample Size & Standard Error n 1 M 7 n 4 M 3. 5 4 n 16 M 1.75 16 (M)

The Sampling Distributions of s 2 and s Sampling Distributions Sample 1 M 74.2 Sample 2 M 73.4 Student Score 155 74 120 67 66 69 216 77 188 84 Student Score 237 68 192 76 101 78 109 69 180 76 s 2 45.70 s 6.76 s 2 20.80 s 4.56 7 n 5 Sample 3 M 67.6 Student Score 221 65 85 70 223 71 48 63 40 69 s 2 11.80 s 3.44

Standardizing the Sampling Distribution Sampling Distributions Because the distribution of sample means tends to be normal, the z-score value obtained for a sample mean can be used with the unit normal table to obtain probabilities. The procedures for computing z-scores and finding probabilities for sample means are essentially the same as we used for individual scores However, when computing z-scores for sample means, you must remember to consider the sample size (n) and use the standard error (σ M ) instead of the standard deviation (σ)

Standardizing the Sampling Distribution Sampling Distributions Remember, to compute z-scores for individual values: z x x To compute z-statistics for sample means: z M M M M n, because M n

Sample problems: For a population of exam scores with μ = 70 and σ = 10: What is the probability of obtaining M between 65 and 75 for a sample of 4 scores? What is the probability of obtaining M > 75 for a sample of 16 scores?

z 0.00 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.0 0.5000 0.4960 0.4920 0.4880 0.4840 0.4801 0.4761 0.4721 0.4681 0.4641 0.1 0.4602 0.4562 0.4522 0.4483 0.4443 0.4404 0.4364 0.4325 0.4286 0.4247 0.2 0.4207 0.4168 0.4129 0.4090 0.4052 0.4013 0.3974 0.3936 0.3897 0.3859 0.3 0.3821 0.3783 0.3745 0.3707 0.3669 0.3632 0.3594 0.3557 0.3520 0.3483 0.4 0.3446 0.3409 0.3372 0.3336 0.3300 0.3264 0.3228 0.3192 0.3156 0.3121 0.5 0.3085 0.3050 0.3015 0.2981 0.2946 0.2912 0.2877 0.2843 0.2810 0.2776 0.6 0.2743 0.2709 0.2676 0.2643 0.2611 0.2578 0.2546 0.2514 0.2483 0.2451 0.7 0.2420 0.2389 0.2358 0.2327 0.2296 0.2266 0.2236 0.2206 0.2177 0.2148 0.8 0.2119 0.2090 0.2061 0.2033 0.2005 0.1977 0.1949 0.1922 0.1894 0.1867 0.9 0.1841 0.1814 0.1788 0.1762 0.1736 0.1711 0.1685 0.1660 0.1635 0.1611 1.0 0.1587 0.1562 0.1539 0.1515 0.1492 0.1469 0.1446 0.1423 0.1401 0.1379 1.1 0.1357 0.1335 0.1314 0.1292 0.1271 0.1251 0.1230 0.1210 0.1190 0.1170 1.2 0.1151 0.1131 0.1112 0.1093 0.1075 0.1056 0.1038 0.1020 0.1003 0.0985 1.3 0.0968 0.0951 0.0934 0.0918 0.0901 0.0885 0.0869 0.0853 0.0838 0.0823 1.4 0.0808 0.0793 0.0778 0.0764 0.0749 0.0735 0.0721 0.0708 0.0694 0.0681 1.5 0.0668 0.0655 0.0643 0.0630 0.0618 0.0606 0.0594 0.0582 0.0571 0.0559 1.6 0.0548 0.0537 0.0526 0.0516 0.0505 0.0495 0.0485 0.0475 0.0465 0.0455 1.7 0.0446 0.0436 0.0427 0.0418 0.0409 0.0401 0.0392 0.0384 0.0375 0.0367 1.8 0.0359 0.0351 0.0344 0.0336 0.0329 0.0322 0.0314 0.0307 0.0301 0.0294 1.9 0.0287 0.0281 0.0274 0.0268 0.0262 0.0256 0.0250 0.0244 0.0239 0.0233 2.0 0.0228 0.0222 0.0217 0.0212 0.0207 0.0202 0.0197 0.0192 0.0188 0.0183 2.1 0.0179 0.0174 0.0170 0.0166 0.0162 0.0158 0.0154 0.0150 0.0146 0.0143 2.2 0.0139 0.0136 0.0132 0.0129 0.0125 0.0122 0.0119 0.0116 0.0113 0.0110 2.3 0.0107 0.0104 0.0102 0.0099 0.0096 0.0094 0.0091 0.0089 0.0087 0.0084 2.4 0.0082 0.0080 0.0078 0.0075 0.0073 0.0071 0.0069 0.0068 0.0066 0.0064 2.5 0.0062 0.0060 0.0059 0.0057 0.0055 0.0054 0.0052 0.0051 0.0049 0.0048 2.6 0.0047 0.0045 0.0044 0.0043 0.0041 0.0040 0.0039 0.0038 0.0037 0.0036 2.7 0.0035 0.0034 0.0033 0.0032 0.0031 0.0030 0.0029 0.0028 0.0027 0.0026 2.8 0.0026 0.0025 0.0024 0.0023 0.0023 0.0022 0.0021 0.0021 0.0020 0.0019 2.9 0.0019 0.0018 0.0018 0.0017 0.0016 0.0016 0.0015 0.0015 0.0014 0.0014 Upper-Tail Probabilities