Sampling Distributions. Ryan Miller

Similar documents
Industrial Electricity

Chapter 7: Sampling Distributions

Do students sleep the recommended 8 hours a night on average?

Chapter 7: Sampling Distributions

Sampling Distribution Models. Chapter 17

Math Lab 10: Differential Equations and Direction Fields Complete before class Wed. Feb. 28; Due noon Thu. Mar. 1 in class

Lecture 10A: Chapter 8, Section 1 Sampling Distributions: Proportions

Randomized Decision Trees

Opening. Monster Guard. Teacher s Guide

Lecture 2: Linear regression

THE SAMPLING DISTRIBUTION OF THE MEAN

Big-oh stuff. You should know this definition by heart and be able to give it,

Unit 22: Sampling Distributions

Matter, Force, Energy, Motion, and the Nature of Science (NOS)

STA 291 Lecture 16. Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately) normal

Successful completion of the core function transformations unit. Algebra manipulation skills with squares and square roots.

C. Graph the solution to possibilities for Sharmara s number and give the solution in interval notation.

Physics 351 Wednesday, January 10, 2018

Warm Up. Fourth Grade Released Test Question: 1) Which of the following has the greatest value? 2) Write the following numbers in expanded form: 25:

Statistics 511 Additional Materials

Physics 162b Quantum Mechanics

Solving with Absolute Value

Math Lecture 3 Notes

7.1: What is a Sampling Distribution?!?!

+ Check for Understanding

Review of the Normal Distribution

Chapter 12: Inference about One Population

Volume vs. Diameter. Teacher Lab Discussion. Overview. Picture, Data Table, and Graph

Week 5: Logistic Regression & Neural Networks

Grade 7/8 Math Circles February 24/25, 2015 Variables in Real Life: Kinematics Solutions

STA Module 8 The Sampling Distribution of the Sample Mean. Rev.F08 1

Steps to take to do the descriptive part of regression analysis:

Boyle s Law and Charles Law Activity

MAT Mathematics in Today's World

FoSSil Puzzler (1 Hour)

appstats8.notebook October 11, 2016

What Is a Sampling Distribution? DISTINGUISH between a parameter and a statistic

STA Why Sampling? Module 6 The Sampling Distributions. Module Objectives

4.2 The Normal Distribution. that is, a graph of the measurement looks like the familiar symmetrical, bell-shaped

Math Fundamentals for Statistics I (Math 52) Unit 7: Connections (Graphs, Equations and Inequalities)

Math 361. Day 3 Traffic Fatalities Inv. A Random Babies Inv. B

Acceleration 1-D Motion for Calculus Students (90 Minutes)

Astronomical Distances. Astronomical Distances 1/30

Algorithms and Programming I. Lecture#1 Spring 2015

Chapter 7: Sampling Distributions

Chapter 6. Net or Unbalanced Forces. Copyright 2011 NSTA. All rights reserved. For more information, go to

Solving Quadratic & Higher Degree Equations

Nondeterministic finite automata

Lecture 14 Singular Value Decomposition

23.3. Sampling Distributions. Engage Sampling Distributions. Learning Objective. Math Processes and Practices. Language Objective

Discrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 14

Vectors. Vector Practice Problems: Odd-numbered problems from

ED 357/358 - FIELD EXPERIENCE - LD & EI LESSON DESIGN & DELIVERY LESSON PLAN #4

Solving Quadratic & Higher Degree Equations

Lesson 5: Measuring Variability for Symmetrical Distributions

Lesson Adaptation Activity: Developing and Using Models

Sampling Distributions

Math 440 Project Assignment

Lecture 11: Information theory THURSDAY, FEBRUARY 21, 2019

Multiple Representations: Equations to Tables and Graphs Transcript

1 Modular Arithmetic Grade Level/Prerequisites: Time: Materials: Preparation: Objectives: Navajo Nation Math Circle Connection

Business Statistics. Lecture 5: Confidence Intervals

Deductive and Inductive Logic

Chapter 1: Introduction. Material from Devore s book (Ed 8), and Cengagebrain.com

An Introduction to Proofs in Mathematics

Computational Cognitive Science

STAT/SOC/CSSS 221 Statistical Concepts and Methods for the Social Sciences. Random Variables

Student Outcomes. Lesson Notes. Classwork. Example 1 (5 minutes) Students apply knowledge of geometry to writing and solving linear equations.

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 3.1- #

Predicting the Treatment Status

Course Review. Kin 304W Week 14: April 9, 2013

LECTURE 15: SIMPLE LINEAR REGRESSION I

Week 11 Sample Means, CLT, Correlation

Student Activity: Finding Factors and Prime Factors

STA Module 4 Probability Concepts. Rev.F08 1

Morpheus: Neo, sooner or later you re going to realize, just as I did, that there s a difference between knowing the path, and walking the path.

Study and research skills 2009 Duncan Golicher. and Adrian Newton. Last draft 11/24/2008

Community in the Classroom Presentation Plan

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved.

EXPLORING CHORDS. Q1. Draw and label a radius on the circle. How does a chord compare with a radius? What are their similarities and differences?

Prof. Bodrero s Guide to Derivatives of Trig Functions (Sec. 3.5) Name:

How to write maths (well)

Sampling. What is the purpose of sampling: Sampling Terms. Sampling and Sampling Distributions

UC Irvine FOCUS! 5 E Lesson Plan Title: Marble Isotope Lab Grade Level and Course: 8 th Grade Physical Science and 9-12 High School Chemistry

Why write proofs? Why not just test and repeat enough examples to confirm a theory?

This activity will help students to differentiate between living and non-living and to identify characteristics of living things.

Math (P)Review Part I:

Chapter 7: Sampling Distributions

Independence 1 2 P(H) = 1 4. On the other hand = P(F ) =

Physics 9A Syllabus. Desired Results

Confidence intervals

MATH 1150 Chapter 2 Notation and Terminology

Symbolic Logic Outline

Terminology for Statistical Data

Big Bang, Black Holes, No Math

PHYSICS LAB: CONSTANT MOTION

CSC321 Lecture 4 The Perceptron Algorithm

Example. 6.6 Fixed-Level Testing 6.7 Power ( )

Performance script for sixth graders By Thomas Kuo and Kimberly Kline LEAPS Fellows, University of California, Santa Barbara

Polynomials; Add/Subtract

Transcription:

Sampling Distributions Ryan Miller 1 / 18

Statistical Inference A major goal of statistics is inference, or using a sample to learn about a population. Today we will walk through the train-of-thought of how statisticians developed formal approaches to inference. In today s activity, the population will be the end of semester grades (percentages) of my previous Sta-209 students I won t give you the population, but I ll let you take as many random samples of size n = 10 as you want The goal will be to find a logically sound method for describing an aspect of the population in the more realistic setting where we only have one random sample 2 / 18

Distributions If we want to learn about a population, the most informative thing we could possibly ask for is the full population distribution ie: a histogram or dotplot of all the end of semester grades Instead, for various reasons, we typically focus on a single number that summarizes an aspect of the population distribution that we re most interested in Thinking about our population of interest, what summary measures might we want to know? 3 / 18

Estimation One goal of statistical inference is estimation: Suppose we re interested in the mean of the population (µ) How might we estimate µ from a random sample? How certain are we that this estimate will be close to µ? It is logical to estimate µ using x; but with only a single sample, the accuracy of our estimate is a bit of a mystery. However, if we repeatedly draw random samples, we can study how an estimate will behave! 4 / 18

Sampling Distribution Activity - Directions For this activity we will use a statistics program known as R, this is likely the only time we ll use R in this class, but it is software of choice for many statisticians. 1. Open RStudio and type: source( https://remiller1450.github.io/s209s19/funs.r ) 2. Enter sample_grades() to generate a random sample of 10 student s end of semester grades 3. Find the mean of your random sample and record it on the dotplot on the board 4. Repeat steps 1 and 2 until you ve recorded the means of six different random samples on the board This dotplot represents the distribution of different sample means that we could pontentially see when taking a random sample of size n = 10 from this population. With your group, discuss why it might be valuable to study this distribution (think about inference and estimation). 5 / 18

Sampling Distribution Activity - Some Questions 1. Based off the sampling distribution (the dotplot on the board), what do you think µ is? 2. Had you only collected a single random sample of size 10, what value do you think is most likely to be that sample s mean? 3. How much variability is there in the different sample means that we could possibly see? 4. Could we use this to provide an interval estimate of µ? 6 / 18

Sampling Distribution Activity - Answers 1. Assuming the samples are representative, µ is the center of the sampling distribution! This is because the sample statistic x is unbiased 2. We are most likely to see a sample mean at the very center of sampling distribution, so µ is the most likely mean of any particular sample 3. We can assess the variability of the possible sample means that we could see by looking at the standard deviation of the sampling distribution, this is called the standard error (SE) of the sample mean. 4. We could provide estimates of µ that look like x ± b SE. The 68-95-99 rule can help us choose b (at least for sampling distributions with a certain shape) 7 / 18

The Role of Sample Size The sampling distribution depends upon both: 1. The parameters of the population distribution 2. The size of the sample, and how it was collected We ll investigate the role of sample size using StatKey, a free online companion to the Lock5 textbook: StatKey Link We ll look at the NFL Contracts dataset that comes pre-loaded in StatKey StatKey allows us to quickly generate many random samples from a dataset 8 / 18

The Role of Sample Size Sampling distribution of x for 1000 samples of size n = 10 9 / 18

The Role of Sample Size Sampling distribution of x for 1000 samples of size n = 30 10 / 18

The Role of Sample Size Sampling distribution of x for 1000 samples of size n = 100 11 / 18

The Role of Sample Size Sampling distribution of x for 1000samples of size n = 1000 12 / 18

The Role of Sample Size Sampling distribution of x when the entire population is sampled 13 / 18

The Role of Sample Size As the size of our sample increases, the standard error, denoted SE, of our sample statistic decreases Standard error is the standard deviation of a sample statistic (ie: it describes variability in the sampling distribution) 14 / 18

Sampling Bias Quarterbacks represent 4.3% of NFL players but often to receive a disproportionate amount of attention and also tend to be paid higher salaries than other positions Suppose we sample in a way that makes QBs four times more likely to be sampled than other positions, how might this influence our samples? What if QBs were ten times more likely to be sampled? 15 / 18

Sampling Bias Histogram of Sample Means Frequency 0 50 100 150 Unbiased sampling QBs overrepresented (4x) 0 1 2 3 4 5 6 Sample Mean 16 / 18

Sampling Bias Histogram of Sample Means Frequency 0 50 100 150 Unbiased sampling QBs overrepresented (10x) 0 1 2 3 4 5 6 Sample Mean 17 / 18

Conclusion Right now you should... 1. Understand the relationships between the population distribution, the sample distribution, and the sampling distribution 2. Be comfortable with the terminology of parameters and statistics 3. Understand, when we only have one sample, the sample statistic is our best guess at the population parameter 4. Understand the impact of bias and sample size (variability) on the sampling distribution These notes cover Section 3.1 of the textbook, I encourage you to read through the section and its examples 18 / 18