Chapter 11 Sampling Distribution. Stat 115

Similar documents
Small-Sample CI s for Normal Pop. Variance

The Chi-Square Distributions

Continuous Random Variables

II. The Normal Distribution

IV. The Normal Distribution

The Chi-Square Distributions

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix)

Ch. 7. One sample hypothesis tests for µ and σ

Lecture 2: Review of Probability

Will Landau. Feb 28, 2013

Continuous random variables

Math438 Actuarial Probability

Distributions of Functions of Random Variables. 5.1 Functions of One Random Variable

IV. The Normal Distribution

Stat 704 Data Analysis I Probability Review

IAM 530 ELEMENTS OF PROBABILITY AND STATISTICS LECTURE 3-RANDOM VARIABLES

Preliminary Statistics. Lecture 3: Probability Models and Distributions

STAT Chapter 5 Continuous Distributions

STA 4322 Exam I Name: Introduction to Statistics Theory

Purposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

Course: ESO-209 Home Work: 1 Instructor: Debasis Kundu

Sampling Distributions of Statistics Corresponds to Chapter 5 of Tamhane and Dunlop

Continuous r.v. s: cdf s, Expected Values

CONTINUOUS RANDOM VARIABLES

Learning Objectives for Stat 225

Arkansas Tech University MATH 3513: Applied Statistics I Dr. Marcel B. Finan

SOME SPECIFIC PROBABILITY DISTRIBUTIONS. 1 2πσ. 2 e 1 2 ( x µ

Review of Statistics 101

Random variables, Expectation, Mean and Variance. Slides are adapted from STAT414 course at PennState

STAT 414: Introduction to Probability Theory

Continuous Distributions

MATH Notebook 5 Fall 2018/2019

Course information: Instructor: Tim Hanson, Leconte 219C, phone Office hours: Tuesday/Thursday 11-12, Wednesday 10-12, and by appointment.

Review of Probability. CS1538: Introduction to Simulations

The Central Limit Theorem

EXAM. Exam #1. Math 3342 Summer II, July 21, 2000 ANSWERS

i=1 X i/n i=1 (X i X) 2 /(n 1). Find the constant c so that the statistic c(x X n+1 )/S has a t-distribution. If n = 8, determine k such that

Chapter 4: Continuous Probability Distributions

Random Variables and Their Distributions

Other Continuous Probability Distributions

Expectation, Variance and Standard Deviation for Continuous Random Variables Class 6, Jeremy Orloff and Jonathan Bloom

Chapter 23: Inferences About Means

Some worked exercises from Ch 4: Ex 10 p 151

Continuous Random Variables. and Probability Distributions. Continuous Random Variables and Probability Distributions ( ) ( ) Chapter 4 4.

MATH 117 Statistical Methods for Management I Chapter Three

Week 1 Quantitative Analysis of Financial Markets Distributions A

Quiz 2 covered materials in Chapter 5 Chapter 6 Chapter 7. Normal Probability Distribution. Continuous Probability. distribution 11/9/2010.

Math 3339 Homework 6 (Sections )

Probability and Distributions

CHAPTER 6 SOME CONTINUOUS PROBABILITY DISTRIBUTIONS. 6.2 Normal Distribution. 6.1 Continuous Uniform Distribution

Homework for 1/13 Due 1/22

Chapter 4. Probability Distributions Continuous

Continuous Random Variables. and Probability Distributions. Continuous Random Variables and Probability Distributions ( ) ( )

Probability Distributions

To find the median, find the 40 th quartile and the 70 th quartile (which are easily found at y=1 and y=2, respectively). Then we interpolate:

Mock Exam - 2 hours - use of basic (non-programmable) calculator is allowed - all exercises carry the same marks - exam is strictly individual

Institute of Actuaries of India

Contents. Acknowledgments. xix

Chapter 2. Random Variable. Define single random variables in terms of their PDF and CDF, and calculate moments such as the mean and variance.

MATH Solutions to Probability Exercises

Chapter 2: Fundamentals of Statistics Lecture 15: Models and statistics

STAT 430/510 Probability Lecture 12: Central Limit Theorem and Exponential Distribution

FREQUENCY DISTRIBUTIONS AND PERCENTILES

6.2 Normal Distribution. Ziad Zahreddine

MVE055/MSG Lecture 8

Discrete Probability distribution Discrete Probability distribution

ECON Fundamentals of Probability

Lecture Notes 5 Convergence and Limit Theorems. Convergence with Probability 1. Convergence in Mean Square. Convergence in Probability, WLLN

Using Tables and Graphing Calculators in Math 11

Solutionbank S1 Edexcel AS and A Level Modular Mathematics

Properties of Continuous Probability Distributions The graph of a continuous probability distribution is a curve. Probability is represented by area

3 Modeling Process Quality

Line of symmetry Total area enclosed is 1

STAT509: Continuous Random Variable

3 Continuous Random Variables

Theoretical Probability Models

Uniform Distribution. Uniform Distribution. Uniform Distribution. Graphs of Gamma Distributions. Gamma Distribution. Continuous Distributions CD - 1

Econ 325: Introduction to Empirical Economics

Practice Problems Section Problems

1 Probability Distributions

Preliminary Statistics Lecture 3: Probability Models and Distributions (Outline) prelimsoas.webs.com

p. 6-1 Continuous Random Variables p. 6-2

MATH4427 Notebook 4 Fall Semester 2017/2018

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015

7 Random samples and sampling distributions

Random Variables. Random variables. A numerically valued map X of an outcome ω from a sample space Ω to the real line R

Test Problems for Probability Theory ,

Histograms, Central Tendency, and Variability

A Probability Primer. A random walk down a probabilistic path leading to some stochastic thoughts on chance events and uncertain outcomes.

Modules 1-2 are background; they are the same for regression analysis and time series.

Probability theory for Networks (Part 1) CS 249B: Science of Networks Week 02: Monday, 02/04/08 Daniel Bilar Wellesley College Spring 2008

STAT100 Elementary Statistics and Probability

Review of Statistics

STATISTICS 1 REVISION NOTES

Introduction and Overview STAT 421, SP Course Instructor

Asymptotic Statistics-VI. Changliang Zou

Chapter Learning Objectives. Probability Distributions and Probability Density Functions. Continuous Random Variables

Continuous Random Variables

STAT 418: Probability and Stochastic Processes

Transcription:

Chapter 11 Sampling Distribution Stat 115 1

Definition 11.1 : Random Sample (finite population) Suppose we select n distinct elements from a population consisting of N elements, using a particular probability sampling method. Let X 1 =measure taken from the 1 st element in the sample X 2 =measure taken from the 2 nd element in the sample X n =measure taken from the n th element in the sample Then, (X 1, X 2,,X n ) is called a random sample of size n from a finite population. 2

Definition 11.2 : Random Sample (infinite population) Let X 1 =measure taken from the 1 st element in the sample X 2 =measure taken from the 2 nd element in the sample X n =measure taken from the n th element in the sample Then (X 1, X 2,,X n ) is called a random sample of size n from an infinite population if the values of X 1, X 2,,X n are n independent observations generated from the same cumulative distribution function (cdf), F(.). This common cdf or its corresponding probability mass/density function, f(.), is called the parent population or the distribution of the population. 3

Definition 11.3: Statistic Suppose (X 1, X 2,,X n ) is a random sample. A statistic is a random variable that is a function of X 1, X 2,,X n. 4

Example 11.1: Statistic Suppose (X 1, X 2,,X n ) is a random sample. n X i a) i= 1 X = is a random variable that is a n function of X 1, X 2,,X n. Thus, X is a statistic. n ( X X ) 2 i 2 i= 1 b) S = n 1 is a random variable that is a function of X 1, X 2,,X n. Thus, S 2 is a statistic. 5

Definition 1.4: Sampling Distribution of a Statistic The sampling distribution of a statistic is its probability distribution. 6

Example 1: Sampling Distribution of the Mean Consider 4 observations making up the population values of a random variable X having the probability distribution f(x) = ¼, x = 0,1,2,3 Note that μ = E(X) = 3/2 and σ 2 = Var(X) = 5/4 7

Con t of Example Suppose we list all possible samples of size 2, with replacement, and for each sample compute for the value of the sample mean, : X No. Sample X No. Sample X 1 2 3 4 5 6 7 8 0,0 0,1 0,2 0,3 1,0 1,1 1,2 1,3 0.0 0.5 1.0 1.5 0.5 1.0 1.5 2.0 9 10 11 12 13 14 15 16 2,0 2,1 2,2 2,3 3,0 3,1 3,2 3,3 1.0 1.5 2.0 2.5 1.5 2.0 2.5 3.0 8

x = ) Con t of Example Find the sampling distribution of the mean. X P( = ) X x 9

Read Examples 11.2, 11.3, and 11.4 10

Definition 11.5: Standard error The standard deviation of a statistic is called its standard error. Standard error is a measure of reliability of the statistic. 11

Notes on Standard Error A small standard error indicates that the computed values of our statistic in the different samples generated are close to one another, so that even if we know that the value of a statistic varies from one sample to another, a small standard error gives us an assurance that at least the variation among their values is not too large. A small standard error means that the realized values of the statistic under repeated sampling are, on the average, very close to the average value of the statistic. 12

Theorem 11.1 If (X 1, X 2,,X n ) is a random sample from a finite population of size N and whose values are generated using simple random sampling without replacement then E( X )=µ and 2 N σ Var( X N n )=, where X i and = N 2 i= 1 σ n ( X μ) i N 2 N 1. i = 1 μ = N 13

Theorem 11.2 If (X 1, X 2,,X n ) is a random sample from an infinite population then E( X )=µ and 2 Var( )= σ, where μ = EX ( i ) and σ X n = EX ( i μ). 2 2 14

Using Example 1 Find the mean of and standard error of X X 15

Read Examples 11.6, 11.7 16

Some Notes on Theorems 11.1 and 11.2 Both theorems state that E( X ) = μ. This means that in both sampling schemes, the values of X in the different samples generated are all centered about μ, the parameter we are estimating. The standard error of is smaller for populations where σ 2 is small. X 17

Some Notes on Theorems 11.1 and 11.2 Theorems 11.1 and 11.2 provide us with mathematical evidence to our claim in Chapter 3 that our estimates under simple random sampling (with or without replacement) are more reliable when the elements in the population are homogeneous with respect to the characteristic under study. Increasing the sample size, n, in either sampling scheme will decrease the standard error of X 18

Some Notes on Theorems 11.1 and 11.2 This tells us that if we use either SRSWR or SRSWOR and we want the value of X from one sample to the other to be very near μ, then all we need to do is increase the sample size, n. 19

Homework DO Exercises for Section 11.1, p. 390 391, nos. 1 and 2 20

11.2 Central Limit Theorem Theorem 11.3 Central Limit Theorem If X is the mean of a random sample of size n from a large or infinite population with mean μ and variance σ 2, then the sampling distribution of X is approximately normally distributed with mean, E( X )=μ, and variance, Var( X )=σ 2 /n, when n is sufficiently large. 21

Notes: CLT CLT states that when the sample size is sufficiently large, we can use the normal distribution to approximate the sampling distribution of X. What is remarkable about the CLT is that it does not state any requirement about the distribution of the population, aside from having mean μ and variance σ 2. 22

Notes: CLT The normal approximation will hold for population distributions that are either discrete or continuous. The normal approximation will hold for population distributions that are either symmetric or skewed. We can use the approximation even for random samples from finite populations so long as N is very large. 23

Notes: CLT When do we consider the sample size, n, to be sufficiently large? In most situations, the normal approximation will be good if n 30. If the distribution of the population is not very different from the normal distribution, then the approximation will be good even if n < 30. In fact, if the population is normally distributed, then X will be normally distributed even for a sample of size 1. 24

Example: CLT An electrical firm manufactures electric light bulbs that have a length of life which is normally distributed with mean and standard deviation equal to 500 and 50 hours, respectively. Find the probability that a random sample of 15 bulbs will have an average life of less than 475 hours. 25

T distribution t distribution Very similar to the standard normal distribution The graph of the pdf of the t distribution is also a bell shaped curve that is symmetric about 0. Its tails will also approach the x axis without ever touching it 26

t distribution Difference with Standard normal distribution: t distribution has a larger variance than a standard normal distribution However, as the degrees of freedom increases, the variance of the t distribution goes to 1, so that when the degrees of freedom is at least 30, the t distribution is almost the same as the standard normal distribution. 27

dotted curve: v=25 v=5 black curve: standard normal v=2-3 -2-1 0 1 2 3 FIGURE 11.1. GRAPH OF THE T-DISTRIBUTION WITH VARYING DEGREES OF FREEDOM (V) AND THE STANDARD NORMAL DISTRIBUTION 28

t distribution If X and s 2 are the mean and variance, respectively, of a random sample of size n taken from a population which is normally distributed with mean μ and variance σ 2, then is a random variable having the t distribution with v = n 1 degrees of freedom Notation: T ~t α,v=n 1 29

t distribution Area under the curve Just like any continuous probability distribution, the probability that a random sample produces a t value falling between any two specified values is equal to the area under the curve of the t distribution between any two ordinates corresponding to the specified values. 30

t distribution Notation: t α is the t value leaving an area of α in the right tail of the t distribution. That is, T~t α,v then t α is such that P(T > t α,v ) = α = P(T < t α,v ) Since the t distribution is symmetric about zero, t 1 α,v = t α,v 31

FIGURE 11.2. GRAPHICAL REPRESENTATION OF THE 100(1-α) TH PERCENTILE, T α (V). 32

Examples: t distribution 1. Find the following values on the t table: a) t 0.025 when v = 14 b) t 0.99 when v = 10 2. Find k such that P(k < T < 2.807) = 0.945 when T ~ t α,23 3. A manufacturing firm claims that batteries used in their electronic games will last an average of 30 hours. To maintain this average, 16 batteries are tested each month. If the computed t value falls between t 0.025 and t 0.025, the firm is satisfied with its claim. What conclusion should the firm draw from a sample that has mean X = 27.5 hours and standard deviation s = 5 hours? Assume the distribution of battery lives to be approximately normal. 33

Read Example 11.10 34

Chi Square Distribution Just like the t distribution, the chi square distribution has a single parameter called the DEGREES OF FREEDOM. If X is a random variable that follows a chisquare distribution with v degrees of freedom, then we write X~χ 2 (v) where χ is the small Greek letter chi (read as ki). 35

Chi square Distribution The pdf of the chi square distribution is positive for positive real numbers only; elsewhere, its value is 0. Its mean is equal to its degrees of freedom. Its variance is twice its degrees of freedom. Thus, as the degrees of freedom increases, both the mean and variance will also increase. 36

v =2 v =5 v =10 v =15 FIGURE 11.3. GRAPH OF THE CHI-SQUARE DISTRIBUTION WITH VARYING DEGREES OF FREEDOM (V). 37

Chi square distribution Skewed to the right Its skewness is more pronounced for smaller degrees of freedom. As the degrees of freedom increases, its distribution becomes more symmetric. If X~ χ 2 (v), then χ 2 (v) satisfies the condition that the P(X χ 2 (v) ) = 1 α, or equivalently, P(X > χ 2 (v) ) = α. 38

FIGURE 11.4. GRAPHICAL REPRESENTATION OF THE 100(1-α) TH PERCENTILE, χ 2 α (V). 39

Example 11.11 Suppose 11.11: Suppose X ~ χ 2 α(v=10). WE CAN USE TABLE B.3, APPENDIX B, TO DETERMINE THE FOLLOWING: A) P(X > 18.307) B) P(X < 20.483) C) χ 2 0.01(V=10) 40

F Distribution The F distribution has 2 parameters: (i) the numerator degrees of freedom, v 1 ; and, (ii) the denominator degrees of freedom, v 2. This distribution is directly related to the chi square distribution. If there are two independent random variables, X and Y, such that X~χ 2 (v 1 ) and Y~χ 2 (v 2 ) then the random variable,, will follow an F distribution with v 1 and v 2 degrees of freedom. This is why we call v 1 and v 2 the numerator and denominator degrees of freedom, respectively. These were the original degrees of freedom of the χ 2 random variables in the F ratio. 41

F Distribution Just like the chi square distribution, the pdf of the F distribution is positive for positive real numbers only; elsewhere, its value is 0. The graph of the pdf is also skewed to the right. In general, distributions with higher degrees of freedom are less skewed. 42

FIGURE 11.5. GRAPHICAL REPRESENTATION OF THE 100(1-α) TH PERCENTILE, F α 43

F Distribution Table B.4, Appendix B, presents the 100(1 α) th percentile, of an F distribution with integralvalued degrees of freedom ranging from 1 to 30 for selected values of α. We will denote the 100(1 α) th percentile of an F distribution with v 1 and v 2 degrees of freedom by F α (v 1,v 2 ). If X~F(v 1,v 2 ) then F α (v 1,v 2 ) satisfies the condition that the P(X F α (v 1,v 2 ))=1 α, or equivalently, P(X>F α (v 1,v 2 ))=α. Figure 11.5 shows the graphical representation of F α (v 1,v 2 ). 44

11.3 Sampling from the Normal Distribution Suppose (X 1,X 2,,X n ) is a random sample satisfying the condition that X i ~Normal(µ, σ 2 ) n for i=1,2,,n. Define the statistics, X i ( X X ) 2 i = 1 S = as the sample mean and n 1 as the sample variance, where n is the sample size. The sampling distribution of X is also normal with mean µ and variance σ 2. n i X 2 = i = 1 n 45

TABLE 11.1. SAMPLING DISTRIBUTIONS OF STATISTICS BASED ON A RANDOM SAMPLE FROM A NORMAL DISTRIBUTION STATISTIC SAMPLING DISTRIBUTION PARAMETER/S Z = X μ σ n STANDARD NORMAL DISTRIBUTION MEAN=0 VARIANCE=1 T = X S μ n T-DISTRIBUTION DEGREES OF FREEDOM: V = N 1 X 2 = ( n 1) S 2 σ 2 CHI-SQUARE DISTRIBUTION DEGREES OF FREEDOM: V = N 1 46

Sampling Distributions of Statistics Based on Two Independent Random Samples from Normal Distributions THERE ARE INSTANCES WHEN WE WISH TO COMPARE THE BEHAVIOR OF TWO DIFFERENT POPULATIONS. SUPPOSE (X 1,X 2,,X n1 ) is a random sample of size n 1 satisfying the condition that X i ~Normal(µ X, σ X2 ) for i=1,2,,n 1. Suppose we take another random sample of size n 2, (Y 1,Y 2,,Y n2 ), whose selection is independent of the selection of the sample from the first population, and this time satisfying the condition that Y i ~Normal(µ Y, σ Y2 ) for i=1,2,,n 2. 47

Sampling Distributions of Statistics Based on Two Independent Random Samples from Normal Distributions i i = 1 X = Define the statistics, n as the sample n 1 1 2 mean, ( X i X ) 2 i = 1 S as the sample variance, X = n1 1 where n 1 is the sample size of the sample taken from the first normal population. n 2 Y i i = 1 Likewise, define Y = as the sample mean, n n 2 2 2 ( Yi Y ) as the 2 i = 1 S sample variance, where n Y = n 2 1 2 is the sample size of the sample taken from the second normal population. n 1 X 48

TABLE 11.2. SAMPLING DISTRIBUTION OF STATISTICS BASED ON TWO INDEPENDENT RANDOM SAMPLES FROM NORMAL DISTRIBUTIONS STATISTIC SAMPLING DISTRIBUTION PARAMETER/S ( X Y) ( μ X μy ) Z = 2 2 σ X σy + n n 1 2 STANDARD NORMAL DISTRIBUTION MEAN=0 VARIANCE =1 T = S p = ( X Y) ( μ μ ) S p X 1 1 + n n 1 2 ( n 1) S + ( n 1) S n + n 2 2 2 1 x 2 Y F = 1 2 S S 2 X 2 Y σ σ 2 X 2 Y Y T-DISTRIBUTION IF σ X2 =σ Y 2 F-DISTRIBUTION DEGREE S OF FREEDOM: V = N 1 +N 2-2 DEGREES OF FREEDOM: V 1 = N 1 1 V 2 = N 2-1 49