Lecture 8 Sampling Theory

Size: px

Start display at page:

Download "Lecture 8 Sampling Theory"

Shon Booker
5 years ago
Views:

1 Lecture 8 Sampling Theory Thais Paiva STA Summer 2013 Term II July 11, / 25 Thais Paiva STA Summer 2013 Term II Lecture 8, 07/11/2013

2 Lecture Plan 1 Sampling Distributions 2 Law of Large Numbers 3 Central Limit Theorem 2 / 25 Thais Paiva STA Summer 2013 Term II Lecture 8, 07/11/2013

3 Statistical Inference We want to study some quantities of interest (parameter) in a large population. Example: Obama s approval rating. But we cannot observe the whole population. What do we do? Design a study to sample individuals from the population. Example: eligible voters Study the quantity of interest on your sample Infer (conclude) about the unknown parameter. Example: 1 Determine a range that will include the parameter of interest: 0.45 < approval rating < Test a hypothesis: is the approval rating > 0.5? 3 / 25 Thais Paiva STA Summer 2013 Term II Lecture 8, 07/11/2013

4 Statistical Inference A statistic refers to a characteristic of the sample (e.g., sample mean, sample deviation, sample maximum) A parameter refers to a characteristic of the population (e.g., population mean, population standard deviation, population proportion that votes for republicans) Our goal is to use statistics to infer the parameter in the population (e.g., what is the relation between the sample mean and the population mean?) The sampling distribution is the bridge! 4 / 25 Thais Paiva STA Summer 2013 Term II Lecture 8, 07/11/2013

5 Example Population: STA 111 Heights Distribution of students height Density Height (in) Let s assume this is the true population with parameter µ = 68.4 σ 2 = 18.6 We wish to take a sample to estimate µ and σ 2. 5 / 25 Thais Paiva STA Summer 2013 Term II Lecture 8, 07/11/2013

6 Samples size = 4 Let s say we take a sample of size 4 and repeat it 5 times. For each sample, we calculate the sample mean x and the variance s 2. Sample # x 1 x 2 x 3 x 4 x s We see that the x s are pretty close around µ = There is quite some variability in s 2 across samples. 6 / 25 Thais Paiva STA Summer 2013 Term II Lecture 8, 07/11/2013

7 Sampling Distribution (n = 4) What if I carry on and repeat it 1000 times? Frequency Some x s are quite extreme! But most of them seem to hover around the population mean (red vertical line) Sample Mean (n=4) 7 / 25 Thais Paiva STA Summer 2013 Term II Lecture 8, 07/11/2013

8 Sampling Distribution What if we change the sample size? Frequency Sample Mean (n=4) Frequency Sample Mean (n=15) Frequency Sample Mean (n=50) Frequency Sample Mean (n=100) 8 / 25 Thais Paiva STA Summer 2013 Term II Lecture 8, 07/11/2013

9 Sampling Distribution The previous histograms are examples of Sampling Distributions Distributions of a statistic calculated from a random sample Each individual in the population is equally likely to be chosen every time we draw an observation A statistic is random because each sample is different: if the data have not been recorded yet, the statistic is simply a function of same random elements Viewing a statistic as a random variable, we can define its mean and variance. For example, E( X ) = µ X V ( X ) = σ 2 X (Tricky notation: Population mean µ of a statistic X!!) 9 / 25 Thais Paiva STA Summer 2013 Term II Lecture 8, 07/11/2013

10 Estimator We saw that The sampling distribution of x is centered around µ The variability of x becomes smaller with larger sample size If we use x to infer about µ, we call x an estimator of µ. There are many other potential estimators for µ. For example, if the underlying population is Normal, we can use the sample median. In the next lecture, we will discuss ways to evaluate and compare estimators. 10 / 25 Thais Paiva STA Summer 2013 Term II Lecture 8, 07/11/2013

11 Combination of Random Variables There are two important properties of random variables that are useful in studying estimators. If we let X and Y be two independent random variables, then E(X + Y ) = E(X ) + E(Y ) Var(X + Y ) = Var(X ) + Var(Y ) We will discuss these properties later in the class. 11 / 25 Thais Paiva STA Summer 2013 Term II Lecture 8, 07/11/2013

12 Mean and Variance of the Sample Mean Let X 1,..., X n be independent and identically distributed random variables. The above assumption says X 1,..., X n are randomly sampled from the same distribution (= random sample). Then ( ) X X n E( X ) = E n ( ) Var( X X X n ) = Var n = E(X 1) E(X n ) n = Var(X 1) Var(X n ) n 2 = nµ n = µ = nσ2 n 2 = σ2 n 12 / 25 Thais Paiva STA Summer 2013 Term II Lecture 8, 07/11/2013

13 Mean and Variance of the Sample Mean E( X ) = µ says If I repeatedly collect my sample, the overall average of X is µ, the true population mean In reality, we usually only collect the sample once This holds for any sample size! V ( X ) = σ2 n says The variability in X decreases as the sample size increases. Specifically, it goes down by a rate of 1/n The variability also depends on the underlying population s variability! 13 / 25 Thais Paiva STA Summer 2013 Term II Lecture 8, 07/11/2013

14 Mean and Variance of the Sample Mean However, E( X ) = µ by itself does not guarantee that X = µ! Luckily, V ( X ) = σ2 n says that the variance of X decreases toward zero as the sample size increases. So, when the sample is large, the uncertainty goes to zero, and therefore lim Var( X ) = 0 n lim X = µ n 14 / 25 Thais Paiva STA Summer 2013 Term II Lecture 8, 07/11/2013

15 Mean and Variance of the Sample Mean Recall our first example: Frequency Sample Mean (n=4) Frequency Sample Mean (n=15) Frequency Sample Mean (n=50) Frequency Sample Mean (n=100) 15 / 25 Thais Paiva STA Summer 2013 Term II Lecture 8, 07/11/2013

16 Law of Large Numbers: Interpretation Suppose you want to estimate µ on a specific population What you can do is to extract a sample from the population and estimate the sample mean x If the sample is big enough, x will be close to µ If you increase the sample size, x should get closer to µ The more you increase the sample size, the closer x to µ 16 / 25 Thais Paiva STA Summer 2013 Term II Lecture 8, 07/11/2013

17 Central Limit Theorem The Law of Large Numbers tells me how X behaves in terms of central tendency and variability. That is useful information, but it does not tell me its actual distribution! The Central Limit Theorem says: when n is large, X is approximately normally distributed ( σ X 2 ) N µ, n Important: the CLT holds regardless of the underlying distribution of X! No matter what the shape of the original distribution is, the sampling distribution of the mean approaches a normal distribution. 17 / 25 Thais Paiva STA Summer 2013 Term II Lecture 8, 07/11/2013

18 Central Limit Theorem Density X Here is a weird distribution with parameter By CLT, µ = 6.5 σ = 2.9 if n = 10: ( X N if n = 50: ( X N 6.5, , ) ) 18 / 25 Thais Paiva STA Summer 2013 Term II Lecture 8, 07/11/2013

19 Central Limit Theorem Sample Mean (n = 10) Sample Mean (n = 50) Density Density Amazing. 19 / 25 Thais Paiva STA Summer 2013 Term II Lecture 8, 07/11/2013

20 Using CLT: Height Example Assume the distribution of height in our class has a mean of 70 (inches) and a variance of 100 (inches 2 ). In my study, I will obtain measurements of 20 individuals. What is the probability that X will be between 65 and 75? By CLT, X N (µ, ), σ2 where µ = 70 and σ2 n n = = 5. ( ) P(65 < X < 75) = P < Z < 5 5 ( = P 5 < Z < ) 5 = / 25 Thais Paiva STA Summer 2013 Term II Lecture 8, 07/11/2013

21 Using CLT: Height example Note: in the previous example we calculated P(65 < X < 75) = NOT ( ) P(65 < X < 75) = P < Z < = P ( 0.5 < Z < 0.5) = 0.38 The first one is the sample average; the second one is just one actual height!!! 21 / 25 Thais Paiva STA Summer 2013 Term II Lecture 8, 07/11/2013

22 Using CLT: Sample Size Assume the distribution of height in our class has a mean of 70 (inches) and a variance of 100 (inches 2 ). In designing my study, what sample size should I use so that the probability that my sample average X is between 69 and 71 is equal to 90%? P(69 < X < 71) = P ( ) < Z < 100/n 100/n ( n = P ( P Z > ) n 10 < Z < = ) ( n = P Z < 10 Because Z is symmetric: n = 1.64 n = ) n = / 25 Thais Paiva STA Summer 2013 Term II Lecture 8, 07/11/2013

23 Sample Percentage The sample percentage is defined to be the ratio between the number of successes over the number of trials n i=1 P = X i n For example, the batting averages (P) are estimates of the unknown proportion of successful batting in the whole career (π) If we could observe the data for the whole career, then we would know the true value 23 / 25 Thais Paiva STA Summer 2013 Term II Lecture 8, 07/11/2013

24 Sample Percentage n i=1 P = X i n E(P) = E[X 1] E[X n ] n = π π n = π Var(P) = Var[X1]+...+Var[Xn] n 2 Law of Large Numbers: Central Limit Theory: = π(1 π)+...+π(1 π) n 2 lim P = π n [ P N π, ] π(1 π) n = π(1 π) n 24 / 25 Thais Paiva STA Summer 2013 Term II Lecture 8, 07/11/2013

25 Sample Percentage Suppose tossing a fair coin 1,000 times. What is the probability of observing heads less than half of the times? Fair coin means that π = 0.5 ( P P < 500 ) ( = P Z < 0.5 ) (1 0.5)/1000 = P(Z < 0) = 0.5 We could also try to work with Binomial distribution probabilities but n is very large here 25 / 25 Thais Paiva STA Summer 2013 Term II Lecture 8, 07/11/2013

The Central Limit Theorem

The Central Limit Theorem Patrick Breheny September 27 Patrick Breheny University of Iowa Biostatistical Methods I (BIOS 5710) 1 / 31 Kerrich s experiment Introduction 10,000 coin flips Expectation and