Confidence Intervals for Normal Data Spring 2018

Similar documents
Confidence Intervals for Normal Data Spring 2014

Confidence Intervals for the Mean of Non-normal Data Class 23, Jeremy Orloff and Jonathan Bloom

Frequentist Statistics and Hypothesis Testing Spring

This does not cover everything on the final. Look at the posted practice problems for other topics.

Null Hypothesis Significance Testing p-values, significance level, power, t-tests Spring 2017

18.05 Practice Final Exam

18.05 Final Exam. Good luck! Name. No calculators. Number of problems 16 concept questions, 16 problems, 21 pages

Class 26: review for final exam 18.05, Spring 2014

Advanced Herd Management Probabilities and distributions

Central Limit Theorem and the Law of Large Numbers Class 6, Jeremy Orloff and Jonathan Bloom

1 Statistical inference for a population mean

Null Hypothesis Significance Testing p-values, significance level, power, t-tests

Exam 2 Practice Questions, 18.05, Spring 2014

Post-exam 2 practice questions 18.05, Spring 2014

Conjugate Priors: Beta and Normal Spring 2018

Continuous Expectation and Variance, the Law of Large Numbers, and the Central Limit Theorem Spring 2014

The t-test Pivots Summary. Pivots and t-tests. Patrick Breheny. October 15. Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/18

Content by Week Week of October 14 27

Swarthmore Honors Exam 2012: Statistics

Compute f(x θ)f(θ) dθ

Conjugate Priors: Beta and Normal Spring 2018

Lecture 7: Confidence interval and Normal approximation

QUIZ 4 (CHAPTER 7) - SOLUTIONS MATH 119 SPRING 2013 KUNIYUKI 105 POINTS TOTAL, BUT 100 POINTS = 100%

Conjugate Priors: Beta and Normal Spring January 1, /15

Pivotal Quantities. Mathematics 47: Lecture 16. Dan Sloughter. Furman University. March 30, 2006

Gov 2000: 6. Hypothesis Testing

79 Wyner Math Academy I Spring 2016

Statistics. Statistics

Performance Evaluation and Comparison

Homework for 1/13 Due 1/22

STAT 135 Lab 5 Bootstrapping and Hypothesis Testing

Probabilities and distributions

Business Statistics. Lecture 5: Confidence Intervals

PubH 5450 Biostatistics I Prof. Carlin. Lecture 13

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015

MVE055/MSG Lecture 8

Choosing Priors Probability Intervals Spring 2014

EC2001 Econometrics 1 Dr. Jose Olmo Room D309

Chapter 9: Hypothesis Testing Sections

Topic 6 - Confidence intervals based on a single sample

UC Berkeley Math 10B, Spring 2015: Midterm 2 Prof. Sturmfels, April 9, SOLUTIONS

Lecture 17: Small-Sample Inferences for Normal Populations. Confidence intervals for µ when σ is unknown

Interval estimation. October 3, Basic ideas CLT and CI CI for a population mean CI for a population proportion CI for a Normal mean

Chapter 8 - Statistical intervals for a single sample

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing

Stat 5101 Lecture Notes

Announcements Monday, September 18

Bayesian Inference for Normal Mean

Interpretation of results through confidence intervals

Inference for a Population Proportion

2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008

IV. The Normal Distribution

Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing

ST495: Survival Analysis: Hypothesis testing and confidence intervals

Fundamental Tools - Probability Theory IV

Discrete Distributions

Review: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.

Summary: the confidence interval for the mean (σ 2 known) with gaussian assumption

Confidence Intervals. Confidence interval for sample mean. Confidence interval for sample mean. Confidence interval for sample mean

STA Module 10 Comparing Two Proportions

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

Spring 2012 Math 541B Exam 1

Studio 8: NHST: t-tests and Rejection Regions Spring 2014

Confidence intervals CE 311S

Section 7.1 How Likely are the Possible Values of a Statistic? The Sampling Distribution of the Proportion

Mathematical Statistics

PHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1

z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests

Are data normally normally distributed?

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3

Continuous Data with Continuous Priors Class 14, Jeremy Orloff and Jonathan Bloom

Math 494: Mathematical Statistics

Introduction to Statistical Data Analysis Lecture 5: Confidence Intervals

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples

CBA4 is live in practice mode this week exam mode from Saturday!

F79SM STATISTICAL METHODS

Space Telescope Science Institute statistics mini-course. October Inference I: Estimation, Confidence Intervals, and Tests of Hypotheses

ACMS Statistics for Life Sciences. Chapter 13: Sampling Distributions

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 9.1-1

ISQS 5349 Final Exam, Spring 2017.

Chapter 5 Confidence Intervals

Stats Review Chapter 14. Mary Stangler Center for Academic Success Revised 8/16

INTERVAL ESTIMATION AND HYPOTHESES TESTING

Last few slides from last time

Chapter 8: Confidence Intervals

Review for Exam Spring 2018

Ch. 7 Statistical Intervals Based on a Single Sample

Physics 403. Segev BenZvi. Parameter Estimation, Correlations, and Error Bars. Department of Physics and Astronomy University of Rochester

STA 2201/442 Assignment 2

Harvard University. Rigorous Research in Engineering Education

Two-Sample Inferential Statistics

Review. December 4 th, Review

Comparing two independent samples

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

CONTINUOUS RANDOM VARIABLES

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix)

Introduction to Statistical Data Analysis Lecture 4: Sampling

II. The Normal Distribution

Transcription:

Confidence Intervals for Normal Data 18.05 Spring 2018

Agenda Exam on Monday April 30. Practice questions posted. Friday s class is for review (no studio) Today Review of critical values and quantiles. Computing z, t, χ 2 confidence intervals for normal data. Conceptual view of confidence intervals. Confidence intervals for polling (Bernoulli distributions). May 2, 2018 2 / 20

Review of critical values and quantiles Quantile: left tail P(X < q α ) = α Critical value: right tail P(X > c α ) = α Letters for critical values: z α for N(0, 1) t α for t(n) c α, x α all purpose P (Z q α ) α q α P (Z > z α ) z α α z q α and z α for the standard normal distribution. May 2, 2018 3 / 20

Concept question P (Z q α ) α q α P (Z > z α ) z α α z 1. z.025 = (a) -1.96 (b) -0.95 (c) 0.95 (d) 1.96 (e) 2.87 2. z.16 = (a) -1.33 (b) -0.99 (c) 0.99 (d) 1.33 (e) 3.52 Solution on next slide. May 2, 2018 4 / 20

Solution 1. answer: z.025 = 1.96. By definition P(Z > z.025 ) = 0.025. This is the same as P(Z z.025 ) = 0.975. Either from memory, a table or using the R function qnorm(.975) we get the result. 2.answer: z.16 = 0.99. We recall that P( Z < 1).68. Since half the leftover probability is in the right tail we have P(Z > 1) 0.16. Thus z.16 1, so z.16 1. May 2, 2018 5 / 20

Computing confidence intervals from normal data Suppose the data x 1,..., x n is drawn from N(µ, σ 2 ) Confidence level = 1 α z confidence interval for the mean (σ known) [ x z α/2 σ, x + z ] α/2 σ n n t confidence interval for the mean (σ unknown) [ x t α/2 s, x + t ] α/2 s n n χ 2 confidence interval for σ 2 [ n 1 c α/2 s 2, ] n 1 s 2 c 1 α/2 t and χ 2 have n 1 degrees of freedom. May 2, 2018 6 / 20

z rule of thumb Suppose x 1,..., x n N(µ, σ 2 ) with σ known. The rule-of-thumb 95% confidence interval for µ is: [ x 2 σ n, x + 2 σ n ] A more precise 95% confidence interval for µ is: [ x 1.96 σ n, x + 1.96 σ n ] May 2, 2018 7 / 20

Board question: computing confidence intervals The data 4, 1, 2, 3 is drawn from N(µ, σ 2 ) with µ unknown. 1 Find a 90% z confidence interval for µ, given that σ = 2. For the remaining parts, suppose σ is unknown. 2 Find a 90% t confidence interval for µ. 3 Find a 90% χ 2 confidence interval for σ 2. 4 Find a 90% χ 2 confidence interval for σ. 5 Given a normal sample with n = 100, x = 12, and s = 5, find the rule-of-thumb 95% confidence interval for µ. May 2, 2018 8 / 20

Solution x = 2.5, s 2 = 1.667, s = 1.29 σ/ n = 1, s/ n = 0.645. 1. z.05 = 1.644: z confidence interval is 2.5 ± 1.644 1 = [0.856, 4.144] 2. t.05 = 2.353 (3 degrees of freedom): t confidence interval is 2.5 ± 2.353 0.645 = [0.982, 4.018] 3. c 0.05 = 7.1814, c 0.95 = 0.352 (3 degrees of freedom): χ 2 confidence interval is [ ] 3 1.667 7.1814, 3 1.667 = [0.696, 14.207]. 0.352 4. Take the square root of the interval in 3. [0.834, 3.769]. 5. The rule of thumb is written for z, but with n = 100 the t(99) and standard normal distributions are very close, so we can assume that t.025 2. Thus the 95% confidence interval is 12 ± 2 5/10 = [11, 13]. May 2, 2018 9 / 20

Conceptual view of confidence intervals Computed from data interval statistic Estimates a parameter of interest interval estimate Width = measure of precision Confidence level = measure of performance Confidence intervals are a frequentist method. No need for a prior, only uses likelihood. Frequentists don t assign probabilities to hypotheses A 95% confidence interval of [1.2, 3.4] for µ doesn t mean that P(1.2 µ 3.4) = 0.95. We will compare with Bayesian probability intervals later. Applet: http://mathlets.org/mathlets/confidence-intervals/ May 2, 2018 10 / 20

Table discussion The quantities n, c = confidence, x, σ all appear in the z confidence interval for the mean. How does the width of a confidence interval for the mean change if: 1. we increase n and leave the others unchanged? 2. we increase c and leave the others unchanged? 3. we increase µ and leave the others unchanged? 4. we increase σ and leave the others unchanged? (A) it gets wider (B) it gets narrower (C) it stays the same. May 2, 2018 11 / 20

Answers 1. Narrower. More data decreases the variance of x 2. Wider. Greater confidence requires a bigger interval. 3. No change. Changing µ will tend to shift the location of the intervals. 4. Wider. Increasing σ will increase the uncertainty about µ. May 2, 2018 12 / 20

Intervals and pivoting x: sample mean (statistic) µ 0 : hypothesized mean (not known) Pivoting: x is in the interval µ 0 ± 2.3 µ 0 is in the interval x ± 2.3. 2 1 0 1 2 3 4 µ 0 ± 1 x ± 1 µ 0 ± 2.3 x ± 2.3 µ 0 x this interval does not contain x this interval does not contain µ 0 this interval contains x this interval contains µ 0 Algebra of pivoting: µ 0 2.3 < x < µ 0 + 2.3 x + 2.3 > µ 0 > x 2.3. May 2, 2018 13 / 20

Board question: confidence intervals, non-rejection regions Suppose x 1,..., x n N(µ, σ 2 ) with σ known. Consider two intervals: 1. The z confidence interval around x at confidence level 1 α. 2. The z non-rejection region for H 0 : µ = µ 0 at significance level α. Compute and sketch these intervals to show that: µ 0 is in the first interval x is in the second interval. May 2, 2018 14 / 20

Solution Confidence interval: σ x ± z α/2 n Non-rejection region: σ µ 0 ± z α/2 n Since the intervals are the same width they either both contain the other s center or neither one does. N(µ 0, σ 2 /n) x x 2 µ 0 z α/2 σ n µ 0 x 1 µ 0 + z α/2 σ n May 2, 2018 15 / 20

Polling: a binomial proportion confidence interval Data x 1,..., x n from a Bernoulli(θ) distribution with θ unknown. A conservative normal (1 α) confidence interval for θ is given by [ x z α/2 2 n, x + z ] α/2 2. n Proof uses the CLT and the observation σ = θ(1 θ) 1/2. Political polls often give a margin-of-error of ±1/ n. This rule-of-thumb corresponds to a 95% confidence interval: [ x 1, x + 1 ]. n n (The proof is in the class 21 notes.) Conversely, a margin of error of ±0.05 means 400 people were polled. There are many types of binomial proportion confidence intervals. http://en.wikipedia.org/wiki/binomial_proportion_confidence_interval May 2, 2018 16 / 20

Board question For a poll to find the proportion θ of people supporting X we know that a (1 α) confidence interval for θ is given by [ x z α/2 2 n, x + z ] α/2 2. n 1. How many people would you have to poll to have a margin of error of.01 with 95% confidence? (You can do this in your head.) 2. How many people would you have to poll to have a margin of error of.01 with 80% confidence. (You ll want R or other calculator here.) 3. If n = 900, compute the 95% and 80% confidence intervals for θ. answer: See next slide. May 2, 2018 17 / 20

answer: 1. Need 1/ n =.01 So n = 10000. 2. α =.2, so z α/2 = qnorm(.9) = 1.2816. So we need z α/2 2 n =.01. This gives n = 4106. 3. 95% interval: x ± 1 = x ± 1 n 30 = x ±.0333 80% interval: x ± z.1 1 2 n = x ± 1.2816 1 60 = x ±.021. May 2, 2018 18 / 20

Concept question: overnight polling During the presidential election season, pollsters often do overnight polls and report a margin of error of about ±5%. The number of people polled is in which of the following ranges? (a) 0 50 (b) 50 100 (c) 100 300 (d) 300 600 (e) 600 1000 Answer: 5% = 1/20. So 20 = n n = 400. May 2, 2018 19 / 20

National Council on Public Polls: Press Release, Sept 1992 The National Council on Public Polls expressed concern today about the current spate of overnight Presidential polls. [...] Overnight polls do a disservice to both the media and the research industry because of the considerable potential for the results to be misleading. The overnight interviewing period may well mean some methodological compromises, the most serious of which is.....what?...the inability to make callbacks, resulting in samples that do not adequately represent such groups as single member households, younger people, and others who are apt to be out on any given night. As overnight polls often result in findings that are less reliable than those from more carefully conducted polls, if the media reports them, it should be with great caution. http://www.ncpp.org/?q=node/42 May 2, 2018 20 / 20