Lecture 7: Confidence interval and Normal approximation

Size: px
Start display at page:

Download "Lecture 7: Confidence interval and Normal approximation"

Transcription

1 Lecture 7: Confidence interval and Normal approximation 26th of November 2015 Confidence interval 26th of November / 23

2 Random sample and uncertainty Example: we aim at estimating the average height of British men. We measure the height of 198 men. The sample mean of 198 men s heights is 1732mm, and the sample standard deviation is 68.8mm. What does this tell us about the average height of British men? We only measured 198 of the many millions of men in the country. Measuring the height of 198 other means would have lead to another estimation of the average height of British men Confidence interval 26th of November / 23

3 Imagine the height of British men follows a normal distribution N(µ, σ 2 ). Denote by X 1,..., X n the random measurements of the height of n = 198 mens. Our best guess for µ will of course be the sample mean X = 1 n (X X n ) What is the distribution of X? It is the sum of normally distributed random variables so it follows a normal distribution. Its mean is µ. Its variance is V ar( X) ) = (V 1 ar(x n 2 1 ) + + V ar(x n ) Therefore X N ) (µ, σ2 n. = σ2 n. Confidence interval 26th of November / 23

4 We can standardise X by writing Z = X µ σ/, where Z N(0, 1). n Using the standard normal tables we can show that P ( 1.96 Z 1.96) = 0.95 Therefore we have that ( P X 1.96 σ µ X σ ) = n n i.e, the probability is 0.95 that µ is in the range X 1.96 σ n to X σ n ; σ can be approximated using the sample standard deviation. We call this interval a 95% confidence interval for the unknown population mean. Confidence interval 26th of November / 23

5 General procedure for normal confidence interval Suppose X 1,..., X n are independent samples from a normal distribution with unknown mean µ, and known variance σ 2. Then a (symmetric) c% normal confidence interval for µ is the interval ( X z σ n, X + z σ n ), which we also write as X ± z σ n, z is the number such that (100 c)/2% of the probability in the standard normal distribution is above z. For a 95% confidence interval, we take z = 1.96 For a 99% confidence interval, we take z = 2.6 Confidence interval 26th of November / 23

6 Standard error A c% normal confidence interval for µ is the interval ( X z σ n, X + z σ n ). The quantity σ n, which determines the scale of the confidence interval, is called the Standard Error (SE) for the sample mean. Here we assume that σ is known. If it is not, we will estimate it from the data. Confidence interval 26th of November / 23

7 Interpreting the confidence interval What does confidence mean? The quantity µ is a fact, not a random quantity. Example: imagine we obtain the following 5% confidence interval: (1722mm, 1742mm) for the men height. We can not say: The probability is 0.95 that µ is between 1722mm and 1742mm. The randomness is in our estimate X. The probability statement is about the random interval. Confidence interval 26th of November / 23

8 Interpreting the confidence interval A (α 100)% confidence interval for a parameter µ, based on observations X = (X 1,..., X n ) is a pair of statistics A(X) and B(X), such that P (A(X) µ B(X)) = α. We can say that: (α 100)% of the time, the random interval generated according to this recipe will cover the true parameter. The quantity P (A(X) µ B(X)) is called the coverage probability for µ. A (α 100)% confidence interval is sometimes also called a confidence interval with confidence coefficient or confidence level α. Confidence interval 26th of November / 23

9 Example: blood sample Four measurements for 100 patients 95% confidence interval Blood Pressure Blood Pressure Trial Trial 90% confidence interval 68% confidence interval Blood Pressure Trial Trial Blood Pressure Confidence interval 26th of November / 23

10 The Normal Approximation So far, we have been assuming that our data are sampled from a population with a normal distribution. What justification do we have for this assumption? And what do we do if the data come from a different distribution? One of the great early discoveries of probability theory is that many different kinds of random variables come close to a normal distribution when you average enough of them. You have already seen examples of this phenomenon in the normal approximation to the binomial distribution and the Poisson. Confidence interval 26th of November / 23

11 The Normal Approximation Approximation Theorems in Probability Suppose X 1, X 2,..., X n are independent samples from a probability distribution with mean µ and variance σ 2. Then Law of Large Numbers (LLN): For n large, X will be close to µ. Central Limit Theorem (CLT): For n large, the error in the LLN is close to a normal distribution, with variance σ 2 /n. That is, using our standardisation procedure for the normal distribution, Z = X µ σ/ n (1) is close to having a standard normal distribution. Equivalently, X X n has approximately N(nµ, nσ 2 ) distribution. Confidence interval 26th of November / 23

12 In probability textbooks, you can find very precise statements about what it means for the distribution to be close. For our purposes, we will simply treat Z as being actually a standard normal random variable. However, we also need to know what it means for n to be large. For most distributions that you might encounter, 20 is usually plenty, while 2 or 3 are not enough. The key rules of thumb are that the approximation works best when the distribution of X 1 is reasonably symmetric: Not skewed in either direction. 2 has thin tails: Most of the probability is close to the mean, not many standard deviations away from the mean. More specific indications will be given in the following examples. Confidence interval 26th of November / 23

13 The Poisson distribution Suppose X i are drawn from a Poisson distribution with parameter µ. The variance is then also µ. We know that X X n has Poisson distribution with parameter nµ. The CLT tells us that for n large enough, the P o(nµ) distribution is very close to the N(nµ, nµ) distribution; or, in other words, P o(λ) is approximately the same as N(λ, λ) for λ large. How large should it be? Confidence interval 26th of November / 23

14 λ = 1 λ = 4 Density Z λ = 10 Density Density Z λ = 20 Density Z Note that when λ < 2.5 λ or, equivalently, λ < 6.2, the normal curve will have substantial probability below 0.5. Z Confidence interval 26th of November / 23

15 The Bernoulli distribution Bernoulli variables is the name for random variables that are 1 or 0, with probability p or 1 p respectively. Then B = X X n is the number of successes in n trials, with success probability p each time that is, a Bin(n, p) random variable. The CLT implies that Bin(n, p) N(np, np(1 p)) for large values of n. Note that B/n is the proportion of successes in n trials, and this has approximately N(p, p(1 p)/n) distribution. In other words, the observed proportion will be close to p, but will be off by a small multiple of the standard deviation, which shrinks as σ/ n, where σ = p(1 p). Confidence interval 26th of November / 23

16 The Bernoulli distribution How large does n need to be? As in the case of the Poisson distribution, a minimum requirement is that the mean be substantially larger than the standard deviation np np(1 p) so that n 1/p. The condition is symmetric, so we also need n 1/(1 p). This fits with our rule of thumb that n needs to be bigger when the distribution of X is skewed, which is the case when p is close to 0 or 1. Confidence interval 26th of November / 23

17 The Bernoulli distribution When p = 0.5 the normal approximation is quite good, even when n is only 10 when p = 0.1 we have a good normal approximation when n = 100, but not when n is 25. p = 0.5, n = 3 p = 0.5, n = 10 p = 0.5, n = 25 p = 0.5, n = p = 0.1, n = p = 0.1, n = p = 0.1, n = p = 0.1, n = Confidence interval 26th of November / 23

18 CLT for real data Example: household incomes in the state of California in 1999 Challenge: household income has a highly skewed distribution hence a poor candidate for applying the CLT. Histogram of California household income 1999 Averages of 5 California household incomes Averages of 50 California household incomes Density income in $thousands Averages of 2 California household incomes Density Density income in $thousands Averages of 10 California household incomes Density Density income in $thousands Averages of 100 California household incomes Density income in $thousands income in $thousands income in $thousands Confidence interval 26th of November / 23

19 Use of the CLT The CLT enables us to approximate probability distributions by a normal distribution. There are many implications of the Central Limit Theorem. Here we discuss one crucial application: The CLT allows us to compute normal confidence intervals to data that are not themselves normally distributed Confidence interval 26th of November / 23

20 Example: average incomes Suppose we take a random sample of 400 households in Oxford. We find that they have an average income of 36,200, with an SD of 26,400. What can we infer about the average income of all households in Oxford? Answer: Although the distribution of incomes is not normal, the average of 400 incomes is approximatively normally distributed. The SE for the mean is 26400/ 400 = 1320, so a 95% confidence interval for the average income in the population will be ± = ( 33560, 38840). A 99% confidence interval is ± = ( 32800, 39600). Confidence interval 26th of November / 23

21 Confidence intervals for probability of success Example: The Gallup organisation carried out a poll in October, 2005, of Americans attitudes about guns They surveyed 1,012 Americans, chosen at random. 30% said they personally owned a gun. If they d picked different people, purely by chance they would have gotten a somewhat different percentage. What does this survey tell us about the true proportion of Americans who own guns? Here, n = 1, 012 is large so, according to the CLT, the distribution of the proportion of Americans who own guns can be approximated by a N(p, p(1 p)/n) distribution. Confidence interval 26th of November / 23

22 We can compute a 95% confidence interval for the true proportion of Americans who own guns as 0.30 ± 1.96SE where SE can be approximated using the sample mean p 0.3: SE = p(1 p) n = So a 95% confidence interval for the true fraction of Americans who own guns is 0.30 ± = (0.271, 0.329). Loosely put, we can be 95% confident that the true proportion supporting EPP is between 27% and 33%. A 99% confidence interval comes from multiplying by 2.6 instead of 1.96: it goes from 26.3% to 33.7%. Confidence interval 26th of November / 23

23 Summary Definition of confidence interval for the mean when data are normally distributed Approximation theorems in probability the law of large numbers the central limit theorem (CLT) Indications about the conditions of the CLT for different probability distributions Applications of the CLT to compute confidence intervals when data are not normally distributed Reminder: the lecture notes contain more details and more examples; they are available on my website. Confidence interval 26th of November / 23

Fundamental Tools - Probability Theory IV

Fundamental Tools - Probability Theory IV Fundamental Tools - Probability Theory IV MSc Financial Mathematics The University of Warwick October 1, 2015 MSc Financial Mathematics Fundamental Tools - Probability Theory IV 1 / 14 Model-independent

More information

Chapter 15 Sampling Distribution Models

Chapter 15 Sampling Distribution Models Chapter 15 Sampling Distribution Models 1 15.1 Sampling Distribution of a Proportion 2 Sampling About Evolution According to a Gallup poll, 43% believe in evolution. Assume this is true of all Americans.

More information

Chapter 18: Sampling Distributions

Chapter 18: Sampling Distributions Chapter 18: Sampling Distributions All random variables have probability distributions, and as statistics are random variables, they too have distributions. The random phenomenon that produces the statistics

More information

Carolyn Anderson & YoungShil Paek (Slide contributors: Shuai Wang, Yi Zheng, Michael Culbertson, & Haiyan Li)

Carolyn Anderson & YoungShil Paek (Slide contributors: Shuai Wang, Yi Zheng, Michael Culbertson, & Haiyan Li) Carolyn Anderson & YoungShil Paek (Slide contributors: Shuai Wang, Yi Zheng, Michael Culbertson, & Haiyan Li) Department of Educational Psychology University of Illinois at Urbana-Champaign 1 Inferential

More information

Lecture 6: The Normal distribution

Lecture 6: The Normal distribution Lecture 6: The Normal distribution 18th of November 2015 Lecture 6: The Normal distribution 18th of November 2015 1 / 29 Continous data In previous lectures we have considered discrete datasets and discrete

More information

Lecture 27. DATA 8 Spring Sample Averages. Slides created by John DeNero and Ani Adhikari

Lecture 27. DATA 8 Spring Sample Averages. Slides created by John DeNero and Ani Adhikari DATA 8 Spring 2018 Lecture 27 Sample Averages Slides created by John DeNero (denero@berkeley.edu) and Ani Adhikari (adhikari@berkeley.edu) Announcements Questions for This Week How can we quantify natural

More information

The Central Limit Theorem

The Central Limit Theorem The Central Limit Theorem Patrick Breheny September 27 Patrick Breheny University of Iowa Biostatistical Methods I (BIOS 5710) 1 / 31 Kerrich s experiment Introduction 10,000 coin flips Expectation and

More information

ST 371 (IX): Theories of Sampling Distributions

ST 371 (IX): Theories of Sampling Distributions ST 371 (IX): Theories of Sampling Distributions 1 Sample, Population, Parameter and Statistic The major use of inferential statistics is to use information from a sample to infer characteristics about

More information

The Central Limit Theorem

The Central Limit Theorem The Central Limit Theorem Patrick Breheny March 1 Patrick Breheny STA 580: Biostatistics I 1/23 Kerrich s experiment A South African mathematician named John Kerrich was visiting Copenhagen in 1940 when

More information

Central Limit Theorem and the Law of Large Numbers Class 6, Jeremy Orloff and Jonathan Bloom

Central Limit Theorem and the Law of Large Numbers Class 6, Jeremy Orloff and Jonathan Bloom Central Limit Theorem and the Law of Large Numbers Class 6, 8.5 Jeremy Orloff and Jonathan Bloom Learning Goals. Understand the statement of the law of large numbers. 2. Understand the statement of the

More information

Discrete Distributions

Discrete Distributions Discrete Distributions STA 281 Fall 2011 1 Introduction Previously we defined a random variable to be an experiment with numerical outcomes. Often different random variables are related in that they have

More information

Introduction to Statistical Data Analysis Lecture 4: Sampling

Introduction to Statistical Data Analysis Lecture 4: Sampling Introduction to Statistical Data Analysis Lecture 4: Sampling James V. Lambers Department of Mathematics The University of Southern Mississippi James V. Lambers Statistical Data Analysis 1 / 30 Introduction

More information

CS 5014: Research Methods in Computer Science. Bernoulli Distribution. Binomial Distribution. Poisson Distribution. Clifford A. Shaffer.

CS 5014: Research Methods in Computer Science. Bernoulli Distribution. Binomial Distribution. Poisson Distribution. Clifford A. Shaffer. Department of Computer Science Virginia Tech Blacksburg, Virginia Copyright c 2015 by Clifford A. Shaffer Computer Science Title page Computer Science Clifford A. Shaffer Fall 2015 Clifford A. Shaffer

More information

Math/Stat 352 Lecture 10. Section 4.11 The Central Limit Theorem

Math/Stat 352 Lecture 10. Section 4.11 The Central Limit Theorem Math/Stat 352 Lecture 10 Section 4.11 The Central Limit Theorem 1 Summing random variables Summing random variables Summing random variables Generally summation changes the shape of the distribution: range

More information

ACMS Statistics for Life Sciences. Chapter 13: Sampling Distributions

ACMS Statistics for Life Sciences. Chapter 13: Sampling Distributions ACMS 20340 Statistics for Life Sciences Chapter 13: Sampling Distributions Sampling We use information from a sample to infer something about a population. When using random samples and randomized experiments,

More information

Math 10 - Compilation of Sample Exam Questions + Answers

Math 10 - Compilation of Sample Exam Questions + Answers Math 10 - Compilation of Sample Exam Questions + Sample Exam Question 1 We have a population of size N. Let p be the independent probability of a person in the population developing a disease. Answer the

More information

Business Statistics. Lecture 5: Confidence Intervals

Business Statistics. Lecture 5: Confidence Intervals Business Statistics Lecture 5: Confidence Intervals Goals for this Lecture Confidence intervals The t distribution 2 Welcome to Interval Estimation! Moments Mean 815.0340 Std Dev 0.8923 Std Error Mean

More information

ACM 116: Lecture 2. Agenda. Independence. Bayes rule. Discrete random variables Bernoulli distribution Binomial distribution

ACM 116: Lecture 2. Agenda. Independence. Bayes rule. Discrete random variables Bernoulli distribution Binomial distribution 1 ACM 116: Lecture 2 Agenda Independence Bayes rule Discrete random variables Bernoulli distribution Binomial distribution Continuous Random variables The Normal distribution Expected value of a random

More information

Confidence Intervals for Normal Data Spring 2014

Confidence Intervals for Normal Data Spring 2014 Confidence Intervals for Normal Data 18.05 Spring 2014 Agenda Today Review of critical values and quantiles. Computing z, t, χ 2 confidence intervals for normal data. Conceptual view of confidence intervals.

More information

Point Estimation and Confidence Interval

Point Estimation and Confidence Interval Chapter 8 Point Estimation and Confidence Interval 8.1 Point estimator The purpose of point estimation is to use a function of the sample data to estimate the unknown parameter. Definition 8.1 A parameter

More information

Special distributions

Special distributions Special distributions August 22, 2017 STAT 101 Class 4 Slide 1 Outline of Topics 1 Motivation 2 Bernoulli and binomial 3 Poisson 4 Uniform 5 Exponential 6 Normal STAT 101 Class 4 Slide 2 What distributions

More information

Statistics and Quantitative Analysis U4320. Segment 5: Sampling and inference Prof. Sharyn O Halloran

Statistics and Quantitative Analysis U4320. Segment 5: Sampling and inference Prof. Sharyn O Halloran Statistics and Quantitative Analysis U4320 Segment 5: Sampling and inference Prof. Sharyn O Halloran Sampling A. Basics 1. Ways to Describe Data Histograms Frequency Tables, etc. 2. Ways to Characterize

More information

Supporting Australian Mathematics Project. A guide for teachers Years 11 and 12. Probability and statistics: Module 25. Inference for means

Supporting Australian Mathematics Project. A guide for teachers Years 11 and 12. Probability and statistics: Module 25. Inference for means 1 Supporting Australian Mathematics Project 2 3 4 6 7 8 9 1 11 12 A guide for teachers Years 11 and 12 Probability and statistics: Module 2 Inference for means Inference for means A guide for teachers

More information

One-sample categorical data: approximate inference

One-sample categorical data: approximate inference One-sample categorical data: approximate inference Patrick Breheny October 6 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction It is relatively easy to think about the distribution

More information

Lecture 8 Sampling Theory

Lecture 8 Sampling Theory Lecture 8 Sampling Theory Thais Paiva STA 111 - Summer 2013 Term II July 11, 2013 1 / 25 Thais Paiva STA 111 - Summer 2013 Term II Lecture 8, 07/11/2013 Lecture Plan 1 Sampling Distributions 2 Law of Large

More information

Sampling Distribution Models. Chapter 17

Sampling Distribution Models. Chapter 17 Sampling Distribution Models Chapter 17 Objectives: 1. Sampling Distribution Model 2. Sampling Variability (sampling error) 3. Sampling Distribution Model for a Proportion 4. Central Limit Theorem 5. Sampling

More information

Confidence Intervals for Population Mean

Confidence Intervals for Population Mean Confidence Intervals for Population Mean Reading: Sections 7.1, 7.2, 7.3 Learning Objectives: Students should be able to: Understand the meaning and purpose of confidence intervals Calculate a confidence

More information

Confidence Intervals for Normal Data Spring 2018

Confidence Intervals for Normal Data Spring 2018 Confidence Intervals for Normal Data 18.05 Spring 2018 Agenda Exam on Monday April 30. Practice questions posted. Friday s class is for review (no studio) Today Review of critical values and quantiles.

More information

Lecture 3: Statistical sampling uncertainty

Lecture 3: Statistical sampling uncertainty Lecture 3: Statistical sampling uncertainty c Christopher S. Bretherton Winter 2015 3.1 Central limit theorem (CLT) Let X 1,..., X N be a sequence of N independent identically-distributed (IID) random

More information

Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics

Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics A short review of the principles of mathematical statistics (or, what you should have learned in EC 151).

More information

Are data normally normally distributed?

Are data normally normally distributed? Standard Normal Image source Are data normally normally distributed? Sample mean: 66.78 Sample standard deviation: 3.37 (66.78-1 x 3.37, 66.78 + 1 x 3.37) (66.78-2 x 3.37, 66.78 + 2 x 3.37) (66.78-3 x

More information

Chapter 7: Theoretical Probability Distributions Variable - Measured/Categorized characteristic

Chapter 7: Theoretical Probability Distributions Variable - Measured/Categorized characteristic BSTT523: Pagano & Gavreau, Chapter 7 1 Chapter 7: Theoretical Probability Distributions Variable - Measured/Categorized characteristic Random Variable (R.V.) X Assumes values (x) by chance Discrete R.V.

More information

Expectations. Definition Let X be a discrete rv with set of possible values D and pmf p(x). The expected value or mean value of X, denoted by E(X ) or

Expectations. Definition Let X be a discrete rv with set of possible values D and pmf p(x). The expected value or mean value of X, denoted by E(X ) or Expectations Expectations Definition Let X be a discrete rv with set of possible values D and pmf p(x). The expected value or mean value of X, denoted by E(X ) or µ X, is E(X ) = µ X = x D x p(x) Expectations

More information

Math 124: Modules Overall Goal. Point Estimations. Interval Estimation. Math 124: Modules Overall Goal.

Math 124: Modules Overall Goal. Point Estimations. Interval Estimation. Math 124: Modules Overall Goal. What we will do today s David Meredith Department of Mathematics San Francisco State University October 22, 2009 s 1 2 s 3 What is a? Decision support Political decisions s s Goal of statistics: optimize

More information

Theoretical Foundations

Theoretical Foundations Theoretical Foundations Sampling Distribution and Central Limit Theorem Monia Ranalli monia.ranalli@uniroma3.it Ranalli M. Theoretical Foundations - Sampling Distribution and Central Limit Theorem Lesson

More information

Continuous Random Variables. What continuous random variables are and how to use them. I can give a definition of a continuous random variable.

Continuous Random Variables. What continuous random variables are and how to use them. I can give a definition of a continuous random variable. Continuous Random Variables Today we are learning... What continuous random variables are and how to use them. I will know if I have been successful if... I can give a definition of a continuous random

More information

Management Programme. MS-08: Quantitative Analysis for Managerial Applications

Management Programme. MS-08: Quantitative Analysis for Managerial Applications MS-08 Management Programme ASSIGNMENT SECOND SEMESTER 2013 MS-08: Quantitative Analysis for Managerial Applications School of Management Studies INDIRA GANDHI NATIONAL OPEN UNIVERSITY MAIDAN GARHI, NEW

More information

Lecture 4: Two-point Sampling, Coupon Collector s problem

Lecture 4: Two-point Sampling, Coupon Collector s problem Randomized Algorithms Lecture 4: Two-point Sampling, Coupon Collector s problem Sotiris Nikoletseas Associate Professor CEID - ETY Course 2013-2014 Sotiris Nikoletseas, Associate Professor Randomized Algorithms

More information

PubH 5450 Biostatistics I Prof. Carlin. Lecture 13

PubH 5450 Biostatistics I Prof. Carlin. Lecture 13 PubH 5450 Biostatistics I Prof. Carlin Lecture 13 Outline Outline Sample Size Counts, Rates and Proportions Part I Sample Size Type I Error and Power Type I error rate: probability of rejecting the null

More information

ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12

ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12 ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12 Winter 2012 Lecture 13 (Winter 2011) Estimation Lecture 13 1 / 33 Review of Main Concepts Sampling Distribution of Sample Mean

More information

PHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1

PHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1 PHP2510: Principles of Biostatistics & Data Analysis Lecture X: Hypothesis testing PHP 2510 Lec 10: Hypothesis testing 1 In previous lectures we have encountered problems of estimating an unknown population

More information

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary Patrick Breheny October 13 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction Introduction What s wrong with z-tests? So far we ve (thoroughly!) discussed how to carry out hypothesis

More information

Statistical Methods for Astronomy

Statistical Methods for Astronomy Statistical Methods for Astronomy Probability (Lecture 1) Statistics (Lecture 2) Why do we need statistics? Useful Statistics Definitions Error Analysis Probability distributions Error Propagation Binomial

More information

Continuous Probability Distributions

Continuous Probability Distributions 1 Chapter 5 Continuous Probability Distributions 5.1 Probability density function Example 5.1.1. Revisit Example 3.1.1. 11 12 13 14 15 16 21 22 23 24 25 26 S = 31 32 33 34 35 36 41 42 43 44 45 46 (5.1.1)

More information

Test 3 SOLUTIONS. x P(x) xp(x)

Test 3 SOLUTIONS. x P(x) xp(x) 16 1. A couple of weeks ago in class, each of you took three quizzes where you randomly guessed the answers to each question. There were eight questions on each quiz, and four possible answers to each

More information

Statistics and parameters

Statistics and parameters Statistics and parameters Tables, histograms and other charts are used to summarize large amounts of data. Often, an even more extreme summary is desirable. Statistics and parameters are numbers that characterize

More information

MAS113 Introduction to Probability and Statistics

MAS113 Introduction to Probability and Statistics MAS113 Introduction to Probability and Statistics School of Mathematics and Statistics, University of Sheffield 2018 19 Identically distributed Suppose we have n random variables X 1, X 2,..., X n. Identically

More information

Central Theorems Chris Piech CS109, Stanford University

Central Theorems Chris Piech CS109, Stanford University Central Theorems Chris Piech CS109, Stanford University Silence!! And now a moment of silence......before we present......a beautiful result of probability theory! Four Prototypical Trajectories Central

More information

1 Binomial Probability [15 points]

1 Binomial Probability [15 points] Economics 250 Assignment 2 (Due November 13, 2017, in class) i) You should do the assignment on your own, Not group work! ii) Submit the completed work in class on the due date. iii) Remember to include

More information

MATH1231 Algebra, 2017 Chapter 9: Probability and Statistics

MATH1231 Algebra, 2017 Chapter 9: Probability and Statistics MATH1231 Algebra, 2017 Chapter 9: Probability and Statistics A/Prof. Daniel Chan School of Mathematics and Statistics University of New South Wales danielc@unsw.edu.au Daniel Chan (UNSW) MATH1231 Algebra

More information

CSE 103 Homework 8: Solutions November 30, var(x) = np(1 p) = P r( X ) 0.95 P r( X ) 0.

CSE 103 Homework 8: Solutions November 30, var(x) = np(1 p) = P r( X ) 0.95 P r( X ) 0. () () a. X is a binomial distribution with n = 000, p = /6 b. The expected value, variance, and standard deviation of X is: E(X) = np = 000 = 000 6 var(x) = np( p) = 000 5 6 666 stdev(x) = np( p) = 000

More information

To understand and analyze this test, we need to have the right model for the events. We need to identify an event and its probability.

To understand and analyze this test, we need to have the right model for the events. We need to identify an event and its probability. Probabilistic Models Example #1 A production lot of 10,000 parts is tested for defects. It is expected that a defective part occurs once in every 1,000 parts. A sample of 500 is tested, with 2 defective

More information

STAT 4385 Topic 01: Introduction & Review

STAT 4385 Topic 01: Introduction & Review STAT 4385 Topic 01: Introduction & Review Xiaogang Su, Ph.D. Department of Mathematical Science University of Texas at El Paso xsu@utep.edu Spring, 2016 Outline Welcome What is Regression Analysis? Basics

More information

Stat 101: Lecture 12. Summer 2006

Stat 101: Lecture 12. Summer 2006 Stat 101: Lecture 12 Summer 2006 Outline Answer Questions More on the CLT The Finite Population Correction Factor Confidence Intervals Problems More on the CLT Recall the Central Limit Theorem for averages:

More information

3.4. The Binomial Probability Distribution

3.4. The Binomial Probability Distribution 3.4. The Binomial Probability Distribution Objectives. Binomial experiment. Binomial random variable. Using binomial tables. Mean and variance of binomial distribution. 3.4.1. Four Conditions that determined

More information

TEACHER NOTES FOR YEAR 12 MATHEMATICAL METHODS

TEACHER NOTES FOR YEAR 12 MATHEMATICAL METHODS TEACHER NOTES FOR YEAR 12 MATHEMATICAL METHODS 21 November 2016 CHAPTER 1: FUNCTIONS A Exponential functions Topic 1 Unit 3 Sub-topic 1.3 Topic 1 B Logarithms Topic 4 Unit 4 C Logarithmic functions Sub-topics

More information

Lecture 01: Introduction

Lecture 01: Introduction Lecture 01: Introduction Dipankar Bandyopadhyay, Ph.D. BMTRY 711: Analysis of Categorical Data Spring 2011 Division of Biostatistics and Epidemiology Medical University of South Carolina Lecture 01: Introduction

More information

Binomial random variable

Binomial random variable Binomial random variable Toss a coin with prob p of Heads n times X: # Heads in n tosses X is a Binomial random variable with parameter n,p. X is Bin(n, p) An X that counts the number of successes in many

More information

Sampling Distributions of Statistics Corresponds to Chapter 5 of Tamhane and Dunlop

Sampling Distributions of Statistics Corresponds to Chapter 5 of Tamhane and Dunlop Sampling Distributions of Statistics Corresponds to Chapter 5 of Tamhane and Dunlop Slides prepared by Elizabeth Newton (MIT), with some slides by Jacqueline Telford (Johns Hopkins University) 1 Sampling

More information

Advanced Herd Management Probabilities and distributions

Advanced Herd Management Probabilities and distributions Advanced Herd Management Probabilities and distributions Anders Ringgaard Kristensen Slide 1 Outline Probabilities Conditional probabilities Bayes theorem Distributions Discrete Continuous Distribution

More information

Introduction and Overview STAT 421, SP Course Instructor

Introduction and Overview STAT 421, SP Course Instructor Introduction and Overview STAT 421, SP 212 Prof. Prem K. Goel Mon, Wed, Fri 3:3PM 4:48PM Postle Hall 118 Course Instructor Prof. Goel, Prem E mail: goel.1@osu.edu Office: CH 24C (Cockins Hall) Phone: 614

More information

13. Sampling distributions

13. Sampling distributions 13. Sampling distributions The Practice of Statistics in the Life Sciences Third Edition 2014 W. H. Freeman and Company Objectives (PSLS Chapter 13) Sampling distributions Parameter versus statistic Sampling

More information

Lecture 18: Central Limit Theorem. Lisa Yan August 6, 2018

Lecture 18: Central Limit Theorem. Lisa Yan August 6, 2018 Lecture 18: Central Limit Theorem Lisa Yan August 6, 2018 Announcements PS5 due today Pain poll PS6 out today Due next Monday 8/13 (1:30pm) (will not be accepted after Wed 8/15) Programming part: Java,

More information

Scientific Measurement

Scientific Measurement Scientific Measurement SPA-4103 Dr Alston J Misquitta Lecture 5 - The Binomial Probability Distribution Binomial Distribution Probability of k successes in n trials. Gaussian/Normal Distribution Poisson

More information

Probability. Hosung Sohn

Probability. Hosung Sohn Probability Hosung Sohn Department of Public Administration and International Affairs Maxwell School of Citizenship and Public Affairs Syracuse University Lecture Slide 4-3 (October 8, 2015) 1/ 43 Table

More information

STA1000F Summary. Mitch Myburgh MYBMIT001 May 28, Work Unit 1: Introducing Probability

STA1000F Summary. Mitch Myburgh MYBMIT001 May 28, Work Unit 1: Introducing Probability STA1000F Summary Mitch Myburgh MYBMIT001 May 28, 2015 1 Module 1: Probability 1.1 Work Unit 1: Introducing Probability 1.1.1 Definitions 1. Random Experiment: A procedure whose outcome (result) in a particular

More information

STA Why Sampling? Module 6 The Sampling Distributions. Module Objectives

STA Why Sampling? Module 6 The Sampling Distributions. Module Objectives STA 2023 Module 6 The Sampling Distributions Module Objectives In this module, we will learn the following: 1. Define sampling error and explain the need for sampling distributions. 2. Recognize that sampling

More information

Business Statistics. Lecture 9: Simple Regression

Business Statistics. Lecture 9: Simple Regression Business Statistics Lecture 9: Simple Regression 1 On to Model Building! Up to now, class was about descriptive and inferential statistics Numerical and graphical summaries of data Confidence intervals

More information

Probability theory and inference statistics! Dr. Paola Grosso! SNE research group!! (preferred!)!!

Probability theory and inference statistics! Dr. Paola Grosso! SNE research group!!  (preferred!)!! Probability theory and inference statistics Dr. Paola Grosso SNE research group p.grosso@uva.nl paola.grosso@os3.nl (preferred) Roadmap Lecture 1: Monday Sep. 22nd Collecting data Presenting data Descriptive

More information

Preliminary Statistics Lecture 5: Hypothesis Testing (Outline)

Preliminary Statistics Lecture 5: Hypothesis Testing (Outline) 1 School of Oriental and African Studies September 2015 Department of Economics Preliminary Statistics Lecture 5: Hypothesis Testing (Outline) Gujarati D. Basic Econometrics, Appendix A.8 Barrow M. Statistics

More information

Chapter 4 - Lecture 3 The Normal Distribution

Chapter 4 - Lecture 3 The Normal Distribution Chapter 4 - Lecture 3 The October 28th, 2009 Chapter 4 - Lecture 3 The Standard Chapter 4 - Lecture 3 The Standard Normal distribution is a statistical unicorn It is the most important distribution in

More information

MA 1125 Lecture 15 - The Standard Normal Distribution. Friday, October 6, Objectives: Introduce the standard normal distribution and table.

MA 1125 Lecture 15 - The Standard Normal Distribution. Friday, October 6, Objectives: Introduce the standard normal distribution and table. MA 1125 Lecture 15 - The Standard Normal Distribution Friday, October 6, 2017. Objectives: Introduce the standard normal distribution and table. 1. The Standard Normal Distribution We ve been looking at

More information

Example continued. Math 425 Intro to Probability Lecture 37. Example continued. Example

Example continued. Math 425 Intro to Probability Lecture 37. Example continued. Example continued : Coin tossing Math 425 Intro to Probability Lecture 37 Kenneth Harris kaharri@umich.edu Department of Mathematics University of Michigan April 8, 2009 Consider a Bernoulli trials process with

More information

Introduction to Statistical Analysis. Cancer Research UK 12 th of February 2018 D.-L. Couturier / M. Eldridge / M. Fernandes [Bioinformatics core]

Introduction to Statistical Analysis. Cancer Research UK 12 th of February 2018 D.-L. Couturier / M. Eldridge / M. Fernandes [Bioinformatics core] Introduction to Statistical Analysis Cancer Research UK 12 th of February 2018 D.-L. Couturier / M. Eldridge / M. Fernandes [Bioinformatics core] 2 Timeline 9:30 Morning I I 45mn Lecture: data type, summary

More information

Chapter 18. Sampling Distribution Models. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 18. Sampling Distribution Models. Copyright 2010, 2007, 2004 Pearson Education, Inc. Chapter 18 Sampling Distribution Models Copyright 2010, 2007, 2004 Pearson Education, Inc. Normal Model When we talk about one data value and the Normal model we used the notation: N(μ, σ) Copyright 2010,

More information

CMPSCI 240: Reasoning Under Uncertainty

CMPSCI 240: Reasoning Under Uncertainty CMPSCI 240: Reasoning Under Uncertainty Lecture 7 Prof. Hanna Wallach wallach@cs.umass.edu February 14, 2012 Reminders Check the course website: http://www.cs.umass.edu/ ~wallach/courses/s12/cmpsci240/

More information

Probability Distributions Columns (a) through (d)

Probability Distributions Columns (a) through (d) Discrete Probability Distributions Columns (a) through (d) Probability Mass Distribution Description Notes Notation or Density Function --------------------(PMF or PDF)-------------------- (a) (b) (c)

More information

Occupy movement - Duke edition. Lecture 14: Large sample inference for proportions. Exploratory analysis. Another poll on the movement

Occupy movement - Duke edition. Lecture 14: Large sample inference for proportions. Exploratory analysis. Another poll on the movement Occupy movement - Duke edition Lecture 14: Large sample inference for proportions Statistics 101 Mine Çetinkaya-Rundel October 20, 2011 On Tuesday we asked you about how closely you re following the news

More information

Random Variable. Discrete Random Variable. Continuous Random Variable. Discrete Random Variable. Discrete Probability Distribution

Random Variable. Discrete Random Variable. Continuous Random Variable. Discrete Random Variable. Discrete Probability Distribution Random Variable Theoretical Probability Distribution Random Variable Discrete Probability Distributions A variable that assumes a numerical description for the outcome of a random eperiment (by chance).

More information

Limiting Distributions

Limiting Distributions Limiting Distributions We introduce the mode of convergence for a sequence of random variables, and discuss the convergence in probability and in distribution. The concept of convergence leads us to the

More information

p = q ˆ = 1 -ˆp = sample proportion of failures in a sample size of n x n Chapter 7 Estimates and Sample Sizes

p = q ˆ = 1 -ˆp = sample proportion of failures in a sample size of n x n Chapter 7 Estimates and Sample Sizes Chapter 7 Estimates and Sample Sizes 7-1 Overview 7-2 Estimating a Population Proportion 7-3 Estimating a Population Mean: σ Known 7-4 Estimating a Population Mean: σ Not Known 7-5 Estimating a Population

More information

Stats for Engineers: Lecture 4

Stats for Engineers: Lecture 4 Stats for Engineers: Lecture 4 Summary from last time Standard deviation σ measure spread of distribution μ Variance = (standard deviation) σ = var X = k μ P(X = k) k = k P X = k k μ σ σ k Discrete Random

More information

Histograms, Central Tendency, and Variability

Histograms, Central Tendency, and Variability The Economist, September 6, 214 1 Histograms, Central Tendency, and Variability Lecture 2 Reading: Sections 5 5.6 Includes ALL margin notes and boxes: For Example, Guided Example, Notation Alert, Just

More information

Week 1 Quantitative Analysis of Financial Markets Distributions A

Week 1 Quantitative Analysis of Financial Markets Distributions A Week 1 Quantitative Analysis of Financial Markets Distributions A Christopher Ting http://www.mysmu.edu/faculty/christophert/ Christopher Ting : christopherting@smu.edu.sg : 6828 0364 : LKCSB 5036 October

More information

AP Statistics Review Ch. 7

AP Statistics Review Ch. 7 AP Statistics Review Ch. 7 Name 1. Which of the following best describes what is meant by the term sampling variability? A. There are many different methods for selecting a sample. B. Two different samples

More information

Chapter 18. Sampling Distribution Models /51

Chapter 18. Sampling Distribution Models /51 Chapter 18 Sampling Distribution Models 1 /51 Homework p432 2, 4, 6, 8, 10, 16, 17, 20, 30, 36, 41 2 /51 3 /51 Objective Students calculate values of central 4 /51 The Central Limit Theorem for Sample

More information

Random Models. Tusheng Zhang. February 14, 2013

Random Models. Tusheng Zhang. February 14, 2013 Random Models Tusheng Zhang February 14, 013 1 Introduction In this module, we will introduce some random models which have many real life applications. The course consists of four parts. 1. A brief review

More information

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved.

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved. 1-1 Chapter 1 Sampling and Descriptive Statistics 1-2 Why Statistics? Deal with uncertainty in repeated scientific measurements Draw conclusions from data Design valid experiments and draw reliable conclusions

More information

Limiting Distributions

Limiting Distributions We introduce the mode of convergence for a sequence of random variables, and discuss the convergence in probability and in distribution. The concept of convergence leads us to the two fundamental results

More information

Stat Lecture 20. Last class we introduced the covariance and correlation between two jointly distributed random variables.

Stat Lecture 20. Last class we introduced the covariance and correlation between two jointly distributed random variables. Stat 260 - Lecture 20 Recap of Last Class Last class we introduced the covariance and correlation between two jointly distributed random variables. Today: We will introduce the idea of a statistic and

More information

IV. The Normal Distribution

IV. The Normal Distribution IV. The Normal Distribution The normal distribution (a.k.a., the Gaussian distribution or bell curve ) is the by far the best known random distribution. It s discovery has had such a far-reaching impact

More information

Chapter 5 Confidence Intervals

Chapter 5 Confidence Intervals Chapter 5 Confidence Intervals Confidence Intervals about a Population Mean, σ, Known Abbas Motamedi Tennessee Tech University A point estimate: a single number, calculated from a set of data, that is

More information

Chapter 9 Regression. 9.1 Simple linear regression Linear models Least squares Predictions and residuals.

Chapter 9 Regression. 9.1 Simple linear regression Linear models Least squares Predictions and residuals. 9.1 Simple linear regression 9.1.1 Linear models Response and eplanatory variables Chapter 9 Regression With bivariate data, it is often useful to predict the value of one variable (the response variable,

More information

IV. The Normal Distribution

IV. The Normal Distribution IV. The Normal Distribution The normal distribution (a.k.a., a the Gaussian distribution or bell curve ) is the by far the best known random distribution. It s discovery has had such a far-reaching impact

More information

STA 291 Lecture 16. Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately) normal

STA 291 Lecture 16. Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately) normal STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately) normal X STA 291 - Lecture 16 1 Sampling Distributions Sampling

More information

1. Poisson distribution is widely used in statistics for modeling rare events.

1. Poisson distribution is widely used in statistics for modeling rare events. Discrete probability distributions - Class 5 January 20, 2014 Debdeep Pati Poisson distribution 1. Poisson distribution is widely used in statistics for modeling rare events. 2. Ex. Infectious Disease

More information

1 Exercises for lecture 1

1 Exercises for lecture 1 1 Exercises for lecture 1 Exercise 1 a) Show that if F is symmetric with respect to µ, and E( X )

More information

Probabilities and distributions

Probabilities and distributions and s s Johan A. Elkink School of Politics & International Relations University College Dublin 10 October 2016 1 s 2 s 3 4 Outline s 1 2 s 3 4 : definition s Frequentist approach: a probability is the

More information

4. Discrete Probability Distributions. Introduction & Binomial Distribution

4. Discrete Probability Distributions. Introduction & Binomial Distribution 4. Discrete Probability Distributions Introduction & Binomial Distribution Aim & Objectives 1 Aims u Introduce discrete probability distributions v Binomial distribution v Poisson distribution 2 Objectives

More information

Continuous Probability Distributions

Continuous Probability Distributions 1 Chapter 5 Continuous Probability Distributions 5.1 Probability density function Example 5.1.1. Revisit Example 3.1.1. 11 12 13 14 15 16 21 22 23 24 25 26 S = 31 32 33 34 35 36 41 42 43 44 45 46 (5.1.1)

More information