Confidence Intervals 1

Size: px
Start display at page:

Download "Confidence Intervals 1"

Transcription

1 Confidence Intervals 1 November 1, HMS, 2017, v1.1

2 Chapter References Diez: Chapter 4.2 Navidi, Chapter 5.0, 5.1, (Self read, 5.2), 5.3, 5.4, 5.6, not 5.7, 5.8 Chapter References 2

3 Terminology Point Estimate A sample statistic used to estimate the value of a population parameter. 1. Provides a single value: Based on observations from 1 sample, there is no sampling distribution. 2. Information: Gives no information about how close the value is to the unknown population parameter. 3. Example: Sample mean x is the point estimate of the unknown population mean. Confidence Intervals 3

4 Terminology Confidence interval (interval estimate) A range of values defined by the confidence level within which the population parameter is estimated to fall. 1. Provides a range of values. 2. Information: Gives information about closeness to unknown population parameter 3. Example: Unknown population mean lies between 50 and 70 with 95% confidence Confidence Level The likelihood, expressed as a percentage or a probability, that a specified interval will contain the population parameter. Confidence Intervals 4

5 Interval Estimation A probability that the population parameter falls somewhere within the interval. µ x = X ± error Confidence Intervals 5

6 Error Bars We ve all seen errors bars: One standard deviation from the mean of nine independently evolved populations. Designing and engineering evolutionary robust genetic circuits Journal of Biological Engineering 2010, 4:12 Confidence Intervals 6

7 Error Bars Error Bar Type Description Formula Range Descriptive Distance between extremes Highest and of data lowest points Standard Deviation Descriptive Average deviation SD from the mean Standard Error Inferential Variability of the mean SD/ n Confidence Limit Inferential Range of values you can be 95% confidence contains the true mean To be determined Confidence Intervals 7

8 Confidence Limits Recall what the standard error means: The standard error gives the standard deviation of the distribution of the mean. We are 68% confident that the mean is within the limits ±SE. But we could do better, we could widen the limits to 95%. Confidence Intervals 8

9 Confidence Limits The standard error of the mean can be interpreted as: If we were to take another sample from the population and compute its mean, there is a 68% chance that the mean of the sample will lie within 1 standard error of the population mean. But 68% is not that big, better to use a wider range - a given range is referred to as the confidence level. Confidence Intervals 9

10 Confidence Limits Confidence Level: The likelihood, expressed as a percentage or a probability, that a specified interval will contain the population parameter. 95% confidence level there is a 0.95 probability that a specified interval does contain the population mean. In other words, there are 5 chances out of 100 (or 1 chance out of 20) that the interval does not contain the population mean. Two σ 99% confidence level there is 1 chance out of 100 that the interval does not contains the population mean. Three σ Confidence Intervals 10

11 Confidence Limits Confidence Intervals 11

12 Confidence Limits How far out is 95% of the area on a normal curve? This means there is a 2.5% area on both sides of the normal curve. Let α = 0.05 i.e 5% Confidence Intervals 12

13 Confidence Limits To find the z-value we look up 97.5 in the standard normal cumulative probability table and we get 1.96 Confidence Intervals 13

14 Confidence Limits z Value Percentage Area % % % % Confidence Intervals 14

15 Confidence Limits In other words, to 1.96 on the z scale represents 95% of the area: or (and this is the critical point) 95% is bounded by ±1.96 SE Confidence Intervals 15

16 Confidence Limits We can therefore state that for 95% of the time, the mean will be bounded by: µ ± 1.96 σ n However, we don t actually have the population mean, µ, or the population standard deviation, σ. Instead we use the sample mean and standard deviation as proxies: x ± 1.96 s n Hence the limits are actually only approximate. Moreover the approximation get worse as the sample size gets smaller. Confidence Intervals 16

17 Confidence Limits You ll also see the confidence level expressed as: z α/2 for 95% confidence, 1.96 z α/3 for 99.7% confidence, 3 Note: The 99.0% level is actually bounded by The z score changes rapidly at the limits of the normal curve. Confidence Intervals 17

18 Exercise Find the z limits for a 90% confidence level. Confidence Intervals 18

19 Example The mean birth weights for 200 babies is 3.28 Kg grams with a population standard deviation of 0.85 Kg. Compute the 95% confidence limits for the mean birth weight. x = 3.28 ± = 3.28 ± 0.12 Kg Confidence Intervals 19

20 Example The mean concentration for a sample of 100 insulin vials is 15 grams/vial with a population standard deviation of 3.4 grams. Compute the 90% confidence limits for the mean concentration of insulin. x = 15 ± = 15 ± 0.56 grams/vial Confidence Intervals 20

21 Summary Large sample confidence interval for a population mean: General Formula: x ± (z critical value) σ n Levels of confidence and corresponding z critical value: 99% % % Since n is large the unknown σ can be replaced by the sample value s: x ± (z critical value) s n Confidence Intervals 21

22 Problems in Paradise You may have noticed in the previous examples that the samples were relatively large. This was to ensure that the means and standard deviations were reasonable representatives of the population measures. In fact the problems stated that the standard deviation was in fact the population standard deviation. For small samples 30 we have to make a slight modification to the procedure. Confidence Intervals 22

23 Student s t Distribution For large samples we assume that the means are distributed as: X N(µ, σ 2 /n) and that the standardized distribution (X µ)/(σ/ n) has a normal distribution with mean 0 and variance 1. However, when the sample size is small, there is significant error in the estimate for the population standard deviation, σ, because we will often estimate the population standard deviation from the sample standard deviation, s. In 1908 Gossett proposed that the quantity (X µ)/(s/ n) was in fact distributed via a different distribution which he called the Student s t distribution. Confidence Intervals 23

24 Student s t Distribution Definition: Let X 1,... be a small sample (n 30) from a normal population with mean µ. Then the quantity: X µ s/ n has a Student s t distribution with n 1 degrees of freedom, denoted, t n 1 Note: The t distribution is a function of the sample size minus 1. Confidence Intervals 24

25 Confidence Limits For small sample sizes, use the t distribution instead of the standard normal distribution. The t distribution is a symmetrical distribution whose probability density function is defined by a single parameter known as the degrees of freedom (df). Example, if the sample size if 19, then the degrees of freedom will be 18. Confidence Intervals 25

26 Confidence Limits A larger portion of the probability area is in the tails compare to the standard normal distribution. This in turn means the confidence limits computed using the t distribution will be larger. Confidence Intervals 26

27 Confidence Limits for Small Samples The confidence limits for a small sample is given by: s X ± t n 1,α/2 n where n is the sample size, n 1 the degrees of freedom, α/2 the confidence level (eg 0.05/2 = 95%) Just as there are z tables there are also t tables Confidence Intervals 27

28 t Tables Confidence Intervals 28

29 t Tables Confidence Intervals 29

30 t Tables: One Tailed and Two Tailed Tables Confidence Intervals 30

31 t Tables: How to use the t Table Confidence Intervals 31

32 t Tables: Example Six vials of penicillin were randomly selected and the concentration of penicillin was determined in each vial in mg/ml to be: 8.6, 9.7, 13.4, 11.4, 10.2, 12.3 Find the 95% confidence limits for the true mean concentration of penicillin. Confidence Intervals 32

33 t Tables: Example Since we are dealing with less than a sample size of 30, we will use the t-statistic to determine the confidence limits. n = 6 x = Sample standard deviation = 1.77 X is the random variable that represents the mean. s X ± t n 1,α/2 n Confidence Intervals 33

34 t Tables: Example n 1 = 6 1 = 5 α/2 = 0.05/2 = Note: α is the area outside the critical bounds = 0.05 In the t table we will look for row 5, and the column marked Note that 0l025 is the areas of a single tail therefore we will use the single tailed table. Confidence Intervals 34

35 t Tables: Example Therefore: t 5,α/2=0.025 = µ = ± = ± 1.86 For the z-statistic the critical value would be 1.96, therefore the range has widened with the t-statistic. Confidence Intervals 35

36 Confidence Intervals for Difference between two Means Consider two samples from two different populations. What is the confidence limit for the difference in the two means? i.e µ X µ Y From previous lectures on combining means and variances we know that: X Y N(µ X µ Y, σ 2 X + σ 2 Y ) That is the difference is also normally distributed but with different mean and variance. Confidence Intervals 36

37 Confidence Intervals for Difference between two Means Consider a mean X and Y that have standard errors: σ X n1 and σ Y n2 where n 1 and n 2 are the sizes of the corresponding samples. Then the difference X Y : Mean = µ X µ Y and variance σ 2 X n1 + σ2 Y n2 Given the new variance and mean we can compute the 95% confidence limit as: σx 2 µ X µ Y ± σ2 Y n1 n2 Confidence Intervals 37

38 Confidence Intervals for Difference between two Means If the samples are small then the confidence limits for the difference in two means requires the use of the t distribution as before. The main complication is the calculation of the degrees of freedom but its difficult to do. I refer you to section 5.6 in Navidi for details. Confidence Intervals 38

39 Using Simulation to Estimate Confidence Limits All the methods so far assume the sample is obtained from a population that is normal or near normal. What happens if the population is not normal? Confidence Intervals 39

40 Using Simulation to Estimate Confidence Limits Consider a series of gene expression rates measures from seven cultures of E. coli. The data are as follows: 7.69, 4.97, 4.56, 6.49, 4.34, 6.24, 4.45 Using the following code a norm Q-Q or probability plot was made: import numpy as np import pylab import scipy.stats as stats measurements = [7.69, 4.97, 4.56, 6.49, 4.34, 6.24, 4.45 ] stats.probplot(measurements, dist="norm", plot=pylab) pylab.show() Confidence Intervals 40

41 Using Simulation to Estimate Confidence Limits The data does not appear to be normally distributed. Confidence Intervals 41

42 Using Simulation to Estimate Confidence Limits To find 95% confidence intervals for this data we must create synthetic data sets using a Bootstrap. We will create a new sample by drawing at random (and with replacement) values from the measured sample. For example the following could be a new synthetic sample: 6.49, 4.97, 4.34, 6.24, 7.69, 6.49, 4.34 Because of replacement it is possible we could pick the same value multiple times. We do this 100,000 times in order to create 100,000 synthetic data sets. We compute the mean for each synthetic data set to generate a population of means. We can use this population to work out the distribution and hence the confidence limits on the mean. Confidence Intervals 42

43 Using Simulation to Estimate Confidence Limits Sort the sample means from low to high Find the 2.5% and 97.5% percentiles. The interval will contain 95% of the data. These calculations can be easily done in Python. Confidence Intervals 43

44 Using Simulation to Estimate Confidence Limits import numpy as np import scipy.stats as stats import random measurements = [7.69, 4.97, 4.56, 6.49, 4.34, 6.24, 4.45 ] ensemble = [] for i in range (100000): sample = [] for j in range (7): sample.append (measurements [random.randint (0,6)]) ensemble.append (np.mean (sample)) np.sort (ensemble) p1 = np.percentile(ensemble, 2.5) p2 = np.percentile(ensemble, 97.5) print p1, p2 Confidence Intervals 44

45 Using Simulation to Estimate Confidence Limits Running the python script yields the values: That is the 95% confidence limits on the gene expression is: 4.72 to 6.46 Confidence Intervals 45

Review. One-way ANOVA, I. What s coming up. Multiple comparisons

Review. One-way ANOVA, I. What s coming up. Multiple comparisons Review One-way ANOVA, I 9.07 /15/00 Earlier in this class, we talked about twosample z- and t-tests for the difference between two conditions of an independent variable Does a trial drug work better than

More information

STAT 135 Lab 5 Bootstrapping and Hypothesis Testing

STAT 135 Lab 5 Bootstrapping and Hypothesis Testing STAT 135 Lab 5 Bootstrapping and Hypothesis Testing Rebecca Barter March 2, 2015 The Bootstrap Bootstrap Suppose that we are interested in estimating a parameter θ from some population with members x 1,...,

More information

Two-Sample Inferential Statistics

Two-Sample Inferential Statistics The t Test for Two Independent Samples 1 Two-Sample Inferential Statistics In an experiment there are two or more conditions One condition is often called the control condition in which the treatment is

More information

The t-statistic. Student s t Test

The t-statistic. Student s t Test The t-statistic 1 Student s t Test When the population standard deviation is not known, you cannot use a z score hypothesis test Use Student s t test instead Student s t, or t test is, conceptually, very

More information

Psychology 282 Lecture #4 Outline Inferences in SLR

Psychology 282 Lecture #4 Outline Inferences in SLR Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations

More information

Regression Analysis: Basic Concepts

Regression Analysis: Basic Concepts The simple linear model Regression Analysis: Basic Concepts Allin Cottrell Represents the dependent variable, y i, as a linear function of one independent variable, x i, subject to a random disturbance

More information

COSC 341 Human Computer Interaction. Dr. Bowen Hui University of British Columbia Okanagan

COSC 341 Human Computer Interaction. Dr. Bowen Hui University of British Columbia Okanagan COSC 341 Human Computer Interaction Dr. Bowen Hui University of British Columbia Okanagan 1 Last Topic Distribution of means When it is needed How to build one (from scratch) Determining the characteristics

More information

CS 5014: Research Methods in Computer Science. Bernoulli Distribution. Binomial Distribution. Poisson Distribution. Clifford A. Shaffer.

CS 5014: Research Methods in Computer Science. Bernoulli Distribution. Binomial Distribution. Poisson Distribution. Clifford A. Shaffer. Department of Computer Science Virginia Tech Blacksburg, Virginia Copyright c 2015 by Clifford A. Shaffer Computer Science Title page Computer Science Clifford A. Shaffer Fall 2015 Clifford A. Shaffer

More information

Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.

Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6. Chapter 7 Reading 7.1, 7.2 Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.112 Introduction In Chapter 5 and 6, we emphasized

More information

Business Statistics. Lecture 5: Confidence Intervals

Business Statistics. Lecture 5: Confidence Intervals Business Statistics Lecture 5: Confidence Intervals Goals for this Lecture Confidence intervals The t distribution 2 Welcome to Interval Estimation! Moments Mean 815.0340 Std Dev 0.8923 Std Error Mean

More information

Probability and Statistics Notes

Probability and Statistics Notes Probability and Statistics Notes Chapter Seven Jesse Crawford Department of Mathematics Tarleton State University Spring 2011 (Tarleton State University) Chapter Seven Notes Spring 2011 1 / 42 Outline

More information

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 2 MATH00040 SEMESTER / Probability

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 2 MATH00040 SEMESTER / Probability ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 2 MATH00040 SEMESTER 2 2017/2018 DR. ANTHONY BROWN 5.1. Introduction to Probability. 5. Probability You are probably familiar with the elementary

More information

Distribution-Free Procedures (Devore Chapter Fifteen)

Distribution-Free Procedures (Devore Chapter Fifteen) Distribution-Free Procedures (Devore Chapter Fifteen) MATH-5-01: Probability and Statistics II Spring 018 Contents 1 Nonparametric Hypothesis Tests 1 1.1 The Wilcoxon Rank Sum Test........... 1 1. Normal

More information

Hypothesis Testing: Chi-Square Test 1

Hypothesis Testing: Chi-Square Test 1 Hypothesis Testing: Chi-Square Test 1 November 9, 2017 1 HMS, 2017, v1.0 Chapter References Diez: Chapter 6.3 Navidi, Chapter 6.10 Chapter References 2 Chi-square Distributions Let X 1, X 2,... X n be

More information

7 Estimation. 7.1 Population and Sample (P.91-92)

7 Estimation. 7.1 Population and Sample (P.91-92) 7 Estimation MATH1015 Biostatistics Week 7 7.1 Population and Sample (P.91-92) Suppose that we wish to study a particular health problem in Australia, for example, the average serum cholesterol level for

More information

Single Sample Means. SOCY601 Alan Neustadtl

Single Sample Means. SOCY601 Alan Neustadtl Single Sample Means SOCY601 Alan Neustadtl The Central Limit Theorem If we have a population measured by a variable with a mean µ and a standard deviation σ, and if all possible random samples of size

More information

IV. The Normal Distribution

IV. The Normal Distribution IV. The Normal Distribution The normal distribution (a.k.a., the Gaussian distribution or bell curve ) is the by far the best known random distribution. It s discovery has had such a far-reaching impact

More information

Sampling and estimation theories

Sampling and estimation theories Chapter 66 Sampling and estimation theories 66.1 Introduction The concepts of elementary sampling theory and estimation theories introduced in this chapter will provide the basis for a more detailed study

More information

16.400/453J Human Factors Engineering. Design of Experiments II

16.400/453J Human Factors Engineering. Design of Experiments II J Human Factors Engineering Design of Experiments II Review Experiment Design and Descriptive Statistics Research question, independent and dependent variables, histograms, box plots, etc. Inferential

More information

Statistical Inference for Means

Statistical Inference for Means Statistical Inference for Means Jamie Monogan University of Georgia February 18, 2011 Jamie Monogan (UGA) Statistical Inference for Means February 18, 2011 1 / 19 Objectives By the end of this meeting,

More information

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 7 Estimates and Sample Sizes 7-1 Overview 7-2 Estimating a Population Proportion 7-3

More information

STAT 200 Chapter 1 Looking at Data - Distributions

STAT 200 Chapter 1 Looking at Data - Distributions STAT 200 Chapter 1 Looking at Data - Distributions What is Statistics? Statistics is a science that involves the design of studies, data collection, summarizing and analyzing the data, interpreting the

More information

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples Objective Section 9.4 Inferences About Two Means (Matched Pairs) Compare of two matched-paired means using two samples from each population. Hypothesis Tests and Confidence Intervals of two dependent means

More information

This does not cover everything on the final. Look at the posted practice problems for other topics.

This does not cover everything on the final. Look at the posted practice problems for other topics. Class 7: Review Problems for Final Exam 8.5 Spring 7 This does not cover everything on the final. Look at the posted practice problems for other topics. To save time in class: set up, but do not carry

More information

The independent-means t-test:

The independent-means t-test: The independent-means t-test: Answers the question: is there a "real" difference between the two conditions in my experiment? Or is the difference due to chance? Previous lecture: (a) Dependent-means t-test:

More information

Confidence Intervals. - simply, an interval for which we have a certain confidence.

Confidence Intervals. - simply, an interval for which we have a certain confidence. Confidence Intervals I. What are confidence intervals? - simply, an interval for which we have a certain confidence. - for example, we are 90% certain that an interval contains the true value of something

More information

IV. The Normal Distribution

IV. The Normal Distribution IV. The Normal Distribution The normal distribution (a.k.a., a the Gaussian distribution or bell curve ) is the by far the best known random distribution. It s discovery has had such a far-reaching impact

More information

Confidence intervals

Confidence intervals Confidence intervals We now want to take what we ve learned about sampling distributions and standard errors and construct confidence intervals. What are confidence intervals? Simply an interval for which

More information

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary Patrick Breheny October 13 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction Introduction What s wrong with z-tests? So far we ve (thoroughly!) discussed how to carry out hypothesis

More information

(right tailed) or minus Z α. (left-tailed). For a two-tailed test the critical Z value is going to be.

(right tailed) or minus Z α. (left-tailed). For a two-tailed test the critical Z value is going to be. More Power Stuff What is the statistical power of a hypothesis test? Statistical power is the probability of rejecting the null conditional on the null being false. In mathematical terms it is ( reject

More information

An inferential procedure to use sample data to understand a population Procedures

An inferential procedure to use sample data to understand a population Procedures Hypothesis Test An inferential procedure to use sample data to understand a population Procedures Hypotheses, the alpha value, the critical region (z-scores), statistics, conclusion Two types of errors

More information

Business Statistics: A Decision-Making Approach, 6e. Chapter Goals

Business Statistics: A Decision-Making Approach, 6e. Chapter Goals Chapter 4 Student Lecture Notes 4-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter 4 Using Probability and Probability Distributions Fundamentals of Business Statistics Murali Shanker

More information

a table or a graph or an equation.

a table or a graph or an equation. Topic (8) POPULATION DISTRIBUTIONS 8-1 So far: Topic (8) POPULATION DISTRIBUTIONS We ve seen some ways to summarize a set of data, including numerical summaries. We ve heard a little about how to sample

More information

Multiple t Tests. Introduction to Analysis of Variance. Experiments with More than 2 Conditions

Multiple t Tests. Introduction to Analysis of Variance. Experiments with More than 2 Conditions Introduction to Analysis of Variance 1 Experiments with More than 2 Conditions Often the research that psychologists perform has more conditions than just the control and experimental conditions You might

More information

MATH Chapter 21 Notes Two Sample Problems

MATH Chapter 21 Notes Two Sample Problems MATH 1070 - Chapter 21 Notes Two Sample Problems Recall: So far, we have dealt with inference (confidence intervals and hypothesis testing) pertaining to: Single sample of data. A matched pairs design

More information

PSYC 331 STATISTICS FOR PSYCHOLOGISTS

PSYC 331 STATISTICS FOR PSYCHOLOGISTS PSYC 331 STATISTICS FOR PSYCHOLOGISTS Session 4 A PARAMETRIC STATISTICAL TEST FOR MORE THAN TWO POPULATIONS Lecturer: Dr. Paul Narh Doku, Dept of Psychology, UG Contact Information: pndoku@ug.edu.gh College

More information

Content by Week Week of October 14 27

Content by Week Week of October 14 27 Content by Week Week of October 14 27 Learning objectives By the end of this week, you should be able to: Understand the purpose and interpretation of confidence intervals for the mean, Calculate confidence

More information

Supporting Australian Mathematics Project. A guide for teachers Years 11 and 12. Probability and statistics: Module 25. Inference for means

Supporting Australian Mathematics Project. A guide for teachers Years 11 and 12. Probability and statistics: Module 25. Inference for means 1 Supporting Australian Mathematics Project 2 3 4 6 7 8 9 1 11 12 A guide for teachers Years 11 and 12 Probability and statistics: Module 2 Inference for means Inference for means A guide for teachers

More information

Last two weeks: Sample, population and sampling distributions finished with estimation & confidence intervals

Last two weeks: Sample, population and sampling distributions finished with estimation & confidence intervals Past weeks: Measures of central tendency (mean, mode, median) Measures of dispersion (standard deviation, variance, range, etc). Working with the normal curve Last two weeks: Sample, population and sampling

More information

Introduction to Statistical Data Analysis Lecture 5: Confidence Intervals

Introduction to Statistical Data Analysis Lecture 5: Confidence Intervals Introduction to Statistical Data Analysis Lecture 5: Confidence Intervals James V. Lambers Department of Mathematics The University of Southern Mississippi James V. Lambers Statistical Data Analysis 1

More information

Statistical Intervals (One sample) (Chs )

Statistical Intervals (One sample) (Chs ) 7 Statistical Intervals (One sample) (Chs 8.1-8.3) Confidence Intervals The CLT tells us that as the sample size n increases, the sample mean X is close to normally distributed with expected value µ and

More information

Section 9 2B:!! Using Confidence Intervals to Estimate the Difference ( µ 1 µ 2 ) in Two Population Means using Two Independent Samples.

Section 9 2B:!! Using Confidence Intervals to Estimate the Difference ( µ 1 µ 2 ) in Two Population Means using Two Independent Samples. Section 9 2B:!! Using Confidence Intervals to Estimate the Difference ( µ 1 µ 2 ) in Two Population Means using Two Independent Samples Requirements 1.A random sample of each population is taken. The sample

More information

Confidence intervals CE 311S

Confidence intervals CE 311S CE 311S PREVIEW OF STATISTICS The first part of the class was about probability. P(H) = 0.5 P(T) = 0.5 HTTHHTTTTHHTHTHH If we know how a random process works, what will we see in the field? Preview of

More information

Student s t-distribution. The t-distribution, t-tests, & Measures of Effect Size

Student s t-distribution. The t-distribution, t-tests, & Measures of Effect Size Student s t-distribution The t-distribution, t-tests, & Measures of Effect Size Sampling Distributions Redux Chapter 7 opens with a return to the concept of sampling distributions from chapter 4 Sampling

More information

Chapter 4. Displaying and Summarizing. Quantitative Data

Chapter 4. Displaying and Summarizing. Quantitative Data STAT 141 Introduction to Statistics Chapter 4 Displaying and Summarizing Quantitative Data Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 1 / 31 4.1 Histograms 1 We divide the range

More information

Samples and Populations Confidence Intervals Hypotheses One-sided vs. two-sided Statistical Significance Error Types. Statistiek I.

Samples and Populations Confidence Intervals Hypotheses One-sided vs. two-sided Statistical Significance Error Types. Statistiek I. Statistiek I Sampling John Nerbonne CLCG, Rijksuniversiteit Groningen http://www.let.rug.nl/nerbonne/teach/statistiek-i/ John Nerbonne 1/41 Overview 1 Samples and Populations 2 Confidence Intervals 3 Hypotheses

More information

Linear Regression. In this lecture we will study a particular type of regression model: the linear regression model

Linear Regression. In this lecture we will study a particular type of regression model: the linear regression model 1 Linear Regression 2 Linear Regression In this lecture we will study a particular type of regression model: the linear regression model We will first consider the case of the model with one predictor

More information

Statistics 1. Edexcel Notes S1. Mathematical Model. A mathematical model is a simplification of a real world problem.

Statistics 1. Edexcel Notes S1. Mathematical Model. A mathematical model is a simplification of a real world problem. Statistics 1 Mathematical Model A mathematical model is a simplification of a real world problem. 1. A real world problem is observed. 2. A mathematical model is thought up. 3. The model is used to make

More information

Statistics 224 Solution key to EXAM 2 FALL 2007 Friday 11/2/07 Professor Michael Iltis (Lecture 2)

Statistics 224 Solution key to EXAM 2 FALL 2007 Friday 11/2/07 Professor Michael Iltis (Lecture 2) NOTE : For the purpose of review, I have added some additional parts not found on the original exam. These parts are indicated with a ** beside them Statistics 224 Solution key to EXAM 2 FALL 2007 Friday

More information

Math 10 - Compilation of Sample Exam Questions + Answers

Math 10 - Compilation of Sample Exam Questions + Answers Math 10 - Compilation of Sample Exam Questions + Sample Exam Question 1 We have a population of size N. Let p be the independent probability of a person in the population developing a disease. Answer the

More information

Z score indicates how far a raw score deviates from the sample mean in SD units. score Mean % Lower Bound

Z score indicates how far a raw score deviates from the sample mean in SD units. score Mean % Lower Bound 1 EDUR 8131 Chat 3 Notes 2 Normal Distribution and Standard Scores Questions Standard Scores: Z score Z = (X M) / SD Z = deviation score divided by standard deviation Z score indicates how far a raw score

More information

Statistical Inference

Statistical Inference Chapter 14 Confidence Intervals: The Basic Statistical Inference Situation: We are interested in estimating some parameter (population mean, μ) that is unknown. We take a random sample from this population.

More information

PHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1

PHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1 PHP2510: Principles of Biostatistics & Data Analysis Lecture X: Hypothesis testing PHP 2510 Lec 10: Hypothesis testing 1 In previous lectures we have encountered problems of estimating an unknown population

More information

Last week: Sample, population and sampling distributions finished with estimation & confidence intervals

Last week: Sample, population and sampling distributions finished with estimation & confidence intervals Past weeks: Measures of central tendency (mean, mode, median) Measures of dispersion (standard deviation, variance, range, etc). Working with the normal curve Last week: Sample, population and sampling

More information

Survey of Smoking Behavior. Samples and Elements. Survey of Smoking Behavior. Samples and Elements

Survey of Smoking Behavior. Samples and Elements. Survey of Smoking Behavior. Samples and Elements s and Elements Units are Same as Elementary Units Frame Elements Analyzed as a binomial variable 9 Persons from, Frame Elements N =, N =, n = 9 Analyzed as a binomial variable HIV+ HIV- % population smokes

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Once an experiment is carried out and the results are measured, the researcher has to decide whether the results of the treatments are different. This would be easy if the results

More information

Multiple Regression Analysis

Multiple Regression Analysis Multiple Regression Analysis y = β 0 + β 1 x 1 + β 2 x 2 +... β k x k + u 2. Inference 0 Assumptions of the Classical Linear Model (CLM)! So far, we know: 1. The mean and variance of the OLS estimators

More information

Today - SPSS and standard error - End of Midterm 1 exam material - T-scores

Today - SPSS and standard error - End of Midterm 1 exam material - T-scores Today - SPSS and standard error - End of Midterm 1 exam material - T-scores Previously, on StatsClass: The standard error is a measure of the typical amount that that a sample mean will be off from the

More information

Chapter 4: An Introduction to Probability and Statistics

Chapter 4: An Introduction to Probability and Statistics Chapter 4: An Introduction to Probability and Statistics 4. Probability The simplest kinds of probabilities to understand are reflected in everyday ideas like these: (i) if you toss a coin, the probability

More information

CHAPTER 6 SOME CONTINUOUS PROBABILITY DISTRIBUTIONS. 6.2 Normal Distribution. 6.1 Continuous Uniform Distribution

CHAPTER 6 SOME CONTINUOUS PROBABILITY DISTRIBUTIONS. 6.2 Normal Distribution. 6.1 Continuous Uniform Distribution CHAPTER 6 SOME CONTINUOUS PROBABILITY DISTRIBUTIONS Recall that a continuous random variable X is a random variable that takes all values in an interval or a set of intervals. The distribution of a continuous

More information

Econ 300/QAC 201: Quantitative Methods in Economics/Applied Data Analysis. 12th Class 6/23/10

Econ 300/QAC 201: Quantitative Methods in Economics/Applied Data Analysis. 12th Class 6/23/10 Econ 300/QAC 201: Quantitative Methods in Economics/Applied Data Analysis 12th Class 6/23/10 In God we trust, all others must use data. --Edward Deming hand out review sheet, answer, point to old test,

More information

Survey of Smoking Behavior. Survey of Smoking Behavior. Survey of Smoking Behavior

Survey of Smoking Behavior. Survey of Smoking Behavior. Survey of Smoking Behavior Sample HH from Frame HH One-Stage Cluster Survey Population Frame Sample Elements N =, N =, n = population smokes Sample HH from Frame HH Elementary units are different from sampling units Sampled HH but

More information

Notes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing

Notes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing Notes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing 1. Purpose of statistical inference Statistical inference provides a means of generalizing

More information

Inference for Single Proportions and Means T.Scofield

Inference for Single Proportions and Means T.Scofield Inference for Single Proportions and Means TScofield Confidence Intervals for Single Proportions and Means A CI gives upper and lower bounds between which we hope to capture the (fixed) population parameter

More information

Chapter 8 - Statistical intervals for a single sample

Chapter 8 - Statistical intervals for a single sample Chapter 8 - Statistical intervals for a single sample 8-1 Introduction In statistics, no quantity estimated from data is known for certain. All estimated quantities have probability distributions of their

More information

FREQUENCY DISTRIBUTIONS AND PERCENTILES

FREQUENCY DISTRIBUTIONS AND PERCENTILES FREQUENCY DISTRIBUTIONS AND PERCENTILES New Statistical Notation Frequency (f): the number of times a score occurs N: sample size Simple Frequency Distributions Raw Scores The scores that we have directly

More information

Correlation 1. December 4, HMS, 2017, v1.1

Correlation 1. December 4, HMS, 2017, v1.1 Correlation 1 December 4, 2017 1 HMS, 2017, v1.1 Chapter References Diez: Chapter 7 Navidi, Chapter 7 I don t expect you to learn the proofs what will follow. Chapter References 2 Correlation The sample

More information

Lecture 12: Small Sample Intervals Based on a Normal Population Distribution

Lecture 12: Small Sample Intervals Based on a Normal Population Distribution Lecture 12: Small Sample Intervals Based on a Normal Population MSU-STT-351-Sum-17B (P. Vellaisamy: MSU-STT-351-Sum-17B) Probability & Statistics for Engineers 1 / 24 In this lecture, we will discuss (i)

More information

Business Analytics and Data Mining Modeling Using R Prof. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee

Business Analytics and Data Mining Modeling Using R Prof. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee Business Analytics and Data Mining Modeling Using R Prof. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee Lecture - 04 Basic Statistics Part-1 (Refer Slide Time: 00:33)

More information

Confidence Intervals with σ unknown

Confidence Intervals with σ unknown STAT 141 Confidence Intervals and Hypothesis Testing 10/26/04 Today (Chapter 7): CI with σ unknown, t-distribution CI for proportions Two sample CI with σ known or unknown Hypothesis Testing, z-test Confidence

More information

UNIVERSITY OF TORONTO MISSISSAUGA. SOC222 Measuring Society In-Class Test. November 11, 2011 Duration 11:15a.m. 13 :00p.m.

UNIVERSITY OF TORONTO MISSISSAUGA. SOC222 Measuring Society In-Class Test. November 11, 2011 Duration 11:15a.m. 13 :00p.m. UNIVERSITY OF TORONTO MISSISSAUGA SOC222 Measuring Society In-Class Test November 11, 2011 Duration 11:15a.m. 13 :00p.m. Location: DV2074 Aids Allowed You may be charged with an academic offence for possessing

More information

Note that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b).

Note that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b). Confidence Intervals 1) What are confidence intervals? Simply, an interval for which we have a certain confidence. For example, we are 90% certain that an interval contains the true value of something

More information

II. The Normal Distribution

II. The Normal Distribution II. The Normal Distribution The normal distribution (a.k.a., a the Gaussian distribution or bell curve ) is the by far the best known random distribution. It s discovery has had such a far-reaching impact

More information

Lecture 30. DATA 8 Summer Regression Inference

Lecture 30. DATA 8 Summer Regression Inference DATA 8 Summer 2018 Lecture 30 Regression Inference Slides created by John DeNero (denero@berkeley.edu) and Ani Adhikari (adhikari@berkeley.edu) Contributions by Fahad Kamran (fhdkmrn@berkeley.edu) and

More information

Chapter 2: Tools for Exploring Univariate Data

Chapter 2: Tools for Exploring Univariate Data Stats 11 (Fall 2004) Lecture Note Introduction to Statistical Methods for Business and Economics Instructor: Hongquan Xu Chapter 2: Tools for Exploring Univariate Data Section 2.1: Introduction What is

More information

Stat 427/527: Advanced Data Analysis I

Stat 427/527: Advanced Data Analysis I Stat 427/527: Advanced Data Analysis I Review of Chapters 1-4 Sep, 2017 1 / 18 Concepts you need to know/interpret Numerical summaries: measures of center (mean, median, mode) measures of spread (sample

More information

STATISTICS INDEX NUMBER

STATISTICS INDEX NUMBER NAME SCHOOL INDEX NUMBER DATE STATISTICS KCSE 1989 2012 Form 4 Mathematics Answer all the questions 1. 1989 Q12 P1 The table below shows the defective bolts from 40 samples No. of detective 0 1 2 3 4 5

More information

The Normal Distribution (Pt. 2)

The Normal Distribution (Pt. 2) Chapter 5 The Normal Distribution (Pt 2) 51 Finding Normal Percentiles Recall that the Nth percentile of a distribution is the value that marks off the bottom N% of the distribution For review, remember

More information

Lecture 11. Data Description Estimation

Lecture 11. Data Description Estimation Lecture 11 Data Description Estimation Measures of Central Tendency (continued, see last lecture) Sample mean, population mean Sample mean for frequency distributions The median The mode The midrange 3-22

More information

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8 CIVL - 7904/8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8 Chi-square Test How to determine the interval from a continuous distribution I = Range 1 + 3.322(logN) I-> Range of the class interval

More information

Discrete Distributions: Poisson Distribution 1

Discrete Distributions: Poisson Distribution 1 Discrete Distributions: Poisson Distribution 1 November 6, 2017 1 HMS, 2017, v1.1 Chapter References Diez: Chapter 3.3, 3.4 (not 3.4.2), 3.5.2 Navidi, Chapter 4.1, 4.2, 4.3 Chapter References 2 Poisson

More information

11: Comparing Group Variances. Review of Variance

11: Comparing Group Variances. Review of Variance 11: Comparing Group Variances Review of Variance Parametric measures of variability are often based on sum of squares (SS) around e mean: (1) For e data set {3, 4, 5, 8}, = 5 and SS = (3 5) + (4 5) + (5

More information

Review of Multiple Regression

Review of Multiple Regression Ronald H. Heck 1 Let s begin with a little review of multiple regression this week. Linear models [e.g., correlation, t-tests, analysis of variance (ANOVA), multiple regression, path analysis, multivariate

More information

HYPOTHESIS TESTING. Hypothesis Testing

HYPOTHESIS TESTING. Hypothesis Testing MBA 605 Business Analytics Don Conant, PhD. HYPOTHESIS TESTING Hypothesis testing involves making inferences about the nature of the population on the basis of observations of a sample drawn from the population.

More information

MAT2377. Rafa l Kulik. Version 2015/November/23. Rafa l Kulik

MAT2377. Rafa l Kulik. Version 2015/November/23. Rafa l Kulik MAT2377 Rafa l Kulik Version 2015/November/23 Rafa l Kulik Rafa l Kulik 1 Rafa l Kulik 2 Rafa l Kulik 3 Rafa l Kulik 4 The Z-test Test on the mean of a normal distribution, σ known Suppose X 1,..., X n

More information

Using the z-table: Given an Area, Find z ID1050 Quantitative & Qualitative Reasoning

Using the z-table: Given an Area, Find z ID1050 Quantitative & Qualitative Reasoning Using the -Table: Given an, Find ID1050 Quantitative & Qualitative Reasoning between mean and beyond 0.0 0.000 0.500 0.1 0.040 0.460 0.2 0.079 0.421 0.3 0.118 0.382 0.4 0.155 0.345 0.5 0.192 0.309 0.6

More information

DART_LAB Tutorial Section 5: Adaptive Inflation

DART_LAB Tutorial Section 5: Adaptive Inflation DART_LAB Tutorial Section 5: Adaptive Inflation UCAR 14 The National Center for Atmospheric Research is sponsored by the National Science Foundation. Any opinions, findings and conclusions or recommendations

More information

Lecture on Null Hypothesis Testing & Temporal Correlation

Lecture on Null Hypothesis Testing & Temporal Correlation Lecture on Null Hypothesis Testing & Temporal Correlation CS 590.21 Analysis and Modeling of Brain Networks Department of Computer Science University of Crete Acknowledgement Resources used in the slides

More information

Fin285a:Computer Simulations and Risk Assessment Section 2.3.2:Hypothesis testing, and Confidence Intervals

Fin285a:Computer Simulations and Risk Assessment Section 2.3.2:Hypothesis testing, and Confidence Intervals Fin285a:Computer Simulations and Risk Assessment Section 2.3.2:Hypothesis testing, and Confidence Intervals Overview Hypothesis testing terms Testing a die Testing issues Estimating means Confidence intervals

More information

z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests

z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests Chapters 3.5.1 3.5.2, 3.3.2 Prof. Tesler Math 283 Fall 2018 Prof. Tesler z and t tests for mean Math

More information

Part 3: Parametric Models

Part 3: Parametric Models Part 3: Parametric Models Matthew Sperrin and Juhyun Park August 19, 2008 1 Introduction There are three main objectives to this section: 1. To introduce the concepts of probability and random variables.

More information

Counting principles, including permutations and combinations.

Counting principles, including permutations and combinations. 1 Counting principles, including permutations and combinations. The binomial theorem: expansion of a + b n, n ε N. THE PRODUCT RULE If there are m different ways of performing an operation and for each

More information

Lecture 17: Small-Sample Inferences for Normal Populations. Confidence intervals for µ when σ is unknown

Lecture 17: Small-Sample Inferences for Normal Populations. Confidence intervals for µ when σ is unknown Lecture 17: Small-Sample Inferences for Normal Populations Confidence intervals for µ when σ is unknown If the population distribution is normal, then X µ σ/ n has a standard normal distribution. If σ

More information

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies The t-test: So Far: Sampling distribution benefit is that even if the original population is not normal, a sampling distribution based on this population will be normal (for sample size > 30). Benefit

More information

POLI 443 Applied Political Research

POLI 443 Applied Political Research POLI 443 Applied Political Research Session 4 Tests of Hypotheses The Normal Curve Lecturer: Prof. A. Essuman-Johnson, Dept. of Political Science Contact Information: aessuman-johnson@ug.edu.gh College

More information

Chapter 5 Confidence Intervals

Chapter 5 Confidence Intervals Chapter 5 Confidence Intervals Confidence Intervals about a Population Mean, σ, Known Abbas Motamedi Tennessee Tech University A point estimate: a single number, calculated from a set of data, that is

More information

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015 AMS7: WEEK 7. CLASS 1 More on Hypothesis Testing Monday May 11th, 2015 Testing a Claim about a Standard Deviation or a Variance We want to test claims about or 2 Example: Newborn babies from mothers taking

More information

y = a + bx 12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation Review: Interpreting Computer Regression Output

y = a + bx 12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation Review: Interpreting Computer Regression Output 12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation y = a + bx y = dependent variable a = intercept b = slope x = independent variable Section 12.1 Inference for Linear

More information

10/4/2013. Hypothesis Testing & z-test. Hypothesis Testing. Hypothesis Testing

10/4/2013. Hypothesis Testing & z-test. Hypothesis Testing. Hypothesis Testing & z-test Lecture Set 11 We have a coin and are trying to determine if it is biased or unbiased What should we assume? Why? Flip coin n = 100 times E(Heads) = 50 Why? Assume we count 53 Heads... What could

More information

p = q ˆ = 1 -ˆp = sample proportion of failures in a sample size of n x n Chapter 7 Estimates and Sample Sizes

p = q ˆ = 1 -ˆp = sample proportion of failures in a sample size of n x n Chapter 7 Estimates and Sample Sizes Chapter 7 Estimates and Sample Sizes 7-1 Overview 7-2 Estimating a Population Proportion 7-3 Estimating a Population Mean: σ Known 7-4 Estimating a Population Mean: σ Not Known 7-5 Estimating a Population

More information