6. CONFIDENCE INTERVALS. Training is everything cauliflower is nothing but cabbage with a college education.

Size: px
Start display at page:

Download "6. CONFIDENCE INTERVALS. Training is everything cauliflower is nothing but cabbage with a college education."

Transcription

1 CIVL 3103 Approximation and Uncertainty J.W. Hurley, R.W. Meier 6. CONFIDENCE INTERVALS Training is everything cauliflower is nothing but cabbage with a college education. Mark Twain At the beginning of the course, we said that we use data obtained from samples to make inferences about the population being sampled. We called this inferential statistics. By applying the appropriate statistics to the sample data, we can infer what the population parameters look like. In this chapter, we ll examine some of those statistics. THE CENTRAL LIMIT THEOREM Suppose we have a population described by a random variable X with a mean µ and a standard deviation σ. We place no restrictions on the probability distribution of X. It may be normally distributed, uniformly distributed, exponentially distributed, it doesn t matter. Suppose we now take random samples from this population, each with a fixed and large sample size n. Each sample will have a sample mean X, and this X will not, in general, be equal to the population mean µ. After repeated samplings, we will have built a population of X s. The X s are themselves random variables and they have their own probability distribution! The probability distribution of a sample statistic is called a sampling distribution of that statistic. The distribution of the X s is a sampling distribution of the mean. The Central Limit Theorem says that, as long as n is reasonably large, σ X N µ, n Let s put that in words as long as n is reasonably large, the X s are normally distributed with a mean equal to the mean of the population being sampled and a variance equal to the variance of the population divided by the sample size. This is true regardless of the population distribution! So, how large is reasonably large? It depends. If the population is normally distributed, any sample size will do. If the probability distribution of the population is at least symmetrical (e.g., a uniform distribution) then n 15 is considered sufficient. If the probability distribution of the underlying population is asymmetrical (e.g., an exponential distribution) then n 30 is considered sufficiently large. THE STANDARD ERROR If σ n is the variance of the sampling distribution, then the standard deviation is σ n. This is commonly referred to as the standard error of the mean. As we ll see very shortly, you can use the standard error to estimate how closely your sample mean approximates the population mean. In other words, you can use the standard error to determine the amount by which your sample mean may be in error!

2 You are making ball bearings for use in an industrial sewing machine. Because of minor variations in the manufacturing process, the diameters always vary slightly from one bearing to the next. If the diameters stray too far from the target value, the sewing machine will not operate properly, so the ball bearings are sampled at regular intervals. If the machine is running properly, the ball bearings should have a mean value of in. with a standard deviation of in. If I collect a random sample of 50 ball bearings, what is the likelihood that the sample mean will be somewhere between 0.50 and inches? 55

3 CONFIDENCE INTERVALS ON THE MEAN In the preceding example, we used the population mean to deduce what the sample means should look like. When we use inferential statistics, we do exactly the opposite. Knowing (usually) one sample mean, we wish to infer the population mean. Suppose that in the previous example, the population standard deviation was known to be inches (i.e., we know the amount of variability from one ball bearing to the next) but the population mean is unknown (perhaps some machine parts have worn down and the mean diameter has changed). If we obtain a random sample of 100 ball bearings and their mean diameter is 0.50 inches, what can we infer about the mean diameter of the ball bearing population being produced today? If, according to the Central Limit Theorem, σ X N µ, n then we know, for example, that 95% of all sample means should fall within two standard errors of the population mean. Why? Well, 95% of the area under the normal probability distribution is located within two standard deviations of the mean and the standard deviation of this normal distribution is the standard error σ n Now, if we know that 95% of all sample means should fall within two standard errors of the population mean, we can also say that the population mean will fall with two standard errors of the sample mean in 95% of all samples. (You may have to think about this for a minute, but it s a valid assertion.) Thus, σ x ± n can be said to be a 95% confidence interval on the mean. If our 100-bearing sample has a mean of inches and, based on the known population variance, a standard error of σ = = n 100 then we are 95% confident that the interval [0.501, 0.508] contains the population mean. Thus, it is safe to assume that we are producing ball bearings that are out of spec! In some cases, we might desire a higher degree of assurance, while in others, we may be willing to accept less assurance. In general, the level of confidence is expressed as (1 α) 100%, where α represents the area under the curve that falls outside the confidence interval. If the total area outside the interval is α, then the upper tail area is α/ and the lower tail area is α/, as shown on the next page. 56

4 f(x) α/ α/ x Now we can write a generic equation for a (1 α) 100% confidence interval on the mean of a population when the standard deviation of the population is known: CONFIDENCE INTERVAL ON A MEAN (σ KNOWN) x σ ± zα n where zα is the critical point corresponding to a tail area of α a. Compute a 90% confidence interval for µ when σ = 3.0, x = 58.3, and n = 5. b. Compute a 99% confidence interval for µ when σ = 3.0, x = 58.3, and n = 100. c. How large must n be for the width of the 99% confidence interval to be less than 1.0? 57

5 STUDENT S t DISTRIBUTION In the real world, we hardly ever know the true value of the population variance, σ, which makes it hard to calculate a standard error. More than a century ago, a statistician named William Gosset studied this problem and came up with a solution. First, if a random variable X is normally distributed, the Central Limit Theorem tells us that the statistic x µ z = σ follows a standard normal distribution regardless of sample size. Now, if you don t know the population standard deviation, σ, maybe you could use the sample standard deviation, s, in its place. If the sample size is large, this is not a bad assumption, but as the sample shrinks, the sample standard deviation becomes an increasingly poor approximation of the population standard deviation. The end result is that a 95% confidence interval computed using s instead of σ may actually only contain the population mean 90% of the time, or 85% of the time, or even less. So William Gosset developed a new probability distribution, which he called the t distribution, to describe the probabilities associated with the statistic n µ t = x s n At the time, Gosset was an employee of the Guinness brewery in Ireland and Guinness policy prohibited employees from publishing their findings under their own name, so Gosset published his results under the pen name A. Student. As a result, his distribution is known as Student s t distribution. Student s t distribution is actually a family of probability distributions, each corresponding to a different sample size n. The distributions are all bell-shaped and symmetrical but, because s is an imperfect substitute for σ, the t statistic has more variability than the z statistic. In other words, compared to the z (standard normal) distribution the t distribution contains more area in the tails and less in the center. Now we can write a generic equation for a (1 α) 100% confidence interval on the mean of a population when the standard deviation of the population is unknown: CONFIDENCE INTERVAL ON A MEAN (σ UNKNOWN) x ± tα, n 1 s n where t α, n 1 is the critical point corresponding to a tail area of α 58

6 As n increases, the t distribution approaches the standard normal distribution. When n 30, the two distributions are virtually indistinguishable and you might as well use the standard normal distribution. What happens if the population is not normally distributed? As long as the sample size is reasonably large and the probability distribution underlying the population is reasonably symmetrical, the t distribution can be used to estimate the population mean when σ is unknown. An unconfined compression test performed on 15 concrete cylinders produced the following strength results (in psi): Find a 95% confidence interval for the true average strength of the concrete. 59

7 CONFIDENCE INTERVALS ON DIFFERENCES Often, we use confidence intervals to compare one population to another. If we draw random samples from two different populations, it would be pure serendipity if their means match exactly. But how far apart can they be for us to still consider the two populations to be the same? If random samples of size n 1 and n are drawn from two populations with known variances σ 1 and σ, respectively, then a 100(1 α)% confidence interval on the difference between the sample means, µ 1 µ is: CONFIDENCE INTERVAL ON A DIFFERENCE IN MEANS (σ 1 and σ KNOWN) ( ) 1 σ σ x1 x ± zα / + n n 1 where zα is the critical point corresponding to a tail area of α This relationship is exact if the two populations are normally distributed. Otherwise, the confidence interval is approximately valid for large sample sizes (n 1 30 and n 30). Aluminum spars from two different suppliers are used in manufacturing the wing of a commercial aircraft. You have been asked to determine if the latest shipments from each supplier are equally strong. From past experience, the standard deviations of the tensile strengths are known to be 1.5 kg/mm for Supplier 1 and 1.0 kg/mm for Supplier (who has tighter quality control). A sample of 1 spars from Supplier 1 has a mean tensile strength of 87.6 kg/mm and a sample of 10 spars from Supplier has a mean tensile strength of 7.5 kg/mm. If µ 1 and µ denote the true mean tensile strengths for the two shipments of spars, find a 90% confidence interval on the difference in mean strength, µ 1 µ. 60

8 What happens if you don t know the standard deviations a priori? If random samples of size n 1 and n are drawn from two normal populations with equal but unknown variances, a 100(1 a)% confidence interval on the difference between the sample means, µ 1 µ is: CONFIDENCE INTERVAL ON A DIFFERENCE IN MEANS (σ 1 = σ UNKNOWN) 1 1 x1 x ± tα /, n + n Sp + n n ( ) 1 1 where S p is a pooled estimator of the unknown standard deviation and is calculated as S p = ( 1) + ( 1) 1 1 n s n s n + n 1 But this can only be used if both populations are normally distributed. The drying time of pavement marking paint is of concern to transportation engineers. Of two such paints from a particular manufacturer, it is suspected that yellow paint dries faster than white paint. Sample measurements of the drying times of both paints (in minutes) are given below. White: 10, 13, 13, 1, 140, 110, 10, 107 Yellow: 16, 14, 116, 15, 109, 130, 15, 117, 19, 10 Find a 95% confidence interval on the difference in mean drying times, assuming that the drying times are normally distributed and the standard deviations of the drying times are equal. 61

9 Experimental situations frequently exist where there is only one set of individuals or objects, and two observations are made on each individual or object. If, for example, you wanted to test the accusation that Auburn s punter got such great distance because Auburn used helium-inflated footballs, you might have 30 punters each punt two footballs one inflated with helium and the other inflated with air and examine the differences in each punter s kicking distance. This testing arrangement eliminates the punters individual capabilities as a source of error in comparing the two footballs. The differences in kicking distance, d, can be treated as though they came from a single population with mean d and standard deviation s d. If the samples are from two normal populations and the data is paired, a 100(1 α)% confidence interval on the difference in means, µ = µ 1 µ is: CONFIDENCE INTERVAL ON A DIFFERENCE IN MEANS (PAIRED SAMPLES) d t d ± α, n 1 s n But this can only be used if both populations are normally distributed. The manager of a fleet of automobiles is testing two brands of radial tires. He assigns one tire of each brand at random to the two front wheels of eight different cars and runs the cars until the tires wear out. The tire lives (in miles) are shown below. Assuming that the tire lives for both brands are normally distributed, find a 99% confidence interval on the difference in mean life. Car Brand 1 36,95 45,300 36,40 3,100 37,10 48,360 38,00 33,500 Brand 34,318 4,80 35,500 31,950 38,015 47,800 37,810 33,15 6

10 CONFIDENCE INTERVALS ON THE VARIANCE If a random sample of size n is taken from a normally distributed population, a 100(1 α)% confidence interval on the variance of the population is CONFIDENCE INTERVAL ON A VARIANCE ( n 1) s ( n ) 1 s σ χ χ α/, n 1 1 α/, n 1 But this can only be used if the population is normally distributed. Here, χ and χ α /, n 1 1 α /,n 1 are the upper and lower critical points of the chi-square distribution with n-1 degrees of freedom. Because the χ distribution is asymmetrical, the upper and lower tails are not the same. The compressive strength of concrete is being tested by a civil engineer. He tests 1 specimens and obtains the following data: Find the 95% confidence interval on the population variance. 63

11 CONFIDENCE INTERVALS ON THE RATIO OF VARIANCES If random samples of size n 1 and n are drawn from two normal populations with unknown variances, a 100(1 a)% confidence interval on the ratio of the population variances is: CONFIDENCE INTERVAL ON RATIO OF VARIANCES 1 σ1 s1 F 1 α, n 1, n1 1 F α, n 1, n1 1 1 σ1 s1 s s But this can only be used if both populations are normally distributed. Here, n 1 and n 1 1 are the degrees of freedom of the F distribution, which has separate degrees of freedom for the numerator and the denominator. The F distribution is not symmetrical, but F 1 α, n 1, n1 1 = F 1 α, n 1, n1 1 The diameter of steel rods manufactured on two different extrusion machines is being investigated. Two random samples of sizes n 1 = 15 and n = 18 were selected from the two machines. The sample means and variances are x 1 = 8.73, s 1 = 0.35, x = 8.68, s = Construct a 95% confidence interval on the ratio of the population variances, σ1 σ. 64

Confidence Intervals CIVL 7012/8012

Confidence Intervals CIVL 7012/8012 Confidence Intervals CIVL 701/801 Sampling Distributions Because we typically can not evaluate an entire population to determine its parameters, we rely on estimators based on samples The estimators themselves

More information

Chapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc.

Chapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc. Chapter 23 Inferences About Means Sampling Distributions of Means Now that we know how to create confidence intervals and test hypotheses about proportions, we do the same for means. Just as we did before,

More information

χ L = χ R =

χ L = χ R = Chapter 7 3C: Examples of Constructing a Confidence Interval for the true value of the Population Standard Deviation σ for a Normal Population. Example 1 Construct a 95% confidence interval for the true

More information

Chapter 12: Inference about One Population

Chapter 12: Inference about One Population Chapter 1: Inference about One Population 1.1 Introduction In this chapter, we presented the statistical inference methods used when the problem objective is to describe a single population. Sections 1.

More information

1. AN INTRODUCTION TO DESCRIPTIVE STATISTICS. No great deed, private or public, has ever been undertaken in a bliss of certainty.

1. AN INTRODUCTION TO DESCRIPTIVE STATISTICS. No great deed, private or public, has ever been undertaken in a bliss of certainty. CIVL 3103 Approximation and Uncertainty J.W. Hurley, R.W. Meier 1. AN INTRODUCTION TO DESCRIPTIVE STATISTICS No great deed, private or public, has ever been undertaken in a bliss of certainty. - Leon Wieseltier

More information

Statistical Intervals (One sample) (Chs )

Statistical Intervals (One sample) (Chs ) 7 Statistical Intervals (One sample) (Chs 8.1-8.3) Confidence Intervals The CLT tells us that as the sample size n increases, the sample mean X is close to normally distributed with expected value µ and

More information

Confidence intervals

Confidence intervals Confidence intervals We now want to take what we ve learned about sampling distributions and standard errors and construct confidence intervals. What are confidence intervals? Simply an interval for which

More information

Ch. 7. One sample hypothesis tests for µ and σ

Ch. 7. One sample hypothesis tests for µ and σ Ch. 7. One sample hypothesis tests for µ and σ Prof. Tesler Math 18 Winter 2019 Prof. Tesler Ch. 7: One sample hypoth. tests for µ, σ Math 18 / Winter 2019 1 / 23 Introduction Data Consider the SAT math

More information

Open book and notes. 120 minutes. Covers Chapters 8 through 14 of Montgomery and Runger (fourth edition).

Open book and notes. 120 minutes. Covers Chapters 8 through 14 of Montgomery and Runger (fourth edition). IE 330 Seat # Open book and notes 10 minutes Covers Chapters 8 through 14 of Montgomery and Runger (fourth edition) Cover page and eight pages of exam No calculator ( points) I have, or will, complete

More information

Inferences Based on Two Samples

Inferences Based on Two Samples Chapter 6 Inferences Based on Two Samples Frequently we want to use statistical techniques to compare two populations. For example, one might wish to compare the proportions of families with incomes below

More information

Chapter 10: Inferences based on two samples

Chapter 10: Inferences based on two samples November 16 th, 2017 Overview Week 1 Week 2 Week 4 Week 7 Week 10 Week 12 Chapter 1: Descriptive statistics Chapter 6: Statistics and Sampling Distributions Chapter 7: Point Estimation Chapter 8: Confidence

More information

Physics 6720 Introduction to Statistics April 4, 2017

Physics 6720 Introduction to Statistics April 4, 2017 Physics 6720 Introduction to Statistics April 4, 2017 1 Statistics of Counting Often an experiment yields a result that can be classified according to a set of discrete events, giving rise to an integer

More information

Chapter 5: HYPOTHESIS TESTING

Chapter 5: HYPOTHESIS TESTING MATH411: Applied Statistics Dr. YU, Chi Wai Chapter 5: HYPOTHESIS TESTING 1 WHAT IS HYPOTHESIS TESTING? As its name indicates, it is about a test of hypothesis. To be more precise, we would first translate

More information

Chapter 23. Inference About Means

Chapter 23. Inference About Means Chapter 23 Inference About Means 1 /57 Homework p554 2, 4, 9, 10, 13, 15, 17, 33, 34 2 /57 Objective Students test null and alternate hypotheses about a population mean. 3 /57 Here We Go Again Now that

More information

Math Lecture 23 Notes

Math Lecture 23 Notes Math 1010 - Lecture 23 Notes Dylan Zwick Fall 2009 In today s lecture we ll expand upon the concept of radicals and radical expressions, and discuss how we can deal with equations involving these radical

More information

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 Statistics Boot Camp Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 March 21, 2018 Outline of boot camp Summarizing and simplifying data Point and interval estimation Foundations of statistical

More information

Slides for Data Mining by I. H. Witten and E. Frank

Slides for Data Mining by I. H. Witten and E. Frank Slides for Data Mining by I. H. Witten and E. Frank Predicting performance Assume the estimated error rate is 5%. How close is this to the true error rate? Depends on the amount of test data Prediction

More information

THE SAMPLING DISTRIBUTION OF THE MEAN

THE SAMPLING DISTRIBUTION OF THE MEAN THE SAMPLING DISTRIBUTION OF THE MEAN COGS 14B JANUARY 26, 2017 TODAY Sampling Distributions Sampling Distribution of the Mean Central Limit Theorem INFERENTIAL STATISTICS Inferential statistics: allows

More information

ESTIMATION BY CONFIDENCE INTERVALS

ESTIMATION BY CONFIDENCE INTERVALS ESTIMATION BY CONFIDENCE INTERVALS Introduction We are now in the knowledge that a population parameter can be estimated from sample data by calculating the corresponding point estimate. This chapter is

More information

Note that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b).

Note that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b). Confidence Intervals 1) What are confidence intervals? Simply, an interval for which we have a certain confidence. For example, we are 90% certain that an interval contains the true value of something

More information

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary Patrick Breheny October 13 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction Introduction What s wrong with z-tests? So far we ve (thoroughly!) discussed how to carry out hypothesis

More information

Math May 13, Final Exam

Math May 13, Final Exam Math 447 - May 13, 2013 - Final Exam Name: Read these instructions carefully: The points assigned are not meant to be a guide to the difficulty of the problems. If the question is multiple choice, there

More information

Everything is not normal

Everything is not normal Everything is not normal According to the dictionary, one thing is considered normal when it s in its natural state or conforms to standards set in advance. And this is its normal meaning. But, like many

More information

Hypothesis Testing: Chi-Square Test 1

Hypothesis Testing: Chi-Square Test 1 Hypothesis Testing: Chi-Square Test 1 November 9, 2017 1 HMS, 2017, v1.0 Chapter References Diez: Chapter 6.3 Navidi, Chapter 6.10 Chapter References 2 Chi-square Distributions Let X 1, X 2,... X n be

More information

4 Hypothesis testing. 4.1 Types of hypothesis and types of error 4 HYPOTHESIS TESTING 49

4 Hypothesis testing. 4.1 Types of hypothesis and types of error 4 HYPOTHESIS TESTING 49 4 HYPOTHESIS TESTING 49 4 Hypothesis testing In sections 2 and 3 we considered the problem of estimating a single parameter of interest, θ. In this section we consider the related problem of testing whether

More information

Chapter 9 Inferences from Two Samples

Chapter 9 Inferences from Two Samples Chapter 9 Inferences from Two Samples 9-1 Review and Preview 9-2 Two Proportions 9-3 Two Means: Independent Samples 9-4 Two Dependent Samples (Matched Pairs) 9-5 Two Variances or Standard Deviations Review

More information

Chapter 4: An Introduction to Probability and Statistics

Chapter 4: An Introduction to Probability and Statistics Chapter 4: An Introduction to Probability and Statistics 4. Probability The simplest kinds of probabilities to understand are reflected in everyday ideas like these: (i) if you toss a coin, the probability

More information

Note that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b).

Note that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b). Confidence Intervals 1) What are confidence intervals? Simply, an interval for which we have a certain confidence. For example, we are 90% certain that an interval contains the true value of something

More information

MAT 2377C FINAL EXAM PRACTICE

MAT 2377C FINAL EXAM PRACTICE Department of Mathematics and Statistics University of Ottawa MAT 2377C FINAL EXAM PRACTICE 10 December 2015 Professor: Rafal Kulik Time: 180 minutes Student Number: Family Name: First Name: This is a

More information

Chapter 6. Estimates and Sample Sizes

Chapter 6. Estimates and Sample Sizes Chapter 6 Estimates and Sample Sizes Lesson 6-1/6-, Part 1 Estimating a Population Proportion This chapter begins the beginning of inferential statistics. There are two major applications of inferential

More information

One-Sample and Two-Sample Means Tests

One-Sample and Two-Sample Means Tests One-Sample and Two-Sample Means Tests 1 Sample t Test The 1 sample t test allows us to determine whether the mean of a sample data set is different than a known value. Used when the population variance

More information

Two-sample inference: Continuous data

Two-sample inference: Continuous data Two-sample inference: Continuous data Patrick Breheny April 6 Patrick Breheny University of Iowa to Biostatistics (BIOS 4120) 1 / 36 Our next several lectures will deal with two-sample inference for continuous

More information

Chapter 8: Confidence Intervals

Chapter 8: Confidence Intervals Chapter 8: Confidence Intervals Introduction Suppose you are trying to determine the mean rent of a two-bedroom apartment in your town. You might look in the classified section of the newspaper, write

More information

2008 Winton. Statistical Testing of RNGs

2008 Winton. Statistical Testing of RNGs 1 Statistical Testing of RNGs Criteria for Randomness For a sequence of numbers to be considered a sequence of randomly acquired numbers, it must have two basic statistical properties: Uniformly distributed

More information

4.1 Hypothesis Testing

4.1 Hypothesis Testing 4.1 Hypothesis Testing z-test for a single value double-sided and single-sided z-test for one average z-test for two averages double-sided and single-sided t-test for one average the F-parameter and F-table

More information

The Components of a Statistical Hypothesis Testing Problem

The Components of a Statistical Hypothesis Testing Problem Statistical Inference: Recall from chapter 5 that statistical inference is the use of a subset of a population (the sample) to draw conclusions about the entire population. In chapter 5 we studied one

More information

Inference for the mean of a population. Testing hypotheses about a single mean (the one sample t-test). The sign test for matched pairs

Inference for the mean of a population. Testing hypotheses about a single mean (the one sample t-test). The sign test for matched pairs Stat 528 (Autumn 2008) Inference for the mean of a population (One sample t procedures) Reading: Section 7.1. Inference for the mean of a population. The t distribution for a normal population. Small sample

More information

1 MA421 Introduction. Ashis Gangopadhyay. Department of Mathematics and Statistics. Boston University. c Ashis Gangopadhyay

1 MA421 Introduction. Ashis Gangopadhyay. Department of Mathematics and Statistics. Boston University. c Ashis Gangopadhyay 1 MA421 Introduction Ashis Gangopadhyay Department of Mathematics and Statistics Boston University c Ashis Gangopadhyay 1.1 Introduction 1.1.1 Some key statistical concepts 1. Statistics: Art of data analysis,

More information

CH.8 Statistical Intervals for a Single Sample

CH.8 Statistical Intervals for a Single Sample CH.8 Statistical Intervals for a Single Sample Introduction Confidence interval on the mean of a normal distribution, variance known Confidence interval on the mean of a normal distribution, variance unknown

More information

CHAPTER SIX Statistical Estimation

CHAPTER SIX Statistical Estimation 75 CHAPTER SIX Statistical Estimation 6. Point Estimation The following table contains some of the well known population parameters and their point estimates based on a random sample. Table Population

More information

Confidence Intervals. - simply, an interval for which we have a certain confidence.

Confidence Intervals. - simply, an interval for which we have a certain confidence. Confidence Intervals I. What are confidence intervals? - simply, an interval for which we have a certain confidence. - for example, we are 90% certain that an interval contains the true value of something

More information

Unit 22: Sampling Distributions

Unit 22: Sampling Distributions Unit 22: Sampling Distributions Summary of Video If we know an entire population, then we can compute population parameters such as the population mean or standard deviation. However, we generally don

More information

Lecture 11. Data Description Estimation

Lecture 11. Data Description Estimation Lecture 11 Data Description Estimation Measures of Central Tendency (continued, see last lecture) Sample mean, population mean Sample mean for frequency distributions The median The mode The midrange 3-22

More information

Probability & Statistics

Probability & Statistics MECE 330 MECE 330 Measurements & Instrumentation Probability & tatistics Dr. Isaac Choutapalli Department of Mechanical Engineering University of Teas Pan American MECE 330 Introduction uppose we have

More information

Confidence intervals CE 311S

Confidence intervals CE 311S CE 311S PREVIEW OF STATISTICS The first part of the class was about probability. P(H) = 0.5 P(T) = 0.5 HTTHHTTTTHHTHTHH If we know how a random process works, what will we see in the field? Preview of

More information

Review: Statistical Model

Review: Statistical Model Review: Statistical Model { f θ :θ Ω} A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced the data. The statistical model

More information

Confidence Intervals. Confidence interval for sample mean. Confidence interval for sample mean. Confidence interval for sample mean

Confidence Intervals. Confidence interval for sample mean. Confidence interval for sample mean. Confidence interval for sample mean Confidence Intervals Confidence interval for sample mean The CLT tells us: as the sample size n increases, the sample mean is approximately Normal with mean and standard deviation Thus, we have a standard

More information

L6: Regression II. JJ Chen. July 2, 2015

L6: Regression II. JJ Chen. July 2, 2015 L6: Regression II JJ Chen July 2, 2015 Today s Plan Review basic inference based on Sample average Difference in sample average Extrapolate the knowledge to sample regression coefficients Standard error,

More information

Discrete and continuous

Discrete and continuous Discrete and continuous A curve, or a function, or a range of values of a variable, is discrete if it has gaps in it - it jumps from one value to another. In practice in S2 discrete variables are variables

More information

their contents. If the sample mean is 15.2 oz. and the sample standard deviation is 0.50 oz., find the 95% confidence interval of the true mean.

their contents. If the sample mean is 15.2 oz. and the sample standard deviation is 0.50 oz., find the 95% confidence interval of the true mean. Math 1342 Exam 3-Review Chapters 7-9 HCCS **************************************************************************************** Name Date **********************************************************************************************

More information

INTRODUCTION TO ANALYSIS OF VARIANCE

INTRODUCTION TO ANALYSIS OF VARIANCE CHAPTER 22 INTRODUCTION TO ANALYSIS OF VARIANCE Chapter 18 on inferences about population means illustrated two hypothesis testing situations: for one population mean and for the difference between two

More information

Chapter 24. Comparing Means. Copyright 2010 Pearson Education, Inc.

Chapter 24. Comparing Means. Copyright 2010 Pearson Education, Inc. Chapter 24 Comparing Means Copyright 2010 Pearson Education, Inc. Plot the Data The natural display for comparing two groups is boxplots of the data for the two groups, placed side-by-side. For example:

More information

Introduction to Measurements of Physical Quantities

Introduction to Measurements of Physical Quantities 1 Goal Introduction to Measurements of Physical Quantities Content Discussion and Activities PHYS 104L The goal of this week s activities is to provide a foundational understanding regarding measurements

More information

Statistical Inference, Populations and Samples

Statistical Inference, Populations and Samples Chapter 3 Statistical Inference, Populations and Samples Contents 3.1 Introduction................................... 2 3.2 What is statistical inference?.......................... 2 3.2.1 Examples of

More information

Treatment of Error in Experimental Measurements

Treatment of Error in Experimental Measurements in Experimental Measurements All measurements contain error. An experiment is truly incomplete without an evaluation of the amount of error in the results. In this course, you will learn to use some common

More information

Chapter 23: Inferences About Means

Chapter 23: Inferences About Means Chapter 3: Inferences About Means Sample of Means: number of observations in one sample the population mean (theoretical mean) sample mean (observed mean) is the theoretical standard deviation of the population

More information

Chapter 9. Hypothesis testing. 9.1 Introduction

Chapter 9. Hypothesis testing. 9.1 Introduction Chapter 9 Hypothesis testing 9.1 Introduction Confidence intervals are one of the two most common types of statistical inference. Use them when our goal is to estimate a population parameter. The second

More information

Chapter 8. Inferences Based on a Two Samples Confidence Intervals and Tests of Hypothesis

Chapter 8. Inferences Based on a Two Samples Confidence Intervals and Tests of Hypothesis Chapter 8 Inferences Based on a Two Samples Confidence Intervals and Tests of Hypothesis Copyright 2018, 2014, and 2011 Pearson Education, Inc. Slide - 1 Content 1. Identifying the Target Parameter 2.

More information

41.2. Tests Concerning a Single Sample. Introduction. Prerequisites. Learning Outcomes

41.2. Tests Concerning a Single Sample. Introduction. Prerequisites. Learning Outcomes Tests Concerning a Single Sample 41.2 Introduction This Section introduces you to the basic ideas of hypothesis testing in a non-mathematical way by using a problem solving approach to highlight the concepts

More information

Lecture on Null Hypothesis Testing & Temporal Correlation

Lecture on Null Hypothesis Testing & Temporal Correlation Lecture on Null Hypothesis Testing & Temporal Correlation CS 590.21 Analysis and Modeling of Brain Networks Department of Computer Science University of Crete Acknowledgement Resources used in the slides

More information

Vectors. Vector Practice Problems: Odd-numbered problems from

Vectors. Vector Practice Problems: Odd-numbered problems from Vectors Vector Practice Problems: Odd-numbered problems from 3.1-3.21 After today, you should be able to: Understand vector notation Use basic trigonometry in order to find the x and y components of a

More information

The variable θ is called the parameter of the model, and the set Ω is called the parameter space.

The variable θ is called the parameter of the model, and the set Ω is called the parameter space. Lecture 8 What is a statistical model? A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced the data. The variable θ is called

More information

Two-sample Categorical data: Testing

Two-sample Categorical data: Testing Two-sample Categorical data: Testing Patrick Breheny April 1 Patrick Breheny Introduction to Biostatistics (171:161) 1/28 Separate vs. paired samples Despite the fact that paired samples usually offer

More information

Describing distributions with numbers

Describing distributions with numbers Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central

More information

Study and research skills 2009 Duncan Golicher. and Adrian Newton. Last draft 11/24/2008

Study and research skills 2009 Duncan Golicher. and Adrian Newton. Last draft 11/24/2008 Study and research skills 2009. and Adrian Newton. Last draft 11/24/2008 Inference about the mean: What you will learn Why we need to draw inferences from samples The difference between a population and

More information

2011 Pearson Education, Inc

2011 Pearson Education, Inc Statistics for Business and Economics Chapter 7 Inferences Based on Two Samples: Confidence Intervals & Tests of Hypotheses Content 1. Identifying the Target Parameter 2. Comparing Two Population Means:

More information

Introduction This is a puzzle station lesson with three puzzles: Skydivers Problem, Cheryl s Birthday Problem and Fun Problems & Paradoxes

Introduction This is a puzzle station lesson with three puzzles: Skydivers Problem, Cheryl s Birthday Problem and Fun Problems & Paradoxes Introduction This is a puzzle station lesson with three puzzles: Skydivers Problem, Cheryl s Birthday Problem and Fun Problems & Paradoxes Resources Calculators, pens and paper is all that is needed as

More information

Purposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions

Purposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions Part 1: Probability Distributions Purposes of Data Analysis True Distributions or Relationships in the Earths System Probability Distribution Normal Distribution Student-t Distribution Chi Square Distribution

More information

hypotheses. P-value Test for a 2 Sample z-test (Large Independent Samples) n > 30 P-value Test for a 2 Sample t-test (Small Samples) n < 30 Identify α

hypotheses. P-value Test for a 2 Sample z-test (Large Independent Samples) n > 30 P-value Test for a 2 Sample t-test (Small Samples) n < 30 Identify α Chapter 8 Notes Section 8-1 Independent and Dependent Samples Independent samples have no relation to each other. An example would be comparing the costs of vacationing in Florida to the cost of vacationing

More information

i=1 X i/n i=1 (X i X) 2 /(n 1). Find the constant c so that the statistic c(x X n+1 )/S has a t-distribution. If n = 8, determine k such that

i=1 X i/n i=1 (X i X) 2 /(n 1). Find the constant c so that the statistic c(x X n+1 )/S has a t-distribution. If n = 8, determine k such that Math 47 Homework Assignment 4 Problem 411 Let X 1, X,, X n, X n+1 be a random sample of size n + 1, n > 1, from a distribution that is N(µ, σ ) Let X = n i=1 X i/n and S = n i=1 (X i X) /(n 1) Find the

More information

Advanced Experimental Design

Advanced Experimental Design Advanced Experimental Design Topic Four Hypothesis testing (z and t tests) & Power Agenda Hypothesis testing Sampling distributions/central limit theorem z test (σ known) One sample z & Confidence intervals

More information

Unit 5 Quadratic Expressions and Equations

Unit 5 Quadratic Expressions and Equations Unit 5 Quadratic Expressions and Equations Test Date: Name: By the end of this unit, you will be able to Add, subtract, and multiply polynomials Solve equations involving the products of monomials and

More information

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t =

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t = 2. The distribution of t values that would be obtained if a value of t were calculated for each sample mean for all possible random of a given size from a population _ t ratio: (X - µ hyp ) t s x The result

More information

Estimating a Population Mean

Estimating a Population Mean Estimating a Population Mean MATH 130, Elements of Statistics I J. Robert Buchanan Department of Mathematics Fall 2017 Objectives At the end of this lesson we will be able to: obtain a point estimate for

More information

Sampling (Statistics)

Sampling (Statistics) Systems & Biomedical Engineering Department SBE 304: Bio-Statistics Random Sampling and Sampling Distributions Dr. Ayman Eldeib Fall 2018 Sampling (Statistics) Sampling is that part of statistical practice

More information

Joint Probability Distributions and Random Samples (Devore Chapter Five)

Joint Probability Distributions and Random Samples (Devore Chapter Five) Joint Probability Distributions and Random Samples (Devore Chapter Five) 1016-345-01: Probability and Statistics for Engineers Spring 2013 Contents 1 Joint Probability Distributions 2 1.1 Two Discrete

More information

CLASS NOTES: BUSINESS CALCULUS

CLASS NOTES: BUSINESS CALCULUS CLASS NOTES: BUSINESS CALCULUS These notes can be thought of as the logical skeleton of my lectures, although they will generally contain a fuller exposition of concepts but fewer examples than my lectures.

More information

psychological statistics

psychological statistics psychological statistics B Sc. Counselling Psychology 011 Admission onwards III SEMESTER COMPLEMENTARY COURSE UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION CALICUT UNIVERSITY.P.O., MALAPPURAM, KERALA,

More information

Probability and Statistics

Probability and Statistics Probability and Statistics Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be CHAPTER 4: IT IS ALL ABOUT DATA 4a - 1 CHAPTER 4: IT

More information

Lecture 15: Inference Based on Two Samples

Lecture 15: Inference Based on Two Samples Lecture 15: Inference Based on Two Samples MSU-STT 351-Sum17B (P. Vellaisamy: STT 351-Sum17B) Probability & Statistics for Engineers 1 / 26 9.1 Z-tests and CI s for (µ 1 µ 2 ) The assumptions: (i) X =

More information

Hypothesis tests for two means

Hypothesis tests for two means Chapter 3 Hypothesis tests for two means 3.1 Introduction Last week you were introduced to the concept of hypothesis testing in statistics, and we considered hypothesis tests for the mean if we have a

More information

Hypothesis Tests and Estimation for Population Variances. Copyright 2014 Pearson Education, Inc.

Hypothesis Tests and Estimation for Population Variances. Copyright 2014 Pearson Education, Inc. Hypothesis Tests and Estimation for Population Variances 11-1 Learning Outcomes Outcome 1. Formulate and carry out hypothesis tests for a single population variance. Outcome 2. Develop and interpret confidence

More information

Semester , Example Exam 1

Semester , Example Exam 1 Semester 1 2017, Example Exam 1 1 of 10 Instructions The exam consists of 4 questions, 1-4. Each question has four items, a-d. Within each question: Item (a) carries a weight of 8 marks. Item (b) carries

More information

Chapter 6. Net or Unbalanced Forces. Copyright 2011 NSTA. All rights reserved. For more information, go to

Chapter 6. Net or Unbalanced Forces. Copyright 2011 NSTA. All rights reserved. For more information, go to Chapter 6 Net or Unbalanced Forces Changes in Motion and What Causes Them Teacher Guide to 6.1/6.2 Objectives: The students will be able to explain that the changes in motion referred to in Newton s first

More information

STATISTICS OF OBSERVATIONS & SAMPLING THEORY. Parent Distributions

STATISTICS OF OBSERVATIONS & SAMPLING THEORY. Parent Distributions ASTR 511/O Connell Lec 6 1 STATISTICS OF OBSERVATIONS & SAMPLING THEORY References: Bevington Data Reduction & Error Analysis for the Physical Sciences LLM: Appendix B Warning: the introductory literature

More information

Statistical Process Control

Statistical Process Control Chapter 3 Statistical Process Control 3.1 Introduction Operations managers are responsible for developing and maintaining the production processes that deliver quality products and services. Once the production

More information

Other Continuous Probability Distributions

Other Continuous Probability Distributions CHAPTER Probability, Statistics, and Reliability for Engineers and Scientists Second Edition PROBABILITY DISTRIBUTION FOR CONTINUOUS RANDOM VARIABLES A. J. Clar School of Engineering Department of Civil

More information

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 9.1-1

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 9.1-1 Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by Mario F. Triola Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. 9.1-1 Chapter 9 Inferences

More information

Two-sample inference: Continuous data

Two-sample inference: Continuous data Two-sample inference: Continuous data Patrick Breheny November 11 Patrick Breheny STA 580: Biostatistics I 1/32 Introduction Our next two lectures will deal with two-sample inference for continuous data

More information

2. A SMIDGEON ABOUT PROBABILITY AND EVENTS. Wisdom ofttimes consists of knowing what to do next. Herbert Hoover

2. A SMIDGEON ABOUT PROBABILITY AND EVENTS. Wisdom ofttimes consists of knowing what to do next. Herbert Hoover CIVL 303 pproximation and Uncertainty JW Hurley, RW Meier MIDGEON BOUT ROBBILITY ND EVENT Wisdom ofttimes consists of knowing what to do next Herbert Hoover DEFINITION Experiment any action or process

More information

Two-Sample Inferential Statistics

Two-Sample Inferential Statistics The t Test for Two Independent Samples 1 Two-Sample Inferential Statistics In an experiment there are two or more conditions One condition is often called the control condition in which the treatment is

More information

Introduction and Overview STAT 421, SP Course Instructor

Introduction and Overview STAT 421, SP Course Instructor Introduction and Overview STAT 421, SP 212 Prof. Prem K. Goel Mon, Wed, Fri 3:3PM 4:48PM Postle Hall 118 Course Instructor Prof. Goel, Prem E mail: goel.1@osu.edu Office: CH 24C (Cockins Hall) Phone: 614

More information

INTERVAL ESTIMATION AND HYPOTHESES TESTING

INTERVAL ESTIMATION AND HYPOTHESES TESTING INTERVAL ESTIMATION AND HYPOTHESES TESTING 1. IDEA An interval rather than a point estimate is often of interest. Confidence intervals are thus important in empirical work. To construct interval estimates,

More information

Physics 8 Wednesday, October 19, Troublesome questions for HW4 (5 or more people got 0 or 1 points on them): 1, 14, 15, 16, 17, 18, 19. Yikes!

Physics 8 Wednesday, October 19, Troublesome questions for HW4 (5 or more people got 0 or 1 points on them): 1, 14, 15, 16, 17, 18, 19. Yikes! Physics 8 Wednesday, October 19, 2011 Troublesome questions for HW4 (5 or more people got 0 or 1 points on them): 1, 14, 15, 16, 17, 18, 19. Yikes! Troublesome HW4 questions 1. Two objects of inertias

More information

Physics 403. Segev BenZvi. Parameter Estimation, Correlations, and Error Bars. Department of Physics and Astronomy University of Rochester

Physics 403. Segev BenZvi. Parameter Estimation, Correlations, and Error Bars. Department of Physics and Astronomy University of Rochester Physics 403 Parameter Estimation, Correlations, and Error Bars Segev BenZvi Department of Physics and Astronomy University of Rochester Table of Contents 1 Review of Last Class Best Estimates and Reliability

More information

Statistics for Business and Economics

Statistics for Business and Economics Statistics for Business and Economics Chapter 6 Sampling and Sampling Distributions Ch. 6-1 6.1 Tools of Business Statistics n Descriptive statistics n Collecting, presenting, and describing data n Inferential

More information

2. Tests in the Normal Model

2. Tests in the Normal Model 1 of 14 7/9/009 3:15 PM Virtual Laboratories > 9. Hy pothesis Testing > 1 3 4 5 6 7. Tests in the Normal Model Preliminaries The Normal Model Suppose that X = (X 1, X,..., X n ) is a random sample of size

More information

IENG581 Design and Analysis of Experiments INTRODUCTION

IENG581 Design and Analysis of Experiments INTRODUCTION Experimental Design IENG581 Design and Analysis of Experiments INTRODUCTION Experiments are performed by investigators in virtually all fields of inquiry, usually to discover something about a particular

More information

SOME SPECIFIC PROBABILITY DISTRIBUTIONS. 1 2πσ. 2 e 1 2 ( x µ

SOME SPECIFIC PROBABILITY DISTRIBUTIONS. 1 2πσ. 2 e 1 2 ( x µ SOME SPECIFIC PROBABILITY DISTRIBUTIONS. Normal random variables.. Probability Density Function. The random variable is said to be normally distributed with mean µ and variance abbreviated by x N[µ, ]

More information

Week 1 Quantitative Analysis of Financial Markets Distributions A

Week 1 Quantitative Analysis of Financial Markets Distributions A Week 1 Quantitative Analysis of Financial Markets Distributions A Christopher Ting http://www.mysmu.edu/faculty/christophert/ Christopher Ting : christopherting@smu.edu.sg : 6828 0364 : LKCSB 5036 October

More information