Chapter 23. Inference About Means
|
|
- Deirdre Carmel Gibson
- 5 years ago
- Views:
Transcription
1 Chapter 23 Inference About Means 1 /57
2 Homework p554 2, 4, 9, 10, 13, 15, 17, 33, 34 2 /57
3 Objective Students test null and alternate hypotheses about a population mean. 3 /57
4 Here We Go Again Now that we are familiar with creating confidence intervals and testing hypotheses about proportions, we should find doing the same for means to be very easy. It is still true that confidence intervals and hypothesis testing are based on the sampling distribution model. The only real change is that the Central Limit Theorem tells us that the sampling distribution model for means is Normal with mean µ and standard deviation SD(y ) = σ n 4 /57
5 All We Need is All we need to proceed is a random sample of quantitative data. Oh yeah, and the true population standard deviation, σ. Uh Oh, that s a problem Proportions have a link between the proportion value and the standard deviation of the sample proportion. This is not the case with means knowing the sample mean tells us nothing about SD(y ) 5 /57
6 What Shall We Do? We will use what we do have available: we will estimate the population parameter σ with the sample statistic s. Our resulting standard error is SE(y ) = s n We now have additional uncertainty, or variation, in our standard error from s, the sample standard deviation which most likely σ. We need to allow for the extra variation so that it does not mess up the margin of error and P-value, especially for a small sample. The additional uncertainty alters the shape of the sampling distribution. The distribution is still unimodal and symmetric, but is no longer Normal. At least, not the Normal with which we are familiar. So, what is the sampling model? 6 /57
7 Student s t Coming to the rescue was William S. Gosset, an employee of the Guinness Brewery in Dublin, Ireland, who worked out what the change to the sampling model was and how we might deal with the added variability. The sampling model that Gosset created he called Student s t. His employer would not allow employees to publish data from the brewery, so Gosset published his finding under the pseudonym Student. The Student s t-models are a whole family of related distributions that depend on a parameter known as degrees of freedom. We denote degrees of freedom as df (or ν (nu)), and the statistic as t df. When appropriate, the t statistic replaces the z statistic in calculations. 7 /57
8 Confidence Interval, Means A sampling distribution model for means t = y µ ( ) SE Y Look familiar? When the conditions are met, the standardized sample mean follows a Student s t-model with n 1 degrees of freedom. We estimate the standard error with SE(y ) = s n 8 /57
9 Confidence Interval, Means One-sample t procedures are to be used only when the population is sufficiently normal. It must be reasonable to assume that the population (sampling distribution) is sufficiently unimodal and symmetric in order to justify the use of t. The t statistic is strongly influenced by outliers. As a result, it is imperative that you check the data. If the population is not unimodal and symmetric, there are outliers and the sample size is small, the results will not be reliable. We have to find other statistical methods. The simplest way to verify the population is sufficiently normal is to trust your methodolgy, assume the sample is representative of the population and verify that the sample is sufficiently unimodal and symmetric by making a histogram of the sample. 9 /57
10 When to use t procedures I, and your calculator, have a simple rule for using t over z. If you do not know the population σ, use the t statistic. Howsomever, keep in mind that it is not always appropriate to use the t model. To ensure the sampling distribution warrants using the normal model we must verify the sample sizes are appropriate. If the sample size is less than 15, only use t procedures if the data are really close to Normal. It must be a reasonable assumption that the population distribution is normal. If the sample is between 15 and 40, use t if the data is unimodal and reasonably symmetric, verified by a histogram of the sample. If the sample size is at least 40, you may use t procedures, even if the original data is skewed. 10/57
11 Confidence Interval, Means When Gosset corrected the model for the extra uncertainty, the margin of error got bigger. Confidence intervals will be slightly wider and P-values a little larger for the t models than they were with the Normal model. Using the t-model, compensates for the extra variability introduced by estimating the population parameters with sample statistics. 11 /57
12 Confidence Interval, Means One-sample t-interval for the mean When the conditions are met, we can find the confidence interval for the population mean, µ exactly like we did for the proportions. The confidence interval is ( ) y ±t * SE X n 1 s y ±t * n 1 n where the standard error of the mean is SE (y ) = s n The critical value t*n-1 depends on the particular confidence level, C, that you specify and on the number of degrees of freedom, n 1. 12/57
13 Confidence Interval, Means Student s t-models are unimodal and symmetric, just like the Normal. But t-models with only a few degrees of freedom are lower, and have much fatter tails than the Normal. Like someone sat on the top of the Z curve and squashed it down. (Making the margin of error bigger.) The difference between the t-curve and the Z curve is the standard deviation. Both distributions have a mean of 0. The Z distribution has a variance of 1. ν The t distribution has a variance approximated by. ν 2 13/57
14 The t curve As the degrees of freedom increase, the t-curves look more and more like the Normal curve. Normal Distribu.on t- Distribu.on As the degrees of freedom increase, the t distribution gets closer to the normal distribution, since s gets closer to σ. In fact, the t-model with infinite degrees of freedom is exactly Normal. 14/57
15 Assumptions and Conditions Normal Population Assumption: We can never know if the population is truly Normal, but we can make a reasonable assumption about some populations. Nearly Normal Condition: The data come from a distribution that is unimodal and symmetric. Check by making a histogram or Normal probability plot (NPP) of the sample data. If the sample is representative, and the sample is sufficiently unimodal and symmetric, we conclude the population and sampling distribution are sufficiently unimodal and symmetric. 15/57
16 Assumptions and Conditions These should come as no surprise to you... Independence Assumption: Independence Condition. The data values should be independent. Randomization Condition: The data arise from a random sample or randomized experiment. 10% Condition: When a sample is drawn without replacement, the sample should be no more than 10% of the population. As there is no proportion, there is no np or nq. Sufficient sample size requires considering the sample sizes 15, and /57
17 Remember If we cannot reasonably assume the distribution of the population data is Normally distributed then we must check further. Nearly Normal Condition: The smaller the sample size (n < 15 or so), the more closely the population data should follow a Normal model. For moderate sample sizes (n between 15 and 40 or so), the t works well as long as the data are unimodal and reasonably symmetric. Check the sample distribution with a histogram. For larger sample sizes (n > 40), the t -statistic is acceptable to use unless the data are extremely skewed or has significant outliers. 17/57
18 Finding t-values By Hand The Student s t-model is different for to 22 each value of degrees of freedom. Because of this, Statistics books usually have one table of t-model critical values for selected confidence levels. Your book has a table in Appendix G, pa-104 Of course, we will use the calculator. 18/57
19 t - test Test Statistic = Observed Value (mean) - Expected Value sample standard deviation t = Sample mean - Population mean adjusted standard deviation t = X - µ s n, d.f. = n 1 19/57
20 t - distribution The t distribution has the following properties: The mean of the distribution is equal to 0. ν The variance is equal to, where df (v) is the degrees of freedom ν 2 and ν > 2. The variance is always greater than 1, although it is close to 1 when there are many degrees of freedom. With infinite degrees of freedom, the t distribution is the same as the standard normal (z) distribution. 20/57
21 Confidence Interval Calculating the confidence interval is exactly the same as with the z statistic, simply replace the z with t. X ± t α 2 s Or X ± t n 1 s n n 21/57
22 Example Find a 90% confidence interval for the population mean if a sample of 20 has mean of 1462 with a standard deviation 42. This is calculator practice, so we will forgo the assumptions and conditions, plus these numbers have no context. To find t* or t19 InvT(.95, 19) = s 90% t 19 n = Thus our interval would be 1462 ± 16.2 or (1445.8, ). to 18 22/57
23 TI-84 With data, enter data into a list. STAT TESTS 8:TInterval Inpt: Data Stats List: Freq: 1 C-Level: Calculate With statistics. STAT TESTS 8:TInterval Inpt: Data Stats x: sx:.4 n: C-Level: Calculate 23/57
24 TI-84 To find the critical value of t* Dist 2nd vars 4:invT( 1 - α, df) or (1 - α/2, df) To find the area to the below (left of) of a t-value 2nd Dist vars 6:tcdf( -10^99, t, df) To find the area to the between t-values Dist 2nd vars 6:tcdf( lowert, uppert, df) To find the area to the above (right of) a t-value 2nd Dist vars 6:tcdf( t, -10^99,df) 24/57
25 Computer Computer output for 1-sample t-test and t interval. two tailed test Mean and Standard 95% Confidence Test P-Value Deviation of the sample Interval Statistic SE of the sampling distribution mean/stdev Run the test and CI on your calculator. 25/57
26 Cautions About Interpreting Confidence Intervals Remember the appropriate interpretation of your confidence interval. What NOT to say: 90% of all the subject scored between 7 and 9 on the depression test. The confidence interval is about the population mean NOT the individual data values. We are 90% confident that a randomly selected subject will have a depression score between 7 and 9. Again, the confidence interval is about the mean not the individual values. 26/57
27 Cautions About Interpreting Confidence Intervals What NOT to say: The mean depression score is 8 90% of the time. The true population mean does not vary, the confidence interval and sample mean would be different from a different sample. 90% of all samples will have mean score between 7 and 9. The interval we calculate does not represent any other interval. It is only one of many possible intervals, and no more likely to be valid than any other interval. The probability of a mean score between 7 and 9 is 90%. The interval we calculate is not special. It is only one of many possible intervals, and no more likely to be valid than some other interval. 27/57
28 Cautions About Interpreting Confidence Intervals What you DO say: Conclusion Based on data from a sample of size, I am C% confident that the true value of the population mean is between and. C% Confidence With repeated sampling, C% of intervals that could be found from samples size n would contain the true value. 28/57
29 Make a Picture x 3 Draw a picher, draw a picher, draw a picher. Now you have another graph to draw. In addition to the t curve, you must draw a histogram of your sample data to demonstrate that the sample, and thus the sampling distribution is sufficiently unimodal and symmetric with no problematic outliers. You may also want to make a Normal probability plot to ensure that it is reasonably straight, indicating a symmetric distribution. 29/57
30 t - test Test Statistic = Observed Value (mean) - Expected Value sample standard error t = Sample mean - Population mean adjusted standard deviation t = X - µ, d.f. = n 1 s n 30/57
31 A Test for the Mean One-sample t-test for the mean The conditions for the one-sample t-test for the mean are the same as for the one-sample t-interval. We test the hypothesis H 0 : µ = µ 0 using the statistic: t = X - µ, d.f. = n 1 s n When the conditions are met and the null hypothesis is true, this statistic follows a Student s t model with n 1 df. We use that model to obtain a P- value. 31/57
32 t - test for a mean Do not forget to write a complete response P Define the parameter. (In this case the population mean.) H Formulate all hypotheses: H0, Ha. A Check the assumptions and conditions. N Determine the test statistic that will assess the evidence against the null hypothesis. (Now the t - test.) T Find critical values of the test statistic based on α. O Obtain the p-value for the statistic M Decide to Reject or Fail to reject H0. S Tell someone about your results in context. 32/57
33 Example A consumer, Skeptical Starks tested 18 bottles of sodypop and found a sample mean of 15.8 ounces with a standard deviation of 0.4 ounces. Skeptical is convinced the company is deliberately shorting the drinks that are labeled 16 ounces. Is Skeptical right? 18 bottles, x = s = 0.4 We do not know the standard deviation of the population of sodypop bottles so we will use the t-test for a mean volume of sodypop. The null hypothesis is that the mean volume for the sodypop is not less than 16 oz. The alternate hypothesis is the mean volume is less than 16 oz. H0: µ = 16 and Ha: µ < 16 33/57
34 Independence Assumption: Example 18 bottles, x = s = 0.4 Independence Assumption. We are not certain that the consumer did not get all of the bottles from one distributor so independence is questionable, but let us pretend. Randomization Condition: We can be relatively certain that the bottles were randomly chosen based on marketing protocols. 10% Condition: 18 bottles is certainly not more than 10% of the population of sodypop bottles. Normal Population Assumption: Nearly Normal Condition: It is a relatively safe bet that the amount of sodypop in 16 ounce bottles is normally distributed. To be confident we would do a bar chart of the volumes in our sample bottles. 34/57
35 TI bottles, x = s = 0.4 We have met the conditions for a t-test. Use the calculator to run the t test. STAT TESTS 2:T-Test Inpt: Data Stats µ0: 16 x: 15.8 Sx:.4 n: 18 p1: µ0 <µ0 >µ0 Calculate µ <16 t= p= x: 15.8 Sx:.4 n: 18 Note all the useful information the calculator so kindly provides for you. Treat it nicely by keeping it fully charged. 35/57
36 Example 18 bottles, x = s = 0.4 We are not given the level of significance so we default to.05. A one tailed test means all the critical area is below the mean. The critical t-statistic (t*, or tα/2) corresponding to α =.05, df = 17 2nd Dist vars 4:invT(.05, 17) = % 5% -3σ -2σ -1σ 0 1σ 2σ 3σ The critical value of the t-statistic corresponding to α =.05 and d.f. = 17 is /57
37 Example 18 bottles, x = s = 0.4 The critical value of the t-statistic corresponding to α =.05 and d.f. = 17 is t = X - µ s, d.f. = n % n % σ -2σ -1σ 0 1σ 2σ 3σ = = p(t< ) = p(x < 15.8) = ttest(stats, 16, 15.8,.4, <µ0) = /57
38 Example < and p-value =.017 <.05 so our t statistic falls within the rejection area. The probability of getting a value of 15.8 oz, if the null (16.0 oz) is true, is less than 5%, so our decision is to reject the null hypothesis. Since we reject H0, it is plausible to conclude the bottling company is deliberately short-changing the consumer. Skeptical Starks has a reason to by skeptical. 38/57
39 Intervals and Tests Confidence intervals and hypothesis tests are built from the same concepts. In point of fact, they two perspectives of the same question. The confidence interval provides the range of values that will cause us to fail to reject the null hypothesis. Values outside the confidence interval will lead us to reject the null hypothesis. 39/57
40 Intervals and Tests To sum up, a confidence interval with confidence C contains all of the plausible null hypothesis values that would fail to be rejected by a two-tail hypothesis text at alpha level 1 C. So a 95% confidence interval matches an α 0.05 level two-sided test. (.025 on each side of the interval) Confidence intervals are naturally two-sided, so they match exactly with twosided hypothesis tests. When the hypothesis is one sided, the corresponding alpha level is (1 C)/2. 40/57
41 Determining Sample Size To find the sample size needed for a particular confidence level with a particular margin of error (ME), solve this equation for n: ME = t * n 1 s n and CI = X ± t * n 1 s n The problem with using the equation above is that we don t know most of the values. Howsomever do not fret, we can overcome this: We can use s from a small pilot study to establish a sample size for a more comprehensive study. OR We can use a two step process by using z* in place of the necessary t value. Calculate a sample size then use that sample size as our degrees of freedom to find t* 41/57
42 Determining Sample Size Two steps to finding sample size for a t-test. We initially estimate an n by using z ME = z * s n Now dial it in even closer by using that first n to determine d.f. for t* The sample size found by using z* can be used to find t*. Calculate again, this time with the new n and the t* found in the table. s ME = t * n 1 n' 42/57
43 Suppose an initial convenience sample has a mean of 135 lbs and a standard deviation of 32 lbs. How large a sample would you need to estimate the population mean with 95% confidence and a margin of error of 5 pounds? First, an estimate with z. ME = z * s 5 = n n = i 32 2 n = n 5 Now we use t ME = t * n 1 s 5 = n n = i 32 2 n = n' 5 We would need a sample of 790 observations. 43/57
44 Sample Size Sample size calculations are never exact and rarely understood. The margin of error you find after collecting the data won t match exactly the one you used to find n. The sample size formula depends on quantities you do not have until you collect the data from a sample you do not have, but using it is an important first step. Before you collect data, it is probably a good idea to know that the sample size is large enough to give you a good chance of being able to use the statistical techniques you hope to use to answer the question. 44/57
45 Degrees of Freedom If only we knew the true population mean, µ, we would find the sample standard deviation as s = n i =1 (x i µ) 2 n But, we use x instead of µ, and that introduces a source of variability. n (x x ) 2 (x µ) 2 i i When we use rather than to calculate s, our standard i =1 n i =1 deviation estimate turns out to be too small. We compensate for the smaller sum by dividing by n 1 which is the degrees of freedom. We use this simply because it gives us a better estimate of the population standard deviation. 45/57
46 Standard Deviation That is why, when we do not know the mean of the population, s = n i =1 (x i x ) 2 n 1 That is the difference in what your calculator reports for the standard deviation between Sx and σx when you ask for the statistics of a list. 46/57
47 Caution Don t confuse proportions and means. Examine the Normal condition closely, if we ain t normal we are done: Beware of multimodality. The Nearly Normal Condition clearly fails if a histogram of the sample has two or more modes. Beware of skewed data. If the data are very skewed, try re-expressing the variable. If you have outliers, run the test both with and without the outliers, reporting the results for both. 47/57
48 Calculator Practice Let us say that in previous years the average temperature for this time of year is 67 F. Students are complaining that this year it is much warmer. To find out the students record the temps at noon for a random two week period. For this example we will assume σ = Test the student s conjecture at a significance level of.05 This is for the purpose of becoming familiar with the calculator. The data is most appropriate for a t test, but we will run both z and t. 48/57
49 Practice We know the population mean (67 ) but we do not know the population standard deviation so we should be running just a t-test. For educational purposes we will test using both a z-test and a t-test. We are measuring the mean temperature of this time of year. The hypotheses are: H0: µ 67 and H1: µ > 67 49/57
50 Assumptions Independence Assumption: Independence Assumption. Our temperature measurements are independent. Randomization Condition: The days were chosen at random. 10% Condition: 14 days is certainly not more than 10% of the days we could measure. Normal Population Assumption: Nearly Normal Condition: Fourteen days is a small sample, but temperatures for this time of year are distributed normally enough around the mean temperature. 50/57
51 Test Statistic Since this is for learning the calculator, we will be finding both z and t. Enter the following data into L1: To clear an earlier list: - stat - 4: 2nd L1 - Enter If we wish to find the critical z-values: 2nd Distr 3:InvNorm(.975) - Enter /57
52 z-test To find the z stat: Stat (Tests) - 1:Z-test - Enter Inpt: Data Stats µ0: 67 σ: 5 List: (2nd) L1 p <.05 Z-Test µ 67 z = p = Freq: 1 Reject H0 x = Sx = This is Ha µ: µ 0 < µ 0 > µ 0 0 Calculate n = 14 52/57
53 If we wish to find the critical t-value: 2nd Distr 3:InvT(.975) - df:13 - Enter /57
54 t-test To find the t stat: Stat (Tests) - 2:T-test - Enter T-Test Inpt: Data Stats µ0: 67 List: (2nd) L1 Freq: 1 p <.05 Reject H0 µ > 67 t = p = x = This is H1 µ: µ 0 < µ 0 > µ 0 Sx = Calculate n = 14 54/57
55 When You Have the Statistics If we know the sample statistics we can test the hypotheses using the sample mean and standard deviation. We will compare a sample mean to a known population mean using the hypotheses: H0: µ = 135 and H1: µ 135 Suppose the population has a standard deviation of 32 lbs and our sample had a standard deviation of 31.5 lbs. Would a sample of 38 items weighing in at 138 lbs be enough to reject H0? 55/57
56 z-test We do not have any data to enter, instead we have statistics. To find the z stat: Stat (Tests) - 1:Z-test - Enter Inpt: Data Stats µ0: 135 σ: 32 x: 138 n: 38 p >.05 Fail to reject H0 µ 135 z = p = x = 138 n = 38 This is H1 µ: µ 0 < µ 0 > µ 0 Calculate 56/57
57 t-test To find the t stat: Stat (Tests) - 2:Z-test - Enter To find Critical Value of test statistic 2nd Distr 3:InvT(.975) - df: 37 - Enter Inpt: Data Stats µ0: 135 µ 135 x: 138 t = sx: 31.5 p >.05 p = n: 38 Fail to x = 138 This is H1 µ: µ 0 < µ 0 > µ 0 reject H0 n = 38 Calculate 57/57
Chapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc.
Chapter 23 Inferences About Means Sampling Distributions of Means Now that we know how to create confidence intervals and test hypotheses about proportions, we do the same for means. Just as we did before,
More informationChapter 24. Comparing Means
Chapter 4 Comparing Means!1 /34 Homework p579, 5, 7, 8, 10, 11, 17, 31, 3! /34 !3 /34 Objective Students test null and alternate hypothesis about two!4 /34 Plot the Data The intuitive display for comparing
More informationDifference between means - t-test /25
Difference between means - t-test 1 Discussion Question p492 Ex 9-4 p492 1-3, 6-8, 12 Assume all variances are not equal. Ignore the test for variance. 2 Students will perform hypothesis tests for two
More informationChapter 23: Inferences About Means
Chapter 3: Inferences About Means Sample of Means: number of observations in one sample the population mean (theoretical mean) sample mean (observed mean) is the theoretical standard deviation of the population
More informationCh. 7: Estimates and Sample Sizes
Ch. 7: Estimates and Sample Sizes Section Title Notes Pages Introduction to the Chapter 2 2 Estimating p in the Binomial Distribution 2 5 3 Estimating a Population Mean: Sigma Known 6 9 4 Estimating a
More informationChapter 8: Confidence Intervals
Chapter 8: Confidence Intervals Introduction Suppose you are trying to determine the mean rent of a two-bedroom apartment in your town. You might look in the classified section of the newspaper, write
More informationBusiness Statistics. Lecture 10: Course Review
Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,
More informationChapter 18. Sampling Distribution Models /51
Chapter 18 Sampling Distribution Models 1 /51 Homework p432 2, 4, 6, 8, 10, 16, 17, 20, 30, 36, 41 2 /51 3 /51 Objective Students calculate values of central 4 /51 The Central Limit Theorem for Sample
More informationNote that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b).
Confidence Intervals 1) What are confidence intervals? Simply, an interval for which we have a certain confidence. For example, we are 90% certain that an interval contains the true value of something
More informationUsing Tables and Graphing Calculators in Math 11
Using Tables and Graphing Calculators in Math 11 Graphing calculators are not required for Math 11, but they are likely to be helpful, primarily because they allow you to avoid the use of tables in some
More informationProbability Distributions
CONDENSED LESSON 13.1 Probability Distributions In this lesson, you Sketch the graph of the probability distribution for a continuous random variable Find probabilities by finding or approximating areas
More informationCh. 7. One sample hypothesis tests for µ and σ
Ch. 7. One sample hypothesis tests for µ and σ Prof. Tesler Math 18 Winter 2019 Prof. Tesler Ch. 7: One sample hypoth. tests for µ, σ Math 18 / Winter 2019 1 / 23 Introduction Data Consider the SAT math
More informationConfidence intervals
Confidence intervals We now want to take what we ve learned about sampling distributions and standard errors and construct confidence intervals. What are confidence intervals? Simply an interval for which
More informationThe t-statistic. Student s t Test
The t-statistic 1 Student s t Test When the population standard deviation is not known, you cannot use a z score hypothesis test Use Student s t test instead Student s t, or t test is, conceptually, very
More informationSingle Sample Means. SOCY601 Alan Neustadtl
Single Sample Means SOCY601 Alan Neustadtl The Central Limit Theorem If we have a population measured by a variable with a mean µ and a standard deviation σ, and if all possible random samples of size
More informationQuestions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.
Chapter 7 Reading 7.1, 7.2 Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.112 Introduction In Chapter 5 and 6, we emphasized
More information9.5 t test: one μ, σ unknown
GOALS: 1. Recognize the assumptions for a 1 mean t test (srs, nd or large sample size, population stdev. NOT known). 2. Understand that the actual p value (area in the tail past the test statistic) is
More informationConfidence Intervals. - simply, an interval for which we have a certain confidence.
Confidence Intervals I. What are confidence intervals? - simply, an interval for which we have a certain confidence. - for example, we are 90% certain that an interval contains the true value of something
More informationNote that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b).
Confidence Intervals 1) What are confidence intervals? Simply, an interval for which we have a certain confidence. For example, we are 90% certain that an interval contains the true value of something
More informationStudent s t-distribution. The t-distribution, t-tests, & Measures of Effect Size
Student s t-distribution The t-distribution, t-tests, & Measures of Effect Size Sampling Distributions Redux Chapter 7 opens with a return to the concept of sampling distributions from chapter 4 Sampling
More informationStatistical inference provides methods for drawing conclusions about a population from sample data.
Introduction to inference Confidence Intervals Statistical inference provides methods for drawing conclusions about a population from sample data. 10.1 Estimating with confidence SAT σ = 100 n = 500 µ
More informationChapter 10 Regression Analysis
Chapter 10 Regression Analysis Goal: To become familiar with how to use Excel 2007/2010 for Correlation and Regression. Instructions: You will be using CORREL, FORECAST and Regression. CORREL and FORECAST
More informationStatistical Foundations:
Statistical Foundations: t distributions, t-tests tests Psychology 790 Lecture #12 10/03/2006 Today sclass The t-distribution t ib ti in its full glory. Why we use it for nearly everything. Confidence
More informationInferences for Regression
Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In
More informationHYPOTHESIS TESTING. Hypothesis Testing
MBA 605 Business Analytics Don Conant, PhD. HYPOTHESIS TESTING Hypothesis testing involves making inferences about the nature of the population on the basis of observations of a sample drawn from the population.
More informationLast two weeks: Sample, population and sampling distributions finished with estimation & confidence intervals
Past weeks: Measures of central tendency (mean, mode, median) Measures of dispersion (standard deviation, variance, range, etc). Working with the normal curve Last two weeks: Sample, population and sampling
More informationy = a + bx 12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation Review: Interpreting Computer Regression Output
12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation y = a + bx y = dependent variable a = intercept b = slope x = independent variable Section 12.1 Inference for Linear
More informationThe Chi-Square Distributions
MATH 03 The Chi-Square Distributions Dr. Neal, Spring 009 The chi-square distributions can be used in statistics to analyze the standard deviation of a normally distributed measurement and to test the
More informationhypotheses. P-value Test for a 2 Sample z-test (Large Independent Samples) n > 30 P-value Test for a 2 Sample t-test (Small Samples) n < 30 Identify α
Chapter 8 Notes Section 8-1 Independent and Dependent Samples Independent samples have no relation to each other. An example would be comparing the costs of vacationing in Florida to the cost of vacationing
More informationz and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests
z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests Chapters 3.5.1 3.5.2, 3.3.2 Prof. Tesler Math 283 Fall 2018 Prof. Tesler z and t tests for mean Math
More informationConfidence Intervals with σ unknown
STAT 141 Confidence Intervals and Hypothesis Testing 10/26/04 Today (Chapter 7): CI with σ unknown, t-distribution CI for proportions Two sample CI with σ known or unknown Hypothesis Testing, z-test Confidence
More informationThe Chi-Square Distributions
MATH 183 The Chi-Square Distributions Dr. Neal, WKU The chi-square distributions can be used in statistics to analyze the standard deviation σ of a normally distributed measurement and to test the goodness
More informationThe Student s t Distribution
The Student s t Distribution What do we do if (a) we don t know σ and (b) n is small? If the population of interest is normally distributed, we can use the Student s t-distribution in place of the standard
More informationChapter 6. The Standard Deviation as a Ruler and the Normal Model 1 /67
Chapter 6 The Standard Deviation as a Ruler and the Normal Model 1 /67 Homework Read Chpt 6 Complete Reading Notes Do P129 1, 3, 5, 7, 15, 17, 23, 27, 29, 31, 37, 39, 43 2 /67 Objective Students calculate
More informationLecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t
Lecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t t Confidence Interval for Population Mean Comparing z and t Confidence Intervals When neither z nor t Applies
More informationLast week: Sample, population and sampling distributions finished with estimation & confidence intervals
Past weeks: Measures of central tendency (mean, mode, median) Measures of dispersion (standard deviation, variance, range, etc). Working with the normal curve Last week: Sample, population and sampling
More informationHarvard University. Rigorous Research in Engineering Education
Statistical Inference Kari Lock Harvard University Department of Statistics Rigorous Research in Engineering Education 12/3/09 Statistical Inference You have a sample and want to use the data collected
More informationDISTRIBUTIONS USED IN STATISTICAL WORK
DISTRIBUTIONS USED IN STATISTICAL WORK In one of the classic introductory statistics books used in Education and Psychology (Glass and Stanley, 1970, Prentice-Hall) there was an excellent chapter on different
More informationCh Inference for Linear Regression
Ch. 12-1 Inference for Linear Regression ACT = 6.71 + 5.17(GPA) For every increase of 1 in GPA, we predict the ACT score to increase by 5.17. population regression line β (true slope) μ y = α + βx mean
More informationappstats27.notebook April 06, 2017
Chapter 27 Objective Students will conduct inference on regression and analyze data to write a conclusion. Inferences for Regression An Example: Body Fat and Waist Size pg 634 Our chapter example revolves
More informationdetermine whether or not this relationship is.
Section 9-1 Correlation A correlation is a between two. The data can be represented by ordered pairs (x,y) where x is the (or ) variable and y is the (or ) variable. There are several types of correlations
More informationChapter 9 Inferences from Two Samples
Chapter 9 Inferences from Two Samples 9-1 Review and Preview 9-2 Two Proportions 9-3 Two Means: Independent Samples 9-4 Two Dependent Samples (Matched Pairs) 9-5 Two Variances or Standard Deviations Review
More informationSTAT Chapter 8: Hypothesis Tests
STAT 515 -- Chapter 8: Hypothesis Tests CIs are possibly the most useful forms of inference because they give a range of reasonable values for a parameter. But sometimes we want to know whether one particular
More informationInferences about Means
Inferences about Means Keith Thompson Department of Mathematics and Statistics Department of Oceanography February 23, 2012 ( ) February 23, 2012 1 / 58 Information on the Instructor Instructor Departments
More informationPopulation Variance. Concepts from previous lectures. HUMBEHV 3HB3 one-sample t-tests. Week 8
Concepts from previous lectures HUMBEHV 3HB3 one-sample t-tests Week 8 Prof. Patrick Bennett sampling distributions - sampling error - standard error of the mean - degrees-of-freedom Null and alternative/research
More informationCENTRAL LIMIT THEOREM (CLT)
CENTRAL LIMIT THEOREM (CLT) A sampling distribution is the probability distribution of the sample statistic that is formed when samples of size n are repeatedly taken from a population. If the sample statistic
More informationChapter 24. Comparing Means. Copyright 2010 Pearson Education, Inc.
Chapter 24 Comparing Means Copyright 2010 Pearson Education, Inc. Plot the Data The natural display for comparing two groups is boxplots of the data for the two groups, placed side-by-side. For example:
More informationSampling Distributions
Sampling Distributions Sampling Distribution of the Mean & Hypothesis Testing Remember sampling? Sampling Part 1 of definition Selecting a subset of the population to create a sample Generally random sampling
More informationChapter 27 Summary Inferences for Regression
Chapter 7 Summary Inferences for Regression What have we learned? We have now applied inference to regression models. Like in all inference situations, there are conditions that we must check. We can test
More information1 MA421 Introduction. Ashis Gangopadhyay. Department of Mathematics and Statistics. Boston University. c Ashis Gangopadhyay
1 MA421 Introduction Ashis Gangopadhyay Department of Mathematics and Statistics Boston University c Ashis Gangopadhyay 1.1 Introduction 1.1.1 Some key statistical concepts 1. Statistics: Art of data analysis,
More informationStatistical Inference. Section 9.1 Significance Tests: The Basics. Significance Test. The Reasoning of Significance Tests.
Section 9.1 Significance Tests: The Basics Significance Test A significance test is a formal procedure for comparing observed data with a claim (also called a hypothesis) whose truth we want to assess.
More informationTwo-Sample Inferential Statistics
The t Test for Two Independent Samples 1 Two-Sample Inferential Statistics In an experiment there are two or more conditions One condition is often called the control condition in which the treatment is
More informationInference for Single Proportions and Means T.Scofield
Inference for Single Proportions and Means TScofield Confidence Intervals for Single Proportions and Means A CI gives upper and lower bounds between which we hope to capture the (fixed) population parameter
More informationReview of Statistics 101
Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods
More informationDo students sleep the recommended 8 hours a night on average?
BIEB100. Professor Rifkin. Notes on Section 2.2, lecture of 27 January 2014. Do students sleep the recommended 8 hours a night on average? We first set up our null and alternative hypotheses: H0: μ= 8
More informationMath 2000 Practice Final Exam: Homework problems to review. Problem numbers
Math 2000 Practice Final Exam: Homework problems to review Pages: Problem numbers 52 20 65 1 181 14 189 23, 30 245 56 256 13 280 4, 15 301 21 315 18 379 14 388 13 441 13 450 10 461 1 553 13, 16 561 13,
More information2008 Winton. Statistical Testing of RNGs
1 Statistical Testing of RNGs Criteria for Randomness For a sequence of numbers to be considered a sequence of randomly acquired numbers, it must have two basic statistical properties: Uniformly distributed
More informationHypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n =
Hypothesis testing I I. What is hypothesis testing? [Note we re temporarily bouncing around in the book a lot! Things will settle down again in a week or so] - Exactly what it says. We develop a hypothesis,
More informationPsychology 282 Lecture #4 Outline Inferences in SLR
Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations
More informationBusiness Statistics. Lecture 5: Confidence Intervals
Business Statistics Lecture 5: Confidence Intervals Goals for this Lecture Confidence intervals The t distribution 2 Welcome to Interval Estimation! Moments Mean 815.0340 Std Dev 0.8923 Std Error Mean
More informationThe t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies
The t-test: So Far: Sampling distribution benefit is that even if the original population is not normal, a sampling distribution based on this population will be normal (for sample size > 30). Benefit
More informationStatistical Inference for Means
Statistical Inference for Means Jamie Monogan University of Georgia February 18, 2011 Jamie Monogan (UGA) Statistical Inference for Means February 18, 2011 1 / 19 Objectives By the end of this meeting,
More informationReview of the Normal Distribution
Sampling and s Normal Distribution Aims of Sampling Basic Principles of Probability Types of Random Samples s of the Mean Standard Error of the Mean The Central Limit Theorem Review of the Normal Distribution
More informationME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV
Theory of Engineering Experimentation Chapter IV. Decision Making for a Single Sample Chapter IV 1 4 1 Statistical Inference The field of statistical inference consists of those methods used to make decisions
More information20 Hypothesis Testing, Part I
20 Hypothesis Testing, Part I Bob has told Alice that the average hourly rate for a lawyer in Virginia is $200 with a standard deviation of $50, but Alice wants to test this claim. If Bob is right, she
More informationMultiple Regression Analysis
Multiple Regression Analysis y = β 0 + β 1 x 1 + β 2 x 2 +... β k x k + u 2. Inference 0 Assumptions of the Classical Linear Model (CLM)! So far, we know: 1. The mean and variance of the OLS estimators
More informationChapter 5 Confidence Intervals
Chapter 5 Confidence Intervals Confidence Intervals about a Population Mean, σ, Known Abbas Motamedi Tennessee Tech University A point estimate: a single number, calculated from a set of data, that is
More informationCh18 links / ch18 pdf links Ch18 image t-dist table
Ch18 links / ch18 pdf links Ch18 image t-dist table ch18 (inference about population mean) exercises: 18.3, 18.5, 18.7, 18.9, 18.15, 18.17, 18.19, 18.27 CHAPTER 18: Inference about a Population Mean The
More information7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between
7.2 One-Sample Correlation ( = a) Introduction Correlation analysis measures the strength and direction of association between variables. In this chapter we will test whether the population correlation
More information(i) The mean and mode both equal the median; that is, the average value and the most likely value are both in the middle of the distribution.
MATH 183 Normal Distributions Dr. Neal, WKU Measurements that are normally distributed can be described in terms of their mean µ and standard deviation!. These measurements should have the following properties:
More informationIntroduction to Business Statistics QM 220 Chapter 12
Department of Quantitative Methods & Information Systems Introduction to Business Statistics QM 220 Chapter 12 Dr. Mohammad Zainal 12.1 The F distribution We already covered this topic in Ch. 10 QM-220,
More informationCOSC 341 Human Computer Interaction. Dr. Bowen Hui University of British Columbia Okanagan
COSC 341 Human Computer Interaction Dr. Bowen Hui University of British Columbia Okanagan 1 Last Class Introduced hypothesis testing Core logic behind it Determining results significance in scenario when:
More informationLecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 9.1-1
Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by Mario F. Triola Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. 9.1-1 Chapter 9 Inferences
More informationInference for the mean of a population. Testing hypotheses about a single mean (the one sample t-test). The sign test for matched pairs
Stat 528 (Autumn 2008) Inference for the mean of a population (One sample t procedures) Reading: Section 7.1. Inference for the mean of a population. The t distribution for a normal population. Small sample
More informationPHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1
PHP2510: Principles of Biostatistics & Data Analysis Lecture X: Hypothesis testing PHP 2510 Lec 10: Hypothesis testing 1 In previous lectures we have encountered problems of estimating an unknown population
More informationSTAT 515 fa 2016 Lec Statistical inference - hypothesis testing
STAT 515 fa 2016 Lec 20-21 Statistical inference - hypothesis testing Karl B. Gregory Wednesday, Oct 12th Contents 1 Statistical inference 1 1.1 Forms of the null and alternate hypothesis for µ and p....................
More informationINTRODUCTION TO ANALYSIS OF VARIANCE
CHAPTER 22 INTRODUCTION TO ANALYSIS OF VARIANCE Chapter 18 on inferences about population means illustrated two hypothesis testing situations: for one population mean and for the difference between two
More informationLECTURE 12 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING
LECTURE 1 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING INTERVAL ESTIMATION Point estimation of : The inference is a guess of a single value as the value of. No accuracy associated with it. Interval estimation
More informationStat 529 (Winter 2011) Experimental Design for the Two-Sample Problem. Motivation: Designing a new silver coins experiment
Stat 529 (Winter 2011) Experimental Design for the Two-Sample Problem Reading: 2.4 2.6. Motivation: Designing a new silver coins experiment Sample size calculations Margin of error for the pooled two sample
More informationHypothesis Testing hypothesis testing approach formulation of the test statistic
Hypothesis Testing For the next few lectures, we re going to look at various test statistics that are formulated to allow us to test hypotheses in a variety of contexts: In all cases, the hypothesis testing
More informationCHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups 10. Comparing Two Means The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Comparing Two Means Learning
More informationCHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups 10.2 Comparing Two Means The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Comparing Two Means Learning
More informationT.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS
ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS In our work on hypothesis testing, we used the value of a sample statistic to challenge an accepted value of a population parameter. We focused only
More informationInference About Two Means: Independent Samples
Inference About Two Means: Independent Samples MATH 130, Elements of Statistics I J. Robert Buchanan Department of Mathematics Fall 2017 Motivation Suppose we wish to study the mean absorption in muscle
More informationChapter 20 Comparing Groups
Chapter 20 Comparing Groups Comparing Proportions Example Researchers want to test the effect of a new anti-anxiety medication. In clinical testing, 64 of 200 people taking the medicine reported symptoms
More informationOrdinary Least Squares Regression Explained: Vartanian
Ordinary Least Squares Regression Explained: Vartanian When to Use Ordinary Least Squares Regression Analysis A. Variable types. When you have an interval/ratio scale dependent variable.. When your independent
More informationInferences About the Difference Between Two Means
7 Inferences About the Difference Between Two Means Chapter Outline 7.1 New Concepts 7.1.1 Independent Versus Dependent Samples 7.1. Hypotheses 7. Inferences About Two Independent Means 7..1 Independent
More informationData analysis and Geostatistics - lecture VI
Data analysis and Geostatistics - lecture VI Statistical testing with population distributions Statistical testing - the steps 1. Define a hypothesis to test in statistics only a hypothesis rejection is
More informationExam 2 (KEY) July 20, 2009
STAT 2300 Business Statistics/Summer 2009, Section 002 Exam 2 (KEY) July 20, 2009 Name: USU A#: Score: /225 Directions: This exam consists of six (6) questions, assessing material learned within Modules
More information9-6. Testing the difference between proportions /20
9-6 Testing the difference between proportions 1 Homework Discussion Question p514 Ex 9-6 p514 2, 3, 4, 7, 9, 11 (use both the critical value and p-value for all problems. 2 Objective Perform hypothesis
More informationSlides for Data Mining by I. H. Witten and E. Frank
Slides for Data Mining by I. H. Witten and E. Frank Predicting performance Assume the estimated error rate is 5%. How close is this to the true error rate? Depends on the amount of test data Prediction
More informationWarm-up Using the given data Create a scatterplot Find the regression line
Time at the lunch table Caloric intake 21.4 472 30.8 498 37.7 335 32.8 423 39.5 437 22.8 508 34.1 431 33.9 479 43.8 454 42.4 450 43.1 410 29.2 504 31.3 437 28.6 489 32.9 436 30.6 480 35.1 439 33.0 444
More informationObjectives Simple linear regression. Statistical model for linear regression. Estimating the regression parameters
Objectives 10.1 Simple linear regression Statistical model for linear regression Estimating the regression parameters Confidence interval for regression parameters Significance test for the slope Confidence
More informationCHAPTER 7. Hypothesis Testing
CHAPTER 7 Hypothesis Testing A hypothesis is a statement about one or more populations, and usually deal with population parameters, such as means or standard deviations. A research hypothesis is a conjecture
More information9. Linear Regression and Correlation
9. Linear Regression and Correlation Data: y a quantitative response variable x a quantitative explanatory variable (Chap. 8: Recall that both variables were categorical) For example, y = annual income,
More informationSection 10.1 (Part 2 of 2) Significance Tests: Power of a Test
1 Section 10.1 (Part 2 of 2) Significance Tests: Power of a Test Learning Objectives After this section, you should be able to DESCRIBE the relationship between the significance level of a test, P(Type
More informationECO220Y Simple Regression: Testing the Slope
ECO220Y Simple Regression: Testing the Slope Readings: Chapter 18 (Sections 18.3-18.5) Winter 2012 Lecture 19 (Winter 2012) Simple Regression Lecture 19 1 / 32 Simple Regression Model y i = β 0 + β 1 x
More informationLecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000
Lecture 14 Analysis of Variance * Correlation and Regression Outline Analysis of Variance (ANOVA) 11-1 Introduction 11-2 Scatter Plots 11-3 Correlation 11-4 Regression Outline 11-5 Coefficient of Determination
More informationLecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA)
Outline Lecture 14 Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA) 11-1 Introduction 11- Scatter Plots 11-3 Correlation 11-4 Regression Outline 11-5 Coefficient of Determination
More informationClass 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700
Class 4 Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science Copyright 013 by D.B. Rowe 1 Agenda: Recap Chapter 9. and 9.3 Lecture Chapter 10.1-10.3 Review Exam 6 Problem Solving
More information