Inference for the mean of a population. Testing hypotheses about a single mean (the one sample t-test). The sign test for matched pairs
|
|
- Francis Caldwell
- 5 years ago
- Views:
Transcription
1 Stat 528 (Autumn 2008) Inference for the mean of a population (One sample t procedures) Reading: Section 7.1. Inference for the mean of a population. The t distribution for a normal population. Small sample CI for µ in a normal population. Robustness of the t procedures. Testing hypotheses about a single mean (the one sample t-test). Methods for matched pairs The paired t-test The sign test for matched pairs The power of the one sample t-test. 1
2 Inference for the mean of a population So far we have based inference for the population mean on the Z statistic Z = X µ σ/ n. For large n, Z is approximately N(0,1). Problem: in practice we do not know the population standard deviation, σ. Instead we use the sample standard deviation, s, as an estimate for σ. 2
3 The distribution of t for a normal population Let X 1, X 2,... X n be a SRS from a normal population with population mean µ. Then the standardized variable t = X µ s/ n, has a t distribution with n 1 degrees of freedom (df). The impact of estimating σ is to add uncertainty about our standardization. Smaller n leads to fewer degrees of freedom and less certainty. We say that t has a t n 1 distribution The quantity, s/ n is the (estimated) standard error for the sample mean. It is denoted SE mean in MINITAB. 3
4 Properties of the t distribution probability density standard normal t with 5 df t with 2 df t with 1 df value The density curve is symmetric with mean zero and is bellshaped like the normal distribution. The t distribution has heavier tails than the normal distribution (more spread out about zero). As the degrees of freedom increase the tails become thinner, and more of the density is concentrated in the center of the distribution. t = standard normal distribution. 4
5 A small sample CI for µ (The normal population case) For one random sample of normal data, a C = 100(1 α)% level confidence interval for µ is given by x ± t n 1,α/2 s n, where t n 1,α/2 is the critical value of the t distribution with n 1 degrees of freedom. The t n 1,α/2 value is tabulated in Table D. 1. Look at the bottom of the table for the confidence level C of the two sided interval, OR 2. Look up α/2 as the upper tail probability p. Recall that the CI for µ comes from a family of hypothesis tests about µ. 5
6 Robustness of the t-procedures What if the population is not normal can we still use the t distribution? Practical guidelines from the textbook: 1. n < 15: Use t procedures if data are close to normal. If data are clearly non-normal or if outliers are present, do not use the t procedure. 2. n 15: Use t procedures except in presence of strong skewness or outliers. 3. Roughly n 40: The t procedures are valid even for clearly skewed distributions. Use plots of the data to help you decide! 6
7 Polymerization example The article Measuring and understanding the aging of craft insulating paper in power transformers contained the following observations on the degree of polymerization for paper specimens for which viscosity times concentration fell in a certain middle range Plots of the data show that a normality assumption for the data is reasonable. (Note that x = , s = 15.14, n = 17). Form a 95% confidence interval for the true average degree of polymerization (as did the authors of the article). Does the interval suggest that 440 is a plausible value for the true average degree of polymerization? What about 450? 7
8 Testing hypotheses about a single mean The one sample t test Data: We assume x 1, x 2,...x n is a random sample from a normal population with mean µ. We state our hypotheses: H 0 : µ = µ 0, for some constant value µ 0 H a : µ < µ 0, µ µ 0, OR µ > µ 0 (remember to define what µ is (in words) for your problem). We calculate the test statistic, t = x µ 0 s/ n. Under H 0, the test statistic follows a t n 1 distribution. Decision: Compare the observed t-statistic to the critical value found in Table D. 8
9 Drawing conclusions in the one-sample t-test For a test of significance at the level α If the observed t-statistic is in the tail, we reject H 0 (in favor of H A ). If the observed t-statistic is not in the tail, we do not reject H 0. Alternatives and tails For a two-tailed alternative, reject if t t α/2. For an upper-tailed alternative, reject if t t α. For a lower-tailed alternative, reject if t t α. As always, write your conclusion(s) in words. It is important to think about the assumptions that you made to carry out the t-test. Remember that some assumptions can be validated using plots of the data. 9
10 Example The one-sample t statistic from a sample of n = 50 observations for the two-sided test of H 0 : µ = 50 versus H a : µ 50, has the value t = What are the degrees of freedom for the test statistic, t? Is the value t = 1.65 statistically significant at the 10% level? At the 5% level? Locate the two critical values, t from Table D that bracket t. What are the right-tail probabilities for these two values? How would you report the P-value for this test? 10
11 Matched pairs (revision and analysis) Suppose we have two treatments. In the matched pairs design we try to gain precision in the response by matching pairs of similar individuals. we assign each treatment randomly to each subject (each subject only receives one treatment). Or an individual serves as his/her own partner. the individual receives both treatments. Each pair of subjects (individual) form their own block. To analyze the results of this type of experiment, we compare the responses across the pairs (individuals). We usually take differences, and carry out the statistical inference using the paired t-test. 11
12 Football example Two identical footballs, one air-filled and one helium-filled, were used outdoors on a windless day at The Ohio State University s athletic complex. The kicker was a novice punter and was not informed which football contained the helium. Each football was kicked 39 times. The kicker changed footballs after each kick so that his leg would play no favorites if he tired or improved with practice. (Source: Lafferty, M. B. (1993), OSU scientists get a kick out of sports controversy, The Columbus Dispatch (21 Nov 1993), B7.) 12
13 The data (all distances are in yards) Trial Air Helium Trial Air Helium Trial Air Helium
14 A scatterplot 14
15 The paired t procedure the setup Suppose we have pairs of data values (x 1, y 1 ), (x 2, y 2 ),... (x n, y n ). e.g., In our example the pairs of values are the (helium-filled, air-filled) distances for each kick. Clearly the x and y values are not independent. Instead, we calculate the differences d i = y i x i, for each i = 1,..., n. We assume d 1, d 2,... d n is a random sample from a normal population with mean µ d and stdev σ d. µ d is the population mean of the differences between the x and y values. σ d is the population stdev of the differences. 15
16 The paired t procedure We want to test: H 0 : µ d = µ 0, for some constant value µ 0 H a : µ d < µ 0, µ d µ 0, OR µ d > µ 0 We compute the test statistic, t = d µ 0 s d / n, where d is the sample average of the differences, and s d is the sample stdev of the differences. Under H 0, the test statistic follows a t n 1 distribution. We make our decision in the same way that we did for the one-sample t-test. if the observed t-statistic is in tail, we reject H 0, if the observed t-statistic is not in the tail, we do not reject H 0. 16
17 Identifying the hypotheses There is a belief that on average a helium-filled ball travels further than the air-filled ball. State the appropriate H 0 and H a. Be sure to identify the parameters appearing in the hypotheses. 17
18 Summary figures 18
19 Performing the test Carry out a test. Can you reject H 0 at the 5% significance level? At the 1% significance level? Write down you conclusion in words. Variable N N* Mean SE Mean StDev Air-Helium Variable Minimum Q1 Median Q3 Maximum Air-Helium Provide a 90% confidence interval for the mean difference in the distances (air-filled minus helium-filled). 19
20 Inference for non-normal populations If the data do not seem to be drawn from a normal population, then the t procedures may not be valid. Three possible strategies: 1. Learn about other probability distributions. For example, there plenty of skewed distributions (e.g, exponential, gamma, Weibull). Use methods for these distributions instead of the methods for the normal distribution. 2. Transform your data to make it look as normal as possible (recall the ladder of power transformations). Can be hard to interpret the results when using a transformation. 3. Use distribution-free tests. These tests do not assume a particular distribution for the population. Often these test are based on other parameters of the distribution such as the median (rather than the mean). These tests can be less powerful in practice. 20
21 The sign test for matched pairs Example of a distribution-free test. As before, consider pairs of data values: (x 1, y 1 ), (x 2, y 2 ),... (x n, y n ). We will test H 0 : population median of differences = 0, versus H a : population median of differences 0. Let d i = y i x i (i = 1,..., n) be the differences. Exclude the differences that are zero. Let X denote the count out of the remaining m differences that are positive. Then under H 0, X is Binomial(m,0.5). (If the median is zero, then half the nonzero differences are above zero, and the other half are below zero). If x is the observed X value, then the P-value is 2 P(X x) or 2 P(X x). 21
22 The sign test for matched pairs (cont.) For the football example: Out of n = 39 differences, m = 37 differences are nonzero. Thus under H 0, X is Binomial(37, 0.5). Out of the 37, we observe 17 that are above zero. P-value = 2 P(X 17) = = No evidence to reject H 0. See the textbook for the one-sided test. Note: If the population of differences is normally (or approximately normally) distributed then this test will be less powerful at detecting differences than the paired t-test. 22
23 The power of the one sample t-test The power calculation for the one sample t-test is similar to the power calculation for the z-test. But, the math is much harder! Instead we use MINITAB. Stat Power and Sample Size 1-Sample t. Under Options select the Alternative Hypothesis and Significance Level Then enter any two of the following three items: 1. Sample sizes: 2. Differences: 3. Power values: Enter the Standard deviation (the sample stdev in this case) and click OK. 23
24 A value for σ There are four main ways to obtain a value for σ. Literature search. Use historical data from similar studies. Pilot study. Use the results of a pilot study. The estimate of σ will often need to be adjusted. Elicit σ. Two useful methods are the Range/4 method and the Range/6 method. Construct a value for σ. Some probability models yield a value for σ. (e.g. For a Bernoulli RV, σ = p(1 p)). Be conservative. Use several methods and consider a slightly larger value of σ than these methods suggest. 24
25 An agricultural field trial example An agricultural field trial compares the yield of two varieties of tomatoes for commercial use. The researchers divide in half each of 10 small plots of land and plant each tomato variety on one half of each plot. After harvest, they compare the yields in pounds per plant at each location. The ten differences (Variety A - Variety B) give the following statistics: x = 0.46 and s = Is there convincing evidence that Variety A has the higher mean yield? Let µ d denote the population mean of the difference in the yields. We test: H 0 : µ d = 0 versus H a : µ d > 0. The MINITAB output for the paired t test is: One-Sample T, Test of mu = 0 vs > 0 95% Lower N Mean StDev SE Mean Bound T P
26 Agricultural trial (cont.) The tomato experts who carried out the field trial suspect that the relative lack of significance is due to low power. They would like to detect a mean difference in yields of 0.6 pounds per plant at the 0.05 significance level. Based on the previous study, use 0.92 as an estimate of the population σ. What is the power of the test with n = 12 against the alternative of µ = 0.6? If the sample size is increased to n = 30 plots of land, what will be the power against the same alternative? 26
Stat 529 (Winter 2011) Experimental Design for the Two-Sample Problem. Motivation: Designing a new silver coins experiment
Stat 529 (Winter 2011) Experimental Design for the Two-Sample Problem Reading: 2.4 2.6. Motivation: Designing a new silver coins experiment Sample size calculations Margin of error for the pooled two sample
More informationMATH Chapter 21 Notes Two Sample Problems
MATH 1070 - Chapter 21 Notes Two Sample Problems Recall: So far, we have dealt with inference (confidence intervals and hypothesis testing) pertaining to: Single sample of data. A matched pairs design
More informationME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV
Theory of Engineering Experimentation Chapter IV. Decision Making for a Single Sample Chapter IV 1 4 1 Statistical Inference The field of statistical inference consists of those methods used to make decisions
More informationINFERENCE FOR REGRESSION
CHAPTER 3 INFERENCE FOR REGRESSION OVERVIEW In Chapter 5 of the textbook, we first encountered regression. The assumptions that describe the regression model we use in this chapter are the following. We
More informationInferences for Regression
Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In
More informationChapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc.
Chapter 23 Inferences About Means Sampling Distributions of Means Now that we know how to create confidence intervals and test hypotheses about proportions, we do the same for means. Just as we did before,
More informationInference for Distributions Inference for the Mean of a Population. Section 7.1
Inference for Distributions Inference for the Mean of a Population Section 7.1 Statistical inference in practice Emphasis turns from statistical reasoning to statistical practice: Population standard deviation,
More informationSTAT Chapter 8: Hypothesis Tests
STAT 515 -- Chapter 8: Hypothesis Tests CIs are possibly the most useful forms of inference because they give a range of reasonable values for a parameter. But sometimes we want to know whether one particular
More informationHomework Example Chapter 1 Similar to Problem #14
Chapter 1 Similar to Problem #14 Given a sample of n = 129 observations of shower-flow-rate, do this: a.) Construct a stem-and-leaf display of the data. b.) What is a typical, or representative flow rate?
More informationANOVA - analysis of variance - used to compare the means of several populations.
12.1 One-Way Analysis of Variance ANOVA - analysis of variance - used to compare the means of several populations. Assumptions for One-Way ANOVA: 1. Independent samples are taken using a randomized design.
More information1; (f) H 0 : = 55 db, H 1 : < 55.
Reference: Chapter 8 of J. L. Devore s 8 th Edition By S. Maghsoodloo TESTING a STATISTICAL HYPOTHESIS A statistical hypothesis is an assumption about the frequency function(s) (i.e., pmf or pdf) of one
More informationStat 427/527: Advanced Data Analysis I
Stat 427/527: Advanced Data Analysis I Review of Chapters 1-4 Sep, 2017 1 / 18 Concepts you need to know/interpret Numerical summaries: measures of center (mean, median, mode) measures of spread (sample
More informationChapter 23: Inferences About Means
Chapter 3: Inferences About Means Sample of Means: number of observations in one sample the population mean (theoretical mean) sample mean (observed mean) is the theoretical standard deviation of the population
More informationChapter 8 of Devore , H 1 :
Chapter 8 of Devore TESTING A STATISTICAL HYPOTHESIS Maghsoodloo A statistical hypothesis is an assumption about the frequency function(s) (i.e., PDF or pdf) of one or more random variables. Stated in
More informationChapter 23. Inference About Means
Chapter 23 Inference About Means 1 /57 Homework p554 2, 4, 9, 10, 13, 15, 17, 33, 34 2 /57 Objective Students test null and alternate hypotheses about a population mean. 3 /57 Here We Go Again Now that
More information7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between
7.2 One-Sample Correlation ( = a) Introduction Correlation analysis measures the strength and direction of association between variables. In this chapter we will test whether the population correlation
More informationInference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3
Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details Section 10.1, 2, 3 Basic components of regression setup Target of inference: linear dependency
More informationChapter 7. Inference for Distributions. Introduction to the Practice of STATISTICS SEVENTH. Moore / McCabe / Craig. Lecture Presentation Slides
Chapter 7 Inference for Distributions Introduction to the Practice of STATISTICS SEVENTH EDITION Moore / McCabe / Craig Lecture Presentation Slides Chapter 7 Inference for Distributions 7.1 Inference for
More informationDisadvantages of using many pooled t procedures. The sampling distribution of the sample means. The variability between the sample means
Stat 529 (Winter 2011) Analysis of Variance (ANOVA) Reading: Sections 5.1 5.3. Introduction and notation Birthweight example Disadvantages of using many pooled t procedures The analysis of variance procedure
More informationLast week: Sample, population and sampling distributions finished with estimation & confidence intervals
Past weeks: Measures of central tendency (mean, mode, median) Measures of dispersion (standard deviation, variance, range, etc). Working with the normal curve Last week: Sample, population and sampling
More information10.2: The Chi Square Test for Goodness of Fit
10.2: The Chi Square Test for Goodness of Fit We can perform a hypothesis test to determine whether the distribution of a single categorical variable is following a proposed distribution. We call this
More information9/2/2010. Wildlife Management is a very quantitative field of study. throughout this course and throughout your career.
Introduction to Data and Analysis Wildlife Management is a very quantitative field of study Results from studies will be used throughout this course and throughout your career. Sampling design influences
More informationy = a + bx 12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation Review: Interpreting Computer Regression Output
12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation y = a + bx y = dependent variable a = intercept b = slope x = independent variable Section 12.1 Inference for Linear
More informationLast two weeks: Sample, population and sampling distributions finished with estimation & confidence intervals
Past weeks: Measures of central tendency (mean, mode, median) Measures of dispersion (standard deviation, variance, range, etc). Working with the normal curve Last two weeks: Sample, population and sampling
More informationCHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups 10. Comparing Two Means The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Comparing Two Means Learning
More informationCHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups 10.2 Comparing Two Means The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Comparing Two Means Learning
More informationBusiness Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing
Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Agenda Introduction to Estimation Point estimation Interval estimation Introduction to Hypothesis Testing Concepts en terminology
More information9.5 t test: one μ, σ unknown
GOALS: 1. Recognize the assumptions for a 1 mean t test (srs, nd or large sample size, population stdev. NOT known). 2. Understand that the actual p value (area in the tail past the test statistic) is
More informationReview of Statistics 101
Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods
More informationChapter 24. Comparing Means
Chapter 4 Comparing Means!1 /34 Homework p579, 5, 7, 8, 10, 11, 17, 31, 3! /34 !3 /34 Objective Students test null and alternate hypothesis about two!4 /34 Plot the Data The intuitive display for comparing
More informationInterpret Standard Deviation. Outlier Rule. Describe the Distribution OR Compare the Distributions. Linear Transformations SOCS. Interpret a z score
Interpret Standard Deviation Outlier Rule Linear Transformations Describe the Distribution OR Compare the Distributions SOCS Using Normalcdf and Invnorm (Calculator Tips) Interpret a z score What is an
More informationThe Components of a Statistical Hypothesis Testing Problem
Statistical Inference: Recall from chapter 5 that statistical inference is the use of a subset of a population (the sample) to draw conclusions about the entire population. In chapter 5 we studied one
More information1 Introduction to Minitab
1 Introduction to Minitab Minitab is a statistical analysis software package. The software is freely available to all students and is downloadable through the Technology Tab at my.calpoly.edu. When you
More informationChapter 7 Comparison of two independent samples
Chapter 7 Comparison of two independent samples 7.1 Introduction Population 1 µ σ 1 1 N 1 Sample 1 y s 1 1 n 1 Population µ σ N Sample y s n 1, : population means 1, : population standard deviations N
More informationSociology 6Z03 Review II
Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability
More informationInference with Simple Regression
1 Introduction Inference with Simple Regression Alan B. Gelder 06E:071, The University of Iowa 1 Moving to infinite means: In this course we have seen one-mean problems, twomean problems, and problems
More informationT.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS
ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS In our work on hypothesis testing, we used the value of a sample statistic to challenge an accepted value of a population parameter. We focused only
More informationCh. 7: Estimates and Sample Sizes
Ch. 7: Estimates and Sample Sizes Section Title Notes Pages Introduction to the Chapter 2 2 Estimating p in the Binomial Distribution 2 5 3 Estimating a Population Mean: Sigma Known 6 9 4 Estimating a
More informationClass 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700
Class 4 Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science Copyright 013 by D.B. Rowe 1 Agenda: Recap Chapter 9. and 9.3 Lecture Chapter 10.1-10.3 Review Exam 6 Problem Solving
More informationAMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015
AMS7: WEEK 7. CLASS 1 More on Hypothesis Testing Monday May 11th, 2015 Testing a Claim about a Standard Deviation or a Variance We want to test claims about or 2 Example: Newborn babies from mothers taking
More informationStatistical Inference. Hypothesis Testing
Statistical Inference Hypothesis Testing Previously, we introduced the point and interval estimation of an unknown parameter(s), say µ and σ 2. However, in practice, the problem confronting the scientist
More informationSMAM 314 Practice Final Examination Winter 2003
SMAM 314 Practice Final Examination Winter 2003 You may use your textbook, one page of notes and a calculator. Please hand in the notes with your exam. 1. Mark the following statements True T or False
More informationINTRODUCTION TO ANALYSIS OF VARIANCE
CHAPTER 22 INTRODUCTION TO ANALYSIS OF VARIANCE Chapter 18 on inferences about population means illustrated two hypothesis testing situations: for one population mean and for the difference between two
More informationInference for Distributions Inference for the Mean of a Population
Inference for Distributions Inference for the Mean of a Population PBS Chapter 7.1 009 W.H Freeman and Company Objectives (PBS Chapter 7.1) Inference for the mean of a population The t distributions The
More informationProbability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur
Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur Lecture No. # 36 Sampling Distribution and Parameter Estimation
More informationChapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence Section 8.3 The Practice of Statistics, 4 th edition For AP* STARNES, YATES, MOORE Chapter 8 Estimating with Confidence n 8.1 Confidence Intervals: The Basics n 8.2
More informationMBA 605, Business Analytics Donald D. Conant, Ph.D. Master of Business Administration
t-distribution Summary MBA 605, Business Analytics Donald D. Conant, Ph.D. Types of t-tests There are several types of t-test. In this course we discuss three. The single-sample t-test The two-sample t-test
More informationCh18 links / ch18 pdf links Ch18 image t-dist table
Ch18 links / ch18 pdf links Ch18 image t-dist table ch18 (inference about population mean) exercises: 18.3, 18.5, 18.7, 18.9, 18.15, 18.17, 18.19, 18.27 CHAPTER 18: Inference about a Population Mean The
More information7 Estimation. 7.1 Population and Sample (P.91-92)
7 Estimation MATH1015 Biostatistics Week 7 7.1 Population and Sample (P.91-92) Suppose that we wish to study a particular health problem in Australia, for example, the average serum cholesterol level for
More informationInferences about Means
Inferences about Means Keith Thompson Department of Mathematics and Statistics Department of Oceanography February 23, 2012 ( ) February 23, 2012 1 / 58 Information on the Instructor Instructor Departments
More informationStatistics for IT Managers
Statistics for IT Managers 95-796, Fall 2012 Module 2: Hypothesis Testing and Statistical Inference (5 lectures) Reading: Statistics for Business and Economics, Ch. 5-7 Confidence intervals Given the sample
More information8.1-4 Test of Hypotheses Based on a Single Sample
8.1-4 Test of Hypotheses Based on a Single Sample Example 1 (Example 8.6, p. 312) A manufacturer of sprinkler systems used for fire protection in office buildings claims that the true average system-activation
More informationExam 2 (KEY) July 20, 2009
STAT 2300 Business Statistics/Summer 2009, Section 002 Exam 2 (KEY) July 20, 2009 Name: USU A#: Score: /225 Directions: This exam consists of six (6) questions, assessing material learned within Modules
More informationSTAT Chapter 13: Categorical Data. Recall we have studied binomial data, in which each trial falls into one of 2 categories (success/failure).
STAT 515 -- Chapter 13: Categorical Data Recall we have studied binomial data, in which each trial falls into one of 2 categories (success/failure). Many studies allow for more than 2 categories. Example
More informationUnit 10: Simple Linear Regression and Correlation
Unit 10: Simple Linear Regression and Correlation Statistics 571: Statistical Methods Ramón V. León 6/28/2004 Unit 10 - Stat 571 - Ramón V. León 1 Introductory Remarks Regression analysis is a method for
More informationDr. Maddah ENMG 617 EM Statistics 10/15/12. Nonparametric Statistics (2) (Goodness of fit tests)
Dr. Maddah ENMG 617 EM Statistics 10/15/12 Nonparametric Statistics (2) (Goodness of fit tests) Introduction Probability models used in decision making (Operations Research) and other fields require fitting
More informationSTA Module 11 Inferences for Two Population Means
STA 2023 Module 11 Inferences for Two Population Means Learning Objectives Upon completing this module, you should be able to: 1. Perform inferences based on independent simple random samples to compare
More informationSTA Rev. F Learning Objectives. Two Population Means. Module 11 Inferences for Two Population Means
STA 2023 Module 11 Inferences for Two Population Means Learning Objectives Upon completing this module, you should be able to: 1. Perform inferences based on independent simple random samples to compare
More informationSTA Module 10 Comparing Two Proportions
STA 2023 Module 10 Comparing Two Proportions Learning Objectives Upon completing this module, you should be able to: 1. Perform large-sample inferences (hypothesis test and confidence intervals) to compare
More informationBasic Statistics. 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation).
Basic Statistics There are three types of error: 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation). 2. Systematic error - always too high or too low
More informationInferential statistics
Inferential statistics Inference involves making a Generalization about a larger group of individuals on the basis of a subset or sample. Ahmed-Refat-ZU Null and alternative hypotheses In hypotheses testing,
More informationPerformance Evaluation and Comparison
Outline Hong Chang Institute of Computing Technology, Chinese Academy of Sciences Machine Learning Methods (Fall 2012) Outline Outline I 1 Introduction 2 Cross Validation and Resampling 3 Interval Estimation
More informationappstats27.notebook April 06, 2017
Chapter 27 Objective Students will conduct inference on regression and analyze data to write a conclusion. Inferences for Regression An Example: Body Fat and Waist Size pg 634 Our chapter example revolves
More informationCorrelation & Simple Regression
Chapter 11 Correlation & Simple Regression The previous chapter dealt with inference for two categorical variables. In this chapter, we would like to examine the relationship between two quantitative variables.
More informationSMAM 314 Exam 3 Name. F A. A null hypothesis that is rejected at α =.05 will always be rejected at α =.01.
SMAM 314 Exam 3 Name 1. Indicate whether the following statements are true (T) or false (F) (6 points) F A. A null hypothesis that is rejected at α =.05 will always be rejected at α =.01. T B. A course
More informationChapter 3. Measuring data
Chapter 3 Measuring data 1 Measuring data versus presenting data We present data to help us draw meaning from it But pictures of data are subjective They re also not susceptible to rigorous inference Measuring
More informationChapter Three. Hypothesis Testing
3.1 Introduction The final phase of analyzing data is to make a decision concerning a set of choices or options. Should I invest in stocks or bonds? Should a new product be marketed? Are my products being
More informationCHAPTER 13: F PROBABILITY DISTRIBUTION
CHAPTER 13: F PROBABILITY DISTRIBUTION continuous probability distribution skewed to the right variable values on horizontal axis are 0 area under the curve represents probability horizontal asymptote
More information2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008
MIT OpenCourseWare http://ocw.mit.edu 2.830J / 6.780J / ESD.63J Control of Processes (SMA 6303) Spring 2008 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.
More informationChapter 27 Summary Inferences for Regression
Chapter 7 Summary Inferences for Regression What have we learned? We have now applied inference to regression models. Like in all inference situations, there are conditions that we must check. We can test
More informationThe t-statistic. Student s t Test
The t-statistic 1 Student s t Test When the population standard deviation is not known, you cannot use a z score hypothesis test Use Student s t test instead Student s t, or t test is, conceptually, very
More informationExample. χ 2 = Continued on the next page. All cells
Section 11.1 Chi Square Statistic k Categories 1 st 2 nd 3 rd k th Total Observed Frequencies O 1 O 2 O 3 O k n Expected Frequencies E 1 E 2 E 3 E k n O 1 + O 2 + O 3 + + O k = n E 1 + E 2 + E 3 + + E
More informationChapter 5: HYPOTHESIS TESTING
MATH411: Applied Statistics Dr. YU, Chi Wai Chapter 5: HYPOTHESIS TESTING 1 WHAT IS HYPOTHESIS TESTING? As its name indicates, it is about a test of hypothesis. To be more precise, we would first translate
More informationStatistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong
Statistics Primer ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong 1 Quick Overview of Statistics 2 Descriptive vs. Inferential Statistics Descriptive Statistics: summarize and describe data
More informationBayesian Models in Machine Learning
Bayesian Models in Machine Learning Lukáš Burget Escuela de Ciencias Informáticas 2017 Buenos Aires, July 24-29 2017 Frequentist vs. Bayesian Frequentist point of view: Probability is the frequency of
More informationChapter 8 Class Notes Comparison of Paired Samples
Chapter 8 Class Notes Comparison of Paired Samples In this chapter, we consider the analysis of paired data. To illustrate, (in the spirit of p.332 ex.8.s.5) an agronomist randomly selected six wheat plants
More informationProbability and Statistics
Probability and Statistics Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be CHAPTER 4: IT IS ALL ABOUT DATA 4a - 1 CHAPTER 4: IT
More informationWarm-up Using the given data Create a scatterplot Find the regression line
Time at the lunch table Caloric intake 21.4 472 30.8 498 37.7 335 32.8 423 39.5 437 22.8 508 34.1 431 33.9 479 43.8 454 42.4 450 43.1 410 29.2 504 31.3 437 28.6 489 32.9 436 30.6 480 35.1 439 33.0 444
More informationLecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t
Lecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t t Confidence Interval for Population Mean Comparing z and t Confidence Intervals When neither z nor t Applies
More informationContinuous Improvement Toolkit. Probability Distributions. Continuous Improvement Toolkit.
Continuous Improvement Toolkit Probability Distributions The Continuous Improvement Map Managing Risk FMEA Understanding Performance** Check Sheets Data Collection PDPC RAID Log* Risk Analysis* Benchmarking***
More informationLecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series
Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 9 Inferences from Two Samples 9-1 Overview 9-2 Inferences About Two Proportions 9-3
More informationDescribing distributions with numbers
Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central
More informationBasic Statistics. 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation).
Basic Statistics There are three types of error: 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation). 2. Systematic error - always too high or too low
More informationGEOMETRIC -discrete A discrete random variable R counts number of times needed before an event occurs
STATISTICS 4 Summary Notes. Geometric and Exponential Distributions GEOMETRIC -discrete A discrete random variable R counts number of times needed before an event occurs P(X = x) = ( p) x p x =,, 3,...
More informationCHAPTER 9, 10. Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities:
CHAPTER 9, 10 Hypothesis Testing Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities: The person is guilty. The person is innocent. To
More information401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis.
401 Review Major topics of the course 1. Univariate analysis 2. Bivariate analysis 3. Simple linear regression 4. Linear algebra 5. Multiple regression analysis Major analysis methods 1. Graphical analysis
More informationSTAT 328 (Statistical Packages)
Department of Statistics and Operations Research College of Science King Saud University Exercises STAT 328 (Statistical Packages) nashmiah r.alshammari ^-^ Excel and Minitab - 1 - Write the commands of
More informationThe Chi-Square Distributions
MATH 03 The Chi-Square Distributions Dr. Neal, Spring 009 The chi-square distributions can be used in statistics to analyze the standard deviation of a normally distributed measurement and to test the
More informationChapter 15: Nonparametric Statistics Section 15.1: An Overview of Nonparametric Statistics
Section 15.1: An Overview of Nonparametric Statistics Understand Difference between Parametric and Nonparametric Statistical Procedures Parametric statistical procedures inferential procedures that rely
More informationLecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2
Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Fall, 2013 Page 1 Random Variable and Probability Distribution Discrete random variable Y : Finite possible values {y
More informationAnalysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments.
Analysis of Covariance In some experiments, the experimental units (subjects) are nonhomogeneous or there is variation in the experimental conditions that are not due to the treatments. For example, a
More informationBEGINNING BAYES IN R. Bayes with discrete models
BEGINNING BAYES IN R Bayes with discrete models Beginning Bayes in R Survey on eating out What is your favorite day for eating out? Construct a prior for p Define p: proportion of all students who answer
More informationChapter 9 Inferences from Two Samples
Chapter 9 Inferences from Two Samples 9-1 Review and Preview 9-2 Two Proportions 9-3 Two Means: Independent Samples 9-4 Two Dependent Samples (Matched Pairs) 9-5 Two Variances or Standard Deviations Review
More informationThe Chi-Square Distributions
MATH 183 The Chi-Square Distributions Dr. Neal, WKU The chi-square distributions can be used in statistics to analyze the standard deviation σ of a normally distributed measurement and to test the goodness
More informationRegression. Marc H. Mehlman University of New Haven
Regression Marc H. Mehlman marcmehlman@yahoo.com University of New Haven the statistician knows that in nature there never was a normal distribution, there never was a straight line, yet with normal and
More informationStatistics For Economics & Business
Statistics For Economics & Business Analysis of Variance In this chapter, you learn: Learning Objectives The basic concepts of experimental design How to use one-way analysis of variance to test for differences
More informationObjectives Simple linear regression. Statistical model for linear regression. Estimating the regression parameters
Objectives 10.1 Simple linear regression Statistical model for linear regression Estimating the regression parameters Confidence interval for regression parameters Significance test for the slope Confidence
More informationSection 10.1 (Part 2 of 2) Significance Tests: Power of a Test
1 Section 10.1 (Part 2 of 2) Significance Tests: Power of a Test Learning Objectives After this section, you should be able to DESCRIBE the relationship between the significance level of a test, P(Type
More informationChapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence Section 8.3 The Practice of Statistics, 4 th edition For AP* STARNES, YATES, MOORE The One-Sample z Interval for a Population Mean In Section 8.1, we estimated the
More informationLecture 17: Small-Sample Inferences for Normal Populations. Confidence intervals for µ when σ is unknown
Lecture 17: Small-Sample Inferences for Normal Populations Confidence intervals for µ when σ is unknown If the population distribution is normal, then X µ σ/ n has a standard normal distribution. If σ
More information