Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.
|
|
- Christian Harper
- 5 years ago
- Views:
Transcription
1 Chapter 7 Reading 7.1, 7.2 Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, Introduction In Chapter 5 and 6, we emphasized sample distributions for a mean and assumed that the sample distribution was either Normal or could be approximated as Normal. We also generally assumed that we knew the population standard deviation. These assumptions allowed us to develop an understanding of a number of concepts such as the confidence interval, significance tests, P-values and Type I and Type II errors. These assumptions allowed us to compute a z-score and to establish critical values,. Confidence Intervals Hypothesis Testing P-Value is the probability, under the null hypothesis of having a z value more extreme then z score for the sample. A critical value can be used for a significance level α The presentation used generic terms such as the test statistic, critical values, and the standard deviation of the test statistic. These general terms anticipated the fact that most often, all we have is a sample! We don t know the standard deviation of a population. Nor can we assume that our test statistic is normally distributed. The z score/statistic represents a centering and rescaling of a random variable. It represents a deviation (from a mean) expressed in units of standard deviations. The t statistic can be thought of as an estimate for the z statistic that uses an estimate of the standard deviation. In Chapter 1, we computed/estimated the variance and standard deviation from a sample as: 1
2 When we know the standard deviation, σ, for the population, we computed the standard deviation for the sample mean as When we don t know the standard deviation, we estimate the standard deviation for the sample mean as is known as the standard error or standard error of the mean. The t statistic is: If (normally distributed with a mean μ and stand deviation σ) then, t will have a t-distribution with n-1 degrees of freedom. If we took repeated samples and recalculated t many times, the probability that the valued of t would be in some interval dt would be independent of μ and σ but would depend on the sample size n. For any particular sample size, there is a corresponding t distribution: t(1) is the t distribution based on a sample size of 2, t(2) is the t distribution based on a sample size of 3 and so on. The t distribution (for any sample size) looks very similar to the z distribution. It is a symmetric bell shaped curve. 2
3 x Figure 1: z density function (smooth) and t(5) density function (circles) The t distribution (for all degrees of freedom) is shorter (less probability in the middle) and so fatter at the tails (more probability in the tails) than the standard normal. Critical t * values can be computed for use in developing margins of error for confidence intervals or for one or two tailed hypothesis tests. The critical values identify points that partition X into events (subsets of the sample space) with specific probabilities. Inference for the mean (one sample) Whether we are using a z statistic (with a known σ) or a t statistic with an estimated we are always assuming that our population is normally distributed or that at least can be assumed to be approximately normally distributed. With this assumption, all of the previous formulas used when σ was known are repeated but with replaced with and z values replaced with corresponding t values. 3
4 Table 1Basic formulae used in hypothesis testing and parameter estimation Known σ and hence known C level confidence interval for the mean Estimated s and hence estimated Critical values of the t statistic, t * are based on a t distribution with k= n-1 degrees of freedom (t(k)). where P( z ) = C where P( t ) = C hypothesis test (two sided alternative) with a significance level α reject if reject if where P( z ) = α where P( t ) = α hypothesis test (one sided alternative) with a significance level α (the following considers just one of the one sided alternatives with a significance level α reject if where P(z ) = α P-Value where P(t reject if ) = α P(z is more extreme than ) P(t is more extreme than ) Table D Critical values,, used in the above formulas can be determined from Table A, a table of standard normal probabilities. The determination is tedious, particularly if you are using a confidence level or a significance level for a two tailed test. C or α must be divided by two and this probability needs to be isolated in the body of the table. The probability is seldom exactly represented leaving the user to consider interpolation between a couple of neighbouring values. What we really want in these situations is a table that allows us to lookup a specific value for C (like 95%) or a specific value for α (like 5%) and find the appropriate corresponding. Table D does this! Equivalent tables to table A, for determining critical values for would require a two page table for each possible degree of freedom. Table D contains the summary information from 37 different tables for the t distribution together with summary information from table A for the standard normal distribution. Each row of Table D displays critical values for (the last row displays critical values for.) The header row identifies an upper tail probability. The footer row identifies a confidence level C. A two tailed probability (α) can be computed as either two times the one tailed probability or as 1-C. The body of the table displays the corresponding critical value. The degrees of freedom (sample size minus 1, n-1) are used to identify each row. The degrees of freedom range from 1 to 30 and then jump to 40, 50, 60, 80, 100, and The last row displays the critical values determined from the standard normal. Inspection of the table reveals that as the sample size increases, the critical values from the corresponding t distribution get closer and closer to the critical values of the z distribution. (This makes intuitive sense in that the t distribution uses to estimate. As the sample size increases, this estimate gets better and better. 4
5 As an example of the use of Table D, consider a random sample of 25 SAT scores from a population of students. The sample mean is 450, the sample standard deviation, s, is 20 and the degrees of freedom, k is 24. The 95% confidence interval is: The interpretation is that if you claimed (in situations like this) that the population mean was between 442 and 458, you would be correct 95% of the time (or wrong 5% of the time.) Two Sample Tests The simplest two sample test is not really a two sample test as it involves matched pairs. This can arise in an experimental situation that involves selecting n pairs of similar experimental units, randomly assigning each unit in a matched pair to one of two treatments, and testing to see if there is a treatment 5
6 effect. In other situations, the response can be measured on the same individual before and after a treatment. In matched pairs, the focus is on the difference between the responses within a pair. It is a single sample (of size n for the n pairs) of differences. Any of the one sample inferences (Table 1) can be applied to the difference between pairs of a sample of matched pairs. The column you use will depend on whether you have a known or assumed standard deviation for the population or have estimated the standard deviation from the sample of differences. is replaced by, the average of the observed differences. The next level of complexity involves two samples that are assumed to come from a population or populations that are normally distributed (or approximately normal) and have a common standard deviation. As an example, a comparative experiment involving a control and a treatment would be considered, under the null hypothesis as two independent samples from a common population. Is there a sufficient difference between the sample means to reject the null hypothesis? If the common population standard deviation is σ, the two sample means are μ 1 and μ 2 and the samples are independent then the mean of the difference between the two means will be the difference between the means: and the variance of the difference will be the sum of the two variances: (Keep in mind that the above mean and variance represent a mean and variance of the difference of just two numbers, the means of the two samples. This difference, a single number, is a random variable and has a sampling distribution) If it is additionally assumed that the distributions of the sample means are normal or can be approximated as normal then: The left hand column of table 1 can now be used. is replaced by, the observed difference between the two sample means, and is replaced by., the population standard deviation for the difference, d. One can construct a confidence interval around the estimate, d, of the difference 6
7 between the two population averages. Or one can create a one-tailed or two tailed hypothesis test with some hypothesized value for the difference between the two population means. If σ is not known and must be estimated, we can use the two estimates, s 1 and s 2, and combine them to provide a pooled estimate of σ. Each estimate is combined in proportion to its degrees of freedom relative to the total degrees of freedom for the two samples. (The degrees of freedom reflect the number of independent observations that have been used to generate the estimate. An estimate of the sample variation uses n observations but also uses the estimated mean. The mean is said to use one degree of freedom. To put it another way, if I told you the mean for a sample of n points, and told you the values of n-1 points, you would be able to tell me the value of the n th point. The n th point, given the mean and the previous n-1 points, is, in a sense, superfluous. The effective number of points being used to estimate the variance is one less than the sample size.) Pooling estimates in proportion to their degrees of freedom places greater emphasis on those estimates that are based on larger samples. is the pooled estimate of the common variance, s 1 and s 2 are the estimated sample variances from the two samples. The best pooled estimated variance for each of the sample means uses this pooled estimate and each of the sample sizes: The estimated variance for the difference is the sum of these estimates (The t statistic, a test statistic, will have a t distribution with degrees of freedom equal to the sum of the degrees of freedom for the two samples.) Any of the one sample inferences (Table 1) can be applied to the difference between sample averages (with an assumed common standard deviation.) The column you use will depend on whether you have a known or assumed standard deviation for the population or have estimated the standard error of the difference (standard deviation) from the pooled estimates of the sample variances. All occurrences of are replaced by d. σ d and SE d are computed from the above equations. And n and the degrees of 7
8 freedom are the total number of observations in the two samples or the sum of the degrees of freedom from the two samples respectively. As an example, consider two schools that tested 25 and 36 randomly selected students. In the first school, the SAT scores had an average and standard deviation was 470 and 40 respectively. In the second school, the average and standard deviation was 455 and 42. Assuming that the distribution of SAT scores for both schools can be assumed to be approximated as Normal with a common standard deviation we can estimate the 95% confidence interval for the difference between the mean school scores. the critical value t * for the t statistic with 74 degrees of freedom for a confidence level of 95% is around The estimated difference between the two schools is 15 ± =15 ± A two tailed hypothesis test with an H 0 : μ = 0 would not be rejected at a significance level of.05. (The 95% confidence interval contains the point 0.) One can also use the formula and substitute d and SE d in place of and The null hypothesis would be rejected if As expected, we cannot conclude that there is a significant difference in the average SAT scores between the two schools. The final complication would be to consider two samples coming from two different populations with two different population variances. The computations are similar to those above. If we assume we know the two variances, and then the expected value (mean) of the difference between the sample means will be the difference of the expected values and the variance of the difference will be the sum of the variances of the two sample means. 8
9 the z score for the difference is: As before, we can proceed with the left column of Table 1 by substituting d for and for. A problem arises when we have to estimate the two population variations. We would like to use as a test statistic some standardize score for the difference between the two means. Recall that we developed the t statistic as if it was an approximation to the z statistic but with an estimate for the standard deviation: We can estimate means: as the square root of the sum of the estimated variances of the two sample If we do this, t is referred to as the two-sample t statistic. It does not have a t distribution! In fact, the actual distribution for the two-sample t statistic depends on the unknown variances and. However, the shape is still bell shaped and can be approximated by a t distribution. A good approximation is provided if, k for the t distribution is chosen to be one less than the smaller sample size. (Computer software can be used to estimate a better value for k. Since this value for k is being used to pick a t distribution curve to approximate a two-sample t distribution, the value of k may be a decimal number. The value of k is referred to as the degrees of freedom for the two sample t procedure but it is really just a value that allows you to pick the best approximating t distribution.) Once again, we can use Table 1 if we make all of the appropriate substitutions, including the estimated degrees of freedom. As an example, consider again the two schools that tested 25 and 36 randomly selected students. In the first school, the SAT scores had an average and standard deviation was 470 and 40 respectively. In the second school, the average and standard deviation was 455 and 42. Again, we assume that the distribution of SAT scores for both schools can be approximated as Normal but we don t assume that they necessarily have a common standard deviation. We can estimate the 95% confidence interval for the difference between the mean school scores first estimating the standard error of the difference: 9
10 and then computing a critical t * (24), The 95% confidence interval for the difference in mean SAT scores for the two schools is: 15 ± = 15 ± 21.9 This is just slightly more conservative (a broader confidence interval) than the one-sample t test based on a pooled estimate of a common variance. The two are very similar reflecting the similarity in the estimate of. Summary The focus of this chapter is on comparing the difference between the mean of two samples. In all situations, the data collected from the two samples is condensed into a single number (test statistic) that is a standardized value for the difference between the two populations. The test statistic will, under suitable assumptions, have a standard normal distribution, a t distribution, or a two-sample t distribution. The test statistic can be used to develop confidence intervals around the sample difference (the difference between the two sample means) or to compute P-values for hypothesis tests. The complications involve determining the standard deviation or standard error to be used in computing a standardized test statistic. Under the null hypothesis the test statistic can be computed as: Known σ z-distribution matched pairs Unknown σ t-distribution pooled two sample procedure, equal variances two sample procedure, unequal variance σ is the population standard deviation, is the average difference between matched pairs, n is the number of matched pairs, is the estimated standard deviation of the sample of differences between the matched pairs, d is the difference between the two sample means, n 1 is the sample size for population 1, n 2 is the sample size for population 2, df 1 is the degrees of freedom for the standard error 10
11 for sample 1, df 2 is the degrees of freedom for the standard error for sample 2, s 1 and s 2 are the estimated standard deviations from sample 1 and sample 2, and t 2 is a two-sample t statistic that has a distribution that is approximated by a t distribution with degrees of freedom equal to the smaller of df 1 or df 2. is the hypothesized difference between the two population means and is generally assumed to be zero. 11
Chapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc.
Chapter 23 Inferences About Means Sampling Distributions of Means Now that we know how to create confidence intervals and test hypotheses about proportions, we do the same for means. Just as we did before,
More informationCh. 7. One sample hypothesis tests for µ and σ
Ch. 7. One sample hypothesis tests for µ and σ Prof. Tesler Math 18 Winter 2019 Prof. Tesler Ch. 7: One sample hypoth. tests for µ, σ Math 18 / Winter 2019 1 / 23 Introduction Data Consider the SAT math
More informationHYPOTHESIS TESTING. Hypothesis Testing
MBA 605 Business Analytics Don Conant, PhD. HYPOTHESIS TESTING Hypothesis testing involves making inferences about the nature of the population on the basis of observations of a sample drawn from the population.
More informationAMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015
AMS7: WEEK 7. CLASS 1 More on Hypothesis Testing Monday May 11th, 2015 Testing a Claim about a Standard Deviation or a Variance We want to test claims about or 2 Example: Newborn babies from mothers taking
More informationAdvanced Experimental Design
Advanced Experimental Design Topic Four Hypothesis testing (z and t tests) & Power Agenda Hypothesis testing Sampling distributions/central limit theorem z test (σ known) One sample z & Confidence intervals
More informationSampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t =
2. The distribution of t values that would be obtained if a value of t were calculated for each sample mean for all possible random of a given size from a population _ t ratio: (X - µ hyp ) t s x The result
More informationT.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS
ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS In our work on hypothesis testing, we used the value of a sample statistic to challenge an accepted value of a population parameter. We focused only
More informationStatistical Inference for Means
Statistical Inference for Means Jamie Monogan University of Georgia February 18, 2011 Jamie Monogan (UGA) Statistical Inference for Means February 18, 2011 1 / 19 Objectives By the end of this meeting,
More informationChapter 9 Inferences from Two Samples
Chapter 9 Inferences from Two Samples 9-1 Review and Preview 9-2 Two Proportions 9-3 Two Means: Independent Samples 9-4 Two Dependent Samples (Matched Pairs) 9-5 Two Variances or Standard Deviations Review
More informationDifference between means - t-test /25
Difference between means - t-test 1 Discussion Question p492 Ex 9-4 p492 1-3, 6-8, 12 Assume all variances are not equal. Ignore the test for variance. 2 Students will perform hypothesis tests for two
More informationCIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8
CIVL - 7904/8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8 Chi-square Test How to determine the interval from a continuous distribution I = Range 1 + 3.322(logN) I-> Range of the class interval
More informationAnalysis of Variance
Statistical Techniques II EXST7015 Analysis of Variance 15a_ANOVA_Introduction 1 Design The simplest model for Analysis of Variance (ANOVA) is the CRD, the Completely Randomized Design This model is also
More informationReview: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.
1 Review: Let X 1, X,..., X n denote n independent random variables sampled from some distribution might not be normal!) with mean µ) and standard deviation σ). Then X µ σ n In other words, X is approximately
More informationCOSC 341 Human Computer Interaction. Dr. Bowen Hui University of British Columbia Okanagan
COSC 341 Human Computer Interaction Dr. Bowen Hui University of British Columbia Okanagan 1 Last Topic Distribution of means When it is needed How to build one (from scratch) Determining the characteristics
More informationTest 3 Practice Test A. NOTE: Ignore Q10 (not covered)
Test 3 Practice Test A NOTE: Ignore Q10 (not covered) MA 180/418 Midterm Test 3, Version A Fall 2010 Student Name (PRINT):............................................. Student Signature:...................................................
More information7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between
7.2 One-Sample Correlation ( = a) Introduction Correlation analysis measures the strength and direction of association between variables. In this chapter we will test whether the population correlation
More informationSingle Sample Means. SOCY601 Alan Neustadtl
Single Sample Means SOCY601 Alan Neustadtl The Central Limit Theorem If we have a population measured by a variable with a mean µ and a standard deviation σ, and if all possible random samples of size
More informationSTAT Chapter 8: Hypothesis Tests
STAT 515 -- Chapter 8: Hypothesis Tests CIs are possibly the most useful forms of inference because they give a range of reasonable values for a parameter. But sometimes we want to know whether one particular
More informationEXAM 3 Math 1342 Elementary Statistics 6-7
EXAM 3 Math 1342 Elementary Statistics 6-7 Name Date ********************************************************************************************************************************************** MULTIPLE
More informationChapter 23. Inference About Means
Chapter 23 Inference About Means 1 /57 Homework p554 2, 4, 9, 10, 13, 15, 17, 33, 34 2 /57 Objective Students test null and alternate hypotheses about a population mean. 3 /57 Here We Go Again Now that
More informationChapter 5 Confidence Intervals
Chapter 5 Confidence Intervals Confidence Intervals about a Population Mean, σ, Known Abbas Motamedi Tennessee Tech University A point estimate: a single number, calculated from a set of data, that is
More informationAn inferential procedure to use sample data to understand a population Procedures
Hypothesis Test An inferential procedure to use sample data to understand a population Procedures Hypotheses, the alpha value, the critical region (z-scores), statistics, conclusion Two types of errors
More informationINTERVAL ESTIMATION AND HYPOTHESES TESTING
INTERVAL ESTIMATION AND HYPOTHESES TESTING 1. IDEA An interval rather than a point estimate is often of interest. Confidence intervals are thus important in empirical work. To construct interval estimates,
More informationInference About Two Means: Independent Samples
Inference About Two Means: Independent Samples MATH 130, Elements of Statistics I J. Robert Buchanan Department of Mathematics Fall 2017 Motivation Suppose we wish to study the mean absorption in muscle
More informationSection 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples
Objective Section 9.4 Inferences About Two Means (Matched Pairs) Compare of two matched-paired means using two samples from each population. Hypothesis Tests and Confidence Intervals of two dependent means
More informationLecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t
Lecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t t Confidence Interval for Population Mean Comparing z and t Confidence Intervals When neither z nor t Applies
More informationElementary Statistics Triola, Elementary Statistics 11/e Unit 17 The Basics of Hypotheses Testing
(Section 8-2) Hypotheses testing is not all that different from confidence intervals, so let s do a quick review of the theory behind the latter. If it s our goal to estimate the mean of a population,
More informationTwo-sample inference: Continuous data
Two-sample inference: Continuous data Patrick Breheny April 6 Patrick Breheny University of Iowa to Biostatistics (BIOS 4120) 1 / 36 Our next several lectures will deal with two-sample inference for continuous
More informationContent by Week Week of October 14 27
Content by Week Week of October 14 27 Learning objectives By the end of this week, you should be able to: Understand the purpose and interpretation of confidence intervals for the mean, Calculate confidence
More informationComparing Means from Two-Sample
Comparing Means from Two-Sample Kwonsang Lee University of Pennsylvania kwonlee@wharton.upenn.edu April 3, 2015 Kwonsang Lee STAT111 April 3, 2015 1 / 22 Inference from One-Sample We have two options to
More informationChapter 24. Comparing Means
Chapter 4 Comparing Means!1 /34 Homework p579, 5, 7, 8, 10, 11, 17, 31, 3! /34 !3 /34 Objective Students test null and alternate hypothesis about two!4 /34 Plot the Data The intuitive display for comparing
More informationStatistical Inference: Estimation and Confidence Intervals Hypothesis Testing
Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing 1 In most statistics problems, we assume that the data have been generated from some unknown probability distribution. We desire
More informationz and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests
z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests Chapters 3.5.1 3.5.2, 3.3.2 Prof. Tesler Math 283 Fall 2018 Prof. Tesler z and t tests for mean Math
More informationInferences About Two Population Proportions
Inferences About Two Population Proportions MATH 130, Elements of Statistics I J. Robert Buchanan Department of Mathematics Fall 2018 Background Recall: for a single population the sampling proportion
More informationReview 6. n 1 = 85 n 2 = 75 x 1 = x 2 = s 1 = 38.7 s 2 = 39.2
Review 6 Use the traditional method to test the given hypothesis. Assume that the samples are independent and that they have been randomly selected ) A researcher finds that of,000 people who said that
More informationSTAT Chapter 9: Two-Sample Problems. Paired Differences (Section 9.3)
STAT 515 -- Chapter 9: Two-Sample Problems Paired Differences (Section 9.3) Examples of Paired Differences studies: Similar subjects are paired off and one of two treatments is given to each subject in
More informationProblem of the Day. 7.3 Hypothesis Testing for Mean (Small Samples n<30) Objective(s): Find critical values in a t-distribution
Problem of the Day Find the standardized test statistic (z) if the sample mean is 14, the standard deviation is 2, the sample size is 36, and the population mean is 15. 7.3 Hypothesis Testing for Mean
More informationReview of Statistics 101
Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods
More informationVisual interpretation with normal approximation
Visual interpretation with normal approximation H 0 is true: H 1 is true: p =0.06 25 33 Reject H 0 α =0.05 (Type I error rate) Fail to reject H 0 β =0.6468 (Type II error rate) 30 Accept H 1 Visual interpretation
More informationInferential Statistics
Inferential Statistics Part 1 Sampling Distributions, Point Estimates & Confidence Intervals Inferential statistics are used to draw inferences (make conclusions/judgements) about a population from a sample.
More informationStatistical Inference. Why Use Statistical Inference. Point Estimates. Point Estimates. Greg C Elvers
Statistical Inference Greg C Elvers 1 Why Use Statistical Inference Whenever we collect data, we want our results to be true for the entire population and not just the sample that we used But our sample
More informationLECTURE 12 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING
LECTURE 1 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING INTERVAL ESTIMATION Point estimation of : The inference is a guess of a single value as the value of. No accuracy associated with it. Interval estimation
More information280 CHAPTER 9 TESTS OF HYPOTHESES FOR A SINGLE SAMPLE Tests of Statistical Hypotheses
280 CHAPTER 9 TESTS OF HYPOTHESES FOR A SINGLE SAMPLE 9-1.2 Tests of Statistical Hypotheses To illustrate the general concepts, consider the propellant burning rate problem introduced earlier. The null
More informationMathematical Notation Math Introduction to Applied Statistics
Mathematical Notation Math 113 - Introduction to Applied Statistics Name : Use Word or WordPerfect to recreate the following documents. Each article is worth 10 points and should be emailed to the instructor
More informationInferences for Regression
Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In
More informationCHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)
FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter
More informationChapter 24. Comparing Means. Copyright 2010 Pearson Education, Inc.
Chapter 24 Comparing Means Copyright 2010 Pearson Education, Inc. Plot the Data The natural display for comparing two groups is boxplots of the data for the two groups, placed side-by-side. For example:
More informationPOLI 443 Applied Political Research
POLI 443 Applied Political Research Session 4 Tests of Hypotheses The Normal Curve Lecturer: Prof. A. Essuman-Johnson, Dept. of Political Science Contact Information: aessuman-johnson@ug.edu.gh College
More informationTwo-Sample Inferential Statistics
The t Test for Two Independent Samples 1 Two-Sample Inferential Statistics In an experiment there are two or more conditions One condition is often called the control condition in which the treatment is
More informationStudent s t-distribution. The t-distribution, t-tests, & Measures of Effect Size
Student s t-distribution The t-distribution, t-tests, & Measures of Effect Size Sampling Distributions Redux Chapter 7 opens with a return to the concept of sampling distributions from chapter 4 Sampling
More informationHow do we compare the relative performance among competing models?
How do we compare the relative performance among competing models? 1 Comparing Data Mining Methods Frequent problem: we want to know which of the two learning techniques is better How to reliably say Model
More informationMultiple Regression Analysis
Multiple Regression Analysis y = β 0 + β 1 x 1 + β 2 x 2 +... β k x k + u 2. Inference 0 Assumptions of the Classical Linear Model (CLM)! So far, we know: 1. The mean and variance of the OLS estimators
More informationChapter 7. Inference for Distributions. Introduction to the Practice of STATISTICS SEVENTH. Moore / McCabe / Craig. Lecture Presentation Slides
Chapter 7 Inference for Distributions Introduction to the Practice of STATISTICS SEVENTH EDITION Moore / McCabe / Craig Lecture Presentation Slides Chapter 7 Inference for Distributions 7.1 Inference for
More informationChap The McGraw-Hill Companies, Inc. All rights reserved.
11 pter11 Chap Analysis of Variance Overview of ANOVA Multiple Comparisons Tests for Homogeneity of Variances Two-Factor ANOVA Without Replication General Linear Model Experimental Design: An Overview
More informationhypotheses. P-value Test for a 2 Sample z-test (Large Independent Samples) n > 30 P-value Test for a 2 Sample t-test (Small Samples) n < 30 Identify α
Chapter 8 Notes Section 8-1 Independent and Dependent Samples Independent samples have no relation to each other. An example would be comparing the costs of vacationing in Florida to the cost of vacationing
More informationChapter 10. Regression. Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania
Chapter 10 Regression Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania Scatter Diagrams A graph in which pairs of points, (x, y), are
More informationTables Table A Table B Table C Table D Table E 675
BMTables.indd Page 675 11/15/11 4:25:16 PM user-s163 Tables Table A Standard Normal Probabilities Table B Random Digits Table C t Distribution Critical Values Table D Chi-square Distribution Critical Values
More informationHarvard University. Rigorous Research in Engineering Education
Statistical Inference Kari Lock Harvard University Department of Statistics Rigorous Research in Engineering Education 12/3/09 Statistical Inference You have a sample and want to use the data collected
More information16.400/453J Human Factors Engineering. Design of Experiments II
J Human Factors Engineering Design of Experiments II Review Experiment Design and Descriptive Statistics Research question, independent and dependent variables, histograms, box plots, etc. Inferential
More informationEvaluation. Andrea Passerini Machine Learning. Evaluation
Andrea Passerini passerini@disi.unitn.it Machine Learning Basic concepts requires to define performance measures to be optimized Performance of learning algorithms cannot be evaluated on entire domain
More informationLecture 10: Generalized likelihood ratio test
Stat 200: Introduction to Statistical Inference Autumn 2018/19 Lecture 10: Generalized likelihood ratio test Lecturer: Art B. Owen October 25 Disclaimer: These notes have not been subjected to the usual
More informationPHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1
PHP2510: Principles of Biostatistics & Data Analysis Lecture X: Hypothesis testing PHP 2510 Lec 10: Hypothesis testing 1 In previous lectures we have encountered problems of estimating an unknown population
More informationThe Components of a Statistical Hypothesis Testing Problem
Statistical Inference: Recall from chapter 5 that statistical inference is the use of a subset of a population (the sample) to draw conclusions about the entire population. In chapter 5 we studied one
More information16.3 One-Way ANOVA: The Procedure
16.3 One-Way ANOVA: The Procedure Tom Lewis Fall Term 2009 Tom Lewis () 16.3 One-Way ANOVA: The Procedure Fall Term 2009 1 / 10 Outline 1 The background 2 Computing formulas 3 The ANOVA Identity 4 Tom
More informationEvaluation requires to define performance measures to be optimized
Evaluation Basic concepts Evaluation requires to define performance measures to be optimized Performance of learning algorithms cannot be evaluated on entire domain (generalization error) approximation
More informationCHAPTER 10 HYPOTHESIS TESTING WITH TWO SAMPLES
CHAPTER 10 HYPOTHESIS TESTING WITH TWO SAMPLES In this chapter our hypothesis tests allow us to compare the means (or proportions) of two different populations using a sample from each population For example,
More informationStatistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017
Statistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017 I. χ 2 or chi-square test Objectives: Compare how close an experimentally derived value agrees with an expected value. One method to
More informationSTP 226 EXAMPLE EXAM #3 INSTRUCTOR:
STP 226 EXAMPLE EXAM #3 INSTRUCTOR: Honor Statement: I have neither given nor received information regarding this exam, and I will not do so until all exams have been graded and returned. Signed Date PRINTED
More informationObjectives Simple linear regression. Statistical model for linear regression. Estimating the regression parameters
Objectives 10.1 Simple linear regression Statistical model for linear regression Estimating the regression parameters Confidence interval for regression parameters Significance test for the slope Confidence
More informationInferential statistics
Inferential statistics Inference involves making a Generalization about a larger group of individuals on the basis of a subset or sample. Ahmed-Refat-ZU Null and alternative hypotheses In hypotheses testing,
More informationSTA Module 10 Comparing Two Proportions
STA 2023 Module 10 Comparing Two Proportions Learning Objectives Upon completing this module, you should be able to: 1. Perform large-sample inferences (hypothesis test and confidence intervals) to compare
More information11: Comparing Group Variances. Review of Variance
11: Comparing Group Variances Review of Variance Parametric measures of variability are often based on sum of squares (SS) around e mean: (1) For e data set {3, 4, 5, 8}, = 5 and SS = (3 5) + (4 5) + (5
More information4.1 Hypothesis Testing
4.1 Hypothesis Testing z-test for a single value double-sided and single-sided z-test for one average z-test for two averages double-sided and single-sided t-test for one average the F-parameter and F-table
More informationSimple Linear Regression
Simple Linear Regression ST 430/514 Recall: A regression model describes how a dependent variable (or response) Y is affected, on average, by one or more independent variables (or factors, or covariates)
More informationData Analysis and Statistical Methods Statistics 651
Data Analysis and Statistical Methods Statistics 65 http://www.stat.tamu.edu/~suhasini/teaching.html Suhasini Subba Rao Comparing populations Suppose I want to compare the heights of males and females
More information1 MA421 Introduction. Ashis Gangopadhyay. Department of Mathematics and Statistics. Boston University. c Ashis Gangopadhyay
1 MA421 Introduction Ashis Gangopadhyay Department of Mathematics and Statistics Boston University c Ashis Gangopadhyay 1.1 Introduction 1.1.1 Some key statistical concepts 1. Statistics: Art of data analysis,
More informationIntroduction to Business Statistics QM 220 Chapter 12
Department of Quantitative Methods & Information Systems Introduction to Business Statistics QM 220 Chapter 12 Dr. Mohammad Zainal 12.1 The F distribution We already covered this topic in Ch. 10 QM-220,
More informationLecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 9.1-1
Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by Mario F. Triola Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. 9.1-1 Chapter 9 Inferences
More informationSummary: the confidence interval for the mean (σ 2 known) with gaussian assumption
Summary: the confidence interval for the mean (σ known) with gaussian assumption on X Let X be a Gaussian r.v. with mean µ and variance σ. If X 1, X,..., X n is a random sample drawn from X then the confidence
More informationLecture 06. DSUR CH 05 Exploring Assumptions of parametric statistics Hypothesis Testing Power
Lecture 06 DSUR CH 05 Exploring Assumptions of parametric statistics Hypothesis Testing Power Introduction Assumptions When broken then we are not able to make inference or accurate descriptions about
More informationThe t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies
The t-test: So Far: Sampling distribution benefit is that even if the original population is not normal, a sampling distribution based on this population will be normal (for sample size > 30). Benefit
More informationEC2001 Econometrics 1 Dr. Jose Olmo Room D309
EC2001 Econometrics 1 Dr. Jose Olmo Room D309 J.Olmo@City.ac.uk 1 Revision of Statistical Inference 1.1 Sample, observations, population A sample is a number of observations drawn from a population. Population:
More informationUnit 27 One-Way Analysis of Variance
Unit 27 One-Way Analysis of Variance Objectives: To perform the hypothesis test in a one-way analysis of variance for comparing more than two population means Recall that a two sample t test is applied
More informationBusiness Statistics. Lecture 10: Course Review
Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,
More informationPolitical Science 236 Hypothesis Testing: Review and Bootstrapping
Political Science 236 Hypothesis Testing: Review and Bootstrapping Rocío Titiunik Fall 2007 1 Hypothesis Testing Definition 1.1 Hypothesis. A hypothesis is a statement about a population parameter The
More informationReview. One-way ANOVA, I. What s coming up. Multiple comparisons
Review One-way ANOVA, I 9.07 /15/00 Earlier in this class, we talked about twosample z- and t-tests for the difference between two conditions of an independent variable Does a trial drug work better than
More informationChapter 20 Comparing Groups
Chapter 20 Comparing Groups Comparing Proportions Example Researchers want to test the effect of a new anti-anxiety medication. In clinical testing, 64 of 200 people taking the medicine reported symptoms
More informationChapter 23: Inferences About Means
Chapter 3: Inferences About Means Sample of Means: number of observations in one sample the population mean (theoretical mean) sample mean (observed mean) is the theoretical standard deviation of the population
More informationProbability and Statistics
Probability and Statistics Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be CHAPTER 4: IT IS ALL ABOUT DATA 4a - 1 CHAPTER 4: IT
More informationCHAPTER 7. Hypothesis Testing
CHAPTER 7 Hypothesis Testing A hypothesis is a statement about one or more populations, and usually deal with population parameters, such as means or standard deviations. A research hypothesis is a conjecture
More informationNotes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing
Notes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing 1. Purpose of statistical inference Statistical inference provides a means of generalizing
More informationChapter 9. Inferences from Two Samples. Objective. Notation. Section 9.2. Definition. Notation. q = 1 p. Inferences About Two Proportions
Chapter 9 Inferences from Two Samples 9. Inferences About Two Proportions 9.3 Inferences About Two s (Independent) 9.4 Inferences About Two s (Matched Pairs) 9.5 Comparing Variation in Two Samples Objective
More informationQuantitative Methods for Economics, Finance and Management (A86050 F86050)
Quantitative Methods for Economics, Finance and Management (A86050 F86050) Matteo Manera matteo.manera@unimib.it Marzio Galeotti marzio.galeotti@unimi.it 1 This material is taken and adapted from Guy Judge
More informationy = a + bx 12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation Review: Interpreting Computer Regression Output
12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation y = a + bx y = dependent variable a = intercept b = slope x = independent variable Section 12.1 Inference for Linear
More informationExam 2 (KEY) July 20, 2009
STAT 2300 Business Statistics/Summer 2009, Section 002 Exam 2 (KEY) July 20, 2009 Name: USU A#: Score: /225 Directions: This exam consists of six (6) questions, assessing material learned within Modules
More informationStatistical inference (estimation, hypothesis tests, confidence intervals) Oct 2018
Statistical inference (estimation, hypothesis tests, confidence intervals) Oct 2018 Sampling A trait is measured on each member of a population. f(y) = propn of individuals in the popn with measurement
More informationOHSU OGI Class ECE-580-DOE :Design of Experiments Steve Brainerd
Why We Use Analysis of Variance to Compare Group Means and How it Works The question of how to compare the population means of more than two groups is an important one to researchers. Let us suppose that
More informationLine of symmetry Total area enclosed is 1
The Normal distribution The Normal distribution is a continuous theoretical probability distribution and, probably, the most important distribution in Statistics. Its name is justified by the fact that
More informationCHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups 10.1 Comparing Two Proportions The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Comparing Two Proportions
More informationA3. Statistical Inference Hypothesis Testing for General Population Parameters
Appendix / A3. Statistical Inference / General Parameters- A3. Statistical Inference Hypothesis Testing for General Population Parameters POPULATION H 0 : θ = θ 0 θ is a generic parameter of interest (e.g.,
More information