11: Comparing Group Variances. Review of Variance
|
|
- Clinton Adams
- 5 years ago
- Views:
Transcription
1 11: Comparing Group Variances Review of Variance Parametric measures of variability are often based on sum of squares (SS) around e mean: (1) For e data set {3, 4, 5, 8}, = 5 and SS = (3 5) + (4 5) + (5 5) + (8 5) = = 14. The variance is e mean sum or squares ( mean square ). The symbol denotes e population variance (parameter) and s denotes e sample variance. The population variance is seldom known, so we calculate e sample variance: () For {3, 4, 5, 8}, s = 14 / 3 = This is an unbiased estimate of. The standard deviation is e square root of e variance ( root mean square ): (3) For {3, 4, 5, 8}, s = (4.667) =.16. This is an unbiased estimate of. Interpretation of e standard deviation is tricky. One ing to keep in mind is at big standard deviations indicate big spreads and small standard deviations indicate small spreads. For example, if one group has a standard deviation of 15 and an oer has a standard deviation of, ere is much greater variability in e first group. There are also rules for interpreting standard deviations. For Normally distributed data, 68% of values lie in ±, 95% lie in ±, and nearly all values lie in ± 3. Most data are not Normally distributed, in which case we can use Chebychev s rule which states at at least 75% of e values lie wiin ±. Illustrative example (data set = agebycen.sav for center 1). Ages (years) of patients at a given center are: 60, 66, 65, 55, 6, 70, 51, 7, 58, 61, 71, 41, 70, 57, 55, 63, 64, 76, 74, 54, 58, 73 The mean = The sum of squares SS = ( ) + ( ) + +( ) = The sample variance s = / ( 1) = 75.1 The standard deviation s = From Chebychev s rule we predict at at least 75% of e ages lies in e interval 6.5 ± ()(8.7) = 6.5 ± 17.4, or between 45.1 and Page 11.1 (C:\data\StatPrimer\variance.wpd Print date: 8/14/06 )
2 Graphical Representations of Spread The data in e prior illustrative example (n = ) are displayed as a stemplot: (years) We can see from is plot at data spread from 41 to 76 around a center of about 60-someing. There is a low outlier, and e distribution have a negative skew. You can also display e data wi a dot plot or mean ± standard deviation plot as follows: Be aware at error bars in statistical charts may represent standard deviations, standard errors, or margins of error, which are quite different ings. A boxplot is also nice:the plot below shows e hinge-spread (smaller vertical arrow) and a whiskers-spread (larger vertical arrow) of e data. Chapter 3 provides instruction on construction and interpretation of boxplots. Page 11. (C:\data\StatPrimer\variance.wpd Print date: 8/14/06 )
3 Confidence Limits for a Variance 95% confidence interval for e population variance. The sample variance s² is an unbiased estimator of ². The 95% confidence intervals for (for Normal populations) is given by: (4) where SS represents e sum of square (formula 1), ² n-1,.975 is e 97.5 percentile on a chi-square distribution wi n 1 degrees of freedom, and ² is e.5 percentile on a chi-square distribution wi n 1 degrees of freedom. n-1,.05 Going from e variance to e sum of squares. If e sums of squares is not available, but e variance (or standard deviation) and sample size is, e sum of squares is (5) For e illustrative data, s = and n = 4. Therefore, SS = (4 1)(4.667) = Illustrative example. Consider e small data set on page 1 of is chapter in which SS = 14 and n = 4. For 95% confidence use ² 3,.975 = , ² 3,.05 = Thus, a 95% confidence interval for is = (1.50, 64.87). 95% confidence interval for e population standard deviation. The 95% confidence limits for population standard deviation is derived by taking e square root of e confidence limits for e variance. For e illustrative example, e 95% confidence interval for = ( 1.50, 64.87) = (1., 8.05). Page 11.3 (C:\data\StatPrimer\variance.wpd Print date: 8/14/06 )
4 Testing Variances for a Significant Difference When we have two independent samples, we might ask if e variances of e two populations differ. Consider e following fictitious data: Sample 1: {3, 4, 5, 8} Sample : {4, 6, 7, 9} 1 The first sample has s = and e second has s = Is it possible e observed difference reflects random variation and e variances in e population are e same? We test null hypoesis H : ² = ². 0 1 We ask What is e probability of taking samples from two populations wi identical variances while observing * sample variances as different as s 1 and s? If is probability is low (say, less an.06 ), we will reject H 0 and conclude e two samples came from populations wi unequal variances. If is probability is not too low, we will say ere is insufficient evidence to reject e null hypoesis. The most common procedure for e test is e F ratio test, which has is test statistic: (6) Notice at e larger variance is placed in e numerator of e statistic, and e smaller variance is placed in e denominator. The statistic is associated wi numerator and denominator degrees of freedom. The numerator degrees of freedom is df 1 = n1 1, where group 1 is e group wi e larger variance. The denominator degrees of freedom is df = n - 1. It is important to keep ese degrees of freedom in e correct numerator-denominator order. 1 Illustrative example (fictitious data). Group 1 is {3, 4, 5, 8} and group is {4, 6, 7, 9}; s = and s = We calculate F = s / s = / = 1.08 wi df = 4 1 = 3 and df = 4 1 = 3. stat stat We ask wheer e F statistic sufficiently far from 1.0 to reject H? To answer is question, we convert e F to a p value rough use of e appropriate F distribution. The test is one-tailed focusing on e upper extent of e F df1, df distribution. * Not a typo. Surely god loves = 0.06 as much as = Page 11.4 (C:\data\StatPrimer\variance.wpd Print date: 8/14/06 )
5 The F Distribution The F distribution is a family of distributions initially described by Fisher and Snedecor. Each member of e F distribution is identified by two parameters: df 1 (numerator degrees of freedom) and df (denominator degrees of freedom). F distributions are positively skewed, wi e extent of skewness determined by e distributions degrees of freedoms. df1,df,q 1 Notation: Let F denote e q percentile an F distribution wi df and df degrees of freedom. (The q percentile is greater an or equal to q% of e distribution.) As always, e area under e curve represents probability, and e total area under e curve sums to 1. The area under e curve to e right of e point F df1,df,q is p = 1 q: Our F table in is organized wi numerator degrees of freedom (df 1) shown in e first row of e table and denominator degrees of freedom (df ) shown in e first column. Only 95 percentile points are provided. Here s e first ten lines in e table: Numerator Degrees of Freedom * * D e n o m i n a t Page 11.5 (C:\data\StatPrimer\variance.wpd Print date: 8/14/06 )
6 Example: The 95 percentile on F 1,9, is 5.1 (italicize). Suppose we calculate F stat = 6.01 wi df 1 = 1 and df = 9. Since is F stat is situated to e right of e 95 percentile point, we know e P-value is less an More precise P-values can be derived wi e proper statistical utility (e.g., StaTable, WinPepi). For example, we can use StaTable to determine at an F of 6.01 wi df = 1 and df = 9 is equivalent to P = stat 1 Illustrative example. Recall at wi our illustrative data calculated F stat = 1.08 wi df 1 = 4 1 = 3 and df = 4 1 = 3. Using e F table we find at e 95 percentile on F 3,3 is 9.8. Therefore, P > Using a software utility determine P = Thus, H 0 is retained. [Keep in mind at a non-significant test does not provide evidence of equal variances, especially when samples are small. It merely says e evidence is insufficient to reject H 0: equal variances] Page 11.6 (C:\data\StatPrimer\variance.wpd Print date: 8/14/06 )
7 Pooling Variances It is common in statistical practice to pool (average) group variances to come up wi a more reliable estimate of variability. This should be done only if data suggest (e.g., wi a negative F ratio test). The pooled estimate of variance (s² p) is: 1 (7) where df = df 1 + df. Illustrative example. Group 1 is {3, 4, 5, 8} and group is {4, 6, 7, 9}. We ve already calculated s 1 = and s = Note df 1 = 4-1 = 3, df = 4-1 = 3, and df = = 6. Thus:. The pooled estimate of e variance can be used to derive e pooled standard error of e mean difference: (8) Use is standard error to test H 0: 1 = when and calculate a confidence interval for 1 when population variances are equal. The equal variance test statistic is: (9) The equal variance 95% confidence interval for 1 is: (10) Illustrative example. For e illustrative data = 1.5, and t stat = 1.00 wi df = = 6. A t stat wi 6 degrees of freedom (p =.36). The 95% confidence 1 = ( ) ± (.48)(1.5) = 1.5 ± 3.7 = ( 5. to.). Page 11.7 (C:\data\StatPrimer\variance.wpd Print date: 8/14/06 )
8 When Variances Should Not be Pooled If EDA and F testing suggest variances differ, sample variance should not be pooled and e equal variance t test should be avoided. Inference can still be pursed while assuming ² 1 ² using e Behrens-Fisher procedures (also called e unequal variance t procedure). The t statistic and confidence interval formulas for is procedure are e same as at for e equal variance procedures. What differs, however, is e standard error, which is: (11) There are two ways to calculate e df for is statistic. SPSS uses is formula: When working by hand, use e smaller of df1 or df. A 95% confidence interval for 1 wiout assuming equal variance is. Illustrative example (Familial blood glucose levels hypoetical data). Blood glucose levels are determined for twenty-five (n = 5) 5-year olds whose faers have type II diabetes ( cases ). That cases have mean fasting blood 1 glucose levels ( ) of mg/dl wi a standard deviation (s 1) of 9.6 mg/dl. A group of control group of 0 children (n = 0) from e same census track whose faers have no history of type II diabetes demonstrate = 99.7 mg/dl and s = 5. mg/dl. We want to test e means for inequality but suspect population variances differ significantly. In testing, H 0: ² 1 = ², we calculate F stat = 9.6² / 5.² = 3.41 wi df 1 = 4 and df = 19 (p =.0084), concluding variances differ significantly. An unequal t procedure will be pursued. In applying an unequal variance t test, =.45, = The df" is a tedious to calculate. Wiout software, we use e smaller of n1 1 or n 1, which in is case is 19. (The actual df'' = 38). This will provide conservative inference for e two populations. Using a t distribution wi df = 19, we derive.00 < p <.01. The 95% confidence interval for e data= ( ) ± (.09)(.45) = 7.6 ± 4.7 = (.9, 1.3). Page 11.8 (C:\data\StatPrimer\variance.wpd Print date: 8/14/06 )
13: Additional ANOVA Topics. Post hoc Comparisons
13: Additional ANOVA Topics Post hoc Comparisons ANOVA Assumptions Assessing Group Variances When Distributional Assumptions are Severely Violated Post hoc Comparisons In the prior chapter we used ANOVA
More informationChapter 23: Inferences About Means
Chapter 3: Inferences About Means Sample of Means: number of observations in one sample the population mean (theoretical mean) sample mean (observed mean) is the theoretical standard deviation of the population
More informationTables Table A Table B Table C Table D Table E 675
BMTables.indd Page 675 11/15/11 4:25:16 PM user-s163 Tables Table A Standard Normal Probabilities Table B Random Digits Table C t Distribution Critical Values Table D Chi-square Distribution Critical Values
More informationQuestions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.
Chapter 7 Reading 7.1, 7.2 Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.112 Introduction In Chapter 5 and 6, we emphasized
More informationThe t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies
The t-test: So Far: Sampling distribution benefit is that even if the original population is not normal, a sampling distribution based on this population will be normal (for sample size > 30). Benefit
More informationSTATISTICS 141 Final Review
STATISTICS 141 Final Review Bin Zou bzou@ualberta.ca Department of Mathematical & Statistical Sciences University of Alberta Winter 2015 Bin Zou (bzou@ualberta.ca) STAT 141 Final Review Winter 2015 1 /
More informationClassroom Activity 7 Math 113 Name : 10 pts Intro to Applied Stats
Classroom Activity 7 Math 113 Name : 10 pts Intro to Applied Stats Materials Needed: Bags of popcorn, watch with second hand or microwave with digital timer. Instructions: Follow the instructions on the
More informationSection 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples
Objective Section 9.4 Inferences About Two Means (Matched Pairs) Compare of two matched-paired means using two samples from each population. Hypothesis Tests and Confidence Intervals of two dependent means
More informationPurposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions
Part 1: Probability Distributions Purposes of Data Analysis True Distributions or Relationships in the Earths System Probability Distribution Normal Distribution Student-t Distribution Chi Square Distribution
More information0 0'0 2S ~~ Employment category
Analyze Phase 331 60000 50000 40000 30000 20000 10000 O~----,------.------,------,,------,------.------,----- N = 227 136 27 41 32 5 ' V~ 00 0' 00 00 i-.~ fl' ~G ~~ ~O~ ()0 -S 0 -S ~~ 0 ~~ 0 ~G d> ~0~
More informationSociology 6Z03 Review II
Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability
More information2011 Pearson Education, Inc
Statistics for Business and Economics Chapter 2 Methods for Describing Sets of Data Summary of Central Tendency Measures Measure Formula Description Mean x i / n Balance Point Median ( n +1) Middle Value
More informationMathematical Notation Math Introduction to Applied Statistics
Mathematical Notation Math 113 - Introduction to Applied Statistics Name : Use Word or WordPerfect to recreate the following documents. Each article is worth 10 points and should be emailed to the instructor
More informationGlossary for the Triola Statistics Series
Glossary for the Triola Statistics Series Absolute deviation The measure of variation equal to the sum of the deviations of each value from the mean, divided by the number of values Acceptance sampling
More informationNotes for Week 13 Analysis of Variance (ANOVA) continued WEEK 13 page 1
Notes for Wee 13 Analysis of Variance (ANOVA) continued WEEK 13 page 1 Exam 3 is on Friday May 1. A part of one of the exam problems is on Predictiontervals : When randomly sampling from a normal population
More informationAMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015
AMS7: WEEK 7. CLASS 1 More on Hypothesis Testing Monday May 11th, 2015 Testing a Claim about a Standard Deviation or a Variance We want to test claims about or 2 Example: Newborn babies from mothers taking
More informationFundamentals to Biostatistics. Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur
Fundamentals to Biostatistics Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur Statistics collection, analysis, interpretation of data development of new
More informationLecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t
Lecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t t Confidence Interval for Population Mean Comparing z and t Confidence Intervals When neither z nor t Applies
More informationBusiness Statistics. Lecture 10: Course Review
Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,
More informationSTP 420 INTRODUCTION TO APPLIED STATISTICS NOTES
INTRODUCTION TO APPLIED STATISTICS NOTES PART - DATA CHAPTER LOOKING AT DATA - DISTRIBUTIONS Individuals objects described by a set of data (people, animals, things) - all the data for one individual make
More information9 One-Way Analysis of Variance
9 One-Way Analysis of Variance SW Chapter 11 - all sections except 6. The one-way analysis of variance (ANOVA) is a generalization of the two sample t test to k 2 groups. Assume that the populations of
More informationStudent s t-distribution. The t-distribution, t-tests, & Measures of Effect Size
Student s t-distribution The t-distribution, t-tests, & Measures of Effect Size Sampling Distributions Redux Chapter 7 opens with a return to the concept of sampling distributions from chapter 4 Sampling
More informationCHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)
FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter
More informationDover- Sherborn High School Mathematics Curriculum Probability and Statistics
Mathematics Curriculum A. DESCRIPTION This is a full year courses designed to introduce students to the basic elements of statistics and probability. Emphasis is placed on understanding terminology and
More informationCHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups 10. Comparing Two Means The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Comparing Two Means Learning
More informationCHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups 10.2 Comparing Two Means The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Comparing Two Means Learning
More information10/31/2012. One-Way ANOVA F-test
PSY 511: Advanced Statistics for Psychological and Behavioral Research 1 1. Situation/hypotheses 2. Test statistic 3.Distribution 4. Assumptions One-Way ANOVA F-test One factor J>2 independent samples
More informationAn inferential procedure to use sample data to understand a population Procedures
Hypothesis Test An inferential procedure to use sample data to understand a population Procedures Hypotheses, the alpha value, the critical region (z-scores), statistics, conclusion Two types of errors
More informationWhat is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty.
What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. Statistics is a field of study concerned with the data collection,
More informationData Analysis and Statistical Methods Statistics 651
Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Suhasini Subba Rao Motivations for the ANOVA We defined the F-distribution, this is mainly used in
More informationLecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2
Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Fall, 2013 Page 1 Random Variable and Probability Distribution Discrete random variable Y : Finite possible values {y
More informationT.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS
ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS In our work on hypothesis testing, we used the value of a sample statistic to challenge an accepted value of a population parameter. We focused only
More informationStatistical Analysis of Chemical Data Chapter 4
Statistical Analysis of Chemical Data Chapter 4 Random errors arise from limitations on our ability to make physical measurements and on natural fluctuations Random errors arise from limitations on our
More informationReview of Statistics 101
Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods
More informationEstimating a Population Mean. Section 7-3
Estimating a Population Mean Section 7-3 The Mean Age Suppose that we wish to estimate the mean age of residents of Metropolis. Furthermore, suppose we know that the ages of Metropolis residents are normally
More informationUnit 27 One-Way Analysis of Variance
Unit 27 One-Way Analysis of Variance Objectives: To perform the hypothesis test in a one-way analysis of variance for comparing more than two population means Recall that a two sample t test is applied
More information1 Introduction to Minitab
1 Introduction to Minitab Minitab is a statistical analysis software package. The software is freely available to all students and is downloadable through the Technology Tab at my.calpoly.edu. When you
More informationSTAT 200 Chapter 1 Looking at Data - Distributions
STAT 200 Chapter 1 Looking at Data - Distributions What is Statistics? Statistics is a science that involves the design of studies, data collection, summarizing and analyzing the data, interpreting the
More informationDETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics
DETAILED CONTENTS About the Author Preface to the Instructor To the Student How to Use SPSS With This Book PART I INTRODUCTION AND DESCRIPTIVE STATISTICS 1. Introduction to Statistics 1.1 Descriptive and
More informationAdvanced Experimental Design
Advanced Experimental Design Topic Four Hypothesis testing (z and t tests) & Power Agenda Hypothesis testing Sampling distributions/central limit theorem z test (σ known) One sample z & Confidence intervals
More informationTABLES AND FORMULAS FOR MOORE Basic Practice of Statistics
TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Exploring Data: Distributions Look for overall pattern (shape, center, spread) and deviations (outliers). Mean (use a calculator): x = x 1 + x
More informationTwo sample hypothesis testing
Statistics February 26, 2014 Debdeep Pati Two sample hypothesis testing 1. Suppose we want to study the relationship between use of oral contraceptives (OC) and level of blood pressure (BP) in women. 2.
More informationUsing SPSS for One Way Analysis of Variance
Using SPSS for One Way Analysis of Variance This tutorial will show you how to use SPSS version 12 to perform a one-way, between- subjects analysis of variance and related post-hoc tests. This tutorial
More informationLecture 6: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries)
Lecture 6: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries) Summarize with Shape, Center, Spread Displays: Stemplots, Histograms Five Number Summary, Outliers, Boxplots Cengage Learning
More informationUnit 9: Inferences for Proportions and Count Data
Unit 9: Inferences for Proportions and Count Data Statistics 571: Statistical Methods Ramón V. León 12/15/2008 Unit 9 - Stat 571 - Ramón V. León 1 Large Sample Confidence Interval for Proportion ( pˆ p)
More information14: Correlation. Introduction Scatter Plot The Correlational Coefficient Hypothesis Test Assumptions An Additional Example
14: Correlation Introduction Scatter Plot The Correlational Coefficient Hypothesis Test Assumptions An Additional Example Introduction Correlation quantifies the extent to which two quantitative variables,
More informationTopic 22 Analysis of Variance
Topic 22 Analysis of Variance Comparing Multiple Populations 1 / 14 Outline Overview One Way Analysis of Variance Sample Means Sums of Squares The F Statistic Confidence Intervals 2 / 14 Overview Two-sample
More information+ Specify 1 tail / 2 tail
Week 2: Null hypothesis Aeroplane seat designer wonders how wide to make the plane seats. He assumes population average hip size μ = 43.2cm Sample size n = 50 Question : Is the assumption μ = 43.2cm reasonable?
More informationChapter 13 Correlation
Chapter Correlation Page. Pearson correlation coefficient -. Inferential tests on correlation coefficients -9. Correlational assumptions -. on-parametric measures of correlation -5 5. correlational example
More informationThe t-statistic. Student s t Test
The t-statistic 1 Student s t Test When the population standard deviation is not known, you cannot use a z score hypothesis test Use Student s t test instead Student s t, or t test is, conceptually, very
More informationStat 427/527: Advanced Data Analysis I
Stat 427/527: Advanced Data Analysis I Review of Chapters 1-4 Sep, 2017 1 / 18 Concepts you need to know/interpret Numerical summaries: measures of center (mean, median, mode) measures of spread (sample
More informationChapter 5. Understanding and Comparing. Distributions
STAT 141 Introduction to Statistics Chapter 5 Understanding and Comparing Distributions Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 1 / 27 Boxplots How to create a boxplot? Assume
More informationSTAT 501 Assignment 2 NAME Spring Chapter 5, and Sections in Johnson & Wichern.
STAT 01 Assignment NAME Spring 00 Reading Assignment: Written Assignment: Chapter, and Sections 6.1-6.3 in Johnson & Wichern. Due Monday, February 1, in class. You should be able to do the first four problems
More informationUnit 9: Inferences for Proportions and Count Data
Unit 9: Inferences for Proportions and Count Data Statistics 571: Statistical Methods Ramón V. León 1/15/008 Unit 9 - Stat 571 - Ramón V. León 1 Large Sample Confidence Interval for Proportion ( pˆ p)
More informationSTA 101 Final Review
STA 101 Final Review Statistics 101 Thomas Leininger June 24, 2013 Announcements All work (besides projects) should be returned to you and should be entered on Sakai. Office Hour: 2 3pm today (Old Chem
More informationINTRODUCTION TO ANALYSIS OF VARIANCE
CHAPTER 22 INTRODUCTION TO ANALYSIS OF VARIANCE Chapter 18 on inferences about population means illustrated two hypothesis testing situations: for one population mean and for the difference between two
More informationChapter2 Description of samples and populations. 2.1 Introduction.
Chapter2 Description of samples and populations. 2.1 Introduction. Statistics=science of analyzing data. Information collected (data) is gathered in terms of variables (characteristics of a subject that
More informationPLSC PRACTICE TEST ONE
PLSC 724 - PRACTICE TEST ONE 1. Discuss briefly the relationship between the shape of the normal curve and the variance. 2. What is the relationship between a statistic and a parameter? 3. How is the α
More informationChapter 2: Tools for Exploring Univariate Data
Stats 11 (Fall 2004) Lecture Note Introduction to Statistical Methods for Business and Economics Instructor: Hongquan Xu Chapter 2: Tools for Exploring Univariate Data Section 2.1: Introduction What is
More informationMATH Chapter 21 Notes Two Sample Problems
MATH 1070 - Chapter 21 Notes Two Sample Problems Recall: So far, we have dealt with inference (confidence intervals and hypothesis testing) pertaining to: Single sample of data. A matched pairs design
More information1 Introduction to One-way ANOVA
Review Source: Chapter 10 - Analysis of Variance (ANOVA). Example Data Source: Example problem 10.1 (dataset: exp10-1.mtw) Link to Data: http://www.auburn.edu/~carpedm/courses/stat3610/textbookdata/minitab/
More informationMBA 605, Business Analytics Donald D. Conant, Ph.D. Master of Business Administration
t-distribution Summary MBA 605, Business Analytics Donald D. Conant, Ph.D. Types of t-tests There are several types of t-test. In this course we discuss three. The single-sample t-test The two-sample t-test
More informationIndependent Samples ANOVA
Independent Samples ANOVA In this example students were randomly assigned to one of three mnemonics (techniques for improving memory) rehearsal (the control group; simply repeat the words), visual imagery
More informationTwo-Sample Inferential Statistics
The t Test for Two Independent Samples 1 Two-Sample Inferential Statistics In an experiment there are two or more conditions One condition is often called the control condition in which the treatment is
More informationPrentice Hall Stats: Modeling the World 2004 (Bock) Correlated to: National Advanced Placement (AP) Statistics Course Outline (Grades 9-12)
National Advanced Placement (AP) Statistics Course Outline (Grades 9-12) Following is an outline of the major topics covered by the AP Statistics Examination. The ordering here is intended to define the
More informationSampling Distributions: Central Limit Theorem
Review for Exam 2 Sampling Distributions: Central Limit Theorem Conceptually, we can break up the theorem into three parts: 1. The mean (µ M ) of a population of sample means (M) is equal to the mean (µ)
More informationSampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t =
2. The distribution of t values that would be obtained if a value of t were calculated for each sample mean for all possible random of a given size from a population _ t ratio: (X - µ hyp ) t s x The result
More informationy ˆ i = ˆ " T u i ( i th fitted value or i th fit)
1 2 INFERENCE FOR MULTIPLE LINEAR REGRESSION Recall Terminology: p predictors x 1, x 2,, x p Some might be indicator variables for categorical variables) k-1 non-constant terms u 1, u 2,, u k-1 Each u
More informationChapter 4. Displaying and Summarizing. Quantitative Data
STAT 141 Introduction to Statistics Chapter 4 Displaying and Summarizing Quantitative Data Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 1 / 31 4.1 Histograms 1 We divide the range
More information13: Additional ANOVA Topics
13: Additional ANOVA Topics Post hoc comparisons Least squared difference The multiple comparisons problem Bonferroni ANOVA assumptions Assessing equal variance When assumptions are severely violated Kruskal-Wallis
More informationChapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence Section 8.3 The Practice of Statistics, 4 th edition For AP* STARNES, YATES, MOORE The One-Sample z Interval for a Population Mean In Section 8.1, we estimated the
More informationSection 9.5. Testing the Difference Between Two Variances. Bluman, Chapter 9 1
Section 9.5 Testing the Difference Between Two Variances Bluman, Chapter 9 1 This the last day the class meets before spring break starts. Please make sure to be present for the test or make appropriate
More informationInference for the Regression Coefficient
Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression line. We can shows that b 0 and b 1 are the unbiased estimates
More informationElementary Statistics
Elementary Statistics Q: What is data? Q: What does the data look like? Q: What conclusions can we draw from the data? Q: Where is the middle of the data? Q: Why is the spread of the data important? Q:
More informationGlossary. The ISI glossary of statistical terms provides definitions in a number of different languages:
Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the
More informationInference for Distributions Inference for the Mean of a Population
Inference for Distributions Inference for the Mean of a Population PBS Chapter 7.1 009 W.H Freeman and Company Objectives (PBS Chapter 7.1) Inference for the mean of a population The t distributions The
More informationStatistical Inference: Estimation and Confidence Intervals Hypothesis Testing
Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing 1 In most statistics problems, we assume that the data have been generated from some unknown probability distribution. We desire
More informationStatistics Handbook. All statistical tables were computed by the author.
Statistics Handbook Contents Page Wilcoxon rank-sum test (Mann-Whitney equivalent) Wilcoxon matched-pairs test 3 Normal Distribution 4 Z-test Related samples t-test 5 Unrelated samples t-test 6 Variance
More informationInference for Regression Inference about the Regression Model and Using the Regression Line
Inference for Regression Inference about the Regression Model and Using the Regression Line PBS Chapter 10.1 and 10.2 2009 W.H. Freeman and Company Objectives (PBS Chapter 10.1 and 10.2) Inference about
More information7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between
7.2 One-Sample Correlation ( = a) Introduction Correlation analysis measures the strength and direction of association between variables. In this chapter we will test whether the population correlation
More informationWe need to define some concepts that are used in experiments.
Chapter 0 Analysis of Variance (a.k.a. Designing and Analysing Experiments) Section 0. Introduction In Chapter we mentioned some different ways in which we could get data: Surveys, Observational Studies,
More informationUNIT 12 ~ More About Regression
***SECTION 15.1*** The Regression Model When a scatterplot shows a relationship between a variable x and a y, we can use the fitted to the data to predict y for a given value of x. Now we want to do tests
More informationCh Inference for Linear Regression
Ch. 12-1 Inference for Linear Regression ACT = 6.71 + 5.17(GPA) For every increase of 1 in GPA, we predict the ACT score to increase by 5.17. population regression line β (true slope) μ y = α + βx mean
More informationThe Difference in Proportions Test
Overview The Difference in Proportions Test Dr Tom Ilvento Department of Food and Resource Economics A Difference of Proportions test is based on large sample only Same strategy as for the mean We calculate
More informationIntroduction to Econometrics. Review of Probability & Statistics
1 Introduction to Econometrics Review of Probability & Statistics Peerapat Wongchaiwat, Ph.D. wongchaiwat@hotmail.com Introduction 2 What is Econometrics? Econometrics consists of the application of mathematical
More information15: Regression. Introduction
15: Regression Introduction Regression Model Inference About the Slope Introduction As with correlation, regression is used to analyze the relation between two continuous (scale) variables. However, regression
More informationHypothesis testing: Steps
Review for Exam 2 Hypothesis testing: Steps Repeated-Measures ANOVA 1. Determine appropriate test and hypotheses 2. Use distribution table to find critical statistic value(s) representing rejection region
More informationChapter 7. Inference for Distributions. Introduction to the Practice of STATISTICS SEVENTH. Moore / McCabe / Craig. Lecture Presentation Slides
Chapter 7 Inference for Distributions Introduction to the Practice of STATISTICS SEVENTH EDITION Moore / McCabe / Craig Lecture Presentation Slides Chapter 7 Inference for Distributions 7.1 Inference for
More information" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2
Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the
More informationThe entire data set consists of n = 32 widgets, 8 of which were made from each of q = 4 different materials.
One-Way ANOVA Summary The One-Way ANOVA procedure is designed to construct a statistical model describing the impact of a single categorical factor X on a dependent variable Y. Tests are run to determine
More informationCh. 7: Estimates and Sample Sizes
Ch. 7: Estimates and Sample Sizes Section Title Notes Pages Introduction to the Chapter 2 2 Estimating p in the Binomial Distribution 2 5 3 Estimating a Population Mean: Sigma Known 6 9 4 Estimating a
More informationLecture 3B: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries)
Lecture 3B: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries) Summarize with Shape, Center, Spread Displays: Stemplots, Histograms Five Number Summary, Outliers, Boxplots Mean vs.
More informationReview: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.
1 Review: Let X 1, X,..., X n denote n independent random variables sampled from some distribution might not be normal!) with mean µ) and standard deviation σ). Then X µ σ n In other words, X is approximately
More information2 and F Distributions. Barrow, Statistics for Economics, Accounting and Business Studies, 4 th edition Pearson Education Limited 2006
and F Distributions Lecture 9 Distribution The distribution is used to: construct confidence intervals for a variance compare a set of actual frequencies with expected frequencies test for association
More informationInferences About the Difference Between Two Means
7 Inferences About the Difference Between Two Means Chapter Outline 7.1 New Concepts 7.1.1 Independent Versus Dependent Samples 7.1. Hypotheses 7. Inferences About Two Independent Means 7..1 Independent
More informationConfidence Intervals, Testing and ANOVA Summary
Confidence Intervals, Testing and ANOVA Summary 1 One Sample Tests 1.1 One Sample z test: Mean (σ known) Let X 1,, X n a r.s. from N(µ, σ) or n > 30. Let The test statistic is H 0 : µ = µ 0. z = x µ 0
More informationSTAT 328 (Statistical Packages)
Department of Statistics and Operations Research College of Science King Saud University Exercises STAT 328 (Statistical Packages) nashmiah r.alshammari ^-^ Excel and Minitab - 1 - Write the commands of
More information4.1 Hypothesis Testing
4.1 Hypothesis Testing z-test for a single value double-sided and single-sided z-test for one average z-test for two averages double-sided and single-sided t-test for one average the F-parameter and F-table
More informationChapter 24. Comparing Means. Copyright 2010 Pearson Education, Inc.
Chapter 24 Comparing Means Copyright 2010 Pearson Education, Inc. Plot the Data The natural display for comparing two groups is boxplots of the data for the two groups, placed side-by-side. For example:
More informationHypothesis testing: Steps
Review for Exam 2 Hypothesis testing: Steps Exam 2 Review 1. Determine appropriate test and hypotheses 2. Use distribution table to find critical statistic value(s) representing rejection region 3. Compute
More information