HYPOTHESIS TESTING: SINGLE MEAN, NORMAL DISTRIBUTION (Z-TEST)

Size: px
Start display at page:

Download "HYPOTHESIS TESTING: SINGLE MEAN, NORMAL DISTRIBUTION (Z-TEST)"

Transcription

1 HYPOTHESIS TESTING: SINGLE MEAN, NORMAL DISTRIBUTION (Z-TEST) In Binomial Hypothesis Testing researchers generally ignore the actual numbers that are obtained on their measure. The Binomial Test for whether UofW students have above average IQs in the last chapter, for example, simply compared the observed IQ for each of the 9 students to 100 and used the binomial outcome of above 100 or not above 100 as the data to be analyzed. The actual IQs of the 9 students were not analyzed further. A superior statistical test would use the numerical IQs of the 9 students to test whether the IQs of UofW students are above average; that is, to test whether = 100. One obvious possibility would be to use (i.e., the sample mean) to test whether it is likely that the observed sample came from a population with mean of 100. In order to carry out such a test, however, it would be necessary to have a probability distribution for, so that we could decide whether the observed outcome was unlikely enough for us to reject the hypothesis that = 100. Recall that for the binomial test it was possible to determine the probability associated with different numbers of successes using the binomial theorem, the binomial table, or the normal approximation. THE CENTRAL LIMIT THEOREM Fortunately much is known about the probability distribution of means drawn from a sample. Specifically, the Central Limit Theorem (CLT) states that for many (all possible) samples of size n drawn from a population, the probability distribution of the means: (a) will tend towards a normal distribution, (b) will have a mean equal to (i.e., equal to the population mean from which the samples were drawn, and (c) will have a standard deviation equal to /n (i.e., equal to the standard deviation of the original population divided by the square root of n, the sample size). Let s start with a small population of just three scores (i.e., x = 1, 2, and 3) from which we select all possible samples of two observations (i.e., n = 2). There are 3 x 3 = 9 possible samples when we sample with replacement; the samples are: 11, 12, 13, 21, 22, 23, 31, 32, 33. The 9 sample means are: = 1.0, 1.5, 2.0, 1.5, 2.0, 2.5, 2.0, 2.5, and 3.0. Now let us see how the population statistics compare to one another. According to the CLT, = x, and = x /n. The relevant calculations are shown below; note that because we are calculation variances for the population, we use formulas for population statistics. If we did calculate using sample formulas, we would divide SS by n and not n - 1. Population of xs Population of s x p(x) p() x 2.0 = 1x x = 1x x x.8165 = (1-2)x =(1-2)x.1111+(1.5-2)x =.8165/2

2 Consider now an initial population with = 100 and = 15 from which we select n = 9 observations and calculate. That gives us one mean and one standard deviation. Now we repeatedly select samples of 9 observations and calculate for each sample. We repeat this until we have a very large number of samples, each with an. What would the distribution of be like? The CLT states that the means will have a = = x and = 15/3 = 5.0 = x. We can use SPSS to demonstrate this more concretely and to confirm the validity of the CLT. SPSS was used to generate 50,000 samples of 9 observations and calculate. The first 10 samples and s are shown below. X1 X2 X3 X4 X5 X6 X7 X8 X9 Mean () Imagine another 49,990 rows like these, giving a column of 50,000 sample means. What would the probability distribution of those means look like? According to the CLT, it will be approximately normal, with a mean of 100 and a standard deviation of 5. The following histogram shows the frequency distribution (i.e., probability distribution) of the 50,000 means. Note that the mean of the 50,000 means is very close to 100, as predicted by the CLT, and that the standard deviation of the 50,000 means is very close to 5.0, again as predicted by the CLT. Also observe that the distribution is quite normal in shape. The dark line along the borders of the distribution is in fact a fit of the normal curve to the observed data. It is a very good fit. If sample means are normally distributed and we know the mean and standard deviation of the s, then it is possible to use the table for the normal distribution (i.e., the distribution of z) to calculate the probability that a certain range of sample means would occur given the hypothesized value for was correct. This is just what is needed to test hypotheses about a population mean. But in using the normal distribution to test hypotheses about a mean, it is important to remember that it is the probability distribution of the sample means that would be used. Probability distributions for sample statistics are also known as sampling distributions.

3 USING Z TO TEST HYPOTHESES ABOUT A POPULATION MEAN Now that we know what the probability distribution (or sampling distribution) for sample means is like, we can use that information to determine when we should reject or not reject some hypothesis about a population mean. The procedure can be illustrated using the question of UofW students s IQs. We start with our null and alternative hypotheses. The null hypothesis is that student IQs have a population mean of 100; that is, UofW students do not differ from the general population. If our research hypothesis is that UofW students have a higher IQ than the general population, then our alternative hypothesis would be that shown below (i.e., population mean is greater than 100). Ho: = 100 Ha: > 100 If the Ho is true, then it is possible to determine the sampling distribution of using the CLT. The population of individual Xs from which our samples are drawn has X = 100 (if H0 true) and X = 15 (because that is the published SD for an IQ test). I have put X as a subscript to make clear that these are the mean and standard deviation of the individual Xs in the population from which we are sampling. According to the CLT, our sample means for n = 9 will be normally distributed, with = X = 100 and = X /n = 15/3 = 5. Given this information, there are several equivalent approaches that can be used to test the null hypothesis. As illustrated in the figure to the right, the Rejection Region is in the upper end of the z distribution; this is because the alternative hypothesis states the researchers expect > 100. The shaded area is the probability that we reject H0 if H0 is true, that is, the probability of a Type I error. This probability, denoted, is generally set at.05, although other values are possible. If we look for.05 in column C of our table for the normal distribution, we will find that it falls exactly half-way between z = 1.64 and z = Hence, we use z = 1.645, as shown in the above figure. Now we can calculate how large our sample must be before the H0 will be rejected. The value is given by: Critical = x (15/9) = x 5 = The H0 will be rejected and the Ha accepted for sample means greater than or equal to and the H0 will not be rejected for sample means less than The 9 scores are shown below, along with calculations for the mean. S X X = = 980/9 =

4 The observed value for the sample mean is greater than the critical value (i.e., the value separating the Do not reject and Reject regions), so the H0 is rejected and Ha is accepted. We conclude that the IQ of UofW students is greater than 100. There is a second approach to testing this hypothesis, equivalent to the first but somewhat easier to implement in practice. Instead of calculating a critical value for the Mean, it is possible to use z = as the critical value, calculate a z score for our observed mean, and compare the two to determine whether z Observed is greater than z Critical = z Observed = ( ) / (15 / 9) = 8.89 / 5 = 1.78 Since 1.78 is greater than the critical value of 1.645, the H0 is rejected and the Ha accepted, identical to our previous conclusion, since the two approaches are logically equivalent. A third approach would be to find C for z = 1.78 and see if p(z1.78) is less than alpha of.05. Finding z in our table of the normal distribution gives a value of C =.0375, which is less than.05, our critical value for alpha. In summary, the three ways to determine whether to reject H0 or not given =.05 are: 1. z Critical = z Observed = 1.78 > Reject Ho, Accept Ha 2. z Observed = 1.78 p(z1.78) =.0375 < Reject Ho, Accept Ha 3. Critical = x 5 = Observed = > Reject Ho, Accept Ha FORMULA FOR Z-TEST H0: = 0 Ha: > 0 OR Ha: < 0 OR Ha: 0 z Observed = ( - 0 ) / /n Note that must be known to use this z-test PROBLEM # 10 Directional Ho: = 50 Ha: > 50 Sampling Distribution of, = 50, = 10/10 = = 56.7 z Observed = ( ) / 10/10 = 6.7/3.162 = =.05 z Critical = z Observed = 2.12>1.645 Reject Ho, Accept Ha 2. p(z >= 2.12) = <.05 Reject Ho, Accept Ha 3. Critical = x = Observed = 56.7>55.20 Reject Ho, Accept Ha Nondirectional Ho: = 50 Ha: 50 z Observed = /2 =.025 z Critical = 1.96 z Observed = 2.12 > 1.96 Reject Ho, Accept Ha 2. p(z 2.12 or z -2.12) = 2 x.0170 =.0340 <.05 Reject Ho, Accept Ha 3. +/ x = > Reject Ho, Accept Ha

5 HYPOTHESIS TESTING PROBLEMS SINGLE-SAMPLE, NORMAL DISTRIBUTION 1. The word "the" appears an average of 15.6 times per paragraph in the writings of Shakespeare, with a standard deviation of 4.3. A sample of 32 paragraphs from an unidentified manuscript contained an average of 14.3 "the's." Can we conclude with alpha =.06 that the unidentified manuscript contains fewer "the's" than works by Shakespeare? 2. Fiction writers spend an average of 6.2 hours per day working, with a standard deviation of 2.5. A sample of 35 poets found that they spent an average of 7.1 hours per day. Can we conclude with alpha =.06 that poets work longer hours than fiction writers? 3. In a certain sampling distribution of the mean of X based on samples of size 44 each, a sample mean of 438 is known to correspond to a z value of If the standard error of the mean is 9, what are the population mean and standard deviation for X? [tricky question] 4. Standard diet programs are known to result in a mean weight loss over six weeks of 10.8 pounds with standard deviation of 3.4 pounds. The designers of a new program report that 3 people had lost an average of 13.9 pounds after six weeks. Would you recommend the new program over standard diet programs? 5. The mean weight of parcels sent by Courier Express is 12.4 kg with a standard deviation of 4.8 kg. A special rate is given for customers sending over 500 kg. What is the probability that a company sending 40 parcels qualifies for the special rate? [tricky question] 6. The amount of money spent on Hallowe'en for costumes is a normally distributed variable with a mean of and a standard deviation of 3.7. What is the probability that the mean value of 80 costumes is more dollars? 7. A researcher observes that the mean number of trials that unreinforced rats require to learn a maze is 7.5 with standard deviation of A sample of 4 rats rewarded with food pellets learned in an average of 3.2 trials. Does reinforcement improve learning? 8. Archeologists excavating at a Viking site in Newfoundland have determined that the average age of artifacts is normally distributed with a mean age of 1000 years and a SD of 50 years. A new site is located that is thought to be older than those previously studied. A random sample of 24 artifacts had a mean age of 1024 years. What conclusions appear warranted about the new site? 9. The average score (MU) on The Achievement Motivation Test for the general population is 95 with a standard deviation of 8. Social psychologists have developed a program designed to raise achievement motivation. A sample of 38 students participated in a pilot study and obtained a mean achievement motivation score of 98.4 after 7 weeks. Do these findings support the conclusion that the program is effective at alpha =.02? 10. The average for the MMPI-2 Psychopathic Deviant Scale is 50, with a standard deviation of 10. Scores for 10 inmates averaged What conclusions are warranted about the PD scores of inmates?

6 SINGLE-SAMPLE, NORMAL DISTRIBUTION - SOLUTIONS 1. Ho: = 15.6 Ha: < 15.6 z = ( ) / (4.3/32) = -1.3/.760 = a. =.06, z = , < , Rej Ho & Acc Ha b. z = -1.71, p(z-1.71) =.0436 <.06, Rej Ho c. M = x.760 = , 14.3<14.418, Rej Ho Yes, manuscript contains fewer "the's" 2. Ho: = 6.2 Ha: > 6.2 z = ( ) / (2.5/35) = +.90 /.423 = 2.13 a. =.06, z = , 2.13>1.555, Rej Ho & Acc Ha b. z = 2.13, p(z2.13) =.0166 <.06, Rej Ho c. M = x.423 = 6.858, 7.1>6.858, Rej Ho Yes, poets work longer hours than fiction writers 3. n = 44, M = 438, z 438 = -1.75, s M = = (438 - ) / 9 = x9 = = /44 = 9x44 = Ho: = 10.8 Ha: > 10.8 z = ( ) / (3.4/3) = 3.1/1.963 = a. =.05, z = , 1.579<1.645, Do Not Rej Ho b. p(z1.58) =.0571 >.05, Do Not Rej Ho c. M = x1.963 = , 13.9<14.029, DNR Ho Insufficient evidence to conclude new program more effective 5. = 12.4, = 4.8, M = 500/40 = 12.5 z = ( )/(4.8/40) =.1/.759 =.13 p(z.13) =.4483 Tricky part is to appreciate that 500 = X 6. = 15.34, = 3.7 z = ( ) / (3.7/80) =.91/.4137 = 2.20 p(z2.20) = Ho: = 1000 Ha: > 1000 z = ( ) / (50/24) = 24/ = 2.35 a. =.05, z = 1.645, 2.35 > 1.645, Rej Ho & Acc Ha b. p(z2.35) =.0094 <.05, Rej Ho c. M = x = , 1024> , Rej Ho The new site appears to be older than other sites 9. Ho: = 95 Ha: > 95 z = ( ) / (8/38) = 3.4 / = 2.62 a. =.02, z = 2.05, 2.62 > 2.05, Rej Ho & Acc Ha b. p(z2.62) =.0044 <.02, Rej Ho c. M = x = 97.66, 98.4 > 97.66, Rej Ho Yes, the program appears effective at increasing motivation 10. Directional (One-tailed) Ho: = 50 Ha: > 50 If assume inmates higher on Psychopathic Deviance z = ( ) / (10/10) = 6.7 / = 2.12 a. =.05, z = 1.645, 2.12>1.645, Rej Ho & Acc Ha b. p(z2.12) =.0170 <.05, Rej Ho c. M = x3.162 = 55.20, 56.7 > 55.20, Rej Ho Inmates have higher PD scores than general population OR Nondirectional (Two-tailed) Ho: = 50 Ha: =/ 50 Assume inmates could be higher or lower on PD z = ( ) / (10/10) = 6.7 / = 2.12 Same as above a. =.05, /2 =.025, z ±1.96, 2.12>+1.96, Rej Ho & Acc Ha b. p(z<-2.12 or z>+2.12) = =.034 <.05, Rej Ho c. M = 50 ± 1.96x3.162 = , , 56.7 > , Rej Ho 7. Ho: = 7.5 Ha: < 7.5 z = ( ) / (2.25/4) = -4.3 / = a. =.05, z = , < , Rej Ho & Acc Ha b. p(z-3.82) <.0001, Rej Ho c. M = x = 5.649, 3.2<5.649, Rej Ho Reinforcement does appear to improve learning

T-TEST FOR HYPOTHESIS ABOUT

T-TEST FOR HYPOTHESIS ABOUT T-TEST FOR HYPOTHESIS ABOUT Previously we tested the hypothesis that a sample comes from a population with a specified using the normal distribution and a z-test. But the z-test required the population

More information

PSY 305. Module 3. Page Title. Introduction to Hypothesis Testing Z-tests. Five steps in hypothesis testing

PSY 305. Module 3. Page Title. Introduction to Hypothesis Testing Z-tests. Five steps in hypothesis testing Page Title PSY 305 Module 3 Introduction to Hypothesis Testing Z-tests Five steps in hypothesis testing State the research and null hypothesis Determine characteristics of comparison distribution Five

More information

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies The t-test: So Far: Sampling distribution benefit is that even if the original population is not normal, a sampling distribution based on this population will be normal (for sample size > 30). Benefit

More information

PSY 216. Assignment 9 Answers. Under what circumstances is a t statistic used instead of a z-score for a hypothesis test

PSY 216. Assignment 9 Answers. Under what circumstances is a t statistic used instead of a z-score for a hypothesis test PSY 216 Assignment 9 Answers 1. Problem 1 from the text Under what circumstances is a t statistic used instead of a z-score for a hypothesis test The t statistic should be used when the population standard

More information

HYPOTHESIS TESTING. Hypothesis Testing

HYPOTHESIS TESTING. Hypothesis Testing MBA 605 Business Analytics Don Conant, PhD. HYPOTHESIS TESTING Hypothesis testing involves making inferences about the nature of the population on the basis of observations of a sample drawn from the population.

More information

1 Descriptive statistics. 2 Scores and probability distributions. 3 Hypothesis testing and one-sample t-test. 4 More on t-tests

1 Descriptive statistics. 2 Scores and probability distributions. 3 Hypothesis testing and one-sample t-test. 4 More on t-tests Overall Overview INFOWO Statistics lecture S3: Hypothesis testing Peter de Waal Department of Information and Computing Sciences Faculty of Science, Universiteit Utrecht 1 Descriptive statistics 2 Scores

More information

16.400/453J Human Factors Engineering. Design of Experiments II

16.400/453J Human Factors Engineering. Design of Experiments II J Human Factors Engineering Design of Experiments II Review Experiment Design and Descriptive Statistics Research question, independent and dependent variables, histograms, box plots, etc. Inferential

More information

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015 AMS7: WEEK 7. CLASS 1 More on Hypothesis Testing Monday May 11th, 2015 Testing a Claim about a Standard Deviation or a Variance We want to test claims about or 2 Example: Newborn babies from mothers taking

More information

Sample Size and Power I: Binary Outcomes. James Ware, PhD Harvard School of Public Health Boston, MA

Sample Size and Power I: Binary Outcomes. James Ware, PhD Harvard School of Public Health Boston, MA Sample Size and Power I: Binary Outcomes James Ware, PhD Harvard School of Public Health Boston, MA Sample Size and Power Principles: Sample size calculations are an essential part of study design Consider

More information

Chapter 9 Inferences from Two Samples

Chapter 9 Inferences from Two Samples Chapter 9 Inferences from Two Samples 9-1 Review and Preview 9-2 Two Proportions 9-3 Two Means: Independent Samples 9-4 Two Dependent Samples (Matched Pairs) 9-5 Two Variances or Standard Deviations Review

More information

Sampling Distributions: Central Limit Theorem

Sampling Distributions: Central Limit Theorem Review for Exam 2 Sampling Distributions: Central Limit Theorem Conceptually, we can break up the theorem into three parts: 1. The mean (µ M ) of a population of sample means (M) is equal to the mean (µ)

More information

CENTRAL LIMIT THEOREM (CLT)

CENTRAL LIMIT THEOREM (CLT) CENTRAL LIMIT THEOREM (CLT) A sampling distribution is the probability distribution of the sample statistic that is formed when samples of size n are repeatedly taken from a population. If the sample statistic

More information

Class 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Class 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700 Class 4 Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science Copyright 013 by D.B. Rowe 1 Agenda: Recap Chapter 9. and 9.3 Lecture Chapter 10.1-10.3 Review Exam 6 Problem Solving

More information

Chapter 2 Descriptive Statistics

Chapter 2 Descriptive Statistics Chapter 2 Descriptive Statistics The Mean "When she told me I was average, she was just being mean". The mean is probably the most often used parameter or statistic used to describe the central tendency

More information

Introduction to Business Statistics QM 220 Chapter 12

Introduction to Business Statistics QM 220 Chapter 12 Department of Quantitative Methods & Information Systems Introduction to Business Statistics QM 220 Chapter 12 Dr. Mohammad Zainal 12.1 The F distribution We already covered this topic in Ch. 10 QM-220,

More information

An inferential procedure to use sample data to understand a population Procedures

An inferential procedure to use sample data to understand a population Procedures Hypothesis Test An inferential procedure to use sample data to understand a population Procedures Hypotheses, the alpha value, the critical region (z-scores), statistics, conclusion Two types of errors

More information

Statistical Inference. Why Use Statistical Inference. Point Estimates. Point Estimates. Greg C Elvers

Statistical Inference. Why Use Statistical Inference. Point Estimates. Point Estimates. Greg C Elvers Statistical Inference Greg C Elvers 1 Why Use Statistical Inference Whenever we collect data, we want our results to be true for the entire population and not just the sample that we used But our sample

More information

MORE ON SIMPLE REGRESSION: OVERVIEW

MORE ON SIMPLE REGRESSION: OVERVIEW FI=NOT0106 NOTICE. Unless otherwise indicated, all materials on this page and linked pages at the blue.temple.edu address and at the astro.temple.edu address are the sole property of Ralph B. Taylor and

More information

COSC 341 Human Computer Interaction. Dr. Bowen Hui University of British Columbia Okanagan

COSC 341 Human Computer Interaction. Dr. Bowen Hui University of British Columbia Okanagan COSC 341 Human Computer Interaction Dr. Bowen Hui University of British Columbia Okanagan 1 Last Class Introduced hypothesis testing Core logic behind it Determining results significance in scenario when:

More information

PHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1

PHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1 PHP2510: Principles of Biostatistics & Data Analysis Lecture X: Hypothesis testing PHP 2510 Lec 10: Hypothesis testing 1 In previous lectures we have encountered problems of estimating an unknown population

More information

The t-statistic. Student s t Test

The t-statistic. Student s t Test The t-statistic 1 Student s t Test When the population standard deviation is not known, you cannot use a z score hypothesis test Use Student s t test instead Student s t, or t test is, conceptually, very

More information

The Central Limit Theorem

The Central Limit Theorem - The Central Limit Theorem Definition Sampling Distribution of the Mean the probability distribution of sample means, with all samples having the same sample size n. (In general, the sampling distribution

More information

Last week: Sample, population and sampling distributions finished with estimation & confidence intervals

Last week: Sample, population and sampling distributions finished with estimation & confidence intervals Past weeks: Measures of central tendency (mean, mode, median) Measures of dispersion (standard deviation, variance, range, etc). Working with the normal curve Last week: Sample, population and sampling

More information

Two-Sample Inferential Statistics

Two-Sample Inferential Statistics The t Test for Two Independent Samples 1 Two-Sample Inferential Statistics In an experiment there are two or more conditions One condition is often called the control condition in which the treatment is

More information

How do we compare the relative performance among competing models?

How do we compare the relative performance among competing models? How do we compare the relative performance among competing models? 1 Comparing Data Mining Methods Frequent problem: we want to know which of the two learning techniques is better How to reliably say Model

More information

PhysicsAndMathsTutor.com

PhysicsAndMathsTutor.com 1. An effect of a certain disease is that a small number of the red blood cells are deformed. Emily has this disease and the deformed blood cells occur randomly at a rate of 2.5 per ml of her blood. Following

More information

Advanced Experimental Design

Advanced Experimental Design Advanced Experimental Design Topic Four Hypothesis testing (z and t tests) & Power Agenda Hypothesis testing Sampling distributions/central limit theorem z test (σ known) One sample z & Confidence intervals

More information

Last two weeks: Sample, population and sampling distributions finished with estimation & confidence intervals

Last two weeks: Sample, population and sampling distributions finished with estimation & confidence intervals Past weeks: Measures of central tendency (mean, mode, median) Measures of dispersion (standard deviation, variance, range, etc). Working with the normal curve Last two weeks: Sample, population and sampling

More information

Lab #12: Exam 3 Review Key

Lab #12: Exam 3 Review Key Psychological Statistics Practice Lab#1 Dr. M. Plonsky Page 1 of 7 Lab #1: Exam 3 Review Key 1) a. Probability - Refers to the likelihood that an event will occur. Ranges from 0 to 1. b. Sampling Distribution

More information

CBA4 is live in practice mode this week exam mode from Saturday!

CBA4 is live in practice mode this week exam mode from Saturday! Announcements CBA4 is live in practice mode this week exam mode from Saturday! Material covered: Confidence intervals (both cases) 1 sample hypothesis tests (both cases) Hypothesis tests for 2 means as

More information

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

 M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2 Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the

More information

(a) The density histogram above right represents a particular sample of n = 40 practice shots. Answer each of the following. Show all work.

(a) The density histogram above right represents a particular sample of n = 40 practice shots. Answer each of the following. Show all work. . Target Practice. An archer is practicing hitting the bull s-eye of the target shown below left. For any point on the target, define the continuous random variable D = (signed) radial distance to the

More information

Sampling Distributions

Sampling Distributions Sampling Distributions Sampling Distribution of the Mean & Hypothesis Testing Remember sampling? Sampling Part 1 of definition Selecting a subset of the population to create a sample Generally random sampling

More information

Introduction to Analysis of Variance. Chapter 11

Introduction to Analysis of Variance. Chapter 11 Introduction to Analysis of Variance Chapter 11 Review t-tests Single-sample t-test Independent samples t-test Related or paired-samples t-test s m M t ) ( 1 1 ) ( m m s M M t M D D D s M t n s s M 1 )

More information

Elementary Statistics Triola, Elementary Statistics 11/e Unit 17 The Basics of Hypotheses Testing

Elementary Statistics Triola, Elementary Statistics 11/e Unit 17 The Basics of Hypotheses Testing (Section 8-2) Hypotheses testing is not all that different from confidence intervals, so let s do a quick review of the theory behind the latter. If it s our goal to estimate the mean of a population,

More information

t-test for b Copyright 2000 Tom Malloy. All rights reserved. Regression

t-test for b Copyright 2000 Tom Malloy. All rights reserved. Regression t-test for b Copyright 2000 Tom Malloy. All rights reserved. Regression Recall, back some time ago, we used a descriptive statistic which allowed us to draw the best fit line through a scatter plot. We

More information

SAMPLE QUESTIONS. Research Methods II - HCS 6313

SAMPLE QUESTIONS. Research Methods II - HCS 6313 SAMPLE QUESTIONS Research Methods II - HCS 6313 This is a (small) set of sample questions. Please, note that the exam comprises more questions that this sample. Social Security Number: NAME: IMPORTANT

More information

Econ 325: Introduction to Empirical Economics

Econ 325: Introduction to Empirical Economics Econ 325: Introduction to Empirical Economics Chapter 9 Hypothesis Testing: Single Population Ch. 9-1 9.1 What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population

More information

Nonparametric tests. Mark Muldoon School of Mathematics, University of Manchester. Mark Muldoon, November 8, 2005 Nonparametric tests - p.

Nonparametric tests. Mark Muldoon School of Mathematics, University of Manchester. Mark Muldoon, November 8, 2005 Nonparametric tests - p. Nonparametric s Mark Muldoon School of Mathematics, University of Manchester Mark Muldoon, November 8, 2005 Nonparametric s - p. 1/31 Overview The sign, motivation The Mann-Whitney Larger Larger, in pictures

More information

Independent Samples ANOVA

Independent Samples ANOVA Independent Samples ANOVA In this example students were randomly assigned to one of three mnemonics (techniques for improving memory) rehearsal (the control group; simply repeat the words), visual imagery

More information

Estimating the accuracy of a hypothesis Setting. Assume a binary classification setting

Estimating the accuracy of a hypothesis Setting. Assume a binary classification setting Estimating the accuracy of a hypothesis Setting Assume a binary classification setting Assume input/output pairs (x, y) are sampled from an unknown probability distribution D = p(x, y) Train a binary classifier

More information

Unit 27 One-Way Analysis of Variance

Unit 27 One-Way Analysis of Variance Unit 27 One-Way Analysis of Variance Objectives: To perform the hypothesis test in a one-way analysis of variance for comparing more than two population means Recall that a two sample t test is applied

More information

Inference for Distributions Inference for the Mean of a Population

Inference for Distributions Inference for the Mean of a Population Inference for Distributions Inference for the Mean of a Population PBS Chapter 7.1 009 W.H Freeman and Company Objectives (PBS Chapter 7.1) Inference for the mean of a population The t distributions The

More information

Hypothesis Testing and Confidence Intervals (Part 2): Cohen s d, Logic of Testing, and Confidence Intervals

Hypothesis Testing and Confidence Intervals (Part 2): Cohen s d, Logic of Testing, and Confidence Intervals Hypothesis Testing and Confidence Intervals (Part 2): Cohen s d, Logic of Testing, and Confidence Intervals Lecture 9 Justin Kern April 9, 2018 Measuring Effect Size: Cohen s d Simply finding whether a

More information

Math 101: Elementary Statistics Tests of Hypothesis

Math 101: Elementary Statistics Tests of Hypothesis Tests of Hypothesis Department of Mathematics and Computer Science University of the Philippines Baguio November 15, 2018 Basic Concepts of Statistical Hypothesis Testing A statistical hypothesis is an

More information

Math 2000 Practice Final Exam: Homework problems to review. Problem numbers

Math 2000 Practice Final Exam: Homework problems to review. Problem numbers Math 2000 Practice Final Exam: Homework problems to review Pages: Problem numbers 52 20 65 1 181 14 189 23, 30 245 56 256 13 280 4, 15 301 21 315 18 379 14 388 13 441 13 450 10 461 1 553 13, 16 561 13,

More information

CS 361: Probability & Statistics

CS 361: Probability & Statistics February 19, 2018 CS 361: Probability & Statistics Random variables Markov s inequality This theorem says that for any random variable X and any value a, we have A random variable is unlikely to have an

More information

Comparing Means from Two-Sample

Comparing Means from Two-Sample Comparing Means from Two-Sample Kwonsang Lee University of Pennsylvania kwonlee@wharton.upenn.edu April 3, 2015 Kwonsang Lee STAT111 April 3, 2015 1 / 22 Inference from One-Sample We have two options to

More information

Statistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong

Statistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong Statistics Primer ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong 1 Quick Overview of Statistics 2 Descriptive vs. Inferential Statistics Descriptive Statistics: summarize and describe data

More information

Single Sample Means. SOCY601 Alan Neustadtl

Single Sample Means. SOCY601 Alan Neustadtl Single Sample Means SOCY601 Alan Neustadtl The Central Limit Theorem If we have a population measured by a variable with a mean µ and a standard deviation σ, and if all possible random samples of size

More information

The One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs)

The One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs) The One-Way Repeated-Measures ANOVA (For Within-Subjects Designs) Logic of the Repeated-Measures ANOVA The repeated-measures ANOVA extends the analysis of variance to research situations using repeated-measures

More information

SL - Binomial Questions

SL - Binomial Questions IB Questionbank Maths SL SL - Binomial Questions 262 min 244 marks 1. A random variable X is distributed normally with mean 450 and standard deviation 20. Find P(X 475). Given that P(X > a) = 0.27, find

More information

Difference in two or more average scores in different groups

Difference in two or more average scores in different groups ANOVAs Analysis of Variance (ANOVA) Difference in two or more average scores in different groups Each participant tested once Same outcome tested in each group Simplest is one-way ANOVA (one variable as

More information

Student s t-distribution. The t-distribution, t-tests, & Measures of Effect Size

Student s t-distribution. The t-distribution, t-tests, & Measures of Effect Size Student s t-distribution The t-distribution, t-tests, & Measures of Effect Size Sampling Distributions Redux Chapter 7 opens with a return to the concept of sampling distributions from chapter 4 Sampling

More information

In a one-way ANOVA, the total sums of squares among observations is partitioned into two components: Sums of squares represent:

In a one-way ANOVA, the total sums of squares among observations is partitioned into two components: Sums of squares represent: Activity #10: AxS ANOVA (Repeated subjects design) Resources: optimism.sav So far in MATH 300 and 301, we have studied the following hypothesis testing procedures: 1) Binomial test, sign-test, Fisher s

More information

Simple Linear Regression: One Quantitative IV

Simple Linear Regression: One Quantitative IV Simple Linear Regression: One Quantitative IV Linear regression is frequently used to explain variation observed in a dependent variable (DV) with theoretically linked independent variables (IV). For example,

More information

Analysis of Variance (ANOVA)

Analysis of Variance (ANOVA) Analysis of Variance (ANOVA) Two types of ANOVA tests: Independent measures and Repeated measures Comparing 2 means: X 1 = 20 t - test X 2 = 30 How can we Compare 3 means?: X 1 = 20 X 2 = 30 X 3 = 35 ANOVA

More information

Non-parametric Statistics

Non-parametric Statistics 45 Contents Non-parametric Statistics 45.1 Non-parametric Tests for a Single Sample 45. Non-parametric Tests for Two Samples 4 Learning outcomes You will learn about some significance tests which may be

More information

Mark Scheme (Results) Summer 2007

Mark Scheme (Results) Summer 2007 Mark Scheme (Results) Summer 007 GCE GCE Mathematics Statistics S3 (6691) Edexcel Limited. Registered in England and Wales No. 4496750 Registered Office: One90 High Holborn, London WC1V 7BH June 007 6691

More information

PSYC 331 STATISTICS FOR PSYCHOLOGISTS

PSYC 331 STATISTICS FOR PSYCHOLOGISTS PSYC 331 STATISTICS FOR PSYCHOLOGISTS Session 4 A PARAMETRIC STATISTICAL TEST FOR MORE THAN TWO POPULATIONS Lecturer: Dr. Paul Narh Doku, Dept of Psychology, UG Contact Information: pndoku@ug.edu.gh College

More information

Descriptive Statistics-I. Dr Mahmoud Alhussami

Descriptive Statistics-I. Dr Mahmoud Alhussami Descriptive Statistics-I Dr Mahmoud Alhussami Biostatistics What is the biostatistics? A branch of applied math. that deals with collecting, organizing and interpreting data using well-defined procedures.

More information

Population Variance. Concepts from previous lectures. HUMBEHV 3HB3 one-sample t-tests. Week 8

Population Variance. Concepts from previous lectures. HUMBEHV 3HB3 one-sample t-tests. Week 8 Concepts from previous lectures HUMBEHV 3HB3 one-sample t-tests Week 8 Prof. Patrick Bennett sampling distributions - sampling error - standard error of the mean - degrees-of-freedom Null and alternative/research

More information

Study Ch. 9.3, #47 53 (45 51), 55 61, (55 59)

Study Ch. 9.3, #47 53 (45 51), 55 61, (55 59) GOALS: 1. Understand that 2 approaches of hypothesis testing exist: classical or critical value, and p value. We will use the p value approach. 2. Understand the critical value for the classical approach

More information

LECTURE 12 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING

LECTURE 12 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING LECTURE 1 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING INTERVAL ESTIMATION Point estimation of : The inference is a guess of a single value as the value of. No accuracy associated with it. Interval estimation

More information

Multiple Regression. More Hypothesis Testing. More Hypothesis Testing The big question: What we really want to know: What we actually know: We know:

Multiple Regression. More Hypothesis Testing. More Hypothesis Testing The big question: What we really want to know: What we actually know: We know: Multiple Regression Ψ320 Ainsworth More Hypothesis Testing What we really want to know: Is the relationship in the population we have selected between X & Y strong enough that we can use the relationship

More information

Lecture 06. DSUR CH 05 Exploring Assumptions of parametric statistics Hypothesis Testing Power

Lecture 06. DSUR CH 05 Exploring Assumptions of parametric statistics Hypothesis Testing Power Lecture 06 DSUR CH 05 Exploring Assumptions of parametric statistics Hypothesis Testing Power Introduction Assumptions When broken then we are not able to make inference or accurate descriptions about

More information

Lecture 9 Two-Sample Test. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech

Lecture 9 Two-Sample Test. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech Lecture 9 Two-Sample Test Fall 2013 Prof. Yao Xie, yao.xie@isye.gatech.edu H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech Computer exam 1 18 Histogram 14 Frequency 9 5 0 75 83.33333333

More information

COSC 341 Human Computer Interaction. Dr. Bowen Hui University of British Columbia Okanagan

COSC 341 Human Computer Interaction. Dr. Bowen Hui University of British Columbia Okanagan COSC 341 Human Computer Interaction Dr. Bowen Hui University of British Columbia Okanagan 1 Last Topic Distribution of means When it is needed How to build one (from scratch) Determining the characteristics

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests

Statistics for Managers Using Microsoft Excel/SPSS Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics for Managers Using Microsoft Excel/SPSS Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests 1999 Prentice-Hall, Inc. Chap. 8-1 Chapter Topics Hypothesis Testing Methodology Z Test

More information

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8 CIVL - 7904/8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8 Chi-square Test How to determine the interval from a continuous distribution I = Range 1 + 3.322(logN) I-> Range of the class interval

More information

10/31/2012. One-Way ANOVA F-test

10/31/2012. One-Way ANOVA F-test PSY 511: Advanced Statistics for Psychological and Behavioral Research 1 1. Situation/hypotheses 2. Test statistic 3.Distribution 4. Assumptions One-Way ANOVA F-test One factor J>2 independent samples

More information

Samples and Populations Confidence Intervals Hypotheses One-sided vs. two-sided Statistical Significance Error Types. Statistiek I.

Samples and Populations Confidence Intervals Hypotheses One-sided vs. two-sided Statistical Significance Error Types. Statistiek I. Statistiek I Sampling John Nerbonne CLCG, Rijksuniversiteit Groningen http://www.let.rug.nl/nerbonne/teach/statistiek-i/ John Nerbonne 1/41 Overview 1 Samples and Populations 2 Confidence Intervals 3 Hypotheses

More information

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

One-Way ANOVA. Some examples of when ANOVA would be appropriate include: One-Way ANOVA 1. Purpose Analysis of variance (ANOVA) is used when one wishes to determine whether two or more groups (e.g., classes A, B, and C) differ on some outcome of interest (e.g., an achievement

More information

Time: 1 hour 30 minutes

Time: 1 hour 30 minutes Paper Reference(s) 6684/01 Edexcel GCE Statistics S2 Gold Level G3 Time: 1 hour 30 minutes Materials required for examination papers Mathematical Formulae (Green) Items included with question Nil Candidates

More information

Factorial Independent Samples ANOVA

Factorial Independent Samples ANOVA Factorial Independent Samples ANOVA Liljenquist, Zhong and Galinsky (2010) found that people were more charitable when they were in a clean smelling room than in a neutral smelling room. Based on that

More information

June 006 6691 Statistics S3 Mark Scheme Mark Scheme (Results) Summer 007 GCE GCE Mathematics Statistics S3 (6691) Edexcel Limited. Registered in England and Wales No. 4496750 Registered Office: One90

More information

POLI 443 Applied Political Research

POLI 443 Applied Political Research POLI 443 Applied Political Research Session 4 Tests of Hypotheses The Normal Curve Lecturer: Prof. A. Essuman-Johnson, Dept. of Political Science Contact Information: aessuman-johnson@ug.edu.gh College

More information

Hypothesis Testing. Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true

Hypothesis Testing. Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true Hypothesis esting Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true Statistical Hypothesis: conjecture about a population parameter

More information

LECTURE 5. Introduction to Econometrics. Hypothesis testing

LECTURE 5. Introduction to Econometrics. Hypothesis testing LECTURE 5 Introduction to Econometrics Hypothesis testing October 18, 2016 1 / 26 ON TODAY S LECTURE We are going to discuss how hypotheses about coefficients can be tested in regression models We will

More information

Review of Multiple Regression

Review of Multiple Regression Ronald H. Heck 1 Let s begin with a little review of multiple regression this week. Linear models [e.g., correlation, t-tests, analysis of variance (ANOVA), multiple regression, path analysis, multivariate

More information

This gives us an upper and lower bound that capture our population mean.

This gives us an upper and lower bound that capture our population mean. Confidence Intervals Critical Values Practice Problems 1 Estimation 1.1 Confidence Intervals Definition 1.1 Margin of error. The margin of error of a distribution is the amount of error we predict when

More information

Universität Potsdam Institut für Informatik Lehrstuhl Maschinelles Lernen. Hypothesis testing. Anna Wegloop Niels Landwehr/Tobias Scheffer

Universität Potsdam Institut für Informatik Lehrstuhl Maschinelles Lernen. Hypothesis testing. Anna Wegloop Niels Landwehr/Tobias Scheffer Universität Potsdam Institut für Informatik Lehrstuhl Maschinelles Lernen Hypothesis testing Anna Wegloop iels Landwehr/Tobias Scheffer Why do a statistical test? input computer model output Outlook ull-hypothesis

More information

Hypothesis Testing. ) the hypothesis that suggests no change from previous experience

Hypothesis Testing. ) the hypothesis that suggests no change from previous experience Hypothesis Testing Definitions Hypothesis a claim about something Null hypothesis ( H 0 ) the hypothesis that suggests no change from previous experience Alternative hypothesis ( H 1 ) the hypothesis that

More information

Statistics and Sampling distributions

Statistics and Sampling distributions Statistics and Sampling distributions a statistic is a numerical summary of sample data. It is a rv. The distribution of a statistic is called its sampling distribution. The rv s X 1, X 2,, X n are said

More information

Chapter 6 Continuous Probability Distributions

Chapter 6 Continuous Probability Distributions Continuous Probability Distributions Learning Objectives 1. Understand the difference between how probabilities are computed for discrete and continuous random variables. 2. Know how to compute probability

More information

P ( z 0.75)

P ( z 0.75) BUS 172 Section 5, Spring 2013 Assignment # 6 Deadline: Your assignment must be submitted/ emailed on or before 1:00 PM, 14 th March, 2013. Late submission will be penalized by 10% for every day after

More information

Chapter # classifications of unlikely, likely, or very likely to describe possible buying of a product?

Chapter # classifications of unlikely, likely, or very likely to describe possible buying of a product? A. Attribute data B. Numerical data C. Quantitative data D. Sample data E. Qualitative data F. Statistic G. Parameter Chapter #1 Match the following descriptions with the best term or classification given

More information

Simple Linear Regression: One Qualitative IV

Simple Linear Regression: One Qualitative IV Simple Linear Regression: One Qualitative IV 1. Purpose As noted before regression is used both to explain and predict variation in DVs, and adding to the equation categorical variables extends regression

More information

Lecture 6: The Normal distribution

Lecture 6: The Normal distribution Lecture 6: The Normal distribution 18th of November 2015 Lecture 6: The Normal distribution 18th of November 2015 1 / 29 Continous data In previous lectures we have considered discrete datasets and discrete

More information

Hypothesis tests

Hypothesis tests 6.1 6.4 Hypothesis tests Prof. Tesler Math 186 February 26, 2014 Prof. Tesler 6.1 6.4 Hypothesis tests Math 186 / February 26, 2014 1 / 41 6.1 6.2 Intro to hypothesis tests and decision rules Hypothesis

More information

79 Wyner Math Academy I Spring 2016

79 Wyner Math Academy I Spring 2016 79 Wyner Math Academy I Spring 2016 CHAPTER NINE: HYPOTHESIS TESTING Review May 11 Test May 17 Research requires an understanding of underlying mathematical distributions as well as of the research methods

More information

A-Level Statistics SS04 Final Mark scheme

A-Level Statistics SS04 Final Mark scheme A-Level Statistics SS04 Final Mark scheme 6380 June 2017 Version/Stage: v1.0 Mark schemes are prepared by the Lead Assessment Writer and considered, together with the relevant questions, by a panel of

More information

Hypothesis testing: Steps

Hypothesis testing: Steps Review for Exam 2 Hypothesis testing: Steps Repeated-Measures ANOVA 1. Determine appropriate test and hypotheses 2. Use distribution table to find critical statistic value(s) representing rejection region

More information

Study Guide #3: OneWay ANALYSIS OF VARIANCE (ANOVA)

Study Guide #3: OneWay ANALYSIS OF VARIANCE (ANOVA) Study Guide #3: OneWay ANALYSIS OF VARIANCE (ANOVA) About the ANOVA Test In educational research, we are most often involved finding out whether there are differences between groups. For example, is there

More information

6 Single Sample Methods for a Location Parameter

6 Single Sample Methods for a Location Parameter 6 Single Sample Methods for a Location Parameter If there are serious departures from parametric test assumptions (e.g., normality or symmetry), nonparametric tests on a measure of central tendency (usually

More information

Ch. 7: Estimates and Sample Sizes

Ch. 7: Estimates and Sample Sizes Ch. 7: Estimates and Sample Sizes Section Title Notes Pages Introduction to the Chapter 2 2 Estimating p in the Binomial Distribution 2 5 3 Estimating a Population Mean: Sigma Known 6 9 4 Estimating a

More information

Math 2200 Fall 2014, Exam 3 You may use any calculator. You may use a 4 6 inch notecard as a cheat sheet.

Math 2200 Fall 2014, Exam 3 You may use any calculator. You may use a 4 6 inch notecard as a cheat sheet. 1 Math 2200 Fall 2014, Exam 3 You may use any calculator. You may use a 4 6 inch notecard as a cheat sheet. Warning to the Reader! If you are a student for whom this document is a historical artifact,

More information

Making Inferences About Parameters

Making Inferences About Parameters Making Inferences About Parameters Parametric statistical inference may take the form of: 1. Estimation: on the basis of sample data we estimate the value of some parameter of the population from which

More information

Two Sample Problems. Two sample problems

Two Sample Problems. Two sample problems Two Sample Problems Two sample problems The goal of inference is to compare the responses in two groups. Each group is a sample from a different population. The responses in each group are independent

More information

Inferences About the Difference Between Two Means

Inferences About the Difference Between Two Means 7 Inferences About the Difference Between Two Means Chapter Outline 7.1 New Concepts 7.1.1 Independent Versus Dependent Samples 7.1. Hypotheses 7. Inferences About Two Independent Means 7..1 Independent

More information