Elementary Statistics and Inference

Similar documents
Statistical Inference for Means

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 9.1-1

UNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL - MAY 2005 EXAMINATIONS STA 248 H1S. Duration - 3 hours. Aids Allowed: Calculator

Lecture 30. DATA 8 Summer Regression Inference

# of 6s # of times Test the null hypthesis that the dice are fair at α =.01 significance

Chapter 22. Comparing Two Proportions. Bin Zou STAT 141 University of Alberta Winter / 15

Statistics and Quantitative Analysis U4320. Segment 5: Sampling and inference Prof. Sharyn O Halloran

Chapter 5 : Probability. Exercise Sheet. SHilal. 1 P a g e

CHAPTER 5 Probabilistic Features of the Distributions of Certain Sample Statistics

Chapter 9. Inferences from Two Samples. Objective. Notation. Section 9.2. Definition. Notation. q = 1 p. Inferences About Two Proportions

MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1. MAT 2379, Introduction to Biostatistics

Name Class Date. Residuals and Linear Regression Going Deeper

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

TEST 1 M3070 Fall 2003

Testing a Claim about the Difference in 2 Population Means Independent Samples. (there is no difference in Population Means µ 1 µ 2 = 0) against

CBA4 is live in practice mode this week exam mode from Saturday!

Analysis of Variance. Contents. 1 Analysis of Variance. 1.1 Review. Anthony Tanbakuchi Department of Mathematics Pima Community College

Inferences Based on Two Samples

Homework 9 (due November 24, 2009)

Statistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH

Nicole Dalzell. July 2, 2014

Simple Linear Regression: One Qualitative IV

Test 3 SOLUTIONS. x P(x) xp(x)

Inferential statistics

Chapter 3. Comparing two populations

Lecture 10/Chapter 8 Bell-Shaped Curves & Other Shapes. From a Histogram to a Frequency Curve Standard Score Using Normal Table Empirical Rule

STAT Chapter 9: Two-Sample Problems. Paired Differences (Section 9.3)

Lecture 14: Introduction to Poisson Regression

Modelling counts. Lecture 14: Introduction to Poisson Regression. Overview

16.400/453J Human Factors Engineering. Design of Experiments II

Paired Samples. Lecture 37 Sections 11.1, 11.2, Robb T. Koether. Hampden-Sydney College. Mon, Apr 2, 2012

where Female = 0 for males, = 1 for females Age is measured in years (22, 23, ) GPA is measured in units on a four-point scale (0, 1.22, 3.45, etc.

Statistics for IT Managers

Class 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Normal Curve in standard form: Answer each of the following questions

LECTURE 12 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING

Psych Jan. 5, 2005

Statistics and Quantitative Analysis U4320

Francine s bone density is 1.45 standard deviations below the mean hip bone density for 25-year-old women of 956 grams/cm 2.

Lecturer: Dr. Adote Anum, Dept. of Psychology Contact Information:

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01

Introduction to Estimation. Martina Litschmannová K210

Lecture Slides. Section 13-1 Overview. Elementary Statistics Tenth Edition. Chapter 13 Nonparametric Statistics. by Mario F.

Sampling Distribution of a Sample Proportion

Inference for Distributions Inference for the Mean of a Population

Math 2000 Practice Final Exam: Homework problems to review. Problem numbers

STAT 201 Assignment 6

T test for two Independent Samples. Raja, BSc.N, DCHN, RN Nursing Instructor Acknowledgement: Ms. Saima Hirani June 07, 2016

4. Nonlinear regression functions

An inferential procedure to use sample data to understand a population Procedures

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

Chapter 12 - Lecture 2 Inferences about regression coefficient

Hypotheses Testing. 1-Single Mean

Data Analysis and Statistical Methods Statistics 651

Chapter 9. Hypothesis testing. 9.1 Introduction

Stat 101: Lecture 6. Summer 2006

Psych 230. Psychological Measurement and Statistics

Chapter 10: STATISTICAL INFERENCE FOR TWO SAMPLES. Part 1: Hypothesis tests on a µ 1 µ 2 for independent groups

(right tailed) or minus Z α. (left-tailed). For a two-tailed test the critical Z value is going to be.

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI

Confidence Intervals for Comparing Means

Chapter 6 The Normal Distribution

OCR Maths S1. Topic Questions from Papers. Representation of Data

COSC 341 Human Computer Interaction. Dr. Bowen Hui University of British Columbia Okanagan

Analysing data: regression and correlation S6 and S7

Math 101: Elementary Statistics Tests of Hypothesis

University of Jordan Fall 2009/2010 Department of Mathematics

6 THE NORMAL DISTRIBUTION

One-sample categorical data: approximate inference

Sampling Distributions

Elementary Statistics

Year 10 Higher Easter 2017 Revision

The One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs)

Review. Midterm Exam. Midterm Review. May 6th, 2015 AMS-UCSC. Spring Session 1 (Midterm Review) AMS-5 May 6th, / 24

Essential Question: What are the standard intervals for a normal distribution? How are these intervals used to solve problems?

Determine whether the value given below is from a discrete or continuous data set.

Stat 135 Fall 2013 FINAL EXAM December 18, 2013

Lesson 19: Understanding Variability When Estimating a Population Proportion

Objectives Simple linear regression. Statistical model for linear regression. Estimating the regression parameters

Announcements. Unit 3: Foundations for inference Lecture 3: Decision errors, significance levels, sample size, and power.

The Components of a Statistical Hypothesis Testing Problem

Chapter 22. Comparing Two Proportions 1 /29

Poisson regression: Further topics

MATH4427 Notebook 5 Fall Semester 2017/2018

Announcements. Lecture 5: Probability. Dangling threads from last week: Mean vs. median. Dangling threads from last week: Sampling bias

CHAPTER 10. Regression and Correlation

Sampling Distribution of a Sample Proportion

Answer Key. 9.1 Scatter Plots and Linear Correlation. Chapter 9 Regression and Correlation. CK-12 Advanced Probability and Statistics Concepts 1

5-1. For which functions in Problem 4-3 does the Central Limit Theorem hold / fail?

Chapter 22. Comparing Two Proportions 1 /30

Statistics and Sampling distributions

Statistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017

Midterm 1 and 2 results

Data Analysis and Statistical Methods Statistics 651

Probability and Probability Distributions. Dr. Mohammed Alahmed

Exercise 1. Exercise 2. Lesson 2 Theoretical Foundations Probabilities Solutions You ip a coin three times.

A-Level Statistics SS04 Final Mark scheme

Lecture 28 Chi-Square Analysis

Transcription:

Elementary tatistics and Inference :5 or 7P:5 Lecture 36 Elementary tatistics and Inference :5 or 7P:5 Chapter 7 3

Chapter 7 More Tests for verages ) The tandard Error for a Difference etween Two verages This chapter discussed the process for comparing the differences between two means. For example, suppose you want to compare the differences between boys and girls, or differences between students who were instructed with method versus students instructed with method. 4 Chapter 7 More Tests for verages (cont.) uppose you have two boxes: (Population) ox μ verage D 6 (Population) ox μ verage 9 D 4 ample ample n 4 n E(avg) μ E(avg) n 6 3 4 E(avg) 9 μ E(avg) n 4 4 5 Chapter 7 More Tests for verages (cont.) ee page 5-53. E ( X X ) E E ( X X ) μ μ 9 ( X X ) 9 6 5 5 X X 6 4 4 36 6 4 The histogram of differences in averages (sampling distribution of differences in averages) is approximately normal in shape with mean and E as: Mean μ μ differences in means of the populations E square root of the sum of the error variances of populations and 6

Chapter 7 More Tests for verages (cont.) Example: (page 5) One hundred draws are made from a box (population) are made with replacement from box C, shown below. Independently, draws are made with replacement from box D. Find the expected value and standard error for the difference between the number of s in box C and the number of 4s drawn from box D. 7 Chapter 7 More Tests for verages (cont.) C D 3 4 avg μc D C E(count) n avg 5 E(count) n D 5 avg μ D D D E(count) n avg 5 E(count) n D 5 8 Chapter 7 More Tests for verages (cont.) E(Difference) E(count C) - E(count D) 5-5 E(Difference) 5 5 5 5 7.7 E( Diff) 7.7 7 Differences The probability histogram for differences in sample means would be approximately normal with mean equal to difference in means of the populations (boxes) and E equal to the square root of the sum of the squares of the E for each box. 9 3

Chapter 7 More Tests for verages (cont.) Exercise et (p. 53) #, 3 #. ox has an average of and a D of. ox has an average of 5 and a D of 8. Now 5 draws are made at random with replacement from box, and independently 36 draws are made at random with replacement from box. Find the expected value and standard error for the difference between the average of the draws from box and the average of the draws from box. Chapter 7 More Tests for verages (cont.) Population (ox) μ Population (ox) μ 5 8 n 5 n 36 E(Differences in Means) - 5 5 E(Differences in Means) E 34 5 36 ( Differences in Means) 4 9 3 3.65 3. 6 Chapter 7 More Tests for verages (cont.) The sampling distribution of differences in averages would be approximately normal with mean 5, and E 3.6. X X μ μ X X 4

Chapter 7 More Tests for verages (cont.). Comparing Two ample verages (Means) ince the theoretical probability histogram (sampling distribution) of differences in averages is approximately normal, we can use the Z-test to test the hypothesis that independent samples were taken from two populations that have the same mean (i.e., H : μ μ ). 3 Chapter 7 More Tests for verages (cont.) Population μ?? Population μ?? ample (n ) ample (n ) X 36.7 X 3.4 3. H : μ μ H : μ μ > 34.9 4 Chapter 7 More Tests for verages (cont.) The sampling distribution of differences is approximately normal. E(diff) 3. 34.9 E(diff) E(diff).46.96.8 H : μ μ 6.3 4.3 X X Z score - Z D of score Z mean of score ( X X ) ( ) ( X X ) ( μ μ) 6.3 4.3.46 E(diff) 5 5

Chapter 7 More Tests for verages (cont.) The p-value associated with Z4.3 is very small..% 99.9983 99.9983% ).7%.85% Z -4.3 4.3 The likelihood of obtaining a difference in means of 6.3 from two independent samples from the populations is much less than % of the time (i.e.,.85%), if the hypothesis were true as a result, we reject the hypothesis and accept the alternative that the mean of population is significantly larger than population. 6 Chapter 7 More Tests for verages (cont.) Example: large university takes a simple random sample of male students, and another simple random sample of females. The sample sizes are and 3. s is turned out, 7 of the sample men used a personal computer on a regular basis, compared to 3 of the women: 53.5% versus 44.%. Is the difference between the percentages real, or a chance variation? 7 Chapter 7 More Tests for verages (cont.) Males Females n n 3 7 3 p.535 p.44 3 E( avg) p E( avg) p E(avg) p ( p ) n est. E(Diff in %) E(Diff in %) p % p % 53.5 44. 9.5% p ( p ) p ( p ) E(avg) (.535)(.465) (.44)(.56) est. E(Diff in %) 3 est. E(Diff in %) 4.54% p ( p ) n 8 6

Chapter 7 More Tests for verages (cont.) H : Male% - Female % H : Male% - Femle % > ( φ φ ) M F est. E(Diff in %) 4.54%.79%.9 score - mean of scores Diff in % - Expected Diff in % Z D of scores est. E of diff in % ( p % p %) () 9.5 Z.9 p ( ) ( ) 4.54 p p p Male% - Female % Z 9 Chapter 7 More Tests for verages (cont.) The chance of obtaining a Z-score of.9 or 96.43 greater is about.79% - so the p-value.79%. ecause this is less than 5%, we reject hypothesis that the percent of males and females with computers is equal, more males had computers than females. Exercise et (p. 56) #7, 8 7

Chapter 7 More Tests for verages (cont.) C. Experiments The methods use to compare differences in two independent sample means can be applied to experiments. For example, at the start of our course we discussed the alk vaccine trials to see if the vaccine was effective in reducing the incidence of polio. Example 4: There are subjects in a small clinical trial on vitamin C. Half the subjects are assigned at random to treatment (, mg of vitamin C daily) and half to control (, mg of placebo). Over the period of the experiment, the treatment group averaged.3 colds, and the D was 3.. The controls did a little worse: they averaged.6 colds and the D was.9. Is the difference in averages statistically significant? Chapter 7 More Tests for verages (cont.) Treatment (n) Vitamin C Treatment (n) Placebo X.3 colds 3. H : μ μ H : μ μ < ( X Z X ) ( μ μ ) (.3.6) 3..9.3.3 Z.77.96.84.45 X.6 colds.9 ee page 58-59 3 Chapter 7 More Tests for verages (cont.) E(diff in means).45 5.6% 4% 4% -.3 -.77 (-.7) H H.7 Differencein means X X Z The p-value is about 4% - since it is greater than 5%, retain H. 4 8

Chapter 7 More Tests for verages (cont.) Note: Read the caution section about randomized experiments on pages 58-5. 5 Chapter 7 More Tests for verages (cont.) There is a box of tickets. Each ticket has two numbers: one shows what the response would be to treatment ; the other, to treatment ; only one of the numbers can be observed. ome tickets are drawn at random without replacement from the box; in this sample, the responses to treatment are observed. Then, a second sample is drawn at random without replacement from the remaining tickets. In the second sample, the responses to treatment are observed. The E for the difference between the two sample averages can be conservatively estimated as follows: i. compute the Es for the averages as if drawing with replacement; ii. combine the Es as if the samples were independent. 6 Chapter 7 More Tests for verages (cont.) Exercise et C (p. 5) #, 3 #. Is Wheaties a power breakfast? To find out, a study is done in a large elementary statistics class: 499 students agree to participate; after the midterm, 5 are randomized to the treatment group, and 49 to the control group. The treatment group is fed Wheaties for breakfast 7 days a week. The control group gets ugar Pops. a) Final scores averaged 66 for the treatment group; the D was. For the control group, the figures were 59 and. What do you conclude? b) What aspects of the study could have been done blind? Note: Wheaties and ugar Pops are registered 7 trademarks. The study is hypothetical. 9

Chapter 7 More Tests for verages (cont.) Wheaties ugar Pops n 5 X 66 n 49 X 59 H : μ μ H : μ μ 8 Chapter 7 More Tests for verages (cont.) a) X X 7 est. E (Diff) est. E(Diff).764.66 5 49 3.37.835 9 Chapter 7 More Tests for verages (cont.) b) 99.986% est. E(Diff).835 7-3.8 3.8 X X ( X Z X ) n n 3

Chapter 7 More Tests for verages (cont.) c) Z ( X X ) n n 7.835 3.8 d) P-value.4%, since it is less than 5%, reject H Wheaties are more powerful! 3