BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

Similar documents
Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ),

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018

Common Large/Small Sample Tests 1/55

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

Data Analysis and Statistical Methods Statistics 651

Chapter 1 (Definitions)

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Chapter 6 Sampling Distributions

1 Inferential Methods for Correlation and Regression Analysis

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

This chapter focuses on two experimental designs that are crucial to comparative studies: (1) independent samples and (2) matched pair samples.

Class 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Successful HE applicants. Information sheet A Number of applicants. Gender Applicants Accepts Applicants Accepts. Age. Domicile

- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion

Estimating the Population Mean - when a sample average is calculated we can create an interval centered on this average

Exam II Covers. STA 291 Lecture 19. Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm 9:15pm Location CB 234

Expectation and Variance of a random variable

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.

Chapter 20. Comparing Two Proportions. BPS - 5th Ed. Chapter 20 1

Frequentist Inference

Final Examination Solutions 17/6/2010

University of California, Los Angeles Department of Statistics. Hypothesis testing

STATISTICAL INFERENCE

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

Statistics 511 Additional Materials

Class 27. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times

This is an introductory course in Analysis of Variance and Design of Experiments.

Announcements. Unit 5: Inference for Categorical Data Lecture 1: Inference for a single proportion

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Chapter 8: Estimating with Confidence

Properties and Hypothesis Testing

Working with Two Populations. Comparing Two Means

Sample Size Determination (Two or More Samples)

Topic 9: Sampling Distributions of Estimators

Instructor: Judith Canner Spring 2010 CONFIDENCE INTERVALS How do we make inferences about the population parameters?

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

Topic 9: Sampling Distributions of Estimators

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

Chapter 22: What is a Test of Significance?

AP Statistics Review Ch. 8

MATH/STAT 352: Lecture 15

6 Sample Size Calculations

MA238 Assignment 4 Solutions (part a)

1036: Probability & Statistics

Lecture 5. Materials Covered: Chapter 6 Suggested Exercises: 6.7, 6.9, 6.17, 6.20, 6.21, 6.41, 6.49, 6.52, 6.53, 6.62, 6.63.

LESSON 20: HYPOTHESIS TESTING

Confidence Intervals รศ.ดร. อน นต ผลเพ ม Assoc.Prof. Anan Phonphoem, Ph.D. Intelligent Wireless Network Group (IWING Lab)

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance

Lecture 7: Non-parametric Comparison of Location. GENOME 560, Spring 2016 Doug Fowler, GS

Module 1 Fundamentals in statistics

UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS

Mathacle. PSet Stats, Concepts In Statistics Level Number Name: Date: Confidence Interval Guesswork with Confidence

Statistical Inference About Means and Proportions With Two Populations

Stat 421-SP2012 Interval Estimation Section

Statistical Inference Procedures

Topic 9: Sampling Distributions of Estimators

Introduction to Econometrics (3 rd Updated Edition) Solutions to Odd- Numbered End- of- Chapter Exercises: Chapter 3

Confidence Intervals QMET103

5. A formulae page and two tables are provided at the end of Part A of the examination PART A

Stat 200 -Testing Summary Page 1

Read through these prior to coming to the test and follow them when you take your test.

Chapter 13, Part A Analysis of Variance and Experimental Design

Binomial Distribution

Math 140 Introductory Statistics

Chapter 23: Inferences About Means

1 Constructing and Interpreting a Confidence Interval

Statistical inference: example 1. Inferential Statistics

Computing Confidence Intervals for Sample Data

CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS. 8.1 Random Sampling. 8.2 Some Important Statistics

Exam II Review. CEE 3710 November 15, /16/2017. EXAM II Friday, November 17, in class. Open book and open notes.

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS

a.) If random samples of size n=16 are selected, can we say anything about the x~ distribution of sample means?

STAT 155 Introductory Statistics Chapter 6: Introduction to Inference. Lecture 18: Estimation with Confidence

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

Notes on Hypothesis Testing, Type I and Type II Errors

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test.

Mathacle. PSet Stats, Concepts In Statistics Level Number Name: Date:

Homework 5 Solutions

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

Topic 6 Sampling, hypothesis testing, and the central limit theorem

Chapter two: Hypothesis testing

Parameter, Statistic and Random Samples

Worksheet 23 ( ) Introduction to Simple Linear Regression (continued)

Mathematical Notation Math Introduction to Applied Statistics

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

Chapter 11: Asking and Answering Questions About the Difference of Two Proportions

Power and Type II Error

Statistical Intervals for a Single Sample

1 Constructing and Interpreting a Confidence Interval

Transcription:

BIOS 4110: Itroductio to Biostatistics Brehey Lab #9 The Cetral Limit Theorem is very importat i the realm of statistics, ad today's lab will explore the applicatio of it i both categorical ad cotiuous data. We will review usig it for hypothesis testig ad for cofidece iterval costructio. Recall these importat formulas: I. CATEGORICAL Z = p p 0 p 0 (1 p 0 ) p 0 (1 p 0 ) is the stadard error of the sample proportio p is the sample proportio p 0 is the hypothesized value of the proportio This formula is used for hypothesis testig. The stadard error uses the hypothesized value of the true proportio i the populatio of iterest. Recall i the large sample settig, the distributio of the sample proportio is approximately ormal; therefore we ca use the Z distributio to approximate the probability of observig a sample proportio as extreme, or more extreme, give that the true proportio is equal to p 0. The followig figure shows the exact probability mass fuctio as the bars, ad the ormal probability desity fuctio as the lie. Notice how they are approximately the same. NOTE: = 6, p = 0.5

(1- α/2)x100% Cofidece iterval for the true proportio, p p (1 p ) p (1 p ) p Z α/2, p + Z α/2 p is the sample proportio Z α/2 is the critical value (depeds o the value of α) Oly works i large sample settig; p ca t be close to 0 or 1 ad has to be relatively large BOUNDED BY 0 AND 1 If we are iterested i providig a cofidece iterval for the true proportio i the populatio, we ca use the formula above. We are (1- α/2)x100% cofidet that the true proportio of [INSERT] i [POPULATION] is betwee [LOWER, UPPER]. II. CONTINUOUS T = x μ 0 s/ x is the sample mea s is the sample stadard deviatio is the sample size μ 0 is the hypothesized mea df is the degrees of freedom, which is equal to - 1 NOTES: This statistic is based off the Studet s curve, which is very similar to the ormal curve. However, the tails are thicker ad we have to accout for the degrees of freedom. This accouts for the extra variability because we have to estimate the stadard deviatio. Whe is greater tha about 50, the ormal distributio ad the Studet s curve are about the same. I practice, Studet s curve is used much more ofte. Note the degrees of freedom for T. (1- α/2)x100% Cofidece iterval for the true mea, µ x T α/2,df s/, x + T α/2,df s/ We are (1- α/2)x100% cofidet that the true mea [INSERT] i [POPULATION] is betwee [LOWER, UPPER]

Cofidece Itervals Iterpretatio If you were to repeat this process a ifiite umber of times, 95% of iterval estimates for μ created this way will cotai the true parameter value μ. We treat the populatio mea μ as beig fixed. Ay particular iterval may or may ot cotai the true populatio mea μ. We say we are 95% cofidet the iterval does cotai μ because the procedure used to costruct this iterval produces a correct iterval estimate 95% of the time. We do ot say there is a 95% probability that μ lies betwee these values. III. PRACTICE PROBLEMS 1. A sample of Alzheimer's patiets are tested to assess the amout of time i stage IV sleep. It has bee hypothesized that idividuals sufferig from Alzheimer's Disease may sped less time per ight i the deeper stages of sleep. Number of miutes spet is Stage IV sleep is recorded for sixty-oe patiets. The sample produced a mea of 45 miutes (S=14 miutes) of stage IV sleep over a 24 hour period of time. a) Will we eed to kow the uderlyig distributio of Stage IV sleep of the populatio of idividuals sufferig from Alzheimer s Disease to make ifereces o the sample mea? Explai. No we do ot. We have a sufficietly large sample size, therefore the sample mea will be approximately ormal. b) Suppose we wat to test if the true mea amout of time i stage IV sleep for this populatio is 49 miutes, which is the mea of the geeral populatio. Set up the hypothesis test, fid the critical value, report the p-value ad coclusio. t = 45 49 14/ 61. = -2.232 P(T < -2.32)*2 = 2*pt(-2.232, df = 60) = 0.029 = pvalue. We have sufficiet evidece that it is i fact differet from the geeral populatio (pvalue = 0.029).

c) Compute a 95 percet cofidece iterval for this data. What does this iformatio tell you about a particular idividual's (a Alzheimer's patiet) stage IV sleep? Critical value for T at sigificace level 0.05 ad df = 60 is 2. 45 ± 2 14 = (41.41, 48.59). The ull hypothesis of 49 miutes is ot i the 95% cofidece 61 iterval. Therefore, we coclude the average amout of sleep spet i stage IV i Alzheimer s patiets is less tha the geeral populatio mea. 2. The distributio of weights for the populatio of males i the Uited States is approximately ormal with mea µ = 172.2 ad stadard deviatio σ = 29.8 pouds. a) What is the probability that a radomly selected ma weighs less tha 130 pouds? z = 130 172.2 29.8 = -1.416. P(Z < -1.416) = 0.078 b) What is the probability that he weighs more tha 210 pouds? z = 210 172.2 29.8 = 1.268. P(Z > 1.268 ) = 0.102 c) What is the probability that amog five males selected at radom from the populatio, at least oe will have a weight outside the rage 130 to 210 pouds? 1 dbiom(0,5,0.18) = 0.629 3. Suppose a ew flu vaccie is developed for the state of Iowa; however, there are adverse side effects associated with it. We wish to test the hypothesis that the proportio of people who experiece a adverse effect is the same as it is with the stadard flu vaccie, which is 30%. Suppose we have a radom sample of size 1000, ad 270 people had a adverse effect to the ew flu vaccie. a) Set up the hypothesis test ad calculate the sample proportio. H0: p = 0.3 H 1 : p 0.3 p = 270/1000 = 0.27 b) Use biom.test(). Test the hypothesis stated i the problem at the 5% sigificace level ad calculate a cofidece iterval. What do you coclude? biom.test(270, 1000, 0.3) : pvalue = 0.038; 95% cofidece iterval is (0.243, 0.299). We have sufficiet evidece to coclude the true proportio of people who experiece a adverse effect for

this ew flu vaccie is ideed lower tha the true proportio of adverse effects from the stadard flu vaccie (p = 0.038). c) Use the ormal approximatio to fid the p-value ad cofidece iterval. z = 0.27 0.3 = -2.07 P(Z < -2.07)*2 = 2*porm(-2.07) = 0.038 0.3(1 0.3) 1000 95% C.I. = 0.27 ± 1.96 0.27(1 0.27) 1000 = (0.24, 0.298). We ca see it is very similar to the exact cofidece iterval ad we draw the same coclusio.