x z Increasing the size of the sample increases the power (reduces the probability of a Type II error) when the significance level remains fixed.

Similar documents
Statistical Inference Procedures

TESTS OF SIGNIFICANCE

Tables and Formulas for Sullivan, Fundamentals of Statistics, 2e Pearson Education, Inc.

Comments on Discussion Sheet 18 and Worksheet 18 ( ) An Introduction to Hypothesis Testing

Chapter 9. Key Ideas Hypothesis Test (Two Populations)

REVIEW OF SIMPLE LINEAR REGRESSION SIMPLE LINEAR REGRESSION

Tools Hypothesis Tests

COMPARISONS INVOLVING TWO SAMPLE MEANS. Two-tail tests have these types of hypotheses: H A : 1 2

IntroEcono. Discrete RV. Continuous RV s

M227 Chapter 9 Section 1 Testing Two Parameters: Means, Variances, Proportions

STA 4032 Final Exam Formula Sheet

CHAPTER 6. Confidence Intervals. 6.1 (a) y = 1269; s = 145; n = 8. The standard error of the mean is = s n = = 51.3 ng/gm.

VIII. Interval Estimation A. A Few Important Definitions (Including Some Reminders)

Chapter 8.2. Interval Estimation

11/19/ Chapter 10 Overview. Chapter 10: Two-Sample Inference. + The Big Picture : Inference for Mean Difference Dependent Samples

Chapter 9: Hypothesis Testing

Grant MacEwan University STAT 151 Formula Sheet Final Exam Dr. Karen Buro

S T A T R a c h e l L. W e b b, P o r t l a n d S t a t e U n i v e r s i t y P a g e 1. = Population Variance

UNIVERSITY OF CALICUT

STUDENT S t-distribution AND CONFIDENCE INTERVALS OF THE MEAN ( )

Statistics and Chemical Measurements: Quantifying Uncertainty. Normal or Gaussian Distribution The Bell Curve

Confidence Intervals. Confidence Intervals

20. CONFIDENCE INTERVALS FOR THE MEAN, UNKNOWN VARIANCE

Difference tests (1): parametric

Stat 3411 Spring 2011 Assignment 6 Answers

CE3502 Environmental Monitoring, Measurements, and Data Analysis (EMMA) Spring 2008 Final Review

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

100(1 α)% confidence interval: ( x z ( sample size needed to construct a 100(1 α)% confidence interval with a margin of error of w:

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Estimation Theory. goavendaño. Estimation Theory

Chapter 8 Part 2. Unpaired t-test With Equal Variances With Unequal Variances

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

Confidence Intervals: Three Views Class 23, Jeremy Orloff and Jonathan Bloom

MTH 212 Formulas page 1 out of 7. Sample variance: s = Sample standard deviation: s = s

SOLUTION: The 95% confidence interval for the population mean µ is x ± t 0.025; 49

Questions about the Assignment. Describing Data: Distributions and Relationships. Measures of Spread Standard Deviation. One Quantitative Variable

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

ON THE SCALE PARAMETER OF EXPONENTIAL DISTRIBUTION

Statistical Equations

Statistics Parameters

TI-83/84 Calculator Instructions for Math Elementary Statistics

MATH/STAT 352: Lecture 15

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

Chapter 8: Estimating with Confidence

Common Large/Small Sample Tests 1/55

Formula Sheet. December 8, 2011

Chapters 5 and 13: REGRESSION AND CORRELATION. Univariate data: x, Bivariate data (x,y).

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

Statistics 20: Final Exam Solutions Summer Session 2007

(7 One- and Two-Sample Estimation Problem )

MATHEMATICS LW Quantitative Methods II Martin Huard Friday April 26, 2013 TEST # 4 SOLUTIONS

ME 410 MECHANICAL ENGINEERING SYSTEMS LABORATORY REGRESSION ANALYSIS

Statistical Inference for Two Samples. Applied Statistics and Probability for Engineers. Chapter 10 Statistical Inference for Two Samples

Chapter 6 Sampling Distributions

18.05 Problem Set 9, Spring 2014 Solutions

This chapter focuses on two experimental designs that are crucial to comparative studies: (1) independent samples and (2) matched pair samples.

Working with Two Populations. Comparing Two Means

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

Stat 200 -Testing Summary Page 1

Describing the Relation between Two Variables

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

LECTURE 13 SIMULTANEOUS EQUATIONS

Widely used? average out effect Discrete Prior. Examplep. More than one observation. using MVUE (sample mean) yy 1 = 3.2, y 2 =2.2, y 3 =3.6, y 4 =4.

Mathacle PSet Stats, Confidence Intervals and Estimation Level Number Name: Date: Unbiased Estimators So we don t have favorite.

10-716: Advanced Machine Learning Spring Lecture 13: March 5

Review - Week 10. There are two types of errors one can make when performing significance tests:

- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion

Chapter 1 (Definitions)

Reasons for Sampling. Forest Sampling. Scales of Measurement. Scales of Measurement. Sampling Error. Sampling - General Approach

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Fig. 1: Streamline coordinates

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Read through these prior to coming to the test and follow them when you take your test.

Important Formulas. Expectation: E (X) = Σ [X P(X)] = n p q σ = n p q. P(X) = n! X1! X 2! X 3! X k! p X. Chapter 6 The Normal Distribution.

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis

Statistics. Chapter 10 Two-Sample Tests. Copyright 2013 Pearson Education, Inc. publishing as Prentice Hall. Chap 10-1

Statistics - Lying without sinning? Statistics - Lying without sinning?

Statistical Intervals Based on a Single Sample (Devore Chapter Seven)

Sample Size Determination (Two or More Samples)

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1

Formulas and Tables for Gerstman

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ),

Chapter 20. Comparing Two Proportions. BPS - 5th Ed. Chapter 20 1

1 Inferential Methods for Correlation and Regression Analysis

m = Statistical Inference Estimators Sampling Distribution of Mean (Parameters) Sampling Distribution s = Sampling Distribution & Confidence Interval

Statistics Problem Set - modified July 25, _. d Q w. i n

Mathematical Notation Math Introduction to Applied Statistics

Isolated Word Recogniser

Expectation and Variance of a random variable

SOLUTIONS y n. n 1 = 605, y 1 = 351. y1. p y n. n 2 = 195, y 2 = 41. y p H 0 : p 1 = p 2 vs. H 1 : p 1 p 2.

Statistical treatment of test results

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.

Class 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

MidtermII Review. Sta Fall Office Hours Wednesday 12:30-2:30pm Watch linear regression videos before lab on Thursday

Announcements. Unit 5: Inference for Categorical Data Lecture 1: Inference for a single proportion

Class 27. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Chapter 13: Tests of Hypothesis Section 13.1 Introduction

Transcription:

] z-tet for the mea, μ If the P-value i a mall or maller tha a pecified value, the data are tatitically igificat at igificace level. Sigificace tet for the hypothei H 0: = 0 cocerig the ukow mea of a populatio are baed o the z tatitic x z 0 coditio: Radom data come from a SRS of the populatio. Normal populatio follow a ormal ditributio or becaue of the CLT thi rule ca be broke if () if <5, ca ue z if the data are roughly ormal (ad o outlier); () if 5 < 30, ca ue z eve if the data are lightly kewed (o outlier); (3) if 30, ca ue z eve if the data are trogly kewed (o outlier). Idepedet 0% rule i atified. H 0 true H a true Reject H 0 Type I Error Accept H 0 Type II Error The power of a igificace tet meaure it ability to detect a alterative hypothei that i, reject H 0 whe H 0 i fale. The power agait a pecific alterative i the probability that the tet will reject H 0 whe the alterative i true. I a fixed level igificace tet, the igificace level i the probability of a Type I error, ad the power agait a pecific alterative i miu the probability of a Type II error for that alterative. α = probability of Type I error; β = probability of Type II error; power = β. A α decreae, β icreae, ad power decreae. Icreaig the ize of the ample icreae the power (reduce the probability of a Type II error) whe the igificace level remai fixed. ] t-tet for the mea, μ I practice we do ot kow the populatio tadard deviatio, σ. I thi cae ubtitute x (the ample tadard deviatio) for the x 0 populatio tadard deviatio. I other word, ue the tatitic t. Thi tatitic ha a approximate t-ditributio with degree of freedom. coditio: Guidelie to uig t-procedure are Radom data mut come from a SRS of the populatio Normal Populatio mut follow a ormal ditributio. Careful here () if <5, ca ue t if the data are roughly ormal (ad o outlier); () if 5 < 30, ca ue t eve if the data are lightly kewed (o outlier); (3) if 30, ca ue t eve if the data are trogly kewed (o outlier). Idepedet ample ize mut be le tha 0% of the populatio ize or the data mut be collected i uch a way o that obervatio are idepedet of oe aother. 3] z-cofidece iterval for μ A level C cofidece iterval for the mea of a ormal populatio with kow tadard deviatio, baed o a SRS of ize, i give by * x z x Here z* i choe o that the tadard ormal curve ha area C betwee -z* ad z*. Becaue of the cetral limit theorem, thi iterval i approximately correct for large ample whe the populatio i ot ormal. The umber z* i called the upper critical value of the tadard ormal ditributio. You ca fid z* for C = 90% by uig z*=ivorm(.95); if C=95% the ue z*=ivorm(.975); i geeral ue z*=ivorm(c + (-C)/)=ivorm((+C)/) Other thig beig equal, the margi of error of a cofidece iterval get maller a

the cofidece level C decreae, the populatio tadard deviatio decreae, ad the ample ize icreae. The ample ize required to obtai a cofidece iterval with pecified margi of error m for a ormal mea i foud by ettig * z m ad olvig for, where z* i the critical value for the deired level of cofidece. Alway roud up whe you ue thi procedure. 4] t-cofidece iterval for μ I practice, we do ot kow. Replace the tadard deviatio tatitic of x by the tadard error to get the oe-ample t x t 0 The t tatitic ha the t ditributio with - degree of freedom. coditio: Radom data mut come from a SRS of the populatio Normal Populatio mut follow a ormal ditributio. Careful here () if <5, ca ue t if the data are roughly ormal (ad o outlier); () if 5 < 30, ca ue t eve if the data are lightly kewed (o outlier); (3) if 30, ca ue t eve if the data are trogly kewed (o outlier). Idepedet ample ize mut be le tha 0% of the populatio ize or the data mut be collected i uch a way o that obervatio are idepedet of oe aother. There i a t ditributio for every poitive degree of freedom k. All are ymmetric ditributio imilar i hape to the tadard ormal ditributio, The t(k) ditributio approache the N(0, ) ditributio a k icreae. x A exact level C cofidece iterval for the mea of a ormal populatio i * x t where the area betwee t* ad t* equal C. The value t* i called a critical value, pecifically it i the upper ( - C)/ critical value of the t( ) ditributio. Comparig Two Mea The data i a two-ample problem are two idepedet SRS, each draw from a eparate ormally ditributed populatio. Tet ad cofidece iterval for the differece betwee the mea ad of the two populatio tart from the differece x x of the two ample mea. Becaue of the cetral limit theorem, the reultig procedure are approximately correct for other populatio ditributio whe the ample ize are large (at leat 30 each). Draw idepedet SRS of ize ad from two ormal populatio with parameter,, ad,. The two-ample t tatitic i x x t The tatitic t doe ot exactly have a t ditributio. For coervative iferece procedure to compare ad, ue the two ample t tatitic with the t(k) ditributio. The degree of freedom k i the maller of ad. For more accurate probability value, ue the t(k) ditributio with degree of freedom k etimated from the data. Thi i the uual procedure i tatitical oftware or the TI83. I thi cae

but for implicity you ca alway ue df = mi(, ) 5] The cofidece iterval for - give by df = x x ha cofidece level at leat C if t* i the upper ( - C)/ critical value for t (k) with k the maller of ad. coditio: Guidelie to uig t-procedure are a before but replace with ad. Coditio mut hold for EACH ample. t *, 6] Sigificace tet for H0: = baed o x t x 0 have a true P-value o higher tha that calculated from t(k). The guidelie for practical ue of two-ample t procedure are imilar to thoe for oe-ample t procedure. Equal ample ize are recommeded. coditio: Guidelie to uig t-procedure are a before but replace with ad. Coditio mut hold for EACH ample. 7] -proportio z-iterval for p Tet ad cofidece iterval for a populatio proportio p whe the data are a SRS of ize are baed o the ample proportio. Whe i large, ha approximately the ormal ditributio with mea p ad tadard deviatio p p. The level C cofidece iterval for p i * z where z* i the upper ( C)/ tadard ormal critical value. (The area betwee z* ad z* i C). Tet ad cofidece iterval for a populatio proportio p whe the data are a SRS of ize are baed o the ample proportio. Whe i large, ha approximately the ormal ditributio with mea p ad tadard deviatio p p. The level C cofidece iterval for p i * z where z* i the upper ( C)/ tadard ormal critical value. (The area betwee z* ad z* i C). coditio: Radom data mut come from a SRS of the populatio. Normal both p ad ( p ) mut be 0 or larger for a cofidece iterval Idepedece 0% rule mut be atified.

8] -proportio z-tet for H 0 : p = p 0 Tet of H 0: p = p 0 are baed o the z tatitic z p p 0 p 0 0 with P-value calculated from the tadard ormal ditributio. The coditio are idetical to the coditio eceary to carry out a proportio z-iterval. coditio: Radom data mut come from a SRS of the populatio. Normal both p 0 ad ( p 0 ) mut be 0 or larger for a igificace tet. Idepedece 0% rule mut be atified. The ample ize eeded to obtai a cofidece iterval with approximate margi of error m for a populatio proportio ivolve olvig * * p z p * m for, where p* i a gueed value for the ample proportio ad z* i the tadard ormal critical poit for the level of cofidece you wat. If you ue p* =.5 i thi formula, the margi of error of the iterval will be le tha or equal to m o matter what the value of i. 9] cofidece iterval for p p, the differece betwee two populatio proportio We wat to compare the proportio p ad p of uccee i two populatio. The compario i baed o the differece p betwee the ample proportio of uccee. Whe the ample ize ad are large eough, we ca ue z procedure ˆ becaue the amplig ditributio of i cloe to ormal. A approximate level C cofidece iterval for p p i where the tadard error of i SE = * SE z p ˆ ad z* i a tadard ormal critical value. coditio: Radom each ample i a SRS from it repective populatio Normal -- Each populatio mut be at leat te time a large a the ample take from that populatio ad the ample have to be large eough to atify p 0, ) 0, p 0, ad ) 0 for a cofidece iterval; ˆ ( ˆ Idepedet each ample atifie the 0% rule eparately. ( 0] igificace tet for H 0 : p = p Sigificace tet of H 0: p = p ue the pooled ample proportio ad the z tatitic cout of uccee i both ample combied c cout of obervatio i both ample combied z P-value come from the tadard ormal table. Coditio: Radom each ample i a SRS from it repective populatio c c c c

Normal -- p ˆ 0 c, ˆ ( p c ) 0, p ˆ 0 c, ad ˆ ( p c ) 0, where c i the pooled ample proportio for a igificace tet. (Accordig to our book, you ca alo ue p 0, etc. for a hypothei tet.) Idepedet each ample atifie the 0% rule eparately. ] Tet for Goode of Fit The chi-quare tet for goode of fit tet the ull hypothei that a populatio ditributio i the ame a a referece ditributio (hypotheized ditributio). The expected cout for ay variable category i obtaied by multiplyig the percet of the ditributio for each category time the ample ize. The chi-quare tatitic i X = (0 - E) /E, where the um i over variable categorie. The chi-quare tet compare the value of the tatitic X with critical value from the chi-quare ditributio with degree of freedom. A chi-quare goode of fit tet take the form: H 0: the populatio ditributio (or percet) i the ame a the hypotheized ditributio. H a: the populatio percet are differet from the hypotheized value the P-value i the area uder the chi-quare deity curve to the right of X. Large value of X are evidece agait H 0. coditio: Radom the igle ample i a SRS of the populatio Large ample all of the expected cout are 5 or larger. Idepedet the 0% rule hold FOR EACH SAMPLE SEPARATELY! ] Iferece for Two-Way Table (homogeeity & idepedece/aociatio) The chi-quare tet for homogeeity (TWO or more populatio with or more ample & ONE categorical variable. Thi i alo ued i radomized experimet with or more treatmet where the treatmet form the colum of the two-way table ad the row i the repoe (relape ye or o, heart attack ye or o, etc.) take the followig form: H 0: the ditributio (or percet) of the categorical variable for populatio i the ame a the ditributio of the categorical variable for populatio (or for experimet the ditributio of the repoe variable i the ame for all treatmet group) Ha: the ditributio (or percet) of the categorical variable for populatio i NOT the ame a the ditributio of the categorical variable for populatio (or for experimet the ditributio of the repoe variable i NOT the ame for all treatmet group The chi-quare tet for idepedece/aociatio (ONE populatio with ONE large ample ad TWO categorical variable) take the form: H0: there i o aociatio betwee the row categorical variable ad the colum categorical variable Ha: there i a aociatio betwee the row categorical variable ad the colum categorical variable OR H 0: the row categorical variable i idepedet of the colum categorical variable H a: the row categorical variable i depedet o the colum categorical variable Oe commo ue of the chi-quare tet i to compare everal populatio proportio. The ull hypothei tate that all of the populatio proportio are equal. The alterative hypothei tate that they are ot all equal but allow ay other relatiohip amog the populatio proportio. The expected cout i ay cell of a two-way table whe H 0 i true i expected cout = (row total colum total) / table total The chi-quare tatitic i X oberved cout expected cout expected cout The chi-quare tet compare the value of the tatitic X with critical value from the chi-quare ditributio with

df = (r - )(c - ) degree of freedom. Large value of X are evidece agait H 0 o the P-value i the area uder the chi-quare deity curve to the right of X. coditio: Radom goode of fit or aociatio -> the igle ample i a SRS of the populatio; homogeeity -> the ample are SRS from their repective populatio. Large ample all of the expected cout are 5 or larger. Idepedet the 0% rule hold FOR EACH SAMPLE SEPARATELY! If the chi-quare tet fid a tatitically igificat P-value, do a follow-up aalyi that compare the oberved cout with the expected cout ad that look for the larget compoet of chi-quare. 3] Iferece for the Slope of a Regreio Lie repreet the lope of the TRUE regreio lie that relate two quatitative variable. However, a with ay parameter, i typically ukow, therefore a ample of data poit i take to etimate it. The leat quare regreio lie i computed uig the ample data poit ad the cofidece iterval etimate for i b ± t SE b Where b i the lope of the leat quare regreio lie computed from a ample ad SE b i the tadard error for b. The formula for the tadard error, SE i give below SE t* i computed uig degree of freedom. reidual b ad x x x I tet of igificace, H 0 alway tate that there i o liear relatiohip betwee the two quatitative variable. That i, H 0: = 0. A before, the alterative hypothei H a ca be < 0, > 0, or 0. The t tatitic i computed uig the formula b 0 t = SE Whe computig the probability be ure to ue degree of freedom. coditio: ) for ay fixed x, the y value vary ormally ad are idepedet of each other. ) the mea repoe y, y, ha a liear relatiohip with x: y = + x 3) the tadard deviatio of y (call it ) i the ame for all value of x. The INFERENCE CONDITIONS ARE L.I.N.E.R L (Liear) the catterplot how a liear patter ad the reidual plot how NO curved patter I (idepedet) the obervatio are idepedet of oe aother 0% rule hold N (reidual are ormal) the ormal probability plot for the reidual how a liear patter idicatig that the reidual are approximately ormal R (radom) the data were collected radomly; SRS take b