MidtermII Review. Sta Fall Office Hours Wednesday 12:30-2:30pm Watch linear regression videos before lab on Thursday

Similar documents
MidtermII Review. Sta Spring Tables will be provided. PS 5 due March 31 at 11:55 PM. PA 5 is due April 2 at 11:55 PM.

Announcements. Unit 5: Inference for Categorical Data Lecture 1: Inference for a single proportion

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times

1 Inferential Methods for Correlation and Regression Analysis

Chapter 1 (Definitions)

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

Sample Size Determination (Two or More Samples)

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis

Statistics 20: Final Exam Solutions Summer Session 2007

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

ST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n.

Common Large/Small Sample Tests 1/55

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Class 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Class 27. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

Chapter 13, Part A Analysis of Variance and Experimental Design

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance

Stat 200 -Testing Summary Page 1

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion

Agenda: Recap. Lecture. Chapter 12. Homework. Chapt 12 #1, 2, 3 SAS Problems 3 & 4 by hand. Marquette University MATH 4740/MSCS 5740

Important Formulas. Expectation: E (X) = Σ [X P(X)] = n p q σ = n p q. P(X) = n! X1! X 2! X 3! X k! p X. Chapter 6 The Normal Distribution.

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

SOLUTIONS y n. n 1 = 605, y 1 = 351. y1. p y n. n 2 = 195, y 2 = 41. y p H 0 : p 1 = p 2 vs. H 1 : p 1 p 2.

Read through these prior to coming to the test and follow them when you take your test.

Final Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech

Lecture 8: Non-parametric Comparison of Location. GENOME 560, Spring 2016 Doug Fowler, GS

This chapter focuses on two experimental designs that are crucial to comparative studies: (1) independent samples and (2) matched pair samples.

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

STAT431 Review. X = n. n )

Rule of probability. Let A and B be two events (sets of elementary events). 11. If P (AB) = P (A)P (B), then A and B are independent.

Stat 319 Theory of Statistics (2) Exercises

Introduction to Econometrics (3 rd Updated Edition) Solutions to Odd- Numbered End- of- Chapter Exercises: Chapter 3

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators

University of California, Los Angeles Department of Statistics. Hypothesis testing

UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS

Chapters 5 and 13: REGRESSION AND CORRELATION. Univariate data: x, Bivariate data (x,y).

Statistics Lecture 27. Final review. Administrative Notes. Outline. Experiments. Sampling and Surveys. Administrative Notes

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test.

Linear Regression Models

Unit 5: Assumptions and Robustness of t-based Inference. Chapter 3 in the Text

Biostatistics for Med Students. Lecture 2

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS

Data Analysis and Statistical Methods Statistics 651

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ),

Good luck! School of Business and Economics. Business Statistics E_BK1_BS / E_IBA1_BS. Date: 25 May, Time: 12:00. Calculator allowed:

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

1 Constructing and Interpreting a Confidence Interval

Chapter 23: Inferences About Means

Mathematical Notation Math Introduction to Applied Statistics

Describing the Relation between Two Variables

Chapter 20. Comparing Two Proportions. BPS - 5th Ed. Chapter 20 1

This is an introductory course in Analysis of Variance and Design of Experiments.

Topic 9: Sampling Distributions of Estimators

Properties and Hypothesis Testing

Chapter 8: Estimating with Confidence

Comparing your lab results with the others by one-way ANOVA

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight)

Instructor: Judith Canner Spring 2010 CONFIDENCE INTERVALS How do we make inferences about the population parameters?

Module 1 Fundamentals in statistics

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018

Chapter 11: Asking and Answering Questions About the Difference of Two Proportions

Statistics. Chapter 10 Two-Sample Tests. Copyright 2013 Pearson Education, Inc. publishing as Prentice Hall. Chap 10-1

MA238 Assignment 4 Solutions (part a)

GG313 GEOLOGICAL DATA ANALYSIS

(7 One- and Two-Sample Estimation Problem )

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

Statistics 511 Additional Materials

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1

STAC51: Categorical data Analysis

TMA4245 Statistics. Corrected 30 May and 4 June Norwegian University of Science and Technology Department of Mathematical Sciences.

Frequentist Inference

Expectation and Variance of a random variable

Median and IQR The median is the value which divides the ordered data values in half.

Simulation. Two Rule For Inverting A Distribution Function

There is no straightforward approach for choosing the warmup period l.

Understanding Samples

Circle the single best answer for each multiple choice question. Your choice should be made clearly.

Final Examination Solutions 17/6/2010

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

Regression, Inference, and Model Building

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

Lesson 2. Projects and Hand-ins. Hypothesis testing Chaptre 3. { } x=172.0 = 3.67

1 Models for Matched Pairs

Lecture 7: Non-parametric Comparison of Location. GENOME 560, Spring 2016 Doug Fowler, GS

Topic 10: Introduction to Estimation

Transcription:

Aoucemets MidtermII Review Sta 101 - Fall 2016 Duke Uiversity, Departmet of Statistical Sciece Office Hours Wedesday 12:30-2:30pm Watch liear regressio videos before lab o Thursday Dr. Abrahamse Slides posted at https://stat.duke.edu/courses/fall16/sta101.002 1 Midterm 2 Exam Format Whe: Wedesday, November 9 - I class What to brig: Scietific calculator (graphig calculator ok, No Phoes!) Cheat sheet (ca be typed) Provided: Z, t ad χ 2 tables Covers HT from Uit 3, Uits 4 ad Uit 5 3 writte questios 70 pts 10 multiple choice questios - 3pts each 2 3

Uit 4.1 - Iferece for Numerical Variables What should you kow? Two mea testig problems Idepedet meas Paired (depedet) meas Coditios Idepedece Approximate Normality 4 5 All other details of the iferetial framework is the same... Oe mea: df = 1 HT: H 0 : µ = µ 0 T df = x µ s CI: x ± t df s HT : test statistic = poit estimate ull SE CI : poit estimate ± critical value SE Paired meas: df = diff 1 HT: H 0 : µ diff = 0 T df = x diff 0 s diff CI: x diff ± t df diff s Idepedet meas: df = mi( 1 1, 2 1) HT: H 0 : µ 1 µ 2 = 0 T df = x 1 x 2 s 2 1 + s2 2 1 2 CI: x 1 x 2 ± t df s 2 1 1 + s2 2 2 Clicker questio A study examiig the relatioship betwee weights of school childre ad abseces foud a 95% cofidece iterval for the differece betwee the average umber of days missed by overweight ad ormal weight childre (µ overweight µ ormal ) to be 1.3 days to 2.8 days. Accordig to this iterval, we are 95% cofidet that overweight childre o average miss 1. 1.3 days fewer to 2.8 days more 2. 1.3 to 2.8 days more 3. 1.3 to 2.8 days fewer 4. 1.3 days more to 2.8 days fewer tha childre with ormal weight. 6 7

Uit 4.2 - Bootstrappig Bootstrap iterval, stadard error Bootstrappig works as follows: (1) take a bootstrap sample - a radom sample take with replacemet from the origial sample, of the same size as the origial sample (2) calculate the bootstrap statistic - a statistic such as mea, media, proportio, etc. computed o the bootstrap samples (3) repeat steps (1) ad (2) may times to create a bootstrap distributio - a distributio of bootstrap statistics The XX% bootstrap cofidece iterval ca be estimated by the cutoff values for the middle XX% of the bootstrap distributio, OR poit estimate ± t SE boot For a radom sample of 20 Horror movies, the dot plot below shows the distributio of 100 bootstrap medias of the Rotte Tomatoes audiece scores. The media of the origial sample is 43.5 ad the bootstrap stadard error is 4.88. Estimate the 90% bootstrap cofidece iterval for the media RT score of horror movies usig the stadard error method. 30 35 40 45 50 55 bootstrap medias 8 9 Uit 4.3: Power Uit 4.4: Aalysis of VAriace (ANOVA) Decisio fail to reject H 0 reject H 0 H 0 true 1 α Type 1 Error, α Truth HA true Type 2 Error, β Power, 1 β Type 1 error is rejectig H 0 whe you should t have, ad the probability of doig so is α (sigificace level) Type 2 error is failig to reject H 0 whe you should have, ad the probability of doig so is β (a little more complicated to calculate) Power of a test is the probability of correctly rejectig H 0, ad the probability of doig so is 1 β I hypothesis testig, we wat to keep α ad β low, but there are iheret trade-offs. ANOVA tests for some differece i meas of may differet groups Coditios 1. Idepedece: (a) withi group: sampled observatios must be idepedet (b) betwee group: groups must be idepedet of each other 2. Approximate ormality: distributio should be early ormal withi each group 3. Equal variace: groups should have roughly equal variability (Youwill otbeaskedtocheckthisfromthedata) 10 11

ANOVA tests for some differece i meas of may differet groups Null hypothesis: H 0 : µ placebo = µ purple = µ brow =... = µ peach = µ orage. Clicker questio Which of the followig is a correct statemet of the alterative hypothesis? (a) For ay two groups, icludig the placebo group, o two group meas are the same. (b) For ay two groups, ot icludig the placebo group, o two group meas are the same. (c) Amogst the jelly bea groups, there are at least two groups that have differet group meas from each other. (d) Amogst all groups, there are at least two groups that have differet group meas from each other. F-statistic: F = k: # of groups; : # of obs. SSG / (k 1) SSE / ( k) = MSG MSE Df Sum Sq Mea Sq F value Pr(>F) Betwee groups k 1 SSG MSG F obs p obs Withi groups k SSE MSE Total 1 SSG+SSE Note: F distributio is defied by two dfs: df G = k 1 ad df E = k Thep-valuewillbegiveoexam, comparewiththestadard α level. 12 13 To idetify which meas are differet, use t-tests ad the Boferroi correctio To idetify which meas are differet, use t-tests ad the Boferroi correctio If the ANOVA yields a sigificat results, ext atural questio is: Which meas are differet? Use t-tests comparig each pair of meas to each other, with a commo variace (MSE from the ANOVA table) istead of each group s variaces i the calculatio of the stadard error, ad with a commo degrees of freedom (df E from the ANOVA table) Compare resultig p-values to a modified sigificace level Youwillotbeaskedtoperformtheactualtests, butyou shouldkow: How to compute the adjusted Boferoi sigificace level α. How to compute the stadard error for this test. The associated degrees of freedom for the test statistic. α = α K where K = k(k 1) 2 is the total umber of pairwise tests 14 15

Uit 4.4: ANOVA Uit 4.4: ANOVA ApplicatioExercise4.4 Df Sum Sq Mea Sq F p- value Rak 2 1.59 0.795 2.74 0.066 Residuals 460 135.07 0.29 Total 462 136.66 Whatpercetofthetotalvariabilityievaluatioscoresis explaiedbyistructorrak? ApplicatioExercise4.4 Df Sum Sq Mea Sq F p- value Rak 2 1.59 0.795 2.74 0.066 Residuals 460 135.07 0.29 Total 462 136.66 Whatsigificacelevelshouldbeusedforapair-wisepost hoctestcomparigtheevaluatioscoresofteachig professorsadteuredprofessors? 16 17 Uit 5.1: Iferece for a Sigle Proportio Uit 5.1: Iferece for a Sigle Proportio Distributioof ˆp Cetral limit theorem for proportios: Sample proportios will be early ormally distributed with mea equal to the populatio mea, p (1 p) p, ad stadard error equal to. ( ) p (1 p) ˆp N mea = p, SE = Coditios: Idepedece: Radom sample/assigmet + 10% rule At least 10 successes ad failures HT vs. CI foraproportio Success-failure coditio: CI: At least 10 observed successes ad failures HT: At least 10 expected successes ad failures, calculated usig the ull value Stadard error: CI: calculate usig observed sample proportio: SE = HT: calculate usig the ull value: SE = p 0 (1 p 0 ) ˆp(1 ˆp) 18 19

Recap o simulatio methods Radomizatio Test If the S-F coditio is ot met HT: Radomizatio test simulate uder the assumptio that H 0 is true, the fid the p-value as proportio of simulatios where the simulated ˆp is at least as extreme as the oe observed. CI: Bootstrap iterval resample with replacemet from the origial sample, ad costruct iterval usig percetile or stadard error method. Clicker questio A report o your local TV statio says that 60% of the city s residets support usig limited city fuds to hire ad trai more police officers. A secod local ews statio has picked up this story, ad they claim that certaily less tha 60% of residets support the additioal hirig ad traiig. I order to test this claim the secod ews statio takes a radom sample of 100 residets ad fids that 57 of them (57%) support the use of limited fuds to hire additioal police officers. 20 21 Uit 5.2: Iferece for Two Proportios Clicker questio Which of the followig is the correct set-up for calculatig the p-value for this test? (a) Roll a 10-sided die (outcomes 1-10) 100 times ad record the proportio of times you get a 6 or lower. Repeat this may times, ad calculate the proportio of simulatios where the sample proportio is 57% or less. (b) Roll a 10-sided die (outcomes 1-10) 100 times ad record the proportio of times you get a 6 or lower. Repeat this may times, ad calculate the proportio of simulatios where the sample proportio is 60% or less. (c) I a bag place 100 chips, 57 red ad 43 blue. Radomly sample 100 chips, with replacemet, ad record the proportio of red chips i the sample. Repeat this may times, ad calculate the proportio of samples where 57% or more of the chips are red. (d) Radomly sample 100 residets of a earby city, record how may of the them who support the hirig ad traiig of additioal police officers. Repeat this may times ad calculate the proportio of samples where at least 57% of the residets support additioal hirig ad traiig. CLT alsodescribesthedistributioof ˆp 1 ˆp 2 (ˆp 1 ˆp 2 ) N mea = (p 1 p 2 ), SE = Coditios: + p 2(1 p 2 ) 1 p 1 (1 p 1 ) Idepedece: Radom sample/assigmet + 10% rule Sample size / skew: At least 10 successes ad failures 2 22 23

Uit 5.2: Iferece for Two Proportios Summary ForHT where H 0 : p 1 = p 2, pool! As with workig with a sigle proportio, Whe doig a HT where H 0 : p 1 = p 2 (almost always for HT), use expected couts / proportios for S-F coditio ad calculatio of the stadard error. Otherwise use observed couts / proportios for S-F coditio ad calculatio of the stadard error. Expected proportio of success for both groups whe H 0 : p 1 = p 2 is defied as the pooled proportio: ˆp pool = total successes total sample size = suc 1 + suc 2 1 + 2 Type Parameter Estimator SE SampligDist. Oe mea µ x s/ t 1 Two meas µ diff x diff s d / t 1 Paired data Two meas t df s µ 1 µ 2 x 1 x 2 1 2 + s2 2 1 2 for df use Idepedet mi{ 1 1, 2 1} ˆp(1 ˆp) C.I. Oe prop p ˆp Z p0 (1 p H.T. 0 ) ˆp1 (1 ˆp C.I. 1 ) + ˆp 2 (1 ˆp 2 ) 1 2 Two prop p 1 p 2 ˆp 1 ˆp 2 Z ˆppool (1 ˆp H.T. pool ) + ˆp pool (1 ˆp pool ) 1 2 24 25 Uit 5.3: χ 2 Tests The χ 2 statistic Categoricaldatawithmoretha2levels χ 2 oe variable: χ 2 test of goodess of fit, o CI two variables: χ 2 test of idepedece, o CI Coditiosfor χ 2 testig 1. Idepedece: I additio to what we previously discussed for idepedece, each case that cotributes a cout to the table must be idepedet of all the other cases i the table. 2. Sample size / distributio: Each cell must have at least 5 expected cases. χ 2 statistic: Whe dealig with couts ad ivestigatig how far the observed couts are from the expected couts, we use a ew test statistic called the chi-square (χ 2 ) statistic: χ 2 = k (O E) 2 i=1 Importatpoits: E where k = total umber of cells Use couts (ot proportios) i the calculatio of the text statistic, eve though we re truly iterested i the proportios for iferece Expected couts are calculated assumig the ull hypothesis is true 26 27

The χ 2 distributio The χ 2 distributio has just oe parameter, degrees of freedom (df), which iflueces the shape, ceter, ad spread of the distributio. For χ 2 GOF test: df = k 1 For χ 2 idepedece test: df = (R 1) (C 1) Degrees of Freedom 2 4 9 0 5 10 15 20 25 28