Grant MacEwan University STAT 252 Dr. Karen Buro Formula Sheet

Similar documents
Describing the Relation between Two Variables

Grant MacEwan University STAT 151 Formula Sheet Final Exam Dr. Karen Buro

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

Regression, Inference, and Model Building

Simple Linear Regression

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

1 The Randomized Block Design

Formulas and Tables for Gerstman

Important Formulas. Expectation: E (X) = Σ [X P(X)] = n p q σ = n p q. P(X) = n! X1! X 2! X 3! X k! p X. Chapter 6 The Normal Distribution.

Agenda: Recap. Lecture. Chapter 12. Homework. Chapt 12 #1, 2, 3 SAS Problems 3 & 4 by hand. Marquette University MATH 4740/MSCS 5740

Final Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech

Chapters 5 and 13: REGRESSION AND CORRELATION. Univariate data: x, Bivariate data (x,y).

1 Inferential Methods for Correlation and Regression Analysis

University of California, Los Angeles Department of Statistics. Simple regression analysis

Linear Regression Models

n but for a small sample of the population, the mean is defined as: n 2. For a lognormal distribution, the median equals the mean.

Simple Regression. Acknowledgement. These slides are based on presentations created and copyrighted by Prof. Daniel Menasce (GMU) CS 700

3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N.

(all terms are scalars).the minimization is clearer in sum notation:

SIMPLE LINEAR REGRESSION AND CORRELATION ANALYSIS

Worksheet 23 ( ) Introduction to Simple Linear Regression (continued)

Assessment and Modeling of Forests. FR 4218 Spring Assignment 1 Solutions

UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS

Regression. Correlation vs. regression. The parameters of linear regression. Regression assumes... Random sample. Y = α + β X.

Stat 139 Homework 7 Solutions, Fall 2015

ST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n.

Chapter 13, Part A Analysis of Variance and Experimental Design

Statistics 20: Final Exam Solutions Summer Session 2007

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

S Y Y = ΣY 2 n. Using the above expressions, the correlation coefficient is. r = SXX S Y Y

y ij = µ + α i + ɛ ij,

Open book and notes. 120 minutes. Cover page and six pages of exam. No calculators.

Good luck! School of Business and Economics. Business Statistics E_BK1_BS / E_IBA1_BS. Date: 25 May, Time: 12:00. Calculator allowed:

Correlation Regression

Pearson Edexcel Level 3 Advanced Subsidiary and Advanced GCE in Statistics

Sample Size Determination (Two or More Samples)

Statistics 203 Introduction to Regression and Analysis of Variance Assignment #1 Solutions January 20, 2005

Linear Regression Analysis. Analysis of paired data and using a given value of one variable to predict the value of the other

Statistics Lecture 27. Final review. Administrative Notes. Outline. Experiments. Sampling and Surveys. Administrative Notes

Lecture 8: Non-parametric Comparison of Location. GENOME 560, Spring 2016 Doug Fowler, GS

TAMS24: Notations and Formulas

MA 575, Linear Models : Homework 3

Topic 10: Introduction to Estimation

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

CE3502 Environmental Monitoring, Measurements, and Data Analysis (EMMA) Spring 2008 Final Review

Tables and Formulas for Sullivan, Fundamentals of Statistics, 2e Pearson Education, Inc.

11 Correlation and Regression

Lecture 1, Jan 19. i=1 p i = 1.

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

Chapter 20. Comparing Two Proportions. BPS - 5th Ed. Chapter 20 1

Stat 200 -Testing Summary Page 1

Additional Notes and Computational Formulas CHAPTER 3

REGRESSION MODELS ANOVA

Question 1: Exercise 8.2

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

Lecture 11 Simple Linear Regression

Definitions of terms and examples. Experimental Design. Sampling versus experiments. For each experimental unit, measures of the variables of

Mathematical Notation Math Introduction to Applied Statistics

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

MidtermII Review. Sta Fall Office Hours Wednesday 12:30-2:30pm Watch linear regression videos before lab on Thursday

This chapter focuses on two experimental designs that are crucial to comparative studies: (1) independent samples and (2) matched pair samples.

[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is:

Correlation. Two variables: Which test? Relationship Between Two Numerical Variables. Two variables: Which test? Contingency table Grouped bar graph

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

Lecture 7: Non-parametric Comparison of Location. GENOME 560 Doug Fowler, GS

Dr. Maddah ENMG 617 EM Statistics 11/26/12. Multiple Regression (2) (Chapter 15, Hines)

Final Examination Solutions 17/6/2010

Common Large/Small Sample Tests 1/55

Introduction to Econometrics (3 rd Updated Edition) Solutions to Odd- Numbered End- of- Chapter Exercises: Chapter 3

Lesson 2. Projects and Hand-ins. Hypothesis testing Chaptre 3. { } x=172.0 = 3.67

A Relationship Between the One-Way MANOVA Test Statistic and the Hotelling Lawley Trace Test Statistic

TMA4245 Statistics. Corrected 30 May and 4 June Norwegian University of Science and Technology Department of Mathematical Sciences.

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Topic 9: Sampling Distributions of Estimators

Comparing your lab results with the others by one-way ANOVA

INSTRUCTIONS (A) 1.22 (B) 0.74 (C) 4.93 (D) 1.18 (E) 2.43

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

ECON 3150/4150, Spring term Lecture 3

Correlation and Regression

There is no straightforward approach for choosing the warmup period l.

STP 226 ELEMENTARY STATISTICS

STP 226 EXAMPLE EXAM #1

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ),

Circle the single best answer for each multiple choice question. Your choice should be made clearly.

Two-Way Analysis of Variance - no interaction

Matrix Representation of Data in Experiment

Properties and Hypothesis Testing

University of California, Los Angeles Department of Statistics. Practice problems - simple regression 2 - solutions

Design of Engineering Experiments Chapter 2 Basic Statistical Concepts

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test.

bwght = cigs

CS 147: Computer Systems Performance Analysis

Rule of probability. Let A and B be two events (sets of elementary events). 11. If P (AB) = P (A)P (B), then A and B are independent.

Statistics. Chapter 10 Two-Sample Tests. Copyright 2013 Pearson Education, Inc. publishing as Prentice Hall. Chap 10-1

Simple Linear Regression

Lecture 10: Performance Evaluation of ML Methods

Random Variables, Sampling and Estimation

Lecture 7: Non-parametric Comparison of Location. GENOME 560, Spring 2016 Doug Fowler, GS

BIOSTATISTICS. Lecture 5 Interval Estimations for Mean and Proportion. dr. Petr Nazarov

REVIEW OF SIMPLE LINEAR REGRESSION SIMPLE LINEAR REGRESSION

Transcription:

Grat MacEwa Uiversity STAT 5 Dr. Kare Buro Formula Sheet Descriptive Statistics Sample Mea: x = x i i= Sample Variace: s = i= (x i x) = Σ i=x i (Σ i= x i) Sample Stadard Deviatio: s = Sample Variace = s Estimatio Parameter Estimator SE(Estimator) Approximate Cofidece Iterval µ x σ s x ± t α/ σ µ µ x x + σ s ( x x ) ± t α/ + s µ µ x d σ d x d ± t α/ s d t-test Statistics Test Statistic for -sample t-test cocerig µ if σ is ukow t 0 = x µ 0 s df = Test Statistic for two sample t-test for Comparig Two Populatio Meas: t 0 = ( x x ) d 0, df = mi(, ) s + s Test Statistic for paired t-test for Comparig Two Populatio Meas: t 0 = x d d 0 ( sd ) df =

Oe-way Aalysis of Variace (-way ANOVA) Total Sum of Squares: T otalss = (x ij x) = x ij ( x ij ), df T otal = where x is the overall mea, from all k samples ad = + +... + k. Sum of Squares for treatmets: SST = i ( x i x), df T = k with x i = sample mea of observatios i sample i. Sum of Squares for Error: SSE = ( )s + ( )s +... + ( k )s k, with s i is the stadard deviatio of observatio from sample i. df E = k ANOVA Table Treatmets/Groups k SST MST = SST/(k ) MST/MSE Error k SSE MSE = SSE/( k) Total T otalss Aalysis of cotrasts i the ANOVA model: Correspodig estimator γ = C µ + C µ... + C k µ k, with C + C +... C k = 0 c = C x + C x... + C k x k, SE(c) = s p C + C +... + C k k with the pooled estimator for σ: s p = MSE. ( α)00% cofidece iterval for γ: c ± t SE(c), df = k Test statistic for a t-test cocerig γ: t 0 = c SE(c), df = k Multiple Comparisos (Boferroi Procedure) Experimet wise error rate =α, m = umber of comparisos, compariso wise error rate α = α/m, The the margi of error (ME) for comparig treatmet i with treatmet j is: ME ij = t df E α / MSE + i j

The Radomized Block Desig k treatmets, b blocks Total Sum of Squares: T otalss = (x ij x) = x ij ( x ij ), df T otal = where x is the overall sample mea. Sum of Squares for treatmets: SST = b( x T i x), df T = k with x T i = sample mea of observatios for treatmet i. Sum of Squares for blocks: SSB = k( x Bj x), df B = b with x Bj = sample mea of observatios for block j. Sum of Squares for Error: SSE = T otalss SST SSB, df E = k b + ANOVA Table for RBD Treatmets k SST MST = SST/(k ) MST/MSE Blocks b SSB MSB = SSB/(b ) MSB/MSE Error k b + SSE MSE = SSE/( k b + ) Total T otalss Factorial Desig (here: -way ANOVA) Factor A at a levels, factor B at b levels ANOVA Table for factorial desig Factor A a SS(A) MS(A) = SS(A)/(a ) MS(A)/MSE Factor B b SS(B) MS(B) = SS(B)/(b ) MS(B)/MSE Iter A B (a )(b ) SS(A B) MS(A B) = SS(A B)/(a )(b ) MS(A B)/MSE Error df E SSE MSE = SSE/df E Total T otalss df E = a b (a )(b ) + Test statistic for Model Utility Test: F = (SS(A) + SS(B) + SS(A B)) (a +b +(a )(b )) MSE, df = a + b + (a )(b ), df d = df E 3

No Parametric Methods Sig Test: Test Statistic S 0 = umber of measuremets supportig H a (for -tailed, the larger of the two). Wilcoxo Rak Sum Test Test statistic: Both sample sizes are at least 0, T =sum of raks for sample. Wilcoxo Siged Rak Test z 0 = T ( + +), ( + +) Test statistic: Sample size of pairs is at least 0, T + =sum of raks for positive differeces. Kruskal-Wallis H Test Test statistic: H 0 = R i = mea rak for sample i. ( + ) z 0 = T + (+) 4 (+)(+) 4 i ( R i + ), df = k, Multiple Compariso for Kruskal Wallis Test (Boferroi procedure) Choose experimet wise error rate α.. Whe comparig k distributios let m = k(k )/ ad the compariso wise error rate α = α/m.. Use the Wilcoxo Rak Sum test to compare every pair of treatmets at sigificace level α. The all sigificat results hold simultaeously at experimet wise error rate of α. Partial Correlatio: r yx x = r yx r x x r yx r x x r yx 4

Simple Liear Regressio SS xy = x i y i ( x i )( y i ), = x i ( x i ) ad SS yy = yi ( y i ) ( ) SSxy Equatio of least squares lie (regressio lie): slope: ˆβ =, itercept: ˆβ0 = ȳ ˆβ x ŷ = ˆβ 0 + ˆβ x Estimate for σ: SSE = SS yy ˆβ SS xy, df E =, s = Pearso s Correlatio Coefficiet: r = SS xy SSxx SS yy SSE Coefficiet of determiatio: R = SS yy SSE SS yy Test statistic for model utility test: t 0 = ˆβ s/, df = ( α) 00% CI for β : ˆβ ± t s, df E = SSxx A ( α)00% CI for E(y) at x = x p : ŷ ± t s A ( α)00% Predictio Iterval for y at x = x p : ŷ ± t s + (x p x), df E = + + (x p x), df E = Multiple Liear Regressio SSE = (y i ŷ i ), df E = k ˆσ = s = MSE = SSE k Adjusted coefficiet of determiatio: R a = k ( ) SSE SS yy Test statistic for model utility test: F 0 = (SS yy SSE)/k MSE df d = df E = k (from ANOVA table), df = k, ( α) 00% Cofidece Iterval for a slope parameter β i : ˆβi ± t s ˆβi, df = df E = k Test statistic for test about slope parameter β i : t 0 = ˆβ i s ˆβi, df = k 5