Ch. 1: Data and Distributions

Size: px
Start display at page:

Download "Ch. 1: Data and Distributions"

Transcription

1 Ch. 1: Data and Distributions Populations vs. Samples How to graphically display data Histograms, dot plots, stem plots, etc Helps to show how samples are distributed Distributions of both continuous and discrete variables Density functions and Mass functions Three basic properties Shows the distribution of the entire population or process Some important distributions and associated Probability Continuous: Exponential, Normal, Uniform Discrete: Binomial, Poisson 4/4/1 H.X. Lecture 30: Final Summary 1

2 Ch. : Numerical Summary Measures Measure of center of Data (Sample) Sample mean Sample median, midpoint Trimmed means x x1 + x xn 1 = = xi n n Measure of variability for Data (Sample) ( x1 x) + ( x x) ( xn Sample variance s = n 1 Sample Standard deviation 1 = n 1 ( x x) Quartiles; Five-number-Summary; IQR and Outliers Graphical Display: Boxplots; Modified Version; Side- By-Side Boxplots s = s x) i 4/4/1 H.X. Lecture 30: Final Summary

3 Ch. (Cont.): Numerical Summary Measures Measures of Center (Distributions) Continuous: µ X = x f ( x) dx Discrete: = x p(x) µ X Measure of variability (Distributions) Continuous: Discrete: ( ) X x µ X f ( x) = dx Normal Quantile (QQ) plot σ ( x X ) σ X = µ p( x) 4/4/1 H.X. Lecture 30: Final Summary 3

4 Ch.3: Bivariate Data Scatterplots: Visually Display Bivariate data, y vs. x Pearson s Correlation Coefficient (between X and Y, both quantitative), r : r measures the strength and direction of the linear relationship, other convenient formulas for Sxy, Sxx and Syy Takes values between -1 and 1, inclusive Sign indicates type/direction of relationship (positive, negative) Value indicates strength: farther from 0 is stronger If switch roles of X and Y à r doesn t change Unit free unaffected by linear transformations Affected by Outliers, Not a resistant measure Correlation Causaiton 4/4/1 H.X. Lecture 30: Final Summary 4

5 Ch. 3: LS (Least Square) Regression Line Estimated straight line Equation: y = a + b x a is the intercept (where it crosses the y-axis) b is the slope (rate) b = r s s y x Predicted value of y Residual from the fit (or regression line) Breaking up Sum of Squares: SSR, SSE, SST Coefficient of Determination: SSR = = 1 SST Percent of variation explained by the linear regression between Y and X r SSE SST 4/4/1 H.X. Lecture 30: Final Summary 5

6 Ch. 3 (Cont.): MSE and Residual Plot Mean Squared Error about the LS line: Standard Deviation about the LS line: Also called root MSE in SAS output. Residual: e ˆ i = yi yi A residual plot, plotting the residuals against x. The residual plot should not have any pattern but a random scattering of points If a pattern is observed, the linear regression model is probably not appropriate. 4/4/1 H.X. Lecture 30: Final Summary 6

7 Ch. 5: Probability and Sampling Distributions Chance Experiments: Simple Events: individual outcomes Events: collections of simple events Sample Space: Venn Diagrams Tree Diagrams Complex Events: Event A or B, Event A and B, Event A (Complement of A) Disjoint Events (Mutually Exclusive) Independent Events 4/4/1 H.X. Lecture 30: Final Summary 7

8 Probability Basic Rules Probability Axioms: 0 P(A) 1 for any event A P(S) = 1, where S is the sample space Addition Rule - For any disjoint events A and B, P(A or B) = P(A)+P(B) Complementary Events: P(A ) = 1 - P(A) General Addition Rule: (for any events A and B) P(A or B) = P(A)+P(B)-P(A and B) Independence Rule: P (A and B) = P(A) P(B) Conditional Probability: P(A B) = P (A and B) / P(B) Bayes Rule for Calculation of Conditional Probability, Tree Diagrams 4/4/1 H.X. Lecture 30: Final Summary 8

9 Random Variables and Sampling Distribution Random Variables Discrete Distribution Table, Prob. Histogram Continuous Distribution Curve, density function Independent R.V.s Sampling Distribution of a Sample Mean Sampling Distribution of a Sample Proportion (rule of thumb for Normal Appox.) Central Limit Theorem Continuity Correction (from Binomial to Normal Appox.) 4/4/1 H.X. Lecture 30: Final Summary 9

10 Ch 7: Estimation and Statistical Inference by C.I. s (Unbiased, Consistent) Point Estimation Large-Sample C.I.s for a Population Mean (Normality Assumption) s X ± (z critical value) one-sided C.I.s: Upper or Lower bound C.I. Interpretation of Confidence Level. Necessary sample size for a desired Bound (round up): ZCrits n = B Small-Sample C.I. s X ± (t critical value) n t-crit is associated with d.f. = n -1 Normailty Assumption still holds. 4/4/1 H.X. Lecture 30: Final Summary 10 n

11 C.I. for a Population Proportion Point Estimation for a Population Proportion Large-Sample C.I.s for a Population Proportion pˆ (1 pˆ ) pˆ ± Zcrit n Necessary sample size for a desired Bound (round up for not-an-integer): z _ critical n= p*(1 p*) B p* = pˆ, or 0.5 if p-hat is unavailable. Small-Sample C.I. replaces z-crit by t-crit 4/4/1 H.X. Lecture 30: Final Summary 11

12 C.I. for two Population Means Difference Large-Sample C.I.s for Difference between two Population Means (Normality Assumption) 1 X 1 X ± Zcrit + n1 Small-Sample C.I., Zcrit replaced by t-crit, with (round down for non-integer) ( ) s1 n1 + s n df = ( ) ( ) s1 n1 s n + n n /4/1 H.X. Lecture 30: Final Summary 1 s s n

13 t C.I. for Paired Data 4/4/1 H.X. Lecture 30: Final Summary 13

14 Ch. 8: Hypotheses Testing State Hypotheses Both Null and Alternative (one or two-sided) Determine an appropriate α level. If not specified, use 5% Type I error; Significance Level. Understand it. Calculate the appropriate test statistic Find the P-value, the probability of the as extreme or more extreme than the test statistic Reject H 0, when the P-value is smaller than the significance level α. Otherwise: Fail to reject H 0 State a conclusion in layman s terms 4/4/1 H.X. Lecture 30: Final Summary 14

15 One-sample t Test for a Population Mean: The null hypothesis is H 0 : µ = µ 0 The alternative hypothesis could be: H a : µ µ 0 (two-sided) H a : µ > µ 0 (one-sided) H a : µ < µ 0 (one-sided) Test Statistic X t ~ Student s t-distribution df = n 1 s µ 0 t = n If n is large ( 30), CLT guarantees an approximate normal distribution and the t can be replaced with z, where z follows a standard normal distribution. 4/4/1 H.X. Lecture 30: Final Summary 15

16 P-value tied to H a Two-sided (both tails) H a : µ µ 0 One-sided (right tail) H a : µ > µ 0 One-sided (left tail) H a : µ < µ 0 4/4/1 H.X. Lecture 30: Final Summary 16

17 Other Tests or Remarks Two-Sample z (or t, depending on sample sizes) test for Two Population Means When using t, the d.f. calculation One-Sample t Test with (Matched) Paired Data Focus on two population means difference A two-sided significance test <-> A two-sided C.I. for the same parameter If the claimed value is in the CI à fail to reject H 0 If the claimed is not in the CI à reject H 0 NOTE: must have in H a! Statistical Significance Practical Sig. 4/4/1 H.X. Lecture 30: Final Summary 17

18 Cautions (for both C.I. and tests of significance): Data: assume SRS (random sampling) Population need to be If n < 30, have to check normality (by Normal QQ-plot) With n 30, CLT can give us approximate normality in most situations. 4/4/1 H.X. Lecture 30: Final Summary 18

19 Ch. 9: One Way ANOVA Hypotheses: H 0 : µ 1 = µ = = µ k vs. H a : At least one µ i is different F test statistic ANOVA table test statistic = between - samples variation within - samplesvariation Source DF SS MS Model (Between) Error (Within) k 1 n k SSM (formula) SSE (formula) Total n 1 SST = SSM + SSE SSM/k 1 SSE/n k P-value is always the upper tail of the F distribution with (k 1, n k) degrees of freedom. Tables of critical values for F distribution: (Table VIII) F statistic > F critical value <=> P-value < α => Reject H 0 4/4/1 H.X. Lecture 30: Final Summary 19

20 Assumptions (prior to Running one-way ANOVA) 1. Constant variance: The variances of the k populations are the same. Check this with the ratio of the largest and smallest standard deviations, the ratio must be <. Each of the k populations follows a normal distribution. Check this by looking at QQplots for each group Remark: statistical significance practical significance 4/4/1 H.X. Lecture 30: Final Summary 0

21 Ch. 9: Multiple Comparison If insignificant in one-way ANOVA, we don t have to try further steps Otherwise, run Multiple Comparison to see which explicitly means are different. Tukey s Mehtod ( cldiff or lines format) Dunnett s Method (only if there s a control group) 4/4/1 H.X. Lecture 30: Final Summary 1

22 9.4: Randomized Complete Block Design RCBD (both treatment and block factor must be categorical) Source DF SS MS Factor A (treatment) Factor B (block) a 1 SSA MSA b 1 SSB MSB Error (a 1)(b 1) SSE MSE Total ab 1 SST In RCBD, we are only interested in the treatment factor The block factor might affect response but that s not of interest. Two F tests Blocking Effect? Use test statistic and P-value to conclude Treatment Effect? Use test statistic and P-value to conclude 4/4/1 H.X. Lecture 30: Final Summary

23 Necessary Assumptions for RCBD Similar to one-way ANOVA 1. Constant variance. Each of the k populations follows a normal distribution One additional assumption 3. There is no interaction between the treatment and blocking variables Can assess just using common sense (Just ask: Do/ should they interact?) OR check by a Two-way ANOVA model Interaction Plot 4/4/1 H.X. Lecture 30: Final Summary 3

24 Ch. 10: Two-Way ANOVA Testing Two factors and their interaction s effect to the response variable Source DF SS MS Factor A a 1 SSA MSA Factor B b 1 SSB MSB AB interaction (a 1)(b 1) SSAB MSAB Error ab(r 1) SSE MSE Total abr 1 SST Test First, Interaction (of the most interest). Then Factor A and B, respectively. If Interaction significant, still run slicing for Factor A and B. If Interaction insignificant while a single Factor significant, run one-way ANOVA and multiple comparison. 4/4/1 H.X. Lecture 30: Final Summary 4

25 Ch. 10 (Cont.): Two-Way ANOVA Interaction plot Roughly speaking, there s no Interaction effect if all lines are parallel to each other In summary, for Ch. 9 and 10 we should know: All of One-way ANOVA (Ch. 9) By hand and/or using SAS Most of randomized Blocking design (Sec 9.4), Two-way ANOVA (Ch. 10, Section ) For both: Complete ANOVA tables, calculate DFs and F test statistic Perform F tests using F table Interpret SAS output Know the general concept of a higher order (multi-way) ANOVA model. 4/4/1 H.X. Lecture 30: Final Summary 5

26 Ch. 11: Inferential Methods in Regression and Slopes (Correlations) Normal Error Regression Model Error Item (3 assumptions: Independence, Normality and Constant Variance) SSE, MSE, and Root MSE Coefficient of Determination, R^ % of variation explained by the regression model Simply by squaring r Statistical Inference about the slope in SLR Model: C.I. for β (the slope): b ± (t crit) * s b Hypotheses Testing w.r.t. the slope, i.e. test of Linear Relationship Remark: t~student s t-distribution with d.f. = n 4/4/1 H.X. Lecture 30: Final Summary 6

27 Using ANOVA table to test SLR Source DF SS MS Model (Regression) 1 SSM (or SSR) SSM/1 = MSM (or MSR) Error n SSE (or SSResid) Total n 1 SST = SSM + SSE SSE/n = MSE Remark: d.f. of F test statistic = (1, n ) 4/4/1 H.X. Lecture 30: Final Summary 7

28 Multiple Linear Regression Model MLR Model: Y X X X e i = α + β1 1+ β βp p + i Test the above linear relationship H 0 : All β i s = 0 vs. H α : At least one β i 0 A rejection of the null indicates that collectively the Xs do well at explaining Y; otherwise don t have to run the following step But it doesn t show which explicit Xi s are doing the explaining Model Selection, especially Backward Elimination The Estimated Line, from SAS output Use it to Predict Yi; Get residual by Actual Y_i Predicted Value 4/4/1 H.X. Lecture 30: Final Summary 8

29 After Class Review Notes, practices, Hw, Labs and previous tests. Wed, Lab#8 (optional) Final Exam (Close book, Close notes) Next Wed, 8-10am Student ID, a calculator (SAT policy, NO QWERTY keyboard) and pencils, two-page crib sheet (8 by 11 ) handwritten by yourself, two-sided. SEE CALCULATOR POLICY and crib sheet (on Syllabus) from course website. No electronics except a calculator. Not allowed to exchange calculator or crib sheet during the exam. Not allowed to type/print your crib sheet. 4/4/1 H.X. Lecture 30: Final Summary 9

Sociology 6Z03 Review II

Sociology 6Z03 Review II Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability

More information

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). For example P(X 1.04) =.8508. For z < 0 subtract the value from

More information

Inference for Regression Simple Linear Regression

Inference for Regression Simple Linear Regression Inference for Regression Simple Linear Regression IPS Chapter 10.1 2009 W.H. Freeman and Company Objectives (IPS Chapter 10.1) Simple linear regression p Statistical model for linear regression p Estimating

More information

Ch 2: Simple Linear Regression

Ch 2: Simple Linear Regression Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component

More information

Inferences for Regression

Inferences for Regression Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In

More information

Inference for the Regression Coefficient

Inference for the Regression Coefficient Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression line. We can shows that b 0 and b 1 are the unbiased estimates

More information

Review for Final. Chapter 1 Type of studies: anecdotal, observational, experimental Random sampling

Review for Final. Chapter 1 Type of studies: anecdotal, observational, experimental Random sampling Review for Final For a detailed review of Chapters 1 7, please see the review sheets for exam 1 and. The following only briefly covers these sections. The final exam could contain problems that are included

More information

Lecture 10 Multiple Linear Regression

Lecture 10 Multiple Linear Regression Lecture 10 Multiple Linear Regression STAT 512 Spring 2011 Background Reading KNNL: 6.1-6.5 10-1 Topic Overview Multiple Linear Regression Model 10-2 Data for Multiple Regression Y i is the response variable

More information

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). For example P(X.04) =.8508. For z < 0 subtract the value from,

More information

Simple Linear Regression: One Quantitative IV

Simple Linear Regression: One Quantitative IV Simple Linear Regression: One Quantitative IV Linear regression is frequently used to explain variation observed in a dependent variable (DV) with theoretically linked independent variables (IV). For example,

More information

Lectures on Simple Linear Regression Stat 431, Summer 2012

Lectures on Simple Linear Regression Stat 431, Summer 2012 Lectures on Simple Linear Regression Stat 43, Summer 0 Hyunseung Kang July 6-8, 0 Last Updated: July 8, 0 :59PM Introduction Previously, we have been investigating various properties of the population

More information

ECN221 Exam 1 VERSION B Fall 2017 (Modules 1-4), ASU-COX VERSION B

ECN221 Exam 1 VERSION B Fall 2017 (Modules 1-4), ASU-COX VERSION B ECN221 Exam 1 VERSION B Fall 2017 (Modules 1-4), ASU-COX VERSION B Choose the best answer. Do not write letters in the margin or communicate with other students in any way; if you do you will receive a

More information

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X.

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X. Estimating σ 2 We can do simple prediction of Y and estimation of the mean of Y at any value of X. To perform inferences about our regression line, we must estimate σ 2, the variance of the error term.

More information

Business Statistics. Lecture 10: Course Review

Business Statistics. Lecture 10: Course Review Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,

More information

ANOVA: Analysis of Variation

ANOVA: Analysis of Variation ANOVA: Analysis of Variation The basic ANOVA situation Two variables: 1 Categorical, 1 Quantitative Main Question: Do the (means of) the quantitative variables depend on which group (given by categorical

More information

Statistics for Managers using Microsoft Excel 6 th Edition

Statistics for Managers using Microsoft Excel 6 th Edition Statistics for Managers using Microsoft Excel 6 th Edition Chapter 13 Simple Linear Regression 13-1 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of

More information

Inference for Regression Inference about the Regression Model and Using the Regression Line

Inference for Regression Inference about the Regression Model and Using the Regression Line Inference for Regression Inference about the Regression Model and Using the Regression Line PBS Chapter 10.1 and 10.2 2009 W.H. Freeman and Company Objectives (PBS Chapter 10.1 and 10.2) Inference about

More information

Linear Regression. Simple linear regression model determines the relationship between one dependent variable (y) and one independent variable (x).

Linear Regression. Simple linear regression model determines the relationship between one dependent variable (y) and one independent variable (x). Linear Regression Simple linear regression model determines the relationship between one dependent variable (y) and one independent variable (x). A dependent variable is a random variable whose variation

More information

AP Statistics Cumulative AP Exam Study Guide

AP Statistics Cumulative AP Exam Study Guide AP Statistics Cumulative AP Eam Study Guide Chapters & 3 - Graphs Statistics the science of collecting, analyzing, and drawing conclusions from data. Descriptive methods of organizing and summarizing statistics

More information

Dover- Sherborn High School Mathematics Curriculum Probability and Statistics

Dover- Sherborn High School Mathematics Curriculum Probability and Statistics Mathematics Curriculum A. DESCRIPTION This is a full year courses designed to introduce students to the basic elements of statistics and probability. Emphasis is placed on understanding terminology and

More information

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics DETAILED CONTENTS About the Author Preface to the Instructor To the Student How to Use SPSS With This Book PART I INTRODUCTION AND DESCRIPTIVE STATISTICS 1. Introduction to Statistics 1.1 Descriptive and

More information

Unit 10: Simple Linear Regression and Correlation

Unit 10: Simple Linear Regression and Correlation Unit 10: Simple Linear Regression and Correlation Statistics 571: Statistical Methods Ramón V. León 6/28/2004 Unit 10 - Stat 571 - Ramón V. León 1 Introductory Remarks Regression analysis is a method for

More information

SIMPLE REGRESSION ANALYSIS. Business Statistics

SIMPLE REGRESSION ANALYSIS. Business Statistics SIMPLE REGRESSION ANALYSIS Business Statistics CONTENTS Ordinary least squares (recap for some) Statistical formulation of the regression model Assessing the regression model Testing the regression coefficients

More information

Basic Business Statistics 6 th Edition

Basic Business Statistics 6 th Edition Basic Business Statistics 6 th Edition Chapter 12 Simple Linear Regression Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of a dependent variable based

More information

Mathematical Notation Math Introduction to Applied Statistics

Mathematical Notation Math Introduction to Applied Statistics Mathematical Notation Math 113 - Introduction to Applied Statistics Name : Use Word or WordPerfect to recreate the following documents. Each article is worth 10 points and should be emailed to the instructor

More information

FRANKLIN UNIVERSITY PROFICIENCY EXAM (FUPE) STUDY GUIDE

FRANKLIN UNIVERSITY PROFICIENCY EXAM (FUPE) STUDY GUIDE FRANKLIN UNIVERSITY PROFICIENCY EXAM (FUPE) STUDY GUIDE Course Title: Probability and Statistics (MATH 80) Recommended Textbook(s): Number & Type of Questions: Probability and Statistics for Engineers

More information

STATISTICS 141 Final Review

STATISTICS 141 Final Review STATISTICS 141 Final Review Bin Zou bzou@ualberta.ca Department of Mathematical & Statistical Sciences University of Alberta Winter 2015 Bin Zou (bzou@ualberta.ca) STAT 141 Final Review Winter 2015 1 /

More information

Inference for Regression

Inference for Regression Inference for Regression Section 9.4 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 13b - 3339 Cathy Poliak, Ph.D. cathy@math.uh.edu

More information

STAT Chapter 11: Regression

STAT Chapter 11: Regression STAT 515 -- Chapter 11: Regression Mostly we have studied the behavior of a single random variable. Often, however, we gather data on two random variables. We wish to determine: Is there a relationship

More information

Correlation Analysis

Correlation Analysis Simple Regression Correlation Analysis Correlation analysis is used to measure strength of the association (linear relationship) between two variables Correlation is only concerned with strength of the

More information

Correlation and the Analysis of Variance Approach to Simple Linear Regression

Correlation and the Analysis of Variance Approach to Simple Linear Regression Correlation and the Analysis of Variance Approach to Simple Linear Regression Biometry 755 Spring 2009 Correlation and the Analysis of Variance Approach to Simple Linear Regression p. 1/35 Correlation

More information

Simple Linear Regression

Simple Linear Regression Simple Linear Regression In simple linear regression we are concerned about the relationship between two variables, X and Y. There are two components to such a relationship. 1. The strength of the relationship.

More information

y ˆ i = ˆ " T u i ( i th fitted value or i th fit)

y ˆ i = ˆ  T u i ( i th fitted value or i th fit) 1 2 INFERENCE FOR MULTIPLE LINEAR REGRESSION Recall Terminology: p predictors x 1, x 2,, x p Some might be indicator variables for categorical variables) k-1 non-constant terms u 1, u 2,, u k-1 Each u

More information

Mathematics for Economics MA course

Mathematics for Economics MA course Mathematics for Economics MA course Simple Linear Regression Dr. Seetha Bandara Simple Regression Simple linear regression is a statistical method that allows us to summarize and study relationships between

More information

Statistics For Economics & Business

Statistics For Economics & Business Statistics For Economics & Business Analysis of Variance In this chapter, you learn: Learning Objectives The basic concepts of experimental design How to use one-way analysis of variance to test for differences

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

Chapter 16. Simple Linear Regression and Correlation

Chapter 16. Simple Linear Regression and Correlation Chapter 16 Simple Linear Regression and Correlation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

Multiple Regression. Inference for Multiple Regression and A Case Study. IPS Chapters 11.1 and W.H. Freeman and Company

Multiple Regression. Inference for Multiple Regression and A Case Study. IPS Chapters 11.1 and W.H. Freeman and Company Multiple Regression Inference for Multiple Regression and A Case Study IPS Chapters 11.1 and 11.2 2009 W.H. Freeman and Company Objectives (IPS Chapters 11.1 and 11.2) Multiple regression Data for multiple

More information

STAT 350 Final (new Material) Review Problems Key Spring 2016

STAT 350 Final (new Material) Review Problems Key Spring 2016 1. The editor of a statistics textbook would like to plan for the next edition. A key variable is the number of pages that will be in the final version. Text files are prepared by the authors using LaTeX,

More information

Week 12 Hypothesis Testing, Part II Comparing Two Populations

Week 12 Hypothesis Testing, Part II Comparing Two Populations Week 12 Hypothesis Testing, Part II Week 12 Hypothesis Testing, Part II Week 12 Objectives 1 The principle of Analysis of Variance is introduced and used to derive the F-test for testing the model utility

More information

Review of Statistics

Review of Statistics Review of Statistics Topics Descriptive Statistics Mean, Variance Probability Union event, joint event Random Variables Discrete and Continuous Distributions, Moments Two Random Variables Covariance and

More information

4.1. Introduction: Comparing Means

4.1. Introduction: Comparing Means 4. Analysis of Variance (ANOVA) 4.1. Introduction: Comparing Means Consider the problem of testing H 0 : µ 1 = µ 2 against H 1 : µ 1 µ 2 in two independent samples of two different populations of possibly

More information

Lecture 3: Inference in SLR

Lecture 3: Inference in SLR Lecture 3: Inference in SLR STAT 51 Spring 011 Background Reading KNNL:.1.6 3-1 Topic Overview This topic will cover: Review of hypothesis testing Inference about 1 Inference about 0 Confidence Intervals

More information

Chapter 16. Simple Linear Regression and dcorrelation

Chapter 16. Simple Linear Regression and dcorrelation Chapter 16 Simple Linear Regression and dcorrelation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

Interpret Standard Deviation. Outlier Rule. Describe the Distribution OR Compare the Distributions. Linear Transformations SOCS. Interpret a z score

Interpret Standard Deviation. Outlier Rule. Describe the Distribution OR Compare the Distributions. Linear Transformations SOCS. Interpret a z score Interpret Standard Deviation Outlier Rule Linear Transformations Describe the Distribution OR Compare the Distributions SOCS Using Normalcdf and Invnorm (Calculator Tips) Interpret a z score What is an

More information

Math Review Sheet, Fall 2008

Math Review Sheet, Fall 2008 1 Descriptive Statistics Math 3070-5 Review Sheet, Fall 2008 First we need to know about the relationship among Population Samples Objects The distribution of the population can be given in one of the

More information

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006 Chapter 17 Simple Linear Regression and Correlation 17.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

Simple Linear Regression. (Chs 12.1, 12.2, 12.4, 12.5)

Simple Linear Regression. (Chs 12.1, 12.2, 12.4, 12.5) 10 Simple Linear Regression (Chs 12.1, 12.2, 12.4, 12.5) Simple Linear Regression Rating 20 40 60 80 0 5 10 15 Sugar 2 Simple Linear Regression Rating 20 40 60 80 0 5 10 15 Sugar 3 Simple Linear Regression

More information

REVIEW: Midterm Exam. Spring 2012

REVIEW: Midterm Exam. Spring 2012 REVIEW: Midterm Exam Spring 2012 Introduction Important Definitions: - Data - Statistics - A Population - A census - A sample Types of Data Parameter (Describing a characteristic of the Population) Statistic

More information

Formal Statement of Simple Linear Regression Model

Formal Statement of Simple Linear Regression Model Formal Statement of Simple Linear Regression Model Y i = β 0 + β 1 X i + ɛ i Y i value of the response variable in the i th trial β 0 and β 1 are parameters X i is a known constant, the value of the predictor

More information

STAT 512 MidTerm I (2/21/2013) Spring 2013 INSTRUCTIONS

STAT 512 MidTerm I (2/21/2013) Spring 2013 INSTRUCTIONS STAT 512 MidTerm I (2/21/2013) Spring 2013 Name: Key INSTRUCTIONS 1. This exam is open book/open notes. All papers (but no electronic devices except for calculators) are allowed. 2. There are 5 pages in

More information

1-Way ANOVA MATH 143. Spring Department of Mathematics and Statistics Calvin College

1-Way ANOVA MATH 143. Spring Department of Mathematics and Statistics Calvin College 1-Way ANOVA MATH 143 Department of Mathematics and Statistics Calvin College Spring 2010 The basic ANOVA situation Two variables: 1 Categorical, 1 Quantitative Main Question: Do the (means of) the quantitative

More information

Confidence Intervals, Testing and ANOVA Summary

Confidence Intervals, Testing and ANOVA Summary Confidence Intervals, Testing and ANOVA Summary 1 One Sample Tests 1.1 One Sample z test: Mean (σ known) Let X 1,, X n a r.s. from N(µ, σ) or n > 30. Let The test statistic is H 0 : µ = µ 0. z = x µ 0

More information

Psychology 282 Lecture #4 Outline Inferences in SLR

Psychology 282 Lecture #4 Outline Inferences in SLR Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations

More information

Multiple linear regression

Multiple linear regression Multiple linear regression Course MF 930: Introduction to statistics June 0 Tron Anders Moger Department of biostatistics, IMB University of Oslo Aims for this lecture: Continue where we left off. Repeat

More information

Lecture 11: Simple Linear Regression

Lecture 11: Simple Linear Regression Lecture 11: Simple Linear Regression Readings: Sections 3.1-3.3, 11.1-11.3 Apr 17, 2009 In linear regression, we examine the association between two quantitative variables. Number of beers that you drink

More information

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved.

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved. 1-1 Chapter 1 Sampling and Descriptive Statistics 1-2 Why Statistics? Deal with uncertainty in repeated scientific measurements Draw conclusions from data Design valid experiments and draw reliable conclusions

More information

Analysis of Variance and Design of Experiments-I

Analysis of Variance and Design of Experiments-I Analysis of Variance and Design of Experiments-I MODULE VIII LECTURE - 35 ANALYSIS OF VARIANCE IN RANDOM-EFFECTS MODEL AND MIXED-EFFECTS MODEL Dr. Shalabh Department of Mathematics and Statistics Indian

More information

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). For example P(X 1.04) =.8508. For z < 0 subtract the value from

More information

Regression Analysis. Regression: Methodology for studying the relationship among two or more variables

Regression Analysis. Regression: Methodology for studying the relationship among two or more variables Regression Analysis Regression: Methodology for studying the relationship among two or more variables Two major aims: Determine an appropriate model for the relationship between the variables Predict the

More information

Correlation and regression

Correlation and regression 1 Correlation and regression Yongjua Laosiritaworn Introductory on Field Epidemiology 6 July 2015, Thailand Data 2 Illustrative data (Doll, 1955) 3 Scatter plot 4 Doll, 1955 5 6 Correlation coefficient,

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

If we have many sets of populations, we may compare the means of populations in each set with one experiment.

If we have many sets of populations, we may compare the means of populations in each set with one experiment. Statistical Methods in Business Lecture 3. Factorial Design: If we have many sets of populations we may compare the means of populations in each set with one experiment. Assume we have two factors with

More information

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

 M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2 Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the

More information

STAT 4385 Topic 03: Simple Linear Regression

STAT 4385 Topic 03: Simple Linear Regression STAT 4385 Topic 03: Simple Linear Regression Xiaogang Su, Ph.D. Department of Mathematical Science University of Texas at El Paso xsu@utep.edu Spring, 2017 Outline The Set-Up Exploratory Data Analysis

More information

Regression used to predict or estimate the value of one variable corresponding to a given value of another variable.

Regression used to predict or estimate the value of one variable corresponding to a given value of another variable. CHAPTER 9 Simple Linear Regression and Correlation Regression used to predict or estimate the value of one variable corresponding to a given value of another variable. X = independent variable. Y = dependent

More information

AIM HIGH SCHOOL. Curriculum Map W. 12 Mile Road Farmington Hills, MI (248)

AIM HIGH SCHOOL. Curriculum Map W. 12 Mile Road Farmington Hills, MI (248) AIM HIGH SCHOOL Curriculum Map 2923 W. 12 Mile Road Farmington Hills, MI 48334 (248) 702-6922 www.aimhighschool.com COURSE TITLE: Statistics DESCRIPTION OF COURSE: PREREQUISITES: Algebra 2 Students will

More information

Contents. Acknowledgments. xix

Contents. Acknowledgments. xix Table of Preface Acknowledgments page xv xix 1 Introduction 1 The Role of the Computer in Data Analysis 1 Statistics: Descriptive and Inferential 2 Variables and Constants 3 The Measurement of Variables

More information

Section 4.6 Simple Linear Regression

Section 4.6 Simple Linear Regression Section 4.6 Simple Linear Regression Objectives ˆ Basic philosophy of SLR and the regression assumptions ˆ Point & interval estimation of the model parameters, and how to make predictions ˆ Point and interval

More information

Glossary for the Triola Statistics Series

Glossary for the Triola Statistics Series Glossary for the Triola Statistics Series Absolute deviation The measure of variation equal to the sum of the deviations of each value from the mean, divided by the number of values Acceptance sampling

More information

Mathematical Notation Math Introduction to Applied Statistics

Mathematical Notation Math Introduction to Applied Statistics Mathematical Notation Math 113 - Introduction to Applied Statistics Name : Use Word or WordPerfect to recreate the following documents. Each article is worth 10 points and can be printed and given to the

More information

Multiple Linear Regression

Multiple Linear Regression Multiple Linear Regression Simple linear regression tries to fit a simple line between two variables Y and X. If X is linearly related to Y this explains some of the variability in Y. In most cases, there

More information

Chapter 4. Regression Models. Learning Objectives

Chapter 4. Regression Models. Learning Objectives Chapter 4 Regression Models To accompany Quantitative Analysis for Management, Eleventh Edition, by Render, Stair, and Hanna Power Point slides created by Brian Peterson Learning Objectives After completing

More information

Ch Inference for Linear Regression

Ch Inference for Linear Regression Ch. 12-1 Inference for Linear Regression ACT = 6.71 + 5.17(GPA) For every increase of 1 in GPA, we predict the ACT score to increase by 5.17. population regression line β (true slope) μ y = α + βx mean

More information

Stat 2300 International, Fall 2006 Sample Midterm. Friday, October 20, Your Name: A Number:

Stat 2300 International, Fall 2006 Sample Midterm. Friday, October 20, Your Name: A Number: Stat 2300 International, Fall 2006 Sample Midterm Friday, October 20, 2006 Your Name: A Number: The Midterm consists of 35 questions: 20 multiple-choice questions (with exactly 1 correct answer) and 15

More information

Simple Linear Regression for the Climate Data

Simple Linear Regression for the Climate Data Prediction Prediction Interval Temperature 0.2 0.0 0.2 0.4 0.6 0.8 320 340 360 380 CO 2 Simple Linear Regression for the Climate Data What do we do with the data? y i = Temperature of i th Year x i =CO

More information

df=degrees of freedom = n - 1

df=degrees of freedom = n - 1 One sample t-test test of the mean Assumptions: Independent, random samples Approximately normal distribution (from intro class: σ is unknown, need to calculate and use s (sample standard deviation)) Hypotheses:

More information

Chapter 12 - Lecture 2 Inferences about regression coefficient

Chapter 12 - Lecture 2 Inferences about regression coefficient Chapter 12 - Lecture 2 Inferences about regression coefficient April 19th, 2010 Facts about slope Test Statistic Confidence interval Hypothesis testing Test using ANOVA Table Facts about slope In previous

More information

CS 5014: Research Methods in Computer Science

CS 5014: Research Methods in Computer Science Computer Science Clifford A. Shaffer Department of Computer Science Virginia Tech Blacksburg, Virginia Fall 2010 Copyright c 2010 by Clifford A. Shaffer Computer Science Fall 2010 1 / 207 Correlation and

More information

Ch 3: Multiple Linear Regression

Ch 3: Multiple Linear Regression Ch 3: Multiple Linear Regression 1. Multiple Linear Regression Model Multiple regression model has more than one regressor. For example, we have one response variable and two regressor variables: 1. delivery

More information

Simple linear regression

Simple linear regression Simple linear regression Biometry 755 Spring 2008 Simple linear regression p. 1/40 Overview of regression analysis Evaluate relationship between one or more independent variables (X 1,...,X k ) and a single

More information

Wed, June 26, (Lecture 8-2). Nonlinearity. Significance test for correlation R-squared, SSE, and SST. Correlation in SPSS.

Wed, June 26, (Lecture 8-2). Nonlinearity. Significance test for correlation R-squared, SSE, and SST. Correlation in SPSS. Wed, June 26, (Lecture 8-2). Nonlinearity. Significance test for correlation R-squared, SSE, and SST. Correlation in SPSS. Last time, we looked at scatterplots, which show the interaction between two variables,

More information

Lecture 9: Linear Regression

Lecture 9: Linear Regression Lecture 9: Linear Regression Goals Develop basic concepts of linear regression from a probabilistic framework Estimating parameters and hypothesis testing with linear models Linear regression in R Regression

More information

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220 Dr. Mohammad Zainal Chapter Goals After completing

More information

IT 403 Statistics and Data Analysis Final Review Guide

IT 403 Statistics and Data Analysis Final Review Guide IT 403 Statistics and Data Analysis Final Review Guide Exam Schedule and Format Date: 11/15 (Wed) for Section 702 (Loop); between 11/15 (Wed) and 11/18 (Sat) for Section 711 (Online). Location: CDM 224

More information

Lecture 15 Multiple regression I Chapter 6 Set 2 Least Square Estimation The quadratic form to be minimized is

Lecture 15 Multiple regression I Chapter 6 Set 2 Least Square Estimation The quadratic form to be minimized is Lecture 15 Multiple regression I Chapter 6 Set 2 Least Square Estimation The quadratic form to be minimized is Q = (Y i β 0 β 1 X i1 β 2 X i2 β p 1 X i.p 1 ) 2, which in matrix notation is Q = (Y Xβ) (Y

More information

Summary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1)

Summary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1) Summary of Chapter 7 (Sections 7.2-7.5) and Chapter 8 (Section 8.1) Chapter 7. Tests of Statistical Hypotheses 7.2. Tests about One Mean (1) Test about One Mean Case 1: σ is known. Assume that X N(µ, σ

More information

Simple Linear Regression

Simple Linear Regression 9-1 l Chapter 9 l Simple Linear Regression 9.1 Simple Linear Regression 9.2 Scatter Diagram 9.3 Graphical Method for Determining Regression 9.4 Least Square Method 9.5 Correlation Coefficient and Coefficient

More information

From Practical Data Analysis with JMP, Second Edition. Full book available for purchase here. About This Book... xiii About The Author...

From Practical Data Analysis with JMP, Second Edition. Full book available for purchase here. About This Book... xiii About The Author... From Practical Data Analysis with JMP, Second Edition. Full book available for purchase here. Contents About This Book... xiii About The Author... xxiii Chapter 1 Getting Started: Data Analysis with JMP...

More information

In a one-way ANOVA, the total sums of squares among observations is partitioned into two components: Sums of squares represent:

In a one-way ANOVA, the total sums of squares among observations is partitioned into two components: Sums of squares represent: Activity #10: AxS ANOVA (Repeated subjects design) Resources: optimism.sav So far in MATH 300 and 301, we have studied the following hypothesis testing procedures: 1) Binomial test, sign-test, Fisher s

More information

Linear regression. We have that the estimated mean in linear regression is. ˆµ Y X=x = ˆβ 0 + ˆβ 1 x. The standard error of ˆµ Y X=x is.

Linear regression. We have that the estimated mean in linear regression is. ˆµ Y X=x = ˆβ 0 + ˆβ 1 x. The standard error of ˆµ Y X=x is. Linear regression We have that the estimated mean in linear regression is The standard error of ˆµ Y X=x is where x = 1 n s.e.(ˆµ Y X=x ) = σ ˆµ Y X=x = ˆβ 0 + ˆβ 1 x. 1 n + (x x)2 i (x i x) 2 i x i. The

More information

Lecture 18 MA Applied Statistics II D 2004

Lecture 18 MA Applied Statistics II D 2004 Lecture 18 MA 2612 - Applied Statistics II D 2004 Today 1. Examples of multiple linear regression 2. The modeling process (PNC 8.4) 3. The graphical exploration of multivariable data (PNC 8.5) 4. Fitting

More information

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #6

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #6 STA 8 Applied Linear Models: Regression Analysis Spring 011 Solution for Homework #6 6. a) = 11 1 31 41 51 1 3 4 5 11 1 31 41 51 β = β1 β β 3 b) = 1 1 1 1 1 11 1 31 41 51 1 3 4 5 β = β 0 β1 β 6.15 a) Stem-and-leaf

More information

We like to capture and represent the relationship between a set of possible causes and their response, by using a statistical predictive model.

We like to capture and represent the relationship between a set of possible causes and their response, by using a statistical predictive model. Statistical Methods in Business Lecture 5. Linear Regression We like to capture and represent the relationship between a set of possible causes and their response, by using a statistical predictive model.

More information

Statistics and Quantitative Analysis U4320

Statistics and Quantitative Analysis U4320 Statistics and Quantitative Analysis U3 Lecture 13: Explaining Variation Prof. Sharyn O Halloran Explaining Variation: Adjusted R (cont) Definition of Adjusted R So we'd like a measure like R, but one

More information

We need to define some concepts that are used in experiments.

We need to define some concepts that are used in experiments. Chapter 0 Analysis of Variance (a.k.a. Designing and Analysing Experiments) Section 0. Introduction In Chapter we mentioned some different ways in which we could get data: Surveys, Observational Studies,

More information

The Multiple Regression Model

The Multiple Regression Model Multiple Regression The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & or more independent variables (X i ) Multiple Regression Model with k Independent Variables:

More information

Unit 27 One-Way Analysis of Variance

Unit 27 One-Way Analysis of Variance Unit 27 One-Way Analysis of Variance Objectives: To perform the hypothesis test in a one-way analysis of variance for comparing more than two population means Recall that a two sample t test is applied

More information

Exam details. Final Review Session. Things to Review

Exam details. Final Review Session. Things to Review Exam details Final Review Session Short answer, similar to book problems Formulae and tables will be given You CAN use a calculator Date and Time: Dec. 7, 006, 1-1:30 pm Location: Osborne Centre, Unit

More information

Announcements. Final Review: Units 1-7

Announcements. Final Review: Units 1-7 Announcements Announcements Final : Units 1-7 Statistics 104 Mine Çetinkaya-Rundel June 24, 2013 Final on Wed: cheat sheet (one sheet, front and back) and calculator Must have webcam + audio on at all

More information