Statistics - Lecture 04

Size: px
Start display at page:

Download "Statistics - Lecture 04"

Transcription

1 Statistics - Lecture 04 Nicodème Paul Faculté de médecine, Université de Strasbourg file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 1/40

2 Correlation In many situations the objective in studying the joint behavior of two variables is to see whether they are related. - X: temperature and Y : body length in drosophila - X: Height and Y : Weight - X: parent Height and Y : child Height - X: eyes Color and Y : hair Color - X: Color and Y : perceived Taste Given n pairs of observations ( x 1, y 1 ), ( x 2, y 2 ),..., ( x n, y n ), it is natural to speak of X and Y having a positive relationship if large x's are paired with large y's and small x's with small y's. Similarly, if large x's are paired with small y's and small x's with large y's, then a negative relationship between the variables is implied. file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 2/40 2/40

3 Correlation In a population, we collect a random sample of 38 individuals. For each individual, we measure his or her height (mm) and his or her weight. Height Weight file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 3/40 3/40

4 Independence Is the variable Height is independent of the variable Weight? No Yes I cannot tell Submit Show Hint Show Answer Clear file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 4/40 4/40

5 Correlation A bivariate data set consists of measurements or observations on two variables, X and Y file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 5/40 5/40

6 Sample correlation The Pearson's sample correlation coe cient r of a bivariate data ( x i, y i ) for i = 1,..., n is given by: r = n i=1 x i xˉ y i ȳ ( )( ) ( ( )( ( ) n i=1 x i xˉ) 2 n i=1 y i ȳ ) 2 file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 6/40 6/40

7 Sample correlation Some properties of the Pearson correlation coe cent are as following: The value of r does not depend on the unit of measurement for either variable The value of r does not depend on which of the two variables is considered X The value of r is between 1 and -1 A correlation coe cient of r = 1 occurs only when all the points in a scatterplot of the data lie exactly on a straight line that slopes upward. Similarly, r = 1 only when all the points lie exactly on a downward-sloping line. file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 7/40 7/40

8 Examples of scatterplots Valeurs possibles: -1, -0.8, -0.4, 0, 0.4, 0.8, 1 file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 8/40 8/40

9 Examples of scatterplots with coe cient correlations file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 9/40 9/40

10 Hypothesis testing on ρ X i and Y i be jointly normal random variables R(X, Y ) = n ( X )( ) i=1 i Xˉ Y i Ȳ n i=1 X i Xˉ) 2 n i=1 Y i Ȳ ) 2 ( ( )( ( ) is an estimator for ρ. Hypotheses: H 0 : ρ = 0 H 1 : ρ 0 Test statistic: T = R n 2 1 R 2 follows a t distribution with n 2 degrees of freedoom Decision is based on the critical value t n 2 1 α/2 or p value = P H0 (T > t n 2 ) file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 10/40 10/40

11 Example - linear relationship r = As n = 38, the value of the t statistic is: t n 2 t = r = r 2 For α = 0.05, = As 4.74 > 2.028, we reject the null hypothesis, meaning that there is strong evidence that ρ, the population parameter is signi cantly di erent from 0. Note that you cannot nd this value neither calculate the p-value using the t table provided. Indeed, there is no entry for the t distribution with 36 degrees of freedom in the table. file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 11/40 11/40

12 If a signi cant sample correlation coe meaning? cient between two variables X and Y is observed, what could be its X causes Y Y causes X Some third factor, either directly or indirectly, causes both X and Y An unlikely event has occurred and a large sample correlation coe cient has been generated by chance from a population in which X and Y are, in fact, not correlated The correlation is purely nonsensical, a situation that may arise when measurements of X and Y are not taken on a common unit of association Submit Show Hint Show Answer Clear file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 12/40 12/40

13 Goodness of t - checking for normality A normal probability plot is a scatterplot of the (normal score z i, observed value x i ) pairs for i = 1, 2,..., n. The empirical cummulative distribution is de ned as: P(X x i ) = /n 0.5 1/n i n if i = 1 if i = n otherwise And the z i scores are found such that P(Z ) = P(X ). z i x i In software packages, the graphical representation can be obtained using a quantile - quantile plot. A strong linear pattern in a normal probability plot suggests that population normality is plausible. On the other hand, systematic departure from a straight-line pattern (such as curvature in the plot) indicates that it is not reasonable to assume that the population distribution is normal. file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 13/40 13/40

14 Checking for normality file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 14/40 14/40

15 Assess normality with Shapiro test Hypotheses: H 0 : L(X) = N H 1 : L(X) N Shapiro-Wilk normality test data: rnorm (50, mean = 1, sd = 3) W = , p-value = Shapiro-Wilk normality test data: rf (50, df1 = 3, df2 = 2) W = , p-value = 2.687e-09 file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 15/40 15/40

16 Assess normality with Shapiro test Hypotheses: H 0 : L(X) = N H 1 : L(X) N Shapiro-Wilk normality test data: scatdat$height W = , p-value = Shapiro-Wilk normality test data: scatdat$weight W = , p-value = file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 16/40 16/40

17 Dealing with non normal data Transform the data with the log or the square root function. Wilcoxon signed rank test can be used in place of a 1-sample or paired t-test Wilcoxon rank sum test substitutes for the 2-sample t-test Bootstrappping (testing for a population mean) 1. Extract a new sample of n observations from the original set of n. 2. Calculate the mean of this new sample of n. 3. Repeat steps (1) and (2) an arbitrarily large number of times, say 5000 times. 4. Use the estimated distribution of the sample mean for inference file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 17/40 17/40

18 Goodness of t Suppose data is generated from an experiment, given the following frequencies for the values of X. X k Frequencies n 1 n 2... n k We want to test the hypotheses: : L(X) = (,,..., ) : L(X) (,,..., ) H 0 p 1 p 2 p k H 1 p 1 p 2 p k with a signi cant level α. file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 18/40 18/40

19 Goodness of t Genetics Consider two di erent characteristics of tomatoes: leaf shape and plant size. The leaf shape may be potato-leafed or cut-leafed, and the plant may be tall or dwarf. file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 19/40 19/40

20 Goodness of t - Genetics Tall cut-leaf (TTCC, TTCc, TtCc, TtCC): 9 16, Tall potato-leaf (TTcc, Ttcc): 3 16 Dwarf cut-leaf (ttcc, ttcc): 3 16, Dwarf potato-leaf (ttcc): 1 16 file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 20/40 20/40

21 Notation Given X or cells): a categorical random variable with k possible values (k di erent levels or categories L(X) = P 0 = ( p 1, p 2,..., p k ) p i is the true proportion of category i and we have p 1 + p p k = 1. X = {1, 2, 3, 4} with tall cut-leaf: 1, tall potato-leaf: 2, dwarf cut-leaf: 3, dwarf potato-leaf: 4 file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 21/40 21/40

22 Goodness of t Suppose data is generated from an experiment, given the following frequencies for the values of X. X k Frequencies n 1 n 2... n k We want to test the hypotheses: : L(X) = (,,..., ) : L(X) (,,..., ) H 0 p 1 p 2 p k H 1 p 1 p 2 p k with a signi cant level α. file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 22/40 22/40

23 Goodness of t Under the null hypothesis, given a sample size n, the expected count for category i is: n p i. We can de ne the goodness-of- t statistic, KH as: k KH = i=1 ( n i n p i ) 2 n p i Expressed di erently and referring to category as cell, we write: KH = all cells (observed cell count expected cell count) 2 expected cell count Under H 0, with additionnal conditions such as: n 50, n 5, for i = 1,..., k we have: p i L(KH) χ 2 k 1 d d is the number of estimated parameters in the null hypothesis. file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 23/40 23/40

24 Goodness of t - Example Suppose we perform an experiment with tomato plants, and we observed the following data : We want to test : H 0 : L(X) = (,,, ) H 1 : L(X) (,,, ) with a level of signi cance α = file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 24/40 24/40

25 Goodness of t The value of the test statistic: ( ) 2 ( ) 2 kh = ( ) ( ) We verify: 25/40 file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 25/40

26 Goodness of t For a signi cance level α, we calculate the critical value c α such that P( > ) = α. We would fail to reject H 0 si kh < c α. χ 2 k 1 d In the example, kh 1.47, d = 0, α = 0.01, number of degrees of freedom is 3. c α The critical value is Therefore, we fail to reject the null hypothesis. The p-value is approximately file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 26/40 26/40

27 Contingency table A rectangular table used to summarize a categorical data set; two-way tables are used to compare several populations on the basis of a categorical variable or to determine if an association exists between two categorical variables. Example of a contingency table : file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 27/40 27/40

28 Review Given two discrete random variables X and Y, the joint distribution is de ned by: = P (X =, Y = ) p ij x i y j Y X x 1 x 2 y 1 p 11 p 21 y 2 p 12 p y m p 1m p 2m x l p l1 p l2... p lm The marginal distribution of X : P(X = x i ) = p = m p i. j=1 ij The marginal distribution of Y : P(X = y j ) = p = n p.j i=1 ij If X and Y are independent : p ij = p i. p.j file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 28/40 28/40

29 Testing independence The sample data observed from (X, Y ) is a contingency table: Y X x 1 x 2 y 1 n 11 n 21 y 2 n 12 n y m n 1m n 2m x l n l1 n l2... n lm n i. The marginal frequencies for X : = n.j The marginal frequencies for Y : = m j=1 n ij n i=1 n ij If X and Y are independent : = n i. n.j n ij n file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 29/40 29/40

30 Testing independence H 0 H 1 : X and Y are independent : X and Y are not independent p ij = p i. p.j avec i = 1, 2,..., let j = 1, 2,..., m The number of parameters to estimate is l 1 + m 1. Test statistic: KH = n i. n.j l m ( n i. n.j n ij n )2 n i. n.j i=1 j=1 n Under H 0, when n 50 and 5 for all i, j: n L(KH) χ 2 (l 1)(m 1) file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 30/40 30/40

31 Example Let X a random variables with values F (Family) and S (alone) and Y another random variable with values P (Present) et A (Absent). From the sample, we can estimate: 40 P(F, P ) P(F, A) P(S, P ) P(S, A) P(F ) P(S) P(P ) P(A) file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 31/40 31/40

32 Example H 0 : X and Y are independent H 1 : X and Y are not independent X and Y are independent if: P(X = x, Y = y) = P(X = x) P(Y = y) file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 32/40 32/40

33 Example 100 P(F ) P(P ) P(F ) P(A) P(S) P(P ) P(S) P(A) file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 33/40 33/40

34 Example expected cell count = (row marginal total) (column marginal total) grand total df = (number of rows 1) (number of columns 1) (40 54) 2 (60 46) 2 ( ) 2 ( ) 2 kh = The χ 2 distribution has (2 1) (2 1) = 1 degrees of freedom. c α = for α = We reject then the null hypothesis. file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 34/40 34/40

35 Can she taste the di erence? Tea-tasting experiment: a woman claimed to be able to judge whether tea or milk was poured in a cup rst. Fisher designed an experiment to test her ability What should be done about chance variations in the temperature, sweetness, and so on? How many cups should be used in the test? Should they be paired? In what order should the cups be presented? Fisher suggests: if discrimination of the kind under test is absent, the result of the experiment will be wholly governed by the laws of chance. What conclusion could be drawn from a perfect score or from one with one or more errors? file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 35/40 35/40

36 Fisher's exact test Tea-tasting experiment: a woman claimed to be able to judge whether tea or milk was poured in a cup rst. The woman was given eight cups of tea, in four of which milk was poured rst, and was told to guess which four had milk poured rst. The contingency for this design is: p 1 p 2 : probability to select a cup milk rst : probability to select a cup tea rst The hypotheses to test: H 0 : p 1 = p 2 H 1 : p 1 > p 2 file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 36/40 36/40

37 Fisher's exact test Tea-tasting experiment: a woman claimed to be able to judge whether tea or milk was poured in a cup rst. The woman was given eight cups of tea, in four of which tea was poured rst, and was told to guess which four had tea poured rst. X random variable with avlues x = 0, 1, 2, 3, 4 The Hypergeometric Distribution: 4 4 ( )( ) x 4 x P H0 ( ) (X = x) = x = 0, 1, 2, 3, ( ) = 70 There are 4 possible ways to classify 4 of the 8 cups as milk rst. file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 37/40 37/40

38 Check yourself Tea-tasting experiment: a woman claimed to be able to judge whether tea or milk was poured in a cup rst. The woman was given eight cups of tea, in four of which milk was poured rst, and was told to guess which four had milk poured rst. Suppose she identi es 3 cups milk rst correctly. The p-value is: 1/ /70 17/70 Submit Show Hint Show Answer Clear file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 38/40 38/40

39 Fisher's exact test The marginal counts a + b, c + d, a + c, and b + d are xed The hypotheses to test: H 0 : p 1 = p 2 H 1 : p 1 > p 2 or p 1 < p 2 or p 1 p 2 The test statistic is T the number of observations cell (1, 1) and a+c b+d ( )( ) x a+b x P H0 n ( ) (T = x) = x = 0, 1, 2,..., min{a + b, a + c} a+b file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 39/40 39/40

40 See you next time file:///users/home/npaul/enseignement/esbs/ /cours/04/index.html#40 40/40 40/40

Statistics. Nicodème Paul Faculté de médecine, Université de Strasbourg. 9/5/2018 Statistics

Statistics. Nicodème Paul Faculté de médecine, Université de Strasbourg. 9/5/2018 Statistics Statistics Nicodème Paul Faculté de médecine, Université de Strasbourg file:///users/home/npaul/enseignement/esbs/2018-2019/cours/01/index.html#21 1/62 Course logistics Statistics Course website: http://statnipa.appspot.com/

More information

Statistics. Nicodème Paul Faculté de médecine, Université de Strasbourg

Statistics. Nicodème Paul Faculté de médecine, Université de Strasbourg Statistics Nicodème Paul Faculté de médecine, Université de Strasbourg Course logistics Statistics & Experimental plani cation Course website: http://statnipa.appspot.com/ (http://statnipa.appspot.com/)

More information

Multiple Sample Categorical Data

Multiple Sample Categorical Data Multiple Sample Categorical Data paired and unpaired data, goodness-of-fit testing, testing for independence University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/~eariasca/teaching.html

More information

Statistics - Lecture 05

Statistics - Lecture 05 Statistics - Lecture 05 Nicodème Paul Faculté de médecine, Université de Strasbourg http://statnipa.appspot.com/cours/05/index.html#47 1/47 Descriptive statistics and probability Data description and graphical

More information

HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC

HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC 1 HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC 7 steps of Hypothesis Testing 1. State the hypotheses 2. Identify level of significant 3. Identify the critical values 4. Calculate test statistics 5. Compare

More information

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007) FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

Psychology 282 Lecture #4 Outline Inferences in SLR

Psychology 282 Lecture #4 Outline Inferences in SLR Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations

More information

Lecture 28 Chi-Square Analysis

Lecture 28 Chi-Square Analysis Lecture 28 STAT 225 Introduction to Probability Models April 23, 2014 Whitney Huang Purdue University 28.1 χ 2 test for For a given contingency table, we want to test if two have a relationship or not

More information

THE PEARSON CORRELATION COEFFICIENT

THE PEARSON CORRELATION COEFFICIENT CORRELATION Two variables are said to have a relation if knowing the value of one variable gives you information about the likely value of the second variable this is known as a bivariate relation There

More information

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing 1 In most statistics problems, we assume that the data have been generated from some unknown probability distribution. We desire

More information

x i = 1 yi 2 = 55 with N = 30. Use the above sample information to answer all the following questions. Show explicitly all formulas and calculations.

x i = 1 yi 2 = 55 with N = 30. Use the above sample information to answer all the following questions. Show explicitly all formulas and calculations. Exercises for the course of Econometrics Introduction 1. () A researcher is using data for a sample of 30 observations to investigate the relationship between some dependent variable y i and independent

More information

Statistics Introductory Correlation

Statistics Introductory Correlation Statistics Introductory Correlation Session 10 oscardavid.barrerarodriguez@sciencespo.fr April 9, 2018 Outline 1 Statistics are not used only to describe central tendency and variability for a single variable.

More information

1 A Non-technical Introduction to Regression

1 A Non-technical Introduction to Regression 1 A Non-technical Introduction to Regression Chapters 1 and Chapter 2 of the textbook are reviews of material you should know from your previous study (e.g. in your second year course). They cover, in

More information

Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2

Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Fall, 2013 Page 1 Random Variable and Probability Distribution Discrete random variable Y : Finite possible values {y

More information

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between 7.2 One-Sample Correlation ( = a) Introduction Correlation analysis measures the strength and direction of association between variables. In this chapter we will test whether the population correlation

More information

Business Statistics. Lecture 10: Correlation and Linear Regression

Business Statistics. Lecture 10: Correlation and Linear Regression Business Statistics Lecture 10: Correlation and Linear Regression Scatterplot A scatterplot shows the relationship between two quantitative variables measured on the same individuals. It displays the Form

More information

Mathematical Notation Math Introduction to Applied Statistics

Mathematical Notation Math Introduction to Applied Statistics Mathematical Notation Math 113 - Introduction to Applied Statistics Name : Use Word or WordPerfect to recreate the following documents. Each article is worth 10 points and should be emailed to the instructor

More information

E509A: Principle of Biostatistics. GY Zou

E509A: Principle of Biostatistics. GY Zou E509A: Principle of Biostatistics (Week 4: Inference for a single mean ) GY Zou gzou@srobarts.ca Example 5.4. (p. 183). A random sample of n =16, Mean I.Q is 106 with standard deviation S =12.4. What

More information

Review of One-way Tables and SAS

Review of One-way Tables and SAS Stat 504, Lecture 7 1 Review of One-way Tables and SAS In-class exercises: Ex1, Ex2, and Ex3 from http://v8doc.sas.com/sashtml/proc/z0146708.htm To calculate p-value for a X 2 or G 2 in SAS: http://v8doc.sas.com/sashtml/lgref/z0245929.htmz0845409

More information

Chi-Square Analyses Stat 251

Chi-Square Analyses Stat 251 Chi-Square Analyses Stat 251 While we have analyses for comparing more than 2 means, we cannot use them when trying to compare more than one proportion. However, there is a distribution that is related

More information

Topic 21 Goodness of Fit

Topic 21 Goodness of Fit Topic 21 Goodness of Fit Contingency Tables 1 / 11 Introduction Two-way Table Smoking Habits The Hypothesis The Test Statistic Degrees of Freedom Outline 2 / 11 Introduction Contingency tables, also known

More information

Sociology 6Z03 Review II

Sociology 6Z03 Review II Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability

More information

Chapter Eight: Assessment of Relationships 1/42

Chapter Eight: Assessment of Relationships 1/42 Chapter Eight: Assessment of Relationships 1/42 8.1 Introduction 2/42 Background This chapter deals, primarily, with two topics. The Pearson product-moment correlation coefficient. The chi-square test

More information

ST3241 Categorical Data Analysis I Two-way Contingency Tables. Odds Ratio and Tests of Independence

ST3241 Categorical Data Analysis I Two-way Contingency Tables. Odds Ratio and Tests of Independence ST3241 Categorical Data Analysis I Two-way Contingency Tables Odds Ratio and Tests of Independence 1 Inference For Odds Ratio (p. 24) For small to moderate sample size, the distribution of sample odds

More information

Goodness of Fit Tests

Goodness of Fit Tests Goodness of Fit Tests Marc H. Mehlman marcmehlman@yahoo.com University of New Haven (University of New Haven) Goodness of Fit Tests 1 / 38 Table of Contents 1 Goodness of Fit Chi Squared Test 2 Tests of

More information

Bivariate Data: Graphical Display The scatterplot is the basic tool for graphically displaying bivariate quantitative data.

Bivariate Data: Graphical Display The scatterplot is the basic tool for graphically displaying bivariate quantitative data. Bivariate Data: Graphical Display The scatterplot is the basic tool for graphically displaying bivariate quantitative data. Example: Some investors think that the performance of the stock market in January

More information

Chapter 10. Chapter 10. Multinomial Experiments and. Multinomial Experiments and Contingency Tables. Contingency Tables.

Chapter 10. Chapter 10. Multinomial Experiments and. Multinomial Experiments and Contingency Tables. Contingency Tables. Chapter 10 Multinomial Experiments and Contingency Tables 1 Chapter 10 Multinomial Experiments and Contingency Tables 10-1 1 Overview 10-2 2 Multinomial Experiments: of-fitfit 10-3 3 Contingency Tables:

More information

Testing Independence

Testing Independence Testing Independence Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM 1/50 Testing Independence Previously, we looked at RR = OR = 1

More information

Contents. Acknowledgments. xix

Contents. Acknowledgments. xix Table of Preface Acknowledgments page xv xix 1 Introduction 1 The Role of the Computer in Data Analysis 1 Statistics: Descriptive and Inferential 2 Variables and Constants 3 The Measurement of Variables

More information

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8 CIVL - 7904/8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8 Chi-square Test How to determine the interval from a continuous distribution I = Range 1 + 3.322(logN) I-> Range of the class interval

More information

Formulas and Tables by Mario F. Triola

Formulas and Tables by Mario F. Triola Copyright 010 Pearson Education, Inc. Ch. 3: Descriptive Statistics x f # x x f Mean 1x - x s - 1 n 1 x - 1 x s 1n - 1 s B variance s Ch. 4: Probability Mean (frequency table) Standard deviation P1A or

More information

Business Statistics. Lecture 10: Course Review

Business Statistics. Lecture 10: Course Review Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,

More information

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric Assumptions The observations must be independent. Dependent variable should be continuous

More information

Contingency Tables. Safety equipment in use Fatal Non-fatal Total. None 1, , ,128 Seat belt , ,878

Contingency Tables. Safety equipment in use Fatal Non-fatal Total. None 1, , ,128 Seat belt , ,878 Contingency Tables I. Definition & Examples. A) Contingency tables are tables where we are looking at two (or more - but we won t cover three or more way tables, it s way too complicated) factors, each

More information

MA : Introductory Probability

MA : Introductory Probability MA 320-001: Introductory Probability David Murrugarra Department of Mathematics, University of Kentucky http://www.math.uky.edu/~dmu228/ma320/ Spring 2017 David Murrugarra (University of Kentucky) MA 320:

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

1. The Multivariate Classical Linear Regression Model

1. The Multivariate Classical Linear Regression Model Business School, Brunel University MSc. EC550/5509 Modelling Financial Decisions and Markets/Introduction to Quantitative Methods Prof. Menelaos Karanasos (Room SS69, Tel. 08956584) Lecture Notes 5. The

More information

Econometrics Midterm Examination Answers

Econometrics Midterm Examination Answers Econometrics Midterm Examination Answers March 4, 204. Question (35 points) Answer the following short questions. (i) De ne what is an unbiased estimator. Show that X is an unbiased estimator for E(X i

More information

We know from STAT.1030 that the relevant test statistic for equality of proportions is:

We know from STAT.1030 that the relevant test statistic for equality of proportions is: 2. Chi 2 -tests for equality of proportions Introduction: Two Samples Consider comparing the sample proportions p 1 and p 2 in independent random samples of size n 1 and n 2 out of two populations which

More information

Multiple Choice Questions (circle one part) 1: a b c d e 2: a b c d e 3: a b c d e 4: a b c d e 5: a b c d e

Multiple Choice Questions (circle one part) 1: a b c d e 2: a b c d e 3: a b c d e 4: a b c d e 5: a b c d e Economics 102: Analysis of Economic Data Cameron Fall 2005 Department of Economics, U.C.-Davis Final Exam (A) Tuesday December 16 Compulsory. Closed book. Total of 56 points and worth 40% of course grade.

More information

Important note: Transcripts are not substitutes for textbook assignments. 1

Important note: Transcripts are not substitutes for textbook assignments. 1 In this lesson we will cover correlation and regression, two really common statistical analyses for quantitative (or continuous) data. Specially we will review how to organize the data, the importance

More information

NONPARAMETRIC TESTS. LALMOHAN BHAR Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-12

NONPARAMETRIC TESTS. LALMOHAN BHAR Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-12 NONPARAMETRIC TESTS LALMOHAN BHAR Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-1 lmb@iasri.res.in 1. Introduction Testing (usually called hypothesis testing ) play a major

More information

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE THE ROYAL STATISTICAL SOCIETY 004 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER II STATISTICAL METHODS The Society provides these solutions to assist candidates preparing for the examinations in future

More information

When is a copula constant? A test for changing relationships

When is a copula constant? A test for changing relationships When is a copula constant? A test for changing relationships Fabio Busetti and Andrew Harvey Bank of Italy and University of Cambridge November 2007 usetti and Harvey (Bank of Italy and University of Cambridge)

More information

Discrete Multivariate Statistics

Discrete Multivariate Statistics Discrete Multivariate Statistics Univariate Discrete Random variables Let X be a discrete random variable which, in this module, will be assumed to take a finite number of t different values which are

More information

Formulas and Tables. for Elementary Statistics, Tenth Edition, by Mario F. Triola Copyright 2006 Pearson Education, Inc. ˆp E p ˆp E Proportion

Formulas and Tables. for Elementary Statistics, Tenth Edition, by Mario F. Triola Copyright 2006 Pearson Education, Inc. ˆp E p ˆp E Proportion Formulas and Tables for Elementary Statistics, Tenth Edition, by Mario F. Triola Copyright 2006 Pearson Education, Inc. Ch. 3: Descriptive Statistics x Sf. x x Sf Mean S(x 2 x) 2 s Å n 2 1 n(sx 2 ) 2 (Sx)

More information

The goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions.

The goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions. The goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions. A common problem of this type is concerned with determining

More information

psychological statistics

psychological statistics psychological statistics B Sc. Counselling Psychology 011 Admission onwards III SEMESTER COMPLEMENTARY COURSE UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION CALICUT UNIVERSITY.P.O., MALAPPURAM, KERALA,

More information

One-Way Tables and Goodness of Fit

One-Way Tables and Goodness of Fit Stat 504, Lecture 5 1 One-Way Tables and Goodness of Fit Key concepts: One-way Frequency Table Pearson goodness-of-fit statistic Deviance statistic Pearson residuals Objectives: Learn how to compute the

More information

This gives us an upper and lower bound that capture our population mean.

This gives us an upper and lower bound that capture our population mean. Confidence Intervals Critical Values Practice Problems 1 Estimation 1.1 Confidence Intervals Definition 1.1 Margin of error. The margin of error of a distribution is the amount of error we predict when

More information

Review for Final. Chapter 1 Type of studies: anecdotal, observational, experimental Random sampling

Review for Final. Chapter 1 Type of studies: anecdotal, observational, experimental Random sampling Review for Final For a detailed review of Chapters 1 7, please see the review sheets for exam 1 and. The following only briefly covers these sections. The final exam could contain problems that are included

More information

Chi Square Analysis M&M Statistics. Name Period Date

Chi Square Analysis M&M Statistics. Name Period Date Chi Square Analysis M&M Statistics Name Period Date Have you ever wondered why the package of M&Ms you just bought never seems to have enough of your favorite color? Or, why is it that you always seem

More information

STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression

STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression Rebecca Barter April 20, 2015 Fisher s Exact Test Fisher s Exact Test

More information

Chapter 2: Looking at Data Relationships (Part 3)

Chapter 2: Looking at Data Relationships (Part 3) Chapter 2: Looking at Data Relationships (Part 3) Dr. Nahid Sultana Chapter 2: Looking at Data Relationships 2.1: Scatterplots 2.2: Correlation 2.3: Least-Squares Regression 2.5: Data Analysis for Two-Way

More information

Economics 326 Methods of Empirical Research in Economics. Lecture 14: Hypothesis testing in the multiple regression model, Part 2

Economics 326 Methods of Empirical Research in Economics. Lecture 14: Hypothesis testing in the multiple regression model, Part 2 Economics 326 Methods of Empirical Research in Economics Lecture 14: Hypothesis testing in the multiple regression model, Part 2 Vadim Marmer University of British Columbia May 5, 2010 Multiple restrictions

More information

Solutions exercises of Chapter 7

Solutions exercises of Chapter 7 Solutions exercises of Chapter 7 Exercise 1 a. These are paired samples: each pair of half plates will have about the same level of corrosion, so the result of polishing by the two brands of polish are

More information

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies The t-test: So Far: Sampling distribution benefit is that even if the original population is not normal, a sampling distribution based on this population will be normal (for sample size > 30). Benefit

More information

Gov 2000: 6. Hypothesis Testing

Gov 2000: 6. Hypothesis Testing Gov 2000: 6. Hypothesis Testing Matthew Blackwell October 11, 2016 1 / 55 1. Hypothesis Testing Examples 2. Hypothesis Test Nomenclature 3. Conducting Hypothesis Tests 4. p-values 5. Power Analyses 6.

More information

Psych 230. Psychological Measurement and Statistics

Psych 230. Psychological Measurement and Statistics Psych 230 Psychological Measurement and Statistics Pedro Wolf December 9, 2009 This Time. Non-Parametric statistics Chi-Square test One-way Two-way Statistical Testing 1. Decide which test to use 2. State

More information

The scatterplot is the basic tool for graphically displaying bivariate quantitative data.

The scatterplot is the basic tool for graphically displaying bivariate quantitative data. Bivariate Data: Graphical Display The scatterplot is the basic tool for graphically displaying bivariate quantitative data. Example: Some investors think that the performance of the stock market in January

More information

ANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS

ANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS ANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS Ravinder Malhotra and Vipul Sharma National Dairy Research Institute, Karnal-132001 The most common use of statistics in dairy science is testing

More information

Statistics. Statistics

Statistics. Statistics The main aims of statistics 1 1 Choosing a model 2 Estimating its parameter(s) 1 point estimates 2 interval estimates 3 Testing hypotheses Distributions used in statistics: χ 2 n-distribution 2 Let X 1,

More information

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS In our work on hypothesis testing, we used the value of a sample statistic to challenge an accepted value of a population parameter. We focused only

More information

Unit 27 One-Way Analysis of Variance

Unit 27 One-Way Analysis of Variance Unit 27 One-Way Analysis of Variance Objectives: To perform the hypothesis test in a one-way analysis of variance for comparing more than two population means Recall that a two sample t test is applied

More information

2.3 Analysis of Categorical Data

2.3 Analysis of Categorical Data 90 CHAPTER 2. ESTIMATION AND HYPOTHESIS TESTING 2.3 Analysis of Categorical Data 2.3.1 The Multinomial Probability Distribution A mulinomial random variable is a generalization of the binomial rv. It results

More information

Math 180B Problem Set 3

Math 180B Problem Set 3 Math 180B Problem Set 3 Problem 1. (Exercise 3.1.2) Solution. By the definition of conditional probabilities we have Pr{X 2 = 1, X 3 = 1 X 1 = 0} = Pr{X 3 = 1 X 2 = 1, X 1 = 0} Pr{X 2 = 1 X 1 = 0} = P

More information

i) the probability of type I error; ii) the 95% con dence interval; iii) the p value; iv) the probability of type II error; v) the power of a test.

i) the probability of type I error; ii) the 95% con dence interval; iii) the p value; iv) the probability of type II error; v) the power of a test. 1. Explain what is: i) the probability of type I error; ii) the 95% con dence interval; iii) the p value; iv) the probability of type II error; v) the power of a test. Answer: i) It is the probability

More information

Ch. 11 Inference for Distributions of Categorical Data

Ch. 11 Inference for Distributions of Categorical Data Ch. 11 Inference for Distributions of Categorical Data CH. 11 2 INFERENCES FOR RELATIONSHIPS The two sample z procedures from Ch. 10 allowed us to compare proportions of successes in two populations or

More information

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis.

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis. 401 Review Major topics of the course 1. Univariate analysis 2. Bivariate analysis 3. Simple linear regression 4. Linear algebra 5. Multiple regression analysis Major analysis methods 1. Graphical analysis

More information

Ordinal Variables in 2 way Tables

Ordinal Variables in 2 way Tables Ordinal Variables in 2 way Tables Edps/Psych/Soc 589 Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Fall 2018 C.J. Anderson (Illinois) Ordinal Variables

More information

STAT Chapter 13: Categorical Data. Recall we have studied binomial data, in which each trial falls into one of 2 categories (success/failure).

STAT Chapter 13: Categorical Data. Recall we have studied binomial data, in which each trial falls into one of 2 categories (success/failure). STAT 515 -- Chapter 13: Categorical Data Recall we have studied binomial data, in which each trial falls into one of 2 categories (success/failure). Many studies allow for more than 2 categories. Example

More information

ME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV

ME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV Theory of Engineering Experimentation Chapter IV. Decision Making for a Single Sample Chapter IV 1 4 1 Statistical Inference The field of statistical inference consists of those methods used to make decisions

More information

Case-Control Association Testing. Case-Control Association Testing

Case-Control Association Testing. Case-Control Association Testing Introduction Association mapping is now routinely being used to identify loci that are involved with complex traits. Technological advances have made it feasible to perform case-control association studies

More information

11-2 Multinomial Experiment

11-2 Multinomial Experiment Chapter 11 Multinomial Experiments and Contingency Tables 1 Chapter 11 Multinomial Experiments and Contingency Tables 11-11 Overview 11-2 Multinomial Experiments: Goodness-of-fitfit 11-3 Contingency Tables:

More information

Statistics 3858 : Contingency Tables

Statistics 3858 : Contingency Tables Statistics 3858 : Contingency Tables 1 Introduction Before proceeding with this topic the student should review generalized likelihood ratios ΛX) for multinomial distributions, its relation to Pearson

More information

Econ 325: Introduction to Empirical Economics

Econ 325: Introduction to Empirical Economics Econ 325: Introduction to Empirical Economics Chapter 9 Hypothesis Testing: Single Population Ch. 9-1 9.1 What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population

More information

Application of Variance Homogeneity Tests Under Violation of Normality Assumption

Application of Variance Homogeneity Tests Under Violation of Normality Assumption Application of Variance Homogeneity Tests Under Violation of Normality Assumption Alisa A. Gorbunova, Boris Yu. Lemeshko Novosibirsk State Technical University Novosibirsk, Russia e-mail: gorbunova.alisa@gmail.com

More information

Economics Introduction to Econometrics - Fall 2007 Final Exam - Answers

Economics Introduction to Econometrics - Fall 2007 Final Exam - Answers Student Name: Economics 4818 - Introduction to Econometrics - Fall 2007 Final Exam - Answers SHOW ALL WORK! Evaluation: Problems: 3, 4C, 5C and 5F are worth 4 points. All other questions are worth 3 points.

More information

Bivariate Relationships Between Variables

Bivariate Relationships Between Variables Bivariate Relationships Between Variables BUS 735: Business Decision Making and Research 1 Goals Specific goals: Detect relationships between variables. Be able to prescribe appropriate statistical methods

More information

Psych Jan. 5, 2005

Psych Jan. 5, 2005 Psych 124 1 Wee 1: Introductory Notes on Variables and Probability Distributions (1/5/05) (Reading: Aron & Aron, Chaps. 1, 14, and this Handout.) All handouts are available outside Mija s office. Lecture

More information

Lecture 7: Hypothesis Testing and ANOVA

Lecture 7: Hypothesis Testing and ANOVA Lecture 7: Hypothesis Testing and ANOVA Goals Overview of key elements of hypothesis testing Review of common one and two sample tests Introduction to ANOVA Hypothesis Testing The intent of hypothesis

More information

Inferences for Correlation

Inferences for Correlation Inferences for Correlation Quantitative Methods II Plan for Today Recall: correlation coefficient Bivariate normal distributions Hypotheses testing for population correlation Confidence intervals for population

More information

STT 843 Key to Homework 1 Spring 2018

STT 843 Key to Homework 1 Spring 2018 STT 843 Key to Homework Spring 208 Due date: Feb 4, 208 42 (a Because σ = 2, σ 22 = and ρ 2 = 05, we have σ 2 = ρ 2 σ σ22 = 2/2 Then, the mean and covariance of the bivariate normal is µ = ( 0 2 and Σ

More information

Chapter 9 Inferences from Two Samples

Chapter 9 Inferences from Two Samples Chapter 9 Inferences from Two Samples 9-1 Review and Preview 9-2 Two Proportions 9-3 Two Means: Independent Samples 9-4 Two Dependent Samples (Matched Pairs) 9-5 Two Variances or Standard Deviations Review

More information

Black White Total Observed Expected χ 2 = (f observed f expected ) 2 f expected (83 126) 2 ( )2 126

Black White Total Observed Expected χ 2 = (f observed f expected ) 2 f expected (83 126) 2 ( )2 126 Psychology 60 Fall 2013 Practice Final Actual Exam: This Wednesday. Good luck! Name: To view the solutions, check the link at the end of the document. This practice final should supplement your studying;

More information

Finite-sample quantiles of the Jarque-Bera test

Finite-sample quantiles of the Jarque-Bera test Finite-sample quantiles of the Jarque-Bera test Steve Lawford Department of Economics and Finance, Brunel University First draft: February 2004. Abstract The nite-sample null distribution of the Jarque-Bera

More information

Tentative solutions TMA4255 Applied Statistics 16 May, 2015

Tentative solutions TMA4255 Applied Statistics 16 May, 2015 Norwegian University of Science and Technology Department of Mathematical Sciences Page of 9 Tentative solutions TMA455 Applied Statistics 6 May, 05 Problem Manufacturer of fertilizers a) Are these independent

More information

Ch. 1: Data and Distributions

Ch. 1: Data and Distributions Ch. 1: Data and Distributions Populations vs. Samples How to graphically display data Histograms, dot plots, stem plots, etc Helps to show how samples are distributed Distributions of both continuous and

More information

Contingency Tables. Contingency tables are used when we want to looking at two (or more) factors. Each factor might have two more or levels.

Contingency Tables. Contingency tables are used when we want to looking at two (or more) factors. Each factor might have two more or levels. Contingency Tables Definition & Examples. Contingency tables are used when we want to looking at two (or more) factors. Each factor might have two more or levels. (Using more than two factors gets complicated,

More information

Inferential statistics

Inferential statistics Inferential statistics Inference involves making a Generalization about a larger group of individuals on the basis of a subset or sample. Ahmed-Refat-ZU Null and alternative hypotheses In hypotheses testing,

More information

Homework 9 Sample Solution

Homework 9 Sample Solution Homework 9 Sample Solution # 1 (Ex 9.12, Ex 9.23) Ex 9.12 (a) Let p vitamin denote the probability of having cold when a person had taken vitamin C, and p placebo denote the probability of having cold

More information

Problem set 1 - Solutions

Problem set 1 - Solutions EMPIRICAL FINANCE AND FINANCIAL ECONOMETRICS - MODULE (8448) Problem set 1 - Solutions Exercise 1 -Solutions 1. The correct answer is (a). In fact, the process generating daily prices is usually assumed

More information

Unit 6 - Introduction to linear regression

Unit 6 - Introduction to linear regression Unit 6 - Introduction to linear regression Suggested reading: OpenIntro Statistics, Chapter 7 Suggested exercises: Part 1 - Relationship between two numerical variables: 7.7, 7.9, 7.11, 7.13, 7.15, 7.25,

More information

Introduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution

Introduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution Introduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution James V. Lambers Department of Mathematics The University of Southern Mississippi James V. Lambers Statistical Data Analysis

More information

ij i j m ij n ij m ij n i j Suppose we denote the row variable by X and the column variable by Y ; We can then re-write the above expression as

ij i j m ij n ij m ij n i j Suppose we denote the row variable by X and the column variable by Y ; We can then re-write the above expression as page1 Loglinear Models Loglinear models are a way to describe association and interaction patterns among categorical variables. They are commonly used to model cell counts in contingency tables. These

More information

Chapte The McGraw-Hill Companies, Inc. All rights reserved.

Chapte The McGraw-Hill Companies, Inc. All rights reserved. er15 Chapte Chi-Square Tests d Chi-Square Tests for -Fit Uniform Goodness- Poisson Goodness- Goodness- ECDF Tests (Optional) Contingency Tables A contingency table is a cross-tabulation of n paired observations

More information

AP Statistics Cumulative AP Exam Study Guide

AP Statistics Cumulative AP Exam Study Guide AP Statistics Cumulative AP Eam Study Guide Chapters & 3 - Graphs Statistics the science of collecting, analyzing, and drawing conclusions from data. Descriptive methods of organizing and summarizing statistics

More information

Summary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1)

Summary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1) Summary of Chapter 7 (Sections 7.2-7.5) and Chapter 8 (Section 8.1) Chapter 7. Tests of Statistical Hypotheses 7.2. Tests about One Mean (1) Test about One Mean Case 1: σ is known. Assume that X N(µ, σ

More information

Y (Nominal/Categorical) 1. Metric (interval/ratio) data for 2+ IVs, and categorical (nominal) data for a single DV

Y (Nominal/Categorical) 1. Metric (interval/ratio) data for 2+ IVs, and categorical (nominal) data for a single DV 1 Neuendorf Discriminant Analysis The Model X1 X2 X3 X4 DF2 DF3 DF1 Y (Nominal/Categorical) Assumptions: 1. Metric (interval/ratio) data for 2+ IVs, and categorical (nominal) data for a single DV 2. Linearity--in

More information