Understand the difference between symmetric and asymmetric measures

Similar documents
Chapter 16: Correlation

Textbook Examples of. SPSS Procedure

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.

Bivariate Relationships Between Variables

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.

NON-PARAMETRIC STATISTICS * (

Lecture 8: Summary Measures

Frequency Distribution Cross-Tabulation

Finding Relationships Among Variables

Chapter 16: Correlation

Correlation. A statistics method to measure the relationship between two variables. Three characteristics

Measuring Associations : Pearson s correlation

CORELATION - Pearson-r - Spearman-rho

UNIT 4 RANK CORRELATION (Rho AND KENDALL RANK CORRELATION

Association Between Variables Measured at the Ordinal Level

Chapter 9: Association Between Variables Measured at the Ordinal Level

Can you tell the relationship between students SAT scores and their college grades?

Overview. Overview. Overview. Specific Examples. General Examples. Bivariate Regression & Correlation

THE PEARSON CORRELATION COEFFICIENT

Statistics Introductory Correlation

Last Lecture. Distinguish Populations from Samples. Knowing different Sampling Techniques. Distinguish Parameters from Statistics

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

Correlation & Linear Regression. Slides adopted fromthe Internet

CHAPTER 14: SUPPLEMENT

Chapter 8. Linear Regression. Copyright 2010 Pearson Education, Inc.

Nonparametric statistic methods. Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health

Bivariate statistics: correlation

Spearman Rho Correlation

Dependence. Practitioner Course: Portfolio Optimization. John Dodson. September 10, Dependence. John Dodson. Outline.

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

Intro to Parametric & Nonparametric Statistics

The Flight of the Space Shuttle Challenger

Data Analysis as a Decision Making Process

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

About Bivariate Correlations and Linear Regression

After completing this chapter, you should be able to:

Contents. Acknowledgments. xix

Correlation: From Picture to Formula

Stages in scientific investigation: Frequency distributions and graphing data: Levels of measurement:

Slide 7.1. Theme 7. Correlation

Dependence. MFM Practitioner Module: Risk & Asset Allocation. John Dodson. September 11, Dependence. John Dodson. Outline.

Correlation: Relationships between Variables

Chapter 8. Linear Regression. The Linear Model. Fat Versus Protein: An Example. The Linear Model (cont.) Residuals

Non-parametric tests, part A:

CHAPTER 3. YAKUP ARI,Ph.D.(C)

ADMS2320.com. We Make Stats Easy. Chapter 4. ADMS2320.com Tutorials Past Tests. Tutorial Length 1 Hour 45 Minutes

2 Describing Contingency Tables

Business Statistics. Lecture 10: Correlation and Linear Regression

Chapter Four. Numerical Descriptive Techniques. Range, Standard Deviation, Variance, Coefficient of Variation

Correlation and Linear Regression

An Overview of Methods in the Analysis of Dependent Ordered Categorical Data: Assumptions and Implications

Descriptive Statistics-I. Dr Mahmoud Alhussami

Chapter 5 Statistical Analysis of Cross-Tabs D. White and A. Korotayev May 2003 Introduction

Review. Number of variables. Standard Scores. Anecdotal / Clinical. Bivariate relationships. Ch. 3: Correlation & Linear Regression

CDA Chapter 3 part II

Notes 21: Scatterplots, Association, Causation

Unit 2. Describing Data: Numerical

Introduction to inferential statistics. Alissa Melinger IGK summer school 2006 Edinburgh

Wed, June 26, (Lecture 8-2). Nonlinearity. Significance test for correlation R-squared, SSE, and SST. Correlation in SPSS.

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation?

Regression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables.

Unit 14: Nonparametric Statistical Methods

Chapter 5 Statistical Analysis of Cross-Tabs Html links are live Some new text added in Blue 30 Oct 2004 Introduction Descriptive statistics

BIOL 4605/7220 CH 20.1 Correlation

Class 11 Maths Chapter 15. Statistics

Reminder: Student Instructional Rating Surveys

Statistics for Managers using Microsoft Excel 6 th Edition

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

Correlation and Regression Bangkok, 14-18, Sept. 2015

Readings Howitt & Cramer (2014) Overview

Notes 6: Correlation

Readings Howitt & Cramer (2014)

LOOKING FOR RELATIONSHIPS

Data files for today. CourseEvalua2on2.sav pontokprediktorok.sav Happiness.sav Ca;erplot.sav

Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression

Research Methodology: Tools

Biostatistics: Correlations

Age 55 (x = 1) Age < 55 (x = 0)

Instrumentation (cont.) Statistics vs. Parameters. Descriptive Statistics. Types of Numerical Data

Measuring relationships among multiple responses

Non-parametric methods

Analyzing Small Sample Experimental Data

DISCUSSION PAPER 2016/46

Workshop Research Methods and Statistical Analysis

Financial Econometrics and Volatility Models Copulas

psychological statistics

Centrality Measures. Leonid E. Zhukov

Hypothesis Testing hypothesis testing approach

Probability Methods in Civil Engineering Prof. Rajib Maity Department of Civil Engineering Indian Institute of Technology, Kharagpur

Ch. 16: Correlation and Regression

Correlation. We don't consider one variable independent and the other dependent. Does x go up as y goes up? Does x go down as y goes up?

Regression Analysis. BUS 735: Business Decision Making and Research

Glossary for the Triola Statistics Series

1 A Review of Correlation and Regression

Multiple Variable Analysis

Centrality Measures. Leonid E. Zhukov

Chapter 13 Correlation

Relationships between variables. Visualizing Bivariate Distributions: Scatter Plots

In many situations, there is a non-parametric test that corresponds to the standard test, as described below:

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p.

Transcription:

Chapter 9 Measures of Strength of a Relationship Learning Objectives Understand the strength of association between two variables Explain an association from a table of joint frequencies Understand a proportional reduction of errors Understand the difference between symmetric and asymmetric measures Learning Objectives Calculate the strength of association for nominal, ordinal, interval, and ratio level data Interpret the strength of association for using lamba, Somers d, Spearman s rho, and Pearson s r Explain and interpret a correlation matrix Introduction Measure of association Important to know the strength of association Descriptive statistical procedures, not inferential ones What is Association? Improve prediction of a value based on X Predict category Y and are told category X Proportional error reduction Calculating how much better the prediction of DV with the knowledge of some IV over knowledge of DV alone.

Nominal Level Data Nominal Level Data Calculating Lambda Number of errors made predicting Y with no information Minus number of errors made predicting Y with information from X Divide by number of errors made predicting Y with no information Nominal Level Data Interpreting Lambda Range values = 0 to 1 0 = no strength 1 = perfect association Between 0 and 1 shows error predicting Y mode X and Y should be either: Ordinal Dichotomized nominal Categorized interval Can tell you both strength and direction of association Goal: Predict Rank order score pairs rather than modal value

1 if rank case knowledge of one perfectly predicts the other 0 if rank case knowledge of one cannot predict the other Nothing gained by including IV Five possibilities of pairing: Concordant Pairs: X and Y ranked same order Discordant Pairs: Ranked oppositely Pairs tied on Y but not X Pairs tied on X but not Y Pairs tied on X and Y Values for ordinal level variables: Range = ± 1 to 0 Positive values = positive relationship Negative values = negative relationship Increased # concordant and discordant = MOA closer to 1 If same #, MOA = 0 Options for analyzing ordinal variables: Partially ordered and ordinal: Tau b and c Somers d

Fully ordered: Spearman s Rho All ordinal MOA have same numerator Tau Three measures for ordinal variables: Tau a = simplest MOA [squared tables] Tau b = square root of two Somers d s [squared tables] % reduction in errors Tau c [for unequal tables] Tau a, b and c range = 1 to 0 to +1 Positive value: more concordant than discordant Negative value: more discordant than concordant Compares the number of times variables are the same with the number times they are different PRE measure only for untied pairs If ties exist, Gamma better than Tau If no ties, Gamma and Tau same If many ties, Gamma gives conservative assessment Can be used with any size table If all other pairs tied, value of 1 based one pair Symmetric measure of association

Doesn t specify which variable is the IV Calculating Gamma Ratio of concordant minus discordant pairs, divided by concordant pairs plus discordant pairs Interpreting Gamma Interpreted essentially same as Tau except with no ties Ratio of Gamma determines sign of Gamma Higher values = stronger positive or negative association Range from 1 to 0 to +1 Somers d Asymmetric counterpart to Gamma Used on any size table Interpreted essentially same as Tau and Gamma Range = 1 to 0 to +1 +1 when all pairs concordant 1 when all pairs discordant Spearman s Rho Measures differences in ranks of individual cases Measures strength, significance, and direction Not a PRE measure Pearson s r applied to ranks of data points Spearman s Rho

+1 if ranks match perfectly 1 if ranked inversely 0 if no association Limitation influenced by a large number of ties of the ranks Interval Level Data Advanced, interval analyses possible if both variables are interval or ratio Supports conclusion X might be cause of Y Proportion of change in one variable explained by change in another Requires linearity Interval Level Data Goals and methods Better prediction (than mean) for exact score of Y Use straight line Best straight line summarizes linear relationship Least squares method Establishes line where sum of squares is smallest Must be straight line Pearson s r It does the following: Measures strength Examines existence Determines direction

Least squares line (LSL) Measures spread around LSL and line slope Spread amount shows association strength Pearson s r Value range = ±1 +1 = perfect positive relationship 1 = perfect negative relationship 0 usually = no relationship If relationship is not curvilinear r 2 also called coefficient of determination Pearson s r Limitations May show low strength though definite patterns Influenced by sample size Weak correlations may be significant when sample size is large Unstable correlations when sample is small Conclusions: Selecting the Most Appropriate Measure of Strength Determine if the relationship has any ability to predict values of the dependent variable Measures of significance do not provide all needed information Measure of strength determines ability to predict mode, median, or mean Conclusions: Selecting the Most Appropriate Measure of Strength Choose measure that represents the data Advisable to examine the direction and nature of the data