Measures of Association for I J tables based on Pearson's 2 Φ 2 = Note that I 2 = I where = n J i=1 j=1 J i=1 j=1 I i=1 j=1 (ß ij ß i+ ß +j ) 2 ß i+ ß

Size: px
Start display at page:

Download "Measures of Association for I J tables based on Pearson's 2 Φ 2 = Note that I 2 = I where = n J i=1 j=1 J i=1 j=1 I i=1 j=1 (ß ij ß i+ ß +j ) 2 ß i+ ß"

Transcription

1 Correlation Coefficient Y = 0 Y = 1 = 0 ß11 ß12 = 1 ß21 ß22 Product moment correlation coefficient: ρ = Corr(; Y ) E() = ß 2+ = ß 21 + ß 22 = E(Y ) E()E(Y ) q V ()V (Y ) E(Y ) = ß 2+ = ß 21 + ß 22 = ß 22 ß2+ ß +2 p ß 1+ ß 2+ ß +1 ß +2 E(Y ) = ß 22 V () = ß 2+ (1 ß 2+ ) = ß 1+ ß 2+ V (Y ) = ß +2 (1 ß +2 ) = ß +1 ß Estimation: Properties: r = p 11p22 p12p21 p p 1+ p 2+ p +1 p +2 = Y 11Y22 Y12Y21 r Y 1+ Y 2+ Y +1 Y » ρ» 1 2. Independence model ) ρ = 0 3. ρ = 1 when ß12 = ß21 = 0 ρ = 01 when ß11 = ß22 = 0 4. ρ is margin sensitive Note that and r 2 = 2 n 2 = n(y 11Y22 Y12Y21) 2 r Y1+ Y 2+ Y +1 Y +2 for 2 2 tables

2 Measures of Association for I J tables based on Pearson's 2 Φ 2 = Note that I 2 = I where = n J i=1 j=1 J i=1 j=1 I i=1 j=1 (ß ij ß i+ ß +j ) 2 ß i+ ß +j Y ij Y i+ Y +j J n Y i+ Y +j n 2 (P ij P i+ P +j ) 2 P i+ P +j Then ^Φ 2 = 2 n is a consistent estimator of Φ 2, but 0» Φ 2» minfi 1; J 1g n = Y ++ P ij = Y ij =n P i+ = Y i+ =n P +j = Y +j =n Pearson's measure of mean square contingency Cramer's V Note that 0» P» r I 1 I Estimation: vu u P = t Φ2 Φ for I I tables vu ^P = u 2 t 2 + n vu V = u t Esitmation: vu ^V = u t Φ 2 minfi 1; J 1g 0» V» 1 2 =n minfi 1; J 1g

3 /* This program is stored as assoc.sas */ Example: Association between diagnosis and treatment prescribed by psychiatrists in New Haven, Conn. (1950) /* This program uses PROC FREQ in SAS to test for independence between diagnosis and perscribed Treatment Psycho- Organic Custodial therapy Therapy Care Affective Alcoholic Organic Schizophrenic Senile treatment in the 1950 New Haven study. */ DATA SET1; INPUT D T ; LABEL D = DIAGNOSIS T = TREATMENT; CARDS; RUN; PROC PRINT DATA=SET1; TITLE 'DATA FOR THE NEW HAVEN STUDY'; PROC FORMAT; VALUE DFMT 1='AFFECTIVE' 2='ALCOHOLIC' 3='ORGANIC' 4='SCHIZOPHRENIC' 5='SENILE'; VALUE TFMT 1='PSYCHOTHERAPY' 2='ORGANIC THERAPY' 3='CUSTODIAL CARE'; RUN; 539 PROC FREQ DATA=SET1 ORDER=INTERNAL; TABLES D*T / CHISQ MEASURES SCORES=TABLE ALPHA=.05 NOPERCENT NOCOL EPECTED; WEIGHT ; FORMAT D DFMT.; FORMAT T TFMT.; TITLE 'ANALYSIS OF THE NEW HAVEN DATA'; RUN; 540

4 ANALYSIS OF THE NEW HAVEN DATA DATA FOR THE NEW HAVEN STUDY Table of D by T Obs D T D(DIAGNOSIS) T(TREATMENT) Frequency Expected Row Pct PSYCHO- ORGANIC CUSTODIAL Total THERAPY THERAPY CARE AFFECTIVE ALCOHOLIC ORGANIC SCHIZOPHRENIC SENILE Total Statistics for Table of D by T Statistic DF Value Prob Chi-Square <.0001 Likelihood Ratio Chi-Square <.0001 Mantel-Haenszel Chi-Square Phi Coefficient Contingency Coefficient Cramer's V Statistic Value ASE Gamma Kendall's Tau-b Stuart's Tau-c Somers' D C R Somers' D R C Pearson Correlation Spearman Correlation Lambda Asymmetric C R Lambda Asymmetric R C Lambda Symmetric Uncertainty Coefficient C R Uncertainty Coefficient R C Uncertainty Coefficient Sym Sample Size =

5 Proportional Reduction in Error (PRE) PRE = minimum prob. of erroneous prediction assuming independence minimum prob. of erroneous prediction under the alternative 2 minimum probability of 6 4 erroneous prediction assuming independence with estimate (1 P +;max ) (1 I ^ C=R = 1 P +;max P i;max ) i=1 The large sample variance is estimated as Predicting the column category from the row category C=R = (1 ß +;max ) (1 I 1 ß +;max ß i;max ) i=1 ^ff 2 1(^ C=R) = 1 N 2 (1 I i=1 6 4 P i;max )( i P i;max + P +;max 2 (1 P +;max ) 3 Λ i P i;max ) New Haven (1950) study: P 11 = :0177 P 12 = :0602 P 13 = :1653 ψ P 1;max P 21 = :0283 P 22 = :0136 P 23 = :0118 P 31 = :0112 P 32 = :0472 P 32 = :0443 P 41 = :0714 P 42 = :2031 P 42 = :2255 P 51 = :0106 P 52 = :0065 P 52 = :0832 P +1 = :1393 P +2 = :3306 P +3 = 5301 ψ P +;max ^ R=C = = 0 and (1 :5000) (1 [: : ]) (1 :5000) ^ff 2 1(^ R=C ) = 0 ^ C=R = (1 :5301) (1 (: : : : :0832)) = :4699 :0505 = 041 :4699 ^ff1;^ C=R = 018 (1 :5301) In this case the 4-th row contains the largest proportion of patients for each column. So it appears that C=R >

6 Properties: 1. 0» C=R» 1 2. C=R = 0 if knowledge of the row category does not improve the probability of correctly predicting column category 3. C=R = 1 0 1/12 0 1/ / / C=R is not always equal to R=C 5. C=R is not affected by permuting rows and/or columns. This makes C=R a suitable measure for tabales defined by nominal variables. Other measures of association for nominal categorical variables: Agresti(1990, pp 22-26) Lloyd(1999, pp 70-71) Concentration coefficient: ß 2 ij =ß i+ ß 2 +j i j j fi C=R = 1 ß 2 +j j Uncertainty coefficient: U C=R = i j ß ij log(ß ij =ß i+ ß+j) j ß+j log(ß+j)

7 Measures of association for I J tables defined by two ordinal variables. Example: Each student in a random" sample of N = 21 sociology students at the Univ. of Michigan was cross-classified with respect to responses to two items. Concern with Willingness proper behavior to join an organization Low moderate high Low moderate high Consider the responses to these two items for two different students (i 1 ; j 1 ) gives the row and column categories for the responses of student 1. (i 2 ; j 2 ) gives the row and column categories for the responses of student 2. Concordant Pairs Low! High Low 22 # High ( ) Concordant pair of students: either i 1 > i 2 j 1 > j 2 or i 1 < i 2 j 1 < j 2 Discordant pair of students: either i 1 > i 2 j 1 > j 2 or i 1 < i 2 j 1 < j 2 Neither concordant nor discordant: i 1 = i 2 and/or j 1 = J ( ) 556

8 Low! High 5 Low 22 # 60 none High 18 60( ) ( ) 27( ) None ( ) 31( )

9 Discordant Pairs None 5( ) (26 + 8) 26(20 + 5) (8 + 31) 8 60(5 + 8) ( )

10 (5 + 27) Number of concordant pairs: 2 P = I J 6 Y ij 4 Y k` + i=1 j=1 k>i `>j k<i `<j = 12; 492 Y k`3 7 5 None 18 Number of discordant pairs: 2 Q = I J 6 Y ij 4 Y k` + i=1 j=1 k>i `<j k<i `>j = 5; 676 Y k` Concern with Willingness proper behavior to join an organization Low moderate high Low moderate high Kendall's Tau b : fi b = r N(N 1) i Y i+(y i+ 1) s P Q N(N 1) j Goodman-Kruskal Gamma: ^fl = P Q P + Q = Y +j(y +j 1) 12; 492 5; ; ; 676 = 0:

11 Properties of ^fl: Large sample variance estimate: ^ff 2 1;^fl = h 16 (P +Q) 4 j j P k>i Y ij nq k>i `<j Y k` + k<i `>j Y k` + k<i 2 i Y k` o `>j `<j Y k` 1. ^fl = 1 2. ^fl = 1 Low! High Low # # High Low! High Low # # High ^fl = 0 for the case where P ij = P i+ P +j. Also for other cases, e.g., 4. j^flj j^fi b j Spearman's rho (corrected for ties) ^ρ = 1 12 h Y ij i j k<i vu u t h N 3 N i Y k+ + Y i+ 2 N 2 Y 3 i+ Y i+ ih i h `<j Y +e + Y i +j N 2 2 i N 3 N J Y 3 +j Y +j j=1 5. For 2 2 tables, ^fl is the estimate of Yule's Q

12 Product-moment correlation coefficient: i 1 j1 r = vu u t I Y i+ i1 I J Y ij N (r i μr)(c j μc) vu N (r i μr) 2 u t J j=1 Y +j N (c j μc) 2 where r 1 ; r 2 ; ; r I are the row scores and c 1 ; c 2 ; ; c J are the column scores μr = 1 N μc = 1 N I Y i+ r i i=1 J Y +j c j j=1 Measure of agreement for I I tables Judge 2 A B C D A ß 11 ß 12 ß 13 ß 14 ß 1+ B ß 21 ß 22 ß 23 ß 24 ß 2+ Judge 1 C ß 31 ß 32 ß 33 ß 34 ß 3+ D ß 41 ß 42 ß 43 ß 44 ß 4+ ß +1 ß +2 ß +3 ß +4 Consider r i = i c j = j i = 1; 2; : : : ; I j = 1; 2; : : : ; J Cohen's Kappa: i=1 K = where 1 = I 2 = I I ß ii I 1 I ß ii i=1 i=1 ß i+ ß +i i=1 1 2 = 1 2 ß i+ ß +i i=1 actual" probability of agreement ß i+ ß +i probability of chance agreement for independent classificatons Estimation: ^K = = I i=1 P ii I i=1 P i+ P +i 1 I i=1 P i+ P +i N P i Y ii Y i+ Y+i i N 2 Y i+ Y+i i

13 Estimate of the large sample variance: ^ff 2 1;^k ^ 1 = i ^ 2 = i ^ 3 = i ^ 4 = i P ii P i+ P+i P ii (P i+ + P+i) j P ij (P+i + P j+ ) 2 = 1 N [^ 1(1 ^ 1) (1 ^ 2) 2 + 2(1 ^ 1)(2^ 1^ 2 ^ 3) (1 ^ 2) 3 + (1 ^ 1) 2 (^ 4 4^ 2 2 ) (1 ^ 2) 4 ] Rating of student teachers by two supervisors (Gross, 1971, BFH, Chap. 11) Supervisor 2 Authori- Supervisor 1 tarian democratic Permissive Authoritarian Democratic Permissive N = 72 ^» = 0:361 ^ff 1;^» = p :0084 = : Properties of Kappa: An approximate 95% confidence interval for Kappa: 1. ^» = 0 if P ii = P i+ P+i ; i = 1; 2; : : : ^» ± (1:96)^ff 1;^» :361 ± : » ^»» 1 3. ^» = 1 if I i=1 P ii = 1 ) (:18; :54) P P P

14 ^» = :70 P 11 + P 22 = : P i+ P+i ^» = i 1 P i+ P+i i when there is no agreement". Kappa is sensitive to marginal distributions ^» = :32 P 11 + P 22 = : ^» = 0:13 P 11 + P 22 = : ^» = 0:26 P 11 + P 22 = : Weighted Kappa /* This program is stored as kappa.sas */ w ij ß ij w ij ß i+ ß+j i j i j» w = 1 w ij ß i+ ß+j j j Choices of weights: w ij = w ij = 8 >< >: 8 >< >: 1 i = j ) Kappa 0 i 6= j 1 i = j 1=2 j = i + 1 or j = j 1 0 otherwise (i j)2 w ij = 1 (I 1) /* First Use PROC FREG in SAS to compute kappa for the student teacher ratings. There are two options for specifying weights */ data set1; input sup1 sup2 count; cards; run; 584

15 proc format; value rating 1=Authoritarian 2=Democratic 3=Permissive; run; proc freq data=set1; tables sup1*sup2 / agree(wt=ca) printkwt alpha=.05 nocol norow; weight count; format sup1 rating. sup2 rating.; run; proc freq data=set1; tables sup1*sup2 / agree(wt=fc) printkwt alpha=.05 nocol norow; weight count; format sup1 rating. sup2 rating.; run; /* This part of the program uses PROC IML in SAS to compute either Kappa or a weighted kappa and the corresponding standard errors. It is applied to the student teacher rating data. */ PROC IML; START KAPPA; /* ENTER TABLE OF COUNTS */ = , , }; /* ENTER THE TABLE OF WEIGHTS; USE AN IDENTITY MATRI IF YOU DO NOT WANT WEIGHTED KAPPA */ W = , , }; /* BEGINNING OF MODULE TO COMPUTE KAPPA AND WEIGHTED KAPPA */ /* COMPUTE NUMBER OF ROWS AND NUMBER OF COLUMNS FOR */ NR = NROW(); NC = NCOL(); /* COMPUTE ROW AND COLUMN TOTALS FOR THE MATRI OF COUNTS */ R = (,+ ); C = ( +, ); /* COMPUTE TABLE OF EPECTED COUNTS FOR INDEPENDENT RANDOM AGREEMENT */ E = R*C; /* COMPUTE OVERALL TOTAL COUNT */ T = SUM(); 587 /* COMPUTE KAPPA */ K1 = SUM(DIAG())/T; K2 = SUM(DIAG(E))/(T**2); K3 = SUM(DIAG()*DIAG(R+C`))/(T**2); J1 = J(1,NR); J2 = J(NC,1); TT1 = ((C`*J1+J2*R`)##2)#; K4 = SUM(TT1)/(T**3); KAPPA = (K1 - K2)/(1 - K2); /* COMPUTE STANDARD ERRORS: S1 DOES NOT ASSUME INDEPENDENCE: S2 ASSUMES THE NULL HYPOTHESIS OF INDEPENDENCE IS TRUE */ S1 = (K1*(1-K1)/((1-K2)**2)+2* (1-K1)*(2*K1*K2-K3)/((1-K2)**3)+ ((1-K1)**2)*(K4-4*K2*K2)/ ((1-K2)**4))/T; S1 = SQRT(S1); S2 = (K2 + K2*K2 -(SUM(DIAG(E)*DIAG(R+C`)) /(T**3)))/(T*(1-K2)**2); S2 = SQRT(S2); 588

16 /* COMPUTE WEIGHTED KAPPA */ W = #W; EW = E#W; WR = (W*C`) / T; WC = (R`*W) / T; KW1 = SUM(W)/T; KW2 = SUM(EW)/(T**2); KAPPAW = (KW1 - KW2)/(1 - KW2); TT2 = (WR*J2`+J1`*WC); TT3 = (W*(1-KW2)-TT2*(1-KW1))##2; /* COMPUTE STANDARD ERRORS: SW1 DOES NOT ASSUME INDEPENDENCE: SW2 ASSUMES THE NULL HYPOTHESIS OF INDEPENDENCE IS TRUE */ SW1 = SUM(#TT3)/T; SW1 = (SW1 -(KW1*KW2-2*KW2+KW1)##2) /(T*(1-KW2)**4); SW1 = SQRT(SW1); SW2 = (W-TT2)##2; SW2 = ((SUM(E#SW2)/T**2)-(KW2##2)) /(T*(1-KW2)**2); SW2 = SQRT(SW2); 589 /* COMPUTE 95% CONFIDENCE INTERVALS AND TESTS OF THE HYPOTHESIS THAT THERE IS ONLY RANDOM AGRREMENT */ TK = KAPPA/S2; TKW = KAPPAW/SW2; TT4 = TK**2; PK = 1 - PROBCHI(TT4,1); TT4 = TKW**2; PKW = 1 - PROBCHI(TT4,1); CKL = KAPPA - (1.96)*S1; CKU = KAPPA + (1.96)*S1; CKWL = KAPPAW - (1.96)*SW1; CKWU = KAPPAW + (1.96)*SW1; 590 /* PRINT RESULTS */ The FREQ Procedure PRINT,,,"Unweighted Kappa statistic " KAPPA; PRINT," Standard error " S1; PRINT,"95% confidence interval (" CKL "," CKU ")"; PRINT,,,"Standard error when there "; PRINT "is only random agreement " S2; PRINT,,,"P-value for test of "; PRINT "completely random agreement" PK; PRINT,,," Weighted Kappa statistic " KAPPAW; PRINT," Standard error " SW1; PRINT,"95% confidence interval (" CKWL "," CKWU ")"; PRINT,,,"Standard error when there"; PRINT "is only random agreement " SW2; PRINT,,,"P-value for test of "; PRINT "completely random agreement" PKW; FINISH; RUN KAPPA; Table of sup1 by sup2 Frequency Percent Authori Democra Permiss Total tarian tic ive Authoritarian Democratic Permissive Total

17 Statistics for Table of sup1 by sup2 Test of Symmetry Statistic (S) DF 3 Pr > S Kappa Coefficient Weights (Fleiss-Cohen Form) Authori Democra Permiss tarian tic ive Kappa Coefficient Weights Authori Democra Permiss tarian tic ive Authoritarian Democratic Permissive Authoritarian Democratic Permissive Kappa Statistics Statistic Value ASE 95% Confidence Limits Simple Kappa Weighted Kappa Kappa Statistics Statistic Value ASE 95% Confidence Limits Simple Kappa Weighted Kappa Sample Size = KAPPA Unweighted Kappa statistic KAPPAW Weighted Kappa statistic S1 Standard error SW1 Standard error CKL CKU 95% confidence interval ( , ) CKWL CKWU 95% confidence interval ( , ) Standard error when there S2 is only random agreement Standard error when there SW2 is only random agreement P-value for test of PK completely random agreement P-value for test of PKW completely random agreement

18 # This file contains Splus code # for computing a Kappa statistic, # or weighted Kappa statistic, # standard errors and confidence # intervals. It is applied to the # student teacher data. # The file is stored as kappa.ssc # Enter the observed counts x <- matrix(c(17, 4, 8, 5, 12, 0, 10, 3, 13), 3, 3, byrow=t) # Enter the weights w<-matrix(c(1.0, 0.5, 0.0, 0.5, 1.0, 0.5, 0.0, 0.5, 1.0), 3, 3, byrow=t) # Compute expected counts for random # agreement n <- sum(x) xr <- apply(x, 1, sum) xc <- apply(x, 2, sum) one <- rep(1, length(xr)) e <- outer(xr, xc)/n # Compute Kappa k1 <- sum(diag(x))/n k2 <- sum(diag(e))/n k3 <- sum(diag(x)*diag(xr+xc))/(n*n) k4 <- sum(((outer(xc, one)+ outer(one, xr))**2)*x)/(n**3) kappa <- (k1-k2)/(1-k2) # Compute weighted Kappa # Compute standard errors: # s1 does not assume random agreement # s2 assumes only random agreement s11 <- (k1*(1-k1)/((1-k2)**2)+2*(1-k1)* (2*k1*k2-k3)/((1-k2)**3)+ ((1-k1)**2)*(k4-4*k2*k2)/((1-k2)**4))/n s1 <- s11**.5 s22 <- (k2+k2*k2-(sum(diag(e)*diag(xr+xc)) /(n**2)))/(n*(1-k2)**2) s2 <- s22** xw <- x*w ew <- e*w wr <- apply(w*xc, 2, sum)/n wc <- apply(w*xr, 2, sum)/n kw1 <- sum(xw)/n kw2 <- sum(ew)/n tt2 <- outer(wr, one)+outer(one, wc) tt3 <- ((w*(1-kw2))-(tt2*(1-kw1)))**2 kappaw <- (kw1-kw2)/(1-kw2) # Compute standard errors: # sw11 does not assume random agreement # sw22 assumes only random agreement sw11 <- sum(x*tt3)/n sw11 <- (sw11-(kw1*kw2-2*kw2+kw1)**2)/ (n*(1-kw2)**4) sw1 <- sw11**.5 sw22 <- (w-tt2)**2 sw22 <- ((sum(e*sw22)/n)-(kw2**2))/ (n*(1-kw2)**2) sw2 <- sw22**.5 600

19 # print results # Construct 95% confidence intervals # and tests for random agreement tk <- kappa/s2 tkw <- kappaw/sw2 tt4 <- tk**2 pk <- (1-pchisq(tt4, 1)) tt4 <- tkw**2 pkw <-(1-pchisq(tt4, 1)) ckl <- kappa-(1.96)*s1 cku <- kappa+(1.96)*s1 ckwl <- kappaw-(1.96)*sw1 ckwu <- kappaw+(1.96)*sw1 cat(" n", " Unweighted Kappa = ", signif(kappa,5)) cat(" n", " Standard error = ", signif(s1,5)) cat(" n", "95% confidence interval: ", signif(ckl,5), signif(cku,5)) cat(" n", "p-value for test of random ", "agreement = ", signif(pk,5)) cat(" n", " Weighted Kappa = ", signif(kappaw,5)) cat(" n", " Standard error = ", signif(sw1,5)) cat(" n", "95% confidence interval: ", signif(ckwl,5), signif(ckwu,5)) cat(" n", "p-value for test of random ", " agreement = ", signif(pkw,5)) You can source this code into S-PLUS and obtain the results by issuing the following command at the prompt in the S-PLUS command window: source("yourdirectory/kappa.ssc") The results are shown below. Unweighted Kappa = Standard error = % confidence interval: p-value for test of random agreement = e-005 Weighted Kappa = Standard error = % confidence interval: p-value for test of random agreement =

Means or "expected" counts: j = 1 j = 2 i = 1 m11 m12 i = 2 m21 m22 True proportions: The odds that a sampled unit is in category 1 for variable 1 giv

Means or expected counts: j = 1 j = 2 i = 1 m11 m12 i = 2 m21 m22 True proportions: The odds that a sampled unit is in category 1 for variable 1 giv Measures of Association References: ffl ffl ffl Summarize strength of associations Quantify relative risk Types of measures odds ratio correlation Pearson statistic ediction concordance/discordance Goodman,

More information

2 Describing Contingency Tables

2 Describing Contingency Tables 2 Describing Contingency Tables I. Probability structure of a 2-way contingency table I.1 Contingency Tables X, Y : cat. var. Y usually random (except in a case-control study), response; X can be random

More information

Lecture 8: Summary Measures

Lecture 8: Summary Measures Lecture 8: Summary Measures Dipankar Bandyopadhyay, Ph.D. BMTRY 711: Analysis of Categorical Data Spring 2011 Division of Biostatistics and Epidemiology Medical University of South Carolina Lecture 8:

More information

Ordinal Variables in 2 way Tables

Ordinal Variables in 2 way Tables Ordinal Variables in 2 way Tables Edps/Psych/Soc 589 Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Fall 2018 C.J. Anderson (Illinois) Ordinal Variables

More information

Chapter 19. Agreement and the kappa statistic

Chapter 19. Agreement and the kappa statistic 19. Agreement Chapter 19 Agreement and the kappa statistic Besides the 2 2contingency table for unmatched data and the 2 2table for matched data, there is a third common occurrence of data appearing summarised

More information

E509A: Principle of Biostatistics. (Week 11(2): Introduction to non-parametric. methods ) GY Zou.

E509A: Principle of Biostatistics. (Week 11(2): Introduction to non-parametric. methods ) GY Zou. E509A: Principle of Biostatistics (Week 11(2): Introduction to non-parametric methods ) GY Zou gzou@robarts.ca Sign test for two dependent samples Ex 12.1 subj 1 2 3 4 5 6 7 8 9 10 baseline 166 135 189

More information

Textbook Examples of. SPSS Procedure

Textbook Examples of. SPSS Procedure Textbook s of IBM SPSS Procedures Each SPSS procedure listed below has its own section in the textbook. These sections include a purpose statement that describes the statistical test, identification of

More information

ST3241 Categorical Data Analysis I Two-way Contingency Tables. Odds Ratio and Tests of Independence

ST3241 Categorical Data Analysis I Two-way Contingency Tables. Odds Ratio and Tests of Independence ST3241 Categorical Data Analysis I Two-way Contingency Tables Odds Ratio and Tests of Independence 1 Inference For Odds Ratio (p. 24) For small to moderate sample size, the distribution of sample odds

More information

Testing Independence

Testing Independence Testing Independence Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM 1/50 Testing Independence Previously, we looked at RR = OR = 1

More information

BIOMETRICS INFORMATION

BIOMETRICS INFORMATION BIOMETRICS INFORMATION (You re 95% likely to need this information) PAMPHLET NO. # 41 DATE: September 18, 1992 SUBJECT: Power Analysis and Sample Size Determination for Contingency Table Tests Statistical

More information

Understand the difference between symmetric and asymmetric measures

Understand the difference between symmetric and asymmetric measures Chapter 9 Measures of Strength of a Relationship Learning Objectives Understand the strength of association between two variables Explain an association from a table of joint frequencies Understand a proportional

More information

Epidemiology Wonders of Biostatistics Chapter 13 - Effect Measures. John Koval

Epidemiology Wonders of Biostatistics Chapter 13 - Effect Measures. John Koval Epidemiology 9509 Wonders of Biostatistics Chapter 13 - Effect Measures John Koval Department of Epidemiology and Biostatistics University of Western Ontario What is being covered 1. risk factors 2. risk

More information

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007) FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter

More information

BIOS 625 Fall 2015 Homework Set 3 Solutions

BIOS 625 Fall 2015 Homework Set 3 Solutions BIOS 65 Fall 015 Homework Set 3 Solutions 1. Agresti.0 Table.1 is from an early study on the death penalty in Florida. Analyze these data and show that Simpson s Paradox occurs. Death Penalty Victim's

More information

Sections 3.4, 3.5. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis

Sections 3.4, 3.5. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis Sections 3.4, 3.5 Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 22 3.4 I J tables with ordinal outcomes Tests that take advantage of ordinal

More information

Inter-Rater Agreement

Inter-Rater Agreement Engineering Statistics (EGC 630) Dec., 008 http://core.ecu.edu/psyc/wuenschk/spss.htm Degree of agreement/disagreement among raters Inter-Rater Agreement Psychologists commonly measure various characteristics

More information

Tests for Two Coefficient Alphas

Tests for Two Coefficient Alphas Chapter 80 Tests for Two Coefficient Alphas Introduction Coefficient alpha, or Cronbach s alpha, is a popular measure of the reliability of a scale consisting of k parts. The k parts often represent k

More information

ST3241 Categorical Data Analysis I Two-way Contingency Tables. 2 2 Tables, Relative Risks and Odds Ratios

ST3241 Categorical Data Analysis I Two-way Contingency Tables. 2 2 Tables, Relative Risks and Odds Ratios ST3241 Categorical Data Analysis I Two-way Contingency Tables 2 2 Tables, Relative Risks and Odds Ratios 1 What Is A Contingency Table (p.16) Suppose X and Y are two categorical variables X has I categories

More information

Cohen s s Kappa and Log-linear Models

Cohen s s Kappa and Log-linear Models Cohen s s Kappa and Log-linear Models HRP 261 03/03/03 10-11 11 am 1. Cohen s Kappa Actual agreement = sum of the proportions found on the diagonals. π ii Cohen: Compare the actual agreement with the chance

More information

Lecture 25: Models for Matched Pairs

Lecture 25: Models for Matched Pairs Lecture 25: Models for Matched Pairs Dipankar Bandyopadhyay, Ph.D. BMTRY 711: Analysis of Categorical Data Spring 2011 Division of Biostatistics and Epidemiology Medical University of South Carolina Lecture

More information

Decomposition of Parsimonious Independence Model Using Pearson, Kendall and Spearman s Correlations for Two-Way Contingency Tables

Decomposition of Parsimonious Independence Model Using Pearson, Kendall and Spearman s Correlations for Two-Way Contingency Tables International Journal of Statistics and Probability; Vol. 7 No. 3; May 208 ISSN 927-7032 E-ISSN 927-7040 Published by Canadian Center of Science and Education Decomposition of Parsimonious Independence

More information

Frequency Distribution Cross-Tabulation

Frequency Distribution Cross-Tabulation Frequency Distribution Cross-Tabulation 1) Overview 2) Frequency Distribution 3) Statistics Associated with Frequency Distribution i. Measures of Location ii. Measures of Variability iii. Measures of Shape

More information

Chi-Square. Heibatollah Baghi, and Mastee Badii

Chi-Square. Heibatollah Baghi, and Mastee Badii 1 Chi-Square Heibatollah Baghi, and Mastee Badii Different Scales, Different Measures of Association Scale of Both Variables Nominal Scale Measures of Association Pearson Chi-Square: χ 2 Ordinal Scale

More information

Count data page 1. Count data. 1. Estimating, testing proportions

Count data page 1. Count data. 1. Estimating, testing proportions Count data page 1 Count data 1. Estimating, testing proportions 100 seeds, 45 germinate. We estimate probability p that a plant will germinate to be 0.45 for this population. Is a 50% germination rate

More information

Small n, σ known or unknown, underlying nongaussian

Small n, σ known or unknown, underlying nongaussian READY GUIDE Summary Tables SUMMARY-1: Methods to compute some confidence intervals Parameter of Interest Conditions 95% CI Proportion (π) Large n, p 0 and p 1 Equation 12.11 Small n, any p Figure 12-4

More information

Unit 14: Nonparametric Statistical Methods

Unit 14: Nonparametric Statistical Methods Unit 14: Nonparametric Statistical Methods Statistics 571: Statistical Methods Ramón V. León 8/8/2003 Unit 14 - Stat 571 - Ramón V. León 1 Introductory Remarks Most methods studied so far have been based

More information

Research Methodology: Tools

Research Methodology: Tools MSc Business Administration Research Methodology: Tools Applied Data Analysis (with SPSS) Lecture 05: Contingency Analysis March 2014 Prof. Dr. Jürg Schwarz Lic. phil. Heidi Bruderer Enzler Contents Slide

More information

In many situations, there is a non-parametric test that corresponds to the standard test, as described below:

In many situations, there is a non-parametric test that corresponds to the standard test, as described below: There are many standard tests like the t-tests and analyses of variance that are commonly used. They rest on assumptions like normality, which can be hard to assess: for example, if you have small samples,

More information

The concord Package. August 20, 2006

The concord Package. August 20, 2006 The concord Package August 20, 2006 Version 1.4-6 Date 2006-08-15 Title Concordance and reliability Author , Ian Fellows Maintainer Measures

More information

CDA Chapter 3 part II

CDA Chapter 3 part II CDA Chapter 3 part II Two-way tables with ordered classfications Let u 1 u 2... u I denote scores for the row variable X, and let ν 1 ν 2... ν J denote column Y scores. Consider the hypothesis H 0 : X

More information

Inference for Binomial Parameters

Inference for Binomial Parameters Inference for Binomial Parameters Dipankar Bandyopadhyay, Ph.D. Department of Biostatistics, Virginia Commonwealth University D. Bandyopadhyay (VCU) BIOS 625: Categorical Data & GLM 1 / 58 Inference for

More information

Didacticiel - Études de cas. In this tutorial, we show how to use TANAGRA ( and higher) for measuring the association between ordinal variables.

Didacticiel - Études de cas. In this tutorial, we show how to use TANAGRA ( and higher) for measuring the association between ordinal variables. Subject Association measures for ordinal variables. In this tutorial, we show how to use TANAGRA (1.4.19 and higher) for measuring the association between ordinal variables. All the measures that we present

More information

SAS/STAT 14.1 User s Guide. Introduction to Nonparametric Analysis

SAS/STAT 14.1 User s Guide. Introduction to Nonparametric Analysis SAS/STAT 14.1 User s Guide Introduction to Nonparametric Analysis This document is an individual chapter from SAS/STAT 14.1 User s Guide. The correct bibliographic citation for this manual is as follows:

More information

UNIT 4 RANK CORRELATION (Rho AND KENDALL RANK CORRELATION

UNIT 4 RANK CORRELATION (Rho AND KENDALL RANK CORRELATION UNIT 4 RANK CORRELATION (Rho AND KENDALL RANK CORRELATION Structure 4.0 Introduction 4.1 Objectives 4. Rank-Order s 4..1 Rank-order data 4.. Assumptions Underlying Pearson s r are Not Satisfied 4.3 Spearman

More information

SAS Analysis Examples Replication C8. * SAS Analysis Examples Replication for ASDA 2nd Edition * Berglund April 2017 * Chapter 8 ;

SAS Analysis Examples Replication C8. * SAS Analysis Examples Replication for ASDA 2nd Edition * Berglund April 2017 * Chapter 8 ; SAS Analysis Examples Replication C8 * SAS Analysis Examples Replication for ASDA 2nd Edition * Berglund April 2017 * Chapter 8 ; libname ncsr "P:\ASDA 2\Data sets\ncsr\" ; data c8_ncsr ; set ncsr.ncsr_sub_13nov2015

More information

N Utilization of Nursing Research in Advanced Practice, Summer 2008

N Utilization of Nursing Research in Advanced Practice, Summer 2008 University of Michigan Deep Blue deepblue.lib.umich.edu 2008-07 536 - Utilization of ursing Research in Advanced Practice, Summer 2008 Tzeng, Huey-Ming Tzeng, H. (2008, ctober 1). Utilization of ursing

More information

HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC

HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC 1 HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC 7 steps of Hypothesis Testing 1. State the hypotheses 2. Identify level of significant 3. Identify the critical values 4. Calculate test statistics 5. Compare

More information

UNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS. Duration - 3 hours. Aids Allowed: Calculator

UNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS. Duration - 3 hours. Aids Allowed: Calculator UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS Duration - 3 hours Aids Allowed: Calculator LAST NAME: FIRST NAME: STUDENT NUMBER: There are 27 pages

More information

Epidemiology Principle of Biostatistics Chapter 14 - Dependent Samples and effect measures. John Koval

Epidemiology Principle of Biostatistics Chapter 14 - Dependent Samples and effect measures. John Koval Epidemiology 9509 Principle of Biostatistics Chapter 14 - Dependent Samples and effect measures John Koval Department of Epidemiology and Biostatistics University of Western Ontario What is being covered

More information

Optimal exact tests for complex alternative hypotheses on cross tabulated data

Optimal exact tests for complex alternative hypotheses on cross tabulated data Optimal exact tests for complex alternative hypotheses on cross tabulated data Daniel Yekutieli Statistics and OR Tel Aviv University CDA course 29 July 2017 Yekutieli (TAU) Optimal exact tests for complex

More information

Epidemiology Wonders of Biostatistics Chapter 11 (continued) - probability in a single population. John Koval

Epidemiology Wonders of Biostatistics Chapter 11 (continued) - probability in a single population. John Koval Epidemiology 9509 Wonders of Biostatistics Chapter 11 (continued) - probability in a single population John Koval Department of Epidemiology and Biostatistics University of Western Ontario What is being

More information

You can specify the response in the form of a single variable or in the form of a ratio of two variables denoted events/trials.

You can specify the response in the form of a single variable or in the form of a ratio of two variables denoted events/trials. The GENMOD Procedure MODEL Statement MODEL response = < effects > < /options > ; MODEL events/trials = < effects > < /options > ; You can specify the response in the form of a single variable or in the

More information

MARGINAL HOMOGENEITY MODEL FOR ORDERED CATEGORIES WITH OPEN ENDS IN SQUARE CONTINGENCY TABLES

MARGINAL HOMOGENEITY MODEL FOR ORDERED CATEGORIES WITH OPEN ENDS IN SQUARE CONTINGENCY TABLES REVSTAT Statistical Journal Volume 13, Number 3, November 2015, 233 243 MARGINAL HOMOGENEITY MODEL FOR ORDERED CATEGORIES WITH OPEN ENDS IN SQUARE CONTINGENCY TABLES Authors: Serpil Aktas Department of

More information

Q30b Moyale Observed counts. The FREQ Procedure. Table 1 of type by response. Controlling for site=moyale. Improved (1+2) Same (3) Group only

Q30b Moyale Observed counts. The FREQ Procedure. Table 1 of type by response. Controlling for site=moyale. Improved (1+2) Same (3) Group only Moyale Observed counts 12:28 Thursday, December 01, 2011 1 The FREQ Procedure Table 1 of by Controlling for site=moyale Row Pct Improved (1+2) Same () Worsened (4+5) Group only 16 51.61 1.2 14 45.16 1

More information

6 Single Sample Methods for a Location Parameter

6 Single Sample Methods for a Location Parameter 6 Single Sample Methods for a Location Parameter If there are serious departures from parametric test assumptions (e.g., normality or symmetry), nonparametric tests on a measure of central tendency (usually

More information

STAT 705: Analysis of Contingency Tables

STAT 705: Analysis of Contingency Tables STAT 705: Analysis of Contingency Tables Timothy Hanson Department of Statistics, University of South Carolina Stat 705: Analysis of Contingency Tables 1 / 45 Outline of Part I: models and parameters Basic

More information

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric Assumptions The observations must be independent. Dependent variable should be continuous

More information

Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA)

Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA) BSTT523 Pagano & Gauvreau Chapter 13 1 Nonparametric Statistics Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA) In particular, data

More information

Tests for Two Correlated Proportions in a Matched Case- Control Design

Tests for Two Correlated Proportions in a Matched Case- Control Design Chapter 155 Tests for Two Correlated Proportions in a Matched Case- Control Design Introduction A 2-by-M case-control study investigates a risk factor relevant to the development of a disease. A population

More information

ssh tap sas913, sas https://www.statlab.umd.edu/sasdoc/sashtml/onldoc.htm

ssh tap sas913, sas https://www.statlab.umd.edu/sasdoc/sashtml/onldoc.htm Kedem, STAT 430 SAS Examples: Logistic Regression ==================================== ssh abc@glue.umd.edu, tap sas913, sas https://www.statlab.umd.edu/sasdoc/sashtml/onldoc.htm a. Logistic regression.

More information

Topic 21 Goodness of Fit

Topic 21 Goodness of Fit Topic 21 Goodness of Fit Contingency Tables 1 / 11 Introduction Two-way Table Smoking Habits The Hypothesis The Test Statistic Degrees of Freedom Outline 2 / 11 Introduction Contingency tables, also known

More information

Models for Binary Outcomes

Models for Binary Outcomes Models for Binary Outcomes Introduction The simple or binary response (for example, success or failure) analysis models the relationship between a binary response variable and one or more explanatory variables.

More information

Two Correlated Proportions Non- Inferiority, Superiority, and Equivalence Tests

Two Correlated Proportions Non- Inferiority, Superiority, and Equivalence Tests Chapter 59 Two Correlated Proportions on- Inferiority, Superiority, and Equivalence Tests Introduction This chapter documents three closely related procedures: non-inferiority tests, superiority (by a

More information

Data Analysis as a Decision Making Process

Data Analysis as a Decision Making Process Data Analysis as a Decision Making Process I. Levels of Measurement A. NOIR - Nominal Categories with names - Ordinal Categories with names and a logical order - Intervals Numerical Scale with logically

More information

Measures of Agreement

Measures of Agreement Measures of Agreement An interesting application is to measure how closely two individuals agree on a series of assessments. A common application for this is to compare the consistency of judgments of

More information

Covariance Structure Approach to Within-Cases

Covariance Structure Approach to Within-Cases Covariance Structure Approach to Within-Cases Remember how the data file grapefruit1.data looks: Store sales1 sales2 sales3 1 62.1 61.3 60.8 2 58.2 57.9 55.1 3 51.6 49.2 46.2 4 53.7 51.5 48.3 5 61.4 58.7

More information

An Overview of Methods in the Analysis of Dependent Ordered Categorical Data: Assumptions and Implications

An Overview of Methods in the Analysis of Dependent Ordered Categorical Data: Assumptions and Implications WORKING PAPER SERIES WORKING PAPER NO 7, 2008 Swedish Business School at Örebro An Overview of Methods in the Analysis of Dependent Ordered Categorical Data: Assumptions and Implications By Hans Högberg

More information

Unit 9: Inferences for Proportions and Count Data

Unit 9: Inferences for Proportions and Count Data Unit 9: Inferences for Proportions and Count Data Statistics 571: Statistical Methods Ramón V. León 12/15/2008 Unit 9 - Stat 571 - Ramón V. León 1 Large Sample Confidence Interval for Proportion ( pˆ p)

More information

Relate Attributes and Counts

Relate Attributes and Counts Relate Attributes and Counts This procedure is designed to summarize data that classifies observations according to two categorical factors. The data may consist of either: 1. Two Attribute variables.

More information

Introduction to Nonparametric Analysis (Chapter)

Introduction to Nonparametric Analysis (Chapter) SAS/STAT 9.3 User s Guide Introduction to Nonparametric Analysis (Chapter) SAS Documentation This document is an individual chapter from SAS/STAT 9.3 User s Guide. The correct bibliographic citation for

More information

Unit 9: Inferences for Proportions and Count Data

Unit 9: Inferences for Proportions and Count Data Unit 9: Inferences for Proportions and Count Data Statistics 571: Statistical Methods Ramón V. León 1/15/008 Unit 9 - Stat 571 - Ramón V. León 1 Large Sample Confidence Interval for Proportion ( pˆ p)

More information

Multinomial Logistic Regression Models

Multinomial Logistic Regression Models Stat 544, Lecture 19 1 Multinomial Logistic Regression Models Polytomous responses. Logistic regression can be extended to handle responses that are polytomous, i.e. taking r>2 categories. (Note: The word

More information

Bivariate Paired Numerical Data

Bivariate Paired Numerical Data Bivariate Paired Numerical Data Pearson s correlation, Spearman s ρ and Kendall s τ, tests of independence University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/~eariasca/teaching.html

More information

Age 55 (x = 1) Age < 55 (x = 0)

Age 55 (x = 1) Age < 55 (x = 0) Logistic Regression with a Single Dichotomous Predictor EXAMPLE: Consider the data in the file CHDcsv Instead of examining the relationship between the continuous variable age and the presence or absence

More information

ST3241 Categorical Data Analysis I Multicategory Logit Models. Logit Models For Nominal Responses

ST3241 Categorical Data Analysis I Multicategory Logit Models. Logit Models For Nominal Responses ST3241 Categorical Data Analysis I Multicategory Logit Models Logit Models For Nominal Responses 1 Models For Nominal Responses Y is nominal with J categories. Let {π 1,, π J } denote the response probabilities

More information

τ xy N c N d n(n 1) / 2 n +1 y = n 1 12 v n n(n 1) ρ xy Brad Thiessen

τ xy N c N d n(n 1) / 2 n +1 y = n 1 12 v n n(n 1) ρ xy Brad Thiessen Introduction: Effect of Correlation Types on Nonmetric Multidimensional Scaling of ITED Subscore Data Brad Thiessen In a discussion of potential data types that can be used as proximity measures, Coxon

More information

1 A Review of Correlation and Regression

1 A Review of Correlation and Regression 1 A Review of Correlation and Regression SW, Chapter 12 Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then

More information

ij i j m ij n ij m ij n i j Suppose we denote the row variable by X and the column variable by Y ; We can then re-write the above expression as

ij i j m ij n ij m ij n i j Suppose we denote the row variable by X and the column variable by Y ; We can then re-write the above expression as page1 Loglinear Models Loglinear models are a way to describe association and interaction patterns among categorical variables. They are commonly used to model cell counts in contingency tables. These

More information

Nonparametric Statistics. Leah Wright, Tyler Ross, Taylor Brown

Nonparametric Statistics. Leah Wright, Tyler Ross, Taylor Brown Nonparametric Statistics Leah Wright, Tyler Ross, Taylor Brown Before we get to nonparametric statistics, what are parametric statistics? These statistics estimate and test population means, while holding

More information

Module 10: Analysis of Categorical Data Statistics (OA3102)

Module 10: Analysis of Categorical Data Statistics (OA3102) Module 10: Analysis of Categorical Data Statistics (OA3102) Professor Ron Fricker Naval Postgraduate School Monterey, California Reading assignment: WM&S chapter 14.1-14.7 Revision: 3-12 1 Goals for this

More information

SPSS LAB FILE 1

SPSS LAB FILE  1 SPSS LAB FILE www.mcdtu.wordpress.com 1 www.mcdtu.wordpress.com 2 www.mcdtu.wordpress.com 3 OBJECTIVE 1: Transporation of Data Set to SPSS Editor INPUTS: Files: group1.xlsx, group1.txt PROCEDURE FOLLOWED:

More information

Sections 2.3, 2.4. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis 1 / 21

Sections 2.3, 2.4. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis 1 / 21 Sections 2.3, 2.4 Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 21 2.3 Partial association in stratified 2 2 tables In describing a relationship

More information

Chapter 2: Describing Contingency Tables - II

Chapter 2: Describing Contingency Tables - II : Describing Contingency Tables - II Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM [Acknowledgements to Tim Hanson and Haitao Chu]

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

Statistics of Contingency Tables - Extension to I x J. stat 557 Heike Hofmann

Statistics of Contingency Tables - Extension to I x J. stat 557 Heike Hofmann Statistics of Contingency Tables - Extension to I x J stat 557 Heike Hofmann Outline Testing Independence Local Odds Ratios Concordance & Discordance Intro to GLMs Simpson s paradox Simpson s paradox:

More information

Discrete Multivariate Statistics

Discrete Multivariate Statistics Discrete Multivariate Statistics Univariate Discrete Random variables Let X be a discrete random variable which, in this module, will be assumed to take a finite number of t different values which are

More information

BIOL 4605/7220 CH 20.1 Correlation

BIOL 4605/7220 CH 20.1 Correlation BIOL 4605/70 CH 0. Correlation GPT Lectures Cailin Xu November 9, 0 GLM: correlation Regression ANOVA Only one dependent variable GLM ANCOVA Multivariate analysis Multiple dependent variables (Correlation)

More information

Logistic Regression Analyses in the Water Level Study

Logistic Regression Analyses in the Water Level Study Logistic Regression Analyses in the Water Level Study A. Introduction. 166 students participated in the Water level Study. 70 passed and 96 failed to correctly draw the water level in the glass. There

More information

Chapter 9: Association Between Variables Measured at the Ordinal Level

Chapter 9: Association Between Variables Measured at the Ordinal Level Chapter 9: Association Between Variables Measured at the Ordinal Level After this week s class: SHOULD BE ABLE TO COMPLETE ALL APLIA ASSIGNMENTS DUE END OF THIS WEEK: APLIA ASSIGNMENTS 5-7 DUE: Friday

More information

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

One-Way ANOVA. Some examples of when ANOVA would be appropriate include: One-Way ANOVA 1. Purpose Analysis of variance (ANOVA) is used when one wishes to determine whether two or more groups (e.g., classes A, B, and C) differ on some outcome of interest (e.g., an achievement

More information

Some comments on Partitioning

Some comments on Partitioning Some comments on Partitioning Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM 1/30 Partitioning Chi-Squares We have developed tests

More information

CSSS/STAT/SOC 321 Case-Based Social Statistics I. Levels of Measurement

CSSS/STAT/SOC 321 Case-Based Social Statistics I. Levels of Measurement CSSS/STAT/SOC 321 Case-Based Social Statistics I Levels of Measurement Christopher Adolph Department of Political Science and Center for Statistics and the Social Sciences University of Washington, Seattle

More information

3 Way Tables Edpsy/Psych/Soc 589

3 Way Tables Edpsy/Psych/Soc 589 3 Way Tables Edpsy/Psych/Soc 589 Carolyn J. Anderson Department of Educational Psychology I L L I N O I S university of illinois at urbana-champaign c Board of Trustees, University of Illinois Spring 2017

More information

Good Confidence Intervals for Categorical Data Analyses. Alan Agresti

Good Confidence Intervals for Categorical Data Analyses. Alan Agresti Good Confidence Intervals for Categorical Data Analyses Alan Agresti Department of Statistics, University of Florida visiting Statistics Department, Harvard University LSHTM, July 22, 2011 p. 1/36 Outline

More information

Entering and recoding variables

Entering and recoding variables Entering and recoding variables To enter: You create a New data file Define the variables on Variable View Enter the values on Data View To create the dichotomies: Transform -> Recode into Different Variable

More information

Section 4.6 Simple Linear Regression

Section 4.6 Simple Linear Regression Section 4.6 Simple Linear Regression Objectives ˆ Basic philosophy of SLR and the regression assumptions ˆ Point & interval estimation of the model parameters, and how to make predictions ˆ Point and interval

More information

Measuring relationships among multiple responses

Measuring relationships among multiple responses Measuring relationships among multiple responses Linear association (correlation, relatedness, shared information) between pair-wise responses is an important property used in almost all multivariate analyses.

More information

Joseph O. Marker Marker Actuarial a Services, LLC and University of Michigan CLRS 2010 Meeting. J. Marker, LSMWP, CLRS 1

Joseph O. Marker Marker Actuarial a Services, LLC and University of Michigan CLRS 2010 Meeting. J. Marker, LSMWP, CLRS 1 Joseph O. Marker Marker Actuarial a Services, LLC and University of Michigan CLRS 2010 Meeting J. Marker, LSMWP, CLRS 1 Expected vs Actual Distribution Test distributions of: Number of claims (frequency)

More information

Nonparametric statistic methods. Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health

Nonparametric statistic methods. Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health Nonparametric statistic methods Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health Measurement What are the 4 levels of measurement discussed? 1. Nominal or Classificatory Scale Gender,

More information

Statistics Introductory Correlation

Statistics Introductory Correlation Statistics Introductory Correlation Session 10 oscardavid.barrerarodriguez@sciencespo.fr April 9, 2018 Outline 1 Statistics are not used only to describe central tendency and variability for a single variable.

More information

Correction for Ascertainment

Correction for Ascertainment Correction for Ascertainment Michael C. Neale International Workshop on Methodology for Genetic Studies of Twins and Families Boulder CO 2004 Virginia Institute for Psychiatric and Behavioral Genetics

More information

Three-Way Contingency Tables

Three-Way Contingency Tables Newsom PSY 50/60 Categorical Data Analysis, Fall 06 Three-Way Contingency Tables Three-way contingency tables involve three binary or categorical variables. I will stick mostly to the binary case to keep

More information

NON-PARAMETRIC STATISTICS * (http://www.statsoft.com)

NON-PARAMETRIC STATISTICS * (http://www.statsoft.com) NON-PARAMETRIC STATISTICS * (http://www.statsoft.com) 1. GENERAL PURPOSE 1.1 Brief review of the idea of significance testing To understand the idea of non-parametric statistics (the term non-parametric

More information

Readings Howitt & Cramer (2014) Overview

Readings Howitt & Cramer (2014) Overview Readings Howitt & Cramer (4) Ch 7: Relationships between two or more variables: Diagrams and tables Ch 8: Correlation coefficients: Pearson correlation and Spearman s rho Ch : Statistical significance

More information

Lecture 11: Simple Linear Regression

Lecture 11: Simple Linear Regression Lecture 11: Simple Linear Regression Readings: Sections 3.1-3.3, 11.1-11.3 Apr 17, 2009 In linear regression, we examine the association between two quantitative variables. Number of beers that you drink

More information

Statistical. Psychology

Statistical. Psychology SEVENTH у *i km m it* & П SB Й EDITION Statistical M e t h o d s for Psychology D a v i d C. Howell University of Vermont ; \ WADSWORTH f% CENGAGE Learning* Australia Biaall apan Korea Меяко Singapore

More information

Intro to Parametric & Nonparametric Statistics

Intro to Parametric & Nonparametric Statistics Kinds of variable The classics & some others Intro to Parametric & Nonparametric Statistics Kinds of variables & why we care Kinds & definitions of nonparametric statistics Where parametric stats come

More information

Adejumo, Heumann, Toutenburg: A review of agreement measure as a subset of association measure between raters

Adejumo, Heumann, Toutenburg: A review of agreement measure as a subset of association measure between raters Adejumo, Heumann, Toutenburg: A review of agreement measure as a subset of association measure between raters Sonderforschungsbereich 386, Paper 385 (2004) Online unter: http://epub.ub.uni-muenchen.de/

More information

Passing-Bablok Regression for Method Comparison

Passing-Bablok Regression for Method Comparison Chapter 313 Passing-Bablok Regression for Method Comparison Introduction Passing-Bablok regression for method comparison is a robust, nonparametric method for fitting a straight line to two-dimensional

More information