Means or "expected" counts: j = 1 j = 2 i = 1 m11 m12 i = 2 m21 m22 True proportions: The odds that a sampled unit is in category 1 for variable 1 giv

Size: px
Start display at page:

Download "Means or "expected" counts: j = 1 j = 2 i = 1 m11 m12 i = 2 m21 m22 True proportions: The odds that a sampled unit is in category 1 for variable 1 giv"

Transcription

1 Measures of Association References: ffl ffl ffl Summarize strength of associations Quantify relative risk Types of measures odds ratio correlation Pearson statistic ediction concordance/discordance Goodman, L.A. and Kruskal, W. H. (17) Measures of Association for Cross Classifications, Springer, New York. Bishop, Fienberg and Holland (175). Discrete Multivariate Analysis: Theory and actice, MIT ess, Cambridge, MA (Chapter 11). Brown, M. B. and Benedetti (177). Sampling behavior of tests for correlation in two-way contingency tables, Journal of the American Statistical Association, 72, Cohen, J. (16). A coefficient of agreement for nominal scales, Educational and Psychological Measurement, 2, Mantel, N. and Haenszel, W. (15). Statistical aspects of the analysis of data from retrospective studies of disease, Journal of the National Cancer Institute, 22, Agresti, A. (14) Analysis of Ordinal Categorical Data, Wiley, New York, (Chapters 2 & 3). Agresti, A. (22) Categorical Data Analysis, 2nd edition, Wiley, New York, (Chapter 2). The odds ratio: The most frequently used measure for 2 2 tables. Observed counts: Variable 2 j = 1 j = 2 Variable 1 i = 1 Y11 Y12 i = 2 Y21 Y

2 Means or "expected" counts: j = 1 j = 2 i = 1 m11 m12 i = 2 m21 m22 True proportions: The odds that a sampled unit is in category 1 for variable 1 given that it is in category j for variable 2: < variable 1 is variable 2 is = : in category 1 in category j ; < variable 1 is variable 2 is = : in category 2 in category j ; = ß 1j ß 1j +ß 2j ß 2j = ß 1j ß 2j ß 1j +ß 2j j = 1 j = 2 i = 1 ß11 ß12 i = 2 ß21 ß22 46 Odds ratio ff = ß11 ß 21 ß12 = ß 22 ß 11ß22 ß21ß12 = m 11m22 m21m12 also called the cross-product ratio. 47 Example: Chinook Salmon Early run (1) Odds of capturing a female: hook & line: Hook & Line Net Female Male net: = :511 :4 = 1:45 = :446 :554 = :

3 Estimated odds ratio: ^ff odds of capturing a female with hook & odds of capturing a female with a net = 1:45 :2 = 1:77 An approximate 5% confidence interval for ff is Conclusion: (1:3; 2:42) 1 A 1 A The odds that a captured fish is female are about 3 to 14 percenct greater with hook & line than with using a net. 473 Late run: Hook & Line Net Female 1 16 Male Estimated odds ratio: ff = :5512 :44 :353 : = 1:23 :63 = 1:6 An approximate 5% confidence interval for ff is (1.4, 2.6) Conclusion: 474 /* ogram to analyze the 1 Chinook salmon data. This program is stored in the file chinook2.sas */ /* Attach labels to categories */ data set1; infile 'chinook.dat'; input (year month day biweek run gear age sexa length) ( $1. 4.); rage=int(age/1); oage=age-(1*rage); if(sexa = 'F') then sex=1; proc format; value run 1 = 'Early' 2 = 'Late'; value sex 1 = 'Female' 2 = 'Male'; value gear 1 = 'Hook' 2 = 'Net'; run; else sex=2; run;

4 run=early proc sort data=set1; by run; run; The FREQ ocedure Table of gear by sex /* Examine partial association between gear sex sex and method of capture within each run. */ proc freq data=set1; by run; table gear*sex / chisq Fisher all nopercent nocol expected; format sex sex. gear gear. run run.; run; Frequency Expected Row Pct Female Male Total Hook Net Total Statistics for Table of gear by sex Statistic DF Value ob Chi-Square Likelihood Ratio Chi-Square Continuity Adj. Chi-Square Mantel-Haenszel Chi-Square Phi Coefficient.146 Contingency Coefficient.132 Cramer's V.146 Statistics for Table of gear by sex Statistic Value ASE Gamma Kendall's Tau-b Stuart's Tau-c Somers' D C R Somers' D R C Fisher's Exact Test Pearson Correlation Spearman Correlation Cell (1,1) Frequency (F) 172 Left-sided <= F. Right-sided >= F 2.5E-4 Lambda Asymmetric C R Lambda Asymmetric R C Lambda Symmetric Table obability (P) Two-sided <= P.352E E-4 Uncertainty Coefficient C R Uncertainty Coefficient R C.145. Uncertainty Coefficient Sym

5 run=late Table of gear by sex gear sex Estimates of the Relative Risk (Row1/Row2) Type of Study Value 5% Confidence Limits Case-Control (Odds Ratio) Cohort (Col1 Risk) Cohort (Col2 Risk) Sample Size = 65 Frequency Expected Row Pct Female Male Total Hook Net Total Statistics for Table of gear by sex Statistics for Table of gear by sex Statistic DF Value ob Statistic Value ASE Chi-Square <.1 Likelihood Ratio Chi-Square <.1 Continuity Adj. Chi-Square <.1 Mantel-Haenszel Chi-Square <.1 Phi Coefficient.1657 Contingency Coefficient.1635 Cramer's V.1657 Fisher's Exact Test Cell (1,1) Frequency (F) 1 Left-sided <= F 1. Right-sided >= F 1.E-6 Table obability (P).75E-7 Two-sided <= P 3.361E-6 Gamma Kendall's Tau-b Stuart's Tau-c Somers' D C R Somers' D R C Pearson Correlation Spearman Correlation Lambda Asymmetric C R.1.41 Lambda Asymmetric R C.5.57 Lambda Symmetric Uncertainty Coefficient C R.2.5 Uncertainty Coefficient R C.2.5 Uncertainty Coefficient Sym

6 operties Estimates of the Relative Risk (Row1/Row2) (i) The odds ratio is not margin sensitive". Type of Study Value 5% Confidence Limits Case-Control (Odds Ratio) Cohort (Col1 Risk) Cohort (Col2 Risk) Sample Size = 77 Choose t1; t2; s1; s2 such that t1+t2 = 1 and s1+s2 = 1. Then the odds ratio for s1t1ß11 s1t2ß12 s2t1ß21 s2t2ß is (iv) Interchanging the rows of the table produces 1=ff. s1t1ß11 s2t2ß22 s1t2ß12 s2t1ß21 = ß 11ß22 ß12ß21 = ff (v) Interchanging the columns of the table produces 1=ff. (ii)» ff» 1 (iii) ff = 1 corresponds to independence (vi) Interchanging both the rows and the columns of the 2 2 table produces ff. So, ff = 4 indicates the same level of association as ff = 1 4 = :

7 Estimation: Substitute Y ij for m ij in ff: ^ff = Y 11Y22 Y12Y21 ^ff has a large sample" N(ff; ff 2 1) distribution with ff1 2 = ff 2 B 1 C A m22 m11 m12 m21 (when each m ij is large). Estimation of the large sample variance: ff 2 1;^ff = ^ff 1 Y11 Y12 Y21 1 A For small samples use and ^ff Λ = (Y 11 + :5)(Y22 + :5) (Y12 + :5)(Y21 + :5) Y22 ^ff 2 1;^ff = (^ff Λ ) 1 Y11 + :5 Y12 + :5 1 Y21 + :5 A Y22 + :5 4 4 Other smooth functions of ff are often used as measures of assocation, say f(ff). The large sample distribution for f(^ff) is N f(ff); [f (ff)] 2 ff 2 1;^ff % The asymptotic variance is obtained from the delta method log-odds ratio: log(ff) (i) 1 < log(ff) < 1 (ii) independence, ff = 1, log(ff) = (iii) log(ff) is not margin sensitive" (iv) log(ff) and log(ff) = log(1=ff) imply equal levels of association (v) log(^ff) has a more nearly symmetric distribution than ^ff for smaller sample sizes 41 42

8 log(^ff) dist! N log(ff); 1 m 11 m 12 m 21 m 22 as m ij! 1 for all (i; j) Yule's Q: Yule (1) Q = ff 1 ff operties: Approximate 5% confidence intervals: For log(ff): log(^ff) ± (1:6)s 1 Y 11 Y 12 Y 21 Y 22 = [A; B] for ff: [exp(a), exp(b)] 1. 1» Q» 1 2. Q = for the independence model 3. Q = 1 when either ß12 = or ß21 = Q = 1 when either ß11 = or ß22 = 5. Q is symmetric". When the columns (or rows) or a 2 2 table are interchanged, then ff ) 1 and Q ) Q ff Q is the Goodman-Kruskal Gamma statistic for 2 2 tables. Q = ß11ß22 ß11ß22 + ß12ß21 Estimation: ß12ß21 ß11ß22 + ß12ß21 ^Q = ^ff 1 ^ff = Y 11Y22 Y12Y21 Y11Y22 + Y12Y21 6. Q is a margin free measure 45 46

9 What is a large or substantial association? Large sample distribution: As m ij! 1 for all i = 1; 2; j = 1; 2 ^Q ο ff 1 N ff ; (1 Q2 ) 2 4 % ( 1 m 11 m 12 m 21 m 22 ) This is [f (ff)] 2 ff 2 1 m 11 m 12 m 21 m 22 where f(ff) = ff+1 ff 1 What is a large value of ff? `n(ff)? Q? 1. Use large sample distributions to construct confidence intervals or test hypotheses H : ff = 1 versus H A : ff 6= 1. Reject H if Z = `n(^ff) ^ff`n(^ff) > Z ff= Suppose you discover that ^ff = 2:4 is significantly different" from zero. Is an odds ratio of 2.4 large enough to have practical importance? There are no absolute guidelines. It depends on the subject matter or field of study. 4 A useful application of measures of association for two-way tables is to assess differences in levels of association across time, or locations. College Graduate Yes No 15 male ^ff 15 = 1:1 female Yes No 16 male ^ff 16 = 4:2 female Yes No 17 male ^ff 17 = 2:7 female Yes No 1 male ^ff 1 = 1: female 5

10 Relative Risk Heart attack No heart attack Placebo ß11 ß12 n1 Aspirin ß21 ß22 n2 Relative risk of a heart attack fheart attack jplacebog fheart attack jaspiring = ß 11 when ß11 and ß21 are small. : = = ff ß 1 ß 1 22A ß21 ß12 51 In that case, ß22 = 1 ß21 = : 1: and ß12 = 1 ß11 = : 1:. Data: Heart No heart attack attack Placebo n1 = 1134 Aspirin n2 = Estimated relative risk of heart attack for those taking the placebo versus those taking the aspirin: Odds ratio: ^ff = = : :4 = 1:2 odds of a heart attack for placebo users odds of a heart attack for aspirin users = (1)(133) (14)(145) = 1:3 Confidence interval: (case-control) ^ff = 1:325 log(^ff) = : S 2 log(^ff) = X i X 1 = :15 Y ij Then log(^ff) ± z ff=2 S log(^ff) 2 ) :654 ± (1:6) p :15 ) (:36467; :4621) ) (e :36467 ; e :4621 ) ) (1:44; 2:331) j 53 54

11 Relative risk: 1 RR col1 = = 1:17 log(rr col1 ) = : S 2 log(rr col1 ) = 1 ^ß 11 n1^ß11 ^ß 21 n2^ß21 = Y 12 n1y11 + Y 22 n2y21 = : log(rr col1 ) ± (1:6) s S 2 log(rr) ) : ± (1:6) p : ) (:357; :35467) ) An approximate 5% confidence interval is (e :357 ; e :35467 ) ) (1:433; 2:36) log(rr) = log ß 1 11C A ß21 = log(ß11) log(ß21) g(ß11; ß21) Independent binomial experiments ^ß11 = Y 11 n1 ^ß21 = Y 21 n2 Y11 ο Bin(n1; ß11) Y21 ο Bin(n2; ß21) 57 V = ψ^ß11! V ^ß21 2 ß11(1 ß11 = 6 n1 4 Delta method: V (g(^ß11; ^ß21)) = " 1 ß11 # 1 ß21 V = 1 ß 11 n1ß11 ß 21 n2ß ß11 1 ß21 ß21(

12 SAS Code /* Assign labels to values */ /* This program computes the odds ratio for a 2x2 table. It is stored in the file aspirin.sas */ DATA SET1; INPUT ROW COL COUNT; LABEL ROW = 'Treatment' COL = 'Heart attack'; CARDS; run; 5 PROC FORMAT; VALUE RFMT 1 = 'Placebo' 2 = 'Aspirin'; VALUE CFMT 1 = 'Yes' 2 = 'No'; run; /* Analyze the table of counts */ PROC FREQ DATA=SET1; TABLES ROW*COL / CHISQ ALL NOPERCENT NOCOL EXPECTED; WEIGHT COUNT; FORMAT ROW RFMT. COL CFMT.; run; 51 The FREQ ocedure Table of ROW by COL ROW(Treatment) COL(Heart attack) Frequency Expected Row Pct Yes No Total Placebo Statistics for Table of ROW by COL Statistic DF Value ob Chi-Square <.1 Likelihood Ratio Chi-Square <.1 Continuity Adj. Chi-Square <.1 Mantel-Haenszel Chi-Square <.1 Phi Coefficient.337 Contingency Coefficient.336 Cramer's V.337 Fisher's Exact Test Aspirin Cell (1,1) Frequency (F) 1 Left-sided <= F 1. Right-sided >= F 3.253E-7 Total Table obability (P) Two-sided <= P 1.516E E

13 S-PLUS Code # An S-PLUS function to compute # an odds ratio and construct an Estimates of the Relative Risk (Row1/Row2) Type of Study Value 5% Confidence Limits Case-Control (Odds Ratio) Cohort (Col1 Risk) Cohort (Col2 Risk) Sample Size = 2271 # approximate confidence interval # This code is posted int he file # oddsratio.ssc oddsratio <- function(table,conf=.5, cont=.) level <- 1-(1-conf)/2 tablec <- table + cont alpha <- tablec[1,1]*tablec[2,2]/ (tablec[1,2]*tablec[2,1]) la <- log(alpha) sla <- sqrt(sum(tablec^(-1))) sa <- alpha*sla lowera <- round(exp(la-qnorm(level)*sla),4) uppera <- round(exp(la+qnorm(level)*sla),4) confper <- round(conf*1,1) alphar <- round(alpha,4) sar <- round(sa,4) cat(" n", "odds-ratio = ", alphar) cat(" n", "std. error = ", sar) cat(" n",confper,"% confidence interval") cat(" n"," lower limit upper limit") cat(" n", " ", lowera, " ", uppera) } 515 # To run this function, first create a # table of counts # # aspirin <- matrix(c(1, 145, 14, # 133), 2, 2, byrow=t) # # Then source this function into the # command window # # source("yourdirectory/oddsratio.ssc") # # Then execute the function # # oddsratio(aspirin,conf=.5,cont=.) # 516

14 The heart attack study is an example of a propspective study odds-ratio = std. error = % confidence interval lower limit upper limit In such studies: ffl Patients are randomly assigned to treatment groups. The treatments are administered. 517 ffl The proportion that give a certain response is recorded for each treatment group. 51 placebo: Y11 n1 aspirin: Y21 n2 = observed proportion that experience a heart attack = observed proportion that experience a heart attack These are direct estimates of population proportions needed to determine relative risk. Retrospective studies: (case-control) Examine what has happened in the past Example: Take a simple random sample of n1 patient records (cases), e.g. women who have experienced a heart attack 51 52

15 Classify each women according to whether or not she ever used oral contraception. Take an independent simple random sample of n2 controls, and classify each woman in the same way. oral contraceptive Heart No heart use attack attack Y 11 n 1 Yes Y11 Y12 No Y21 Y22 estimates < : n1 n2 used oral experienced a contraceptive heart attack = ; 521 Y 12 n 2 estimates < : used oral never had a contraceptive heart attack 522 = ; These do not provide a direct estimate of RR = >< P r >: >< P r >: heart used oral attack contraception heart do not use oral attack contraception >= >; >= >; Bayes Rule to the rescue? >< P r >: >< >: heart use attack o.c. use heart o.c. attack >< >: >= >; = >= >; use o.c. >< >: >= >; heart attack >= >;

16 Then ( ) use heart. ( ) use o.c. attack o.c. RR = ( ) do not heart. ( ) do not use o.c. attack use o.c. Relative risk of heart attack cannot be estimated without additional information on n use o.c. o the proportion of women in the population who use oral contraceptives. Approximate the relative risk with an odds ratio: 2 < : 6 < 4 : ff = 2 < : 6 < 4 : 3 heart use = attack o.c. ; no heart use = 7 5 attack o.c. ; 3 heart do not = attack use o.c. ; no heart do not = 7 5 attack use o.c. ; Which is equal to Which is equal to ( ) use heart o.c. attack >< no >= use heart >: o.c. attack ( ) do not heart use o.c. attack >< no >= do not heart >: use o.c. attack >; >; ( ) heart. ( ) use attack o.c. >< no >=. ( ) heart use >: attack >; o.c. ( ) heart. ( ) do not attack use o.c. >< no >=. ( ) heart do not >: attack >; use o.c < use heart = : o.c. attack ; < do not heart = 7 5 : use o.c. attack ; 3 < use no heart = : o.c. attack ; < do not heart = 7 5 : use o.c. attack ; An estimate is (Y11=n1) (Y21=n1) (Y12=n2) (Y22=n2) = Y 11Y22 Y12Y

Testing Independence

Testing Independence Testing Independence Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM 1/50 Testing Independence Previously, we looked at RR = OR = 1

More information

Measures of Association for I J tables based on Pearson's 2 Φ 2 = Note that I 2 = I where = n J i=1 j=1 J i=1 j=1 I i=1 j=1 (ß ij ß i+ ß +j ) 2 ß i+ ß

Measures of Association for I J tables based on Pearson's 2 Φ 2 = Note that I 2 = I where = n J i=1 j=1 J i=1 j=1 I i=1 j=1 (ß ij ß i+ ß +j ) 2 ß i+ ß Correlation Coefficient Y = 0 Y = 1 = 0 ß11 ß12 = 1 ß21 ß22 Product moment correlation coefficient: ρ = Corr(; Y ) E() = ß 2+ = ß 21 + ß 22 = E(Y ) E()E(Y ) q V ()V (Y ) E(Y ) = ß 2+ = ß 21 + ß 22 = ß

More information

2 Describing Contingency Tables

2 Describing Contingency Tables 2 Describing Contingency Tables I. Probability structure of a 2-way contingency table I.1 Contingency Tables X, Y : cat. var. Y usually random (except in a case-control study), response; X can be random

More information

Epidemiology Wonders of Biostatistics Chapter 13 - Effect Measures. John Koval

Epidemiology Wonders of Biostatistics Chapter 13 - Effect Measures. John Koval Epidemiology 9509 Wonders of Biostatistics Chapter 13 - Effect Measures John Koval Department of Epidemiology and Biostatistics University of Western Ontario What is being covered 1. risk factors 2. risk

More information

ST3241 Categorical Data Analysis I Two-way Contingency Tables. 2 2 Tables, Relative Risks and Odds Ratios

ST3241 Categorical Data Analysis I Two-way Contingency Tables. 2 2 Tables, Relative Risks and Odds Ratios ST3241 Categorical Data Analysis I Two-way Contingency Tables 2 2 Tables, Relative Risks and Odds Ratios 1 What Is A Contingency Table (p.16) Suppose X and Y are two categorical variables X has I categories

More information

Lecture 8: Summary Measures

Lecture 8: Summary Measures Lecture 8: Summary Measures Dipankar Bandyopadhyay, Ph.D. BMTRY 711: Analysis of Categorical Data Spring 2011 Division of Biostatistics and Epidemiology Medical University of South Carolina Lecture 8:

More information

ST3241 Categorical Data Analysis I Two-way Contingency Tables. Odds Ratio and Tests of Independence

ST3241 Categorical Data Analysis I Two-way Contingency Tables. Odds Ratio and Tests of Independence ST3241 Categorical Data Analysis I Two-way Contingency Tables Odds Ratio and Tests of Independence 1 Inference For Odds Ratio (p. 24) For small to moderate sample size, the distribution of sample odds

More information

Frequency Distribution Cross-Tabulation

Frequency Distribution Cross-Tabulation Frequency Distribution Cross-Tabulation 1) Overview 2) Frequency Distribution 3) Statistics Associated with Frequency Distribution i. Measures of Location ii. Measures of Variability iii. Measures of Shape

More information

STAT 705: Analysis of Contingency Tables

STAT 705: Analysis of Contingency Tables STAT 705: Analysis of Contingency Tables Timothy Hanson Department of Statistics, University of South Carolina Stat 705: Analysis of Contingency Tables 1 / 45 Outline of Part I: models and parameters Basic

More information

BIOS 625 Fall 2015 Homework Set 3 Solutions

BIOS 625 Fall 2015 Homework Set 3 Solutions BIOS 65 Fall 015 Homework Set 3 Solutions 1. Agresti.0 Table.1 is from an early study on the death penalty in Florida. Analyze these data and show that Simpson s Paradox occurs. Death Penalty Victim's

More information

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F).

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F). STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis 1. Indicate whether each of the following is true (T) or false (F). (a) (b) (c) (d) (e) In 2 2 tables, statistical independence is equivalent

More information

Ordinal Variables in 2 way Tables

Ordinal Variables in 2 way Tables Ordinal Variables in 2 way Tables Edps/Psych/Soc 589 Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Fall 2018 C.J. Anderson (Illinois) Ordinal Variables

More information

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F).

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F). STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis 1. Indicate whether each of the following is true (T) or false (F). (a) T In 2 2 tables, statistical independence is equivalent to a population

More information

E509A: Principle of Biostatistics. (Week 11(2): Introduction to non-parametric. methods ) GY Zou.

E509A: Principle of Biostatistics. (Week 11(2): Introduction to non-parametric. methods ) GY Zou. E509A: Principle of Biostatistics (Week 11(2): Introduction to non-parametric methods ) GY Zou gzou@robarts.ca Sign test for two dependent samples Ex 12.1 subj 1 2 3 4 5 6 7 8 9 10 baseline 166 135 189

More information

Textbook Examples of. SPSS Procedure

Textbook Examples of. SPSS Procedure Textbook s of IBM SPSS Procedures Each SPSS procedure listed below has its own section in the textbook. These sections include a purpose statement that describes the statistical test, identification of

More information

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

One-Way ANOVA. Some examples of when ANOVA would be appropriate include: One-Way ANOVA 1. Purpose Analysis of variance (ANOVA) is used when one wishes to determine whether two or more groups (e.g., classes A, B, and C) differ on some outcome of interest (e.g., an achievement

More information

Small n, σ known or unknown, underlying nongaussian

Small n, σ known or unknown, underlying nongaussian READY GUIDE Summary Tables SUMMARY-1: Methods to compute some confidence intervals Parameter of Interest Conditions 95% CI Proportion (π) Large n, p 0 and p 1 Equation 12.11 Small n, any p Figure 12-4

More information

n y π y (1 π) n y +ylogπ +(n y)log(1 π).

n y π y (1 π) n y +ylogπ +(n y)log(1 π). Tests for a binomial probability π Let Y bin(n,π). The likelihood is L(π) = n y π y (1 π) n y and the log-likelihood is L(π) = log n y +ylogπ +(n y)log(1 π). So L (π) = y π n y 1 π. 1 Solving for π gives

More information

3 Way Tables Edpsy/Psych/Soc 589

3 Way Tables Edpsy/Psych/Soc 589 3 Way Tables Edpsy/Psych/Soc 589 Carolyn J. Anderson Department of Educational Psychology I L L I N O I S university of illinois at urbana-champaign c Board of Trustees, University of Illinois Spring 2017

More information

Discrete Multivariate Statistics

Discrete Multivariate Statistics Discrete Multivariate Statistics Univariate Discrete Random variables Let X be a discrete random variable which, in this module, will be assumed to take a finite number of t different values which are

More information

Research Methodology: Tools

Research Methodology: Tools MSc Business Administration Research Methodology: Tools Applied Data Analysis (with SPSS) Lecture 05: Contingency Analysis March 2014 Prof. Dr. Jürg Schwarz Lic. phil. Heidi Bruderer Enzler Contents Slide

More information

Sections 2.3, 2.4. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis 1 / 21

Sections 2.3, 2.4. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis 1 / 21 Sections 2.3, 2.4 Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 21 2.3 Partial association in stratified 2 2 tables In describing a relationship

More information

BIOMETRICS INFORMATION

BIOMETRICS INFORMATION BIOMETRICS INFORMATION (You re 95% likely to need this information) PAMPHLET NO. # 41 DATE: September 18, 1992 SUBJECT: Power Analysis and Sample Size Determination for Contingency Table Tests Statistical

More information

Chapter 2: Describing Contingency Tables - II

Chapter 2: Describing Contingency Tables - II : Describing Contingency Tables - II Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM [Acknowledgements to Tim Hanson and Haitao Chu]

More information

Some comments on Partitioning

Some comments on Partitioning Some comments on Partitioning Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM 1/30 Partitioning Chi-Squares We have developed tests

More information

Categorical Data Analysis Chapter 3

Categorical Data Analysis Chapter 3 Categorical Data Analysis Chapter 3 The actual coverage probability is usually a bit higher than the nominal level. Confidence intervals for association parameteres Consider the odds ratio in the 2x2 table,

More information

Sections 3.4, 3.5. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis

Sections 3.4, 3.5. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis Sections 3.4, 3.5 Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 22 3.4 I J tables with ordinal outcomes Tests that take advantage of ordinal

More information

Person-Time Data. Incidence. Cumulative Incidence: Example. Cumulative Incidence. Person-Time Data. Person-Time Data

Person-Time Data. Incidence. Cumulative Incidence: Example. Cumulative Incidence. Person-Time Data. Person-Time Data Person-Time Data CF Jeff Lin, MD., PhD. Incidence 1. Cumulative incidence (incidence proportion) 2. Incidence density (incidence rate) December 14, 2005 c Jeff Lin, MD., PhD. c Jeff Lin, MD., PhD. Person-Time

More information

13.1 Categorical Data and the Multinomial Experiment

13.1 Categorical Data and the Multinomial Experiment Chapter 13 Categorical Data Analysis 13.1 Categorical Data and the Multinomial Experiment Recall Variable: (numerical) variable (i.e. # of students, temperature, height,). (non-numerical, categorical)

More information

MARGINAL HOMOGENEITY MODEL FOR ORDERED CATEGORIES WITH OPEN ENDS IN SQUARE CONTINGENCY TABLES

MARGINAL HOMOGENEITY MODEL FOR ORDERED CATEGORIES WITH OPEN ENDS IN SQUARE CONTINGENCY TABLES REVSTAT Statistical Journal Volume 13, Number 3, November 2015, 233 243 MARGINAL HOMOGENEITY MODEL FOR ORDERED CATEGORIES WITH OPEN ENDS IN SQUARE CONTINGENCY TABLES Authors: Serpil Aktas Department of

More information

Inference for Binomial Parameters

Inference for Binomial Parameters Inference for Binomial Parameters Dipankar Bandyopadhyay, Ph.D. Department of Biostatistics, Virginia Commonwealth University D. Bandyopadhyay (VCU) BIOS 625: Categorical Data & GLM 1 / 58 Inference for

More information

STAT Chapter 13: Categorical Data. Recall we have studied binomial data, in which each trial falls into one of 2 categories (success/failure).

STAT Chapter 13: Categorical Data. Recall we have studied binomial data, in which each trial falls into one of 2 categories (success/failure). STAT 515 -- Chapter 13: Categorical Data Recall we have studied binomial data, in which each trial falls into one of 2 categories (success/failure). Many studies allow for more than 2 categories. Example

More information

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric Assumptions The observations must be independent. Dependent variable should be continuous

More information

Statistical. Psychology

Statistical. Psychology SEVENTH у *i km m it* & П SB Й EDITION Statistical M e t h o d s for Psychology D a v i d C. Howell University of Vermont ; \ WADSWORTH f% CENGAGE Learning* Australia Biaall apan Korea Меяко Singapore

More information

Chi-Square. Heibatollah Baghi, and Mastee Badii

Chi-Square. Heibatollah Baghi, and Mastee Badii 1 Chi-Square Heibatollah Baghi, and Mastee Badii Different Scales, Different Measures of Association Scale of Both Variables Nominal Scale Measures of Association Pearson Chi-Square: χ 2 Ordinal Scale

More information

Relate Attributes and Counts

Relate Attributes and Counts Relate Attributes and Counts This procedure is designed to summarize data that classifies observations according to two categorical factors. The data may consist of either: 1. Two Attribute variables.

More information

UNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS. Duration - 3 hours. Aids Allowed: Calculator

UNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS. Duration - 3 hours. Aids Allowed: Calculator UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS Duration - 3 hours Aids Allowed: Calculator LAST NAME: FIRST NAME: STUDENT NUMBER: There are 27 pages

More information

Cohen s s Kappa and Log-linear Models

Cohen s s Kappa and Log-linear Models Cohen s s Kappa and Log-linear Models HRP 261 03/03/03 10-11 11 am 1. Cohen s Kappa Actual agreement = sum of the proportions found on the diagonals. π ii Cohen: Compare the actual agreement with the chance

More information

Measuring relationships among multiple responses

Measuring relationships among multiple responses Measuring relationships among multiple responses Linear association (correlation, relatedness, shared information) between pair-wise responses is an important property used in almost all multivariate analyses.

More information

A bias-correction for Cramér s V and Tschuprow s T

A bias-correction for Cramér s V and Tschuprow s T A bias-correction for Cramér s V and Tschuprow s T Wicher Bergsma London School of Economics and Political Science Abstract Cramér s V and Tschuprow s T are closely related nominal variable association

More information

STAC51: Categorical data Analysis

STAC51: Categorical data Analysis STAC51: Categorical data Analysis Mahinda Samarakoon January 26, 2016 Mahinda Samarakoon STAC51: Categorical data Analysis 1 / 32 Table of contents Contingency Tables 1 Contingency Tables Mahinda Samarakoon

More information

CDA Chapter 3 part II

CDA Chapter 3 part II CDA Chapter 3 part II Two-way tables with ordered classfications Let u 1 u 2... u I denote scores for the row variable X, and let ν 1 ν 2... ν J denote column Y scores. Consider the hypothesis H 0 : X

More information

LISA Short Course Series Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R. Liang (Sally) Shan Nov. 4, 2014

LISA Short Course Series Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R. Liang (Sally) Shan Nov. 4, 2014 LISA Short Course Series Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R Liang (Sally) Shan Nov. 4, 2014 L Laboratory for Interdisciplinary Statistical Analysis LISA helps VT researchers

More information

Review of One-way Tables and SAS

Review of One-way Tables and SAS Stat 504, Lecture 7 1 Review of One-way Tables and SAS In-class exercises: Ex1, Ex2, and Ex3 from http://v8doc.sas.com/sashtml/proc/z0146708.htm To calculate p-value for a X 2 or G 2 in SAS: http://v8doc.sas.com/sashtml/lgref/z0245929.htmz0845409

More information

Statistics in medicine

Statistics in medicine Statistics in medicine Lecture 3: Bivariate association : Categorical variables Proportion in one group One group is measured one time: z test Use the z distribution as an approximation to the binomial

More information

Three-Way Contingency Tables

Three-Way Contingency Tables Newsom PSY 50/60 Categorical Data Analysis, Fall 06 Three-Way Contingency Tables Three-way contingency tables involve three binary or categorical variables. I will stick mostly to the binary case to keep

More information

Analysis of Categorical Data. Nick Jackson University of Southern California Department of Psychology 10/11/2013

Analysis of Categorical Data. Nick Jackson University of Southern California Department of Psychology 10/11/2013 Analysis of Categorical Data Nick Jackson University of Southern California Department of Psychology 10/11/2013 1 Overview Data Types Contingency Tables Logit Models Binomial Ordinal Nominal 2 Things not

More information

The concord Package. August 20, 2006

The concord Package. August 20, 2006 The concord Package August 20, 2006 Version 1.4-6 Date 2006-08-15 Title Concordance and reliability Author , Ian Fellows Maintainer Measures

More information

Measure for No Three-Factor Interaction Model in Three-Way Contingency Tables

Measure for No Three-Factor Interaction Model in Three-Way Contingency Tables American Journal of Biostatistics (): 7-, 00 ISSN 948-9889 00 Science Publications Measure for No Three-Factor Interaction Model in Three-Way Contingency Tables Kouji Yamamoto, Kyoji Hori and Sadao Tomizawa

More information

Decomposition of Parsimonious Independence Model Using Pearson, Kendall and Spearman s Correlations for Two-Way Contingency Tables

Decomposition of Parsimonious Independence Model Using Pearson, Kendall and Spearman s Correlations for Two-Way Contingency Tables International Journal of Statistics and Probability; Vol. 7 No. 3; May 208 ISSN 927-7032 E-ISSN 927-7040 Published by Canadian Center of Science and Education Decomposition of Parsimonious Independence

More information

Epidemiology Principle of Biostatistics Chapter 14 - Dependent Samples and effect measures. John Koval

Epidemiology Principle of Biostatistics Chapter 14 - Dependent Samples and effect measures. John Koval Epidemiology 9509 Principle of Biostatistics Chapter 14 - Dependent Samples and effect measures John Koval Department of Epidemiology and Biostatistics University of Western Ontario What is being covered

More information

Suppose that we are concerned about the effects of smoking. How could we deal with this?

Suppose that we are concerned about the effects of smoking. How could we deal with this? Suppose that we want to study the relationship between coffee drinking and heart attacks in adult males under 55. In particular, we want to know if there is an association between coffee drinking and heart

More information

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007) FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter

More information

Multinomial Logistic Regression Models

Multinomial Logistic Regression Models Stat 544, Lecture 19 1 Multinomial Logistic Regression Models Polytomous responses. Logistic regression can be extended to handle responses that are polytomous, i.e. taking r>2 categories. (Note: The word

More information

Describing Stratified Multiple Responses for Sparse Data

Describing Stratified Multiple Responses for Sparse Data Describing Stratified Multiple Responses for Sparse Data Ivy Liu School of Mathematical and Computing Sciences Victoria University Wellington, New Zealand June 28, 2004 SUMMARY Surveys often contain qualitative

More information

Analysis of Categorical Data Three-Way Contingency Table

Analysis of Categorical Data Three-Way Contingency Table Yu Lecture 4 p. 1/17 Analysis of Categorical Data Three-Way Contingency Table Yu Lecture 4 p. 2/17 Outline Three way contingency tables Simpson s paradox Marginal vs. conditional independence Homogeneous

More information

SAS/STAT 14.1 User s Guide. Introduction to Nonparametric Analysis

SAS/STAT 14.1 User s Guide. Introduction to Nonparametric Analysis SAS/STAT 14.1 User s Guide Introduction to Nonparametric Analysis This document is an individual chapter from SAS/STAT 14.1 User s Guide. The correct bibliographic citation for this manual is as follows:

More information

Didacticiel - Études de cas. In this tutorial, we show how to use TANAGRA ( and higher) for measuring the association between ordinal variables.

Didacticiel - Études de cas. In this tutorial, we show how to use TANAGRA ( and higher) for measuring the association between ordinal variables. Subject Association measures for ordinal variables. In this tutorial, we show how to use TANAGRA (1.4.19 and higher) for measuring the association between ordinal variables. All the measures that we present

More information

Chapter 19. Agreement and the kappa statistic

Chapter 19. Agreement and the kappa statistic 19. Agreement Chapter 19 Agreement and the kappa statistic Besides the 2 2contingency table for unmatched data and the 2 2table for matched data, there is a third common occurrence of data appearing summarised

More information

Chapter 2: Describing Contingency Tables - I

Chapter 2: Describing Contingency Tables - I : Describing Contingency Tables - I Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM [Acknowledgements to Tim Hanson and Haitao Chu]

More information

8 Nominal and Ordinal Logistic Regression

8 Nominal and Ordinal Logistic Regression 8 Nominal and Ordinal Logistic Regression 8.1 Introduction If the response variable is categorical, with more then two categories, then there are two options for generalized linear models. One relies on

More information

E509A: Principle of Biostatistics. GY Zou

E509A: Principle of Biostatistics. GY Zou E509A: Principle of Biostatistics (Week 4: Inference for a single mean ) GY Zou gzou@srobarts.ca Example 5.4. (p. 183). A random sample of n =16, Mean I.Q is 106 with standard deviation S =12.4. What

More information

Analytic Methods for Applied Epidemiology: Framework and Contingency Table Analysis

Analytic Methods for Applied Epidemiology: Framework and Contingency Table Analysis Analytic Methods for Applied Epidemiology: Framework and Contingency Table Analysis 2014 Maternal and Child Health Epidemiology Training Pre-Training Webinar: Friday, May 16 2-4pm Eastern Kristin Rankin,

More information

Statistical Methods in Clinical Trials Categorical Data

Statistical Methods in Clinical Trials Categorical Data Statistical Methods in Clinical Trials Categorical Data Types of Data quantitative Continuous Blood pressure Time to event Categorical sex qualitative Discrete No of relapses Ordered Categorical Pain level

More information

Hypothesis Testing, Power, Sample Size and Confidence Intervals (Part 2)

Hypothesis Testing, Power, Sample Size and Confidence Intervals (Part 2) Hypothesis Testing, Power, Sample Size and Confidence Intervals (Part 2) B.H. Robbins Scholars Series June 23, 2010 1 / 29 Outline Z-test χ 2 -test Confidence Interval Sample size and power Relative effect

More information

Nemours Biomedical Research Statistics Course. Li Xie Nemours Biostatistics Core October 14, 2014

Nemours Biomedical Research Statistics Course. Li Xie Nemours Biostatistics Core October 14, 2014 Nemours Biomedical Research Statistics Course Li Xie Nemours Biostatistics Core October 14, 2014 Outline Recap Introduction to Logistic Regression Recap Descriptive statistics Variable type Example of

More information

STATISTICS ( CODE NO. 08 ) PAPER I PART - I

STATISTICS ( CODE NO. 08 ) PAPER I PART - I STATISTICS ( CODE NO. 08 ) PAPER I PART - I 1. Descriptive Statistics Types of data - Concepts of a Statistical population and sample from a population ; qualitative and quantitative data ; nominal and

More information

Introduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution

Introduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution Introduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution James V. Lambers Department of Mathematics The University of Southern Mississippi James V. Lambers Statistical Data Analysis

More information

STAT 7030: Categorical Data Analysis

STAT 7030: Categorical Data Analysis STAT 7030: Categorical Data Analysis 5. Logistic Regression Peng Zeng Department of Mathematics and Statistics Auburn University Fall 2012 Peng Zeng (Auburn University) STAT 7030 Lecture Notes Fall 2012

More information

Unit 9: Inferences for Proportions and Count Data

Unit 9: Inferences for Proportions and Count Data Unit 9: Inferences for Proportions and Count Data Statistics 571: Statistical Methods Ramón V. León 12/15/2008 Unit 9 - Stat 571 - Ramón V. León 1 Large Sample Confidence Interval for Proportion ( pˆ p)

More information

Procedia - Social and Behavioral Sciences 109 ( 2014 )

Procedia - Social and Behavioral Sciences 109 ( 2014 ) Available online at www.sciencedirect.com ScienceDirect Procedia - Social and Behavioral Sciences 09 ( 04 ) 730 736 nd World Conference On Business, Economics And Management - WCBEM 03 Categorical Principal

More information

In many situations, there is a non-parametric test that corresponds to the standard test, as described below:

In many situations, there is a non-parametric test that corresponds to the standard test, as described below: There are many standard tests like the t-tests and analyses of variance that are commonly used. They rest on assumptions like normality, which can be hard to assess: for example, if you have small samples,

More information

Readings Howitt & Cramer (2014) Overview

Readings Howitt & Cramer (2014) Overview Readings Howitt & Cramer (4) Ch 7: Relationships between two or more variables: Diagrams and tables Ch 8: Correlation coefficients: Pearson correlation and Spearman s rho Ch : Statistical significance

More information

Logistic regression analysis. Birthe Lykke Thomsen H. Lundbeck A/S

Logistic regression analysis. Birthe Lykke Thomsen H. Lundbeck A/S Logistic regression analysis Birthe Lykke Thomsen H. Lundbeck A/S 1 Response with only two categories Example Odds ratio and risk ratio Quantitative explanatory variable More than one variable Logistic

More information

Confidence Intervals of the Simple Difference between the Proportions of a Primary Infection and a Secondary Infection, Given the Primary Infection

Confidence Intervals of the Simple Difference between the Proportions of a Primary Infection and a Secondary Infection, Given the Primary Infection Biometrical Journal 42 (2000) 1, 59±69 Confidence Intervals of the Simple Difference between the Proportions of a Primary Infection and a Secondary Infection, Given the Primary Infection Kung-Jong Lui

More information

Optimal exact tests for complex alternative hypotheses on cross tabulated data

Optimal exact tests for complex alternative hypotheses on cross tabulated data Optimal exact tests for complex alternative hypotheses on cross tabulated data Daniel Yekutieli Statistics and OR Tel Aviv University CDA course 29 July 2017 Yekutieli (TAU) Optimal exact tests for complex

More information

Epidemiology Wonders of Biostatistics Chapter 11 (continued) - probability in a single population. John Koval

Epidemiology Wonders of Biostatistics Chapter 11 (continued) - probability in a single population. John Koval Epidemiology 9509 Wonders of Biostatistics Chapter 11 (continued) - probability in a single population John Koval Department of Epidemiology and Biostatistics University of Western Ontario What is being

More information

Introduction to Nonparametric Analysis (Chapter)

Introduction to Nonparametric Analysis (Chapter) SAS/STAT 9.3 User s Guide Introduction to Nonparametric Analysis (Chapter) SAS Documentation This document is an individual chapter from SAS/STAT 9.3 User s Guide. The correct bibliographic citation for

More information

Unit 9: Inferences for Proportions and Count Data

Unit 9: Inferences for Proportions and Count Data Unit 9: Inferences for Proportions and Count Data Statistics 571: Statistical Methods Ramón V. León 1/15/008 Unit 9 - Stat 571 - Ramón V. León 1 Large Sample Confidence Interval for Proportion ( pˆ p)

More information

SAS Analysis Examples Replication C8. * SAS Analysis Examples Replication for ASDA 2nd Edition * Berglund April 2017 * Chapter 8 ;

SAS Analysis Examples Replication C8. * SAS Analysis Examples Replication for ASDA 2nd Edition * Berglund April 2017 * Chapter 8 ; SAS Analysis Examples Replication C8 * SAS Analysis Examples Replication for ASDA 2nd Edition * Berglund April 2017 * Chapter 8 ; libname ncsr "P:\ASDA 2\Data sets\ncsr\" ; data c8_ncsr ; set ncsr.ncsr_sub_13nov2015

More information

Describing Contingency tables

Describing Contingency tables Today s topics: Describing Contingency tables 1. Probability structure for contingency tables (distributions, sensitivity/specificity, sampling schemes). 2. Comparing two proportions (relative risk, odds

More information

ij i j m ij n ij m ij n i j Suppose we denote the row variable by X and the column variable by Y ; We can then re-write the above expression as

ij i j m ij n ij m ij n i j Suppose we denote the row variable by X and the column variable by Y ; We can then re-write the above expression as page1 Loglinear Models Loglinear models are a way to describe association and interaction patterns among categorical variables. They are commonly used to model cell counts in contingency tables. These

More information

Lecture 22. December 19, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University.

Lecture 22. December 19, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University. Lecture 22 Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University December 19, 2007 1 2 3 4 5 6 7 8 9 1 tests for equivalence of two binomial 2 tests for,

More information

Entering and recoding variables

Entering and recoding variables Entering and recoding variables To enter: You create a New data file Define the variables on Variable View Enter the values on Data View To create the dichotomies: Transform -> Recode into Different Variable

More information

Non-parametric methods

Non-parametric methods Eastern Mediterranean University Faculty of Medicine Biostatistics course Non-parametric methods March 4&7, 2016 Instructor: Dr. Nimet İlke Akçay (ilke.cetin@emu.edu.tr) Learning Objectives 1. Distinguish

More information

Good Confidence Intervals for Categorical Data Analyses. Alan Agresti

Good Confidence Intervals for Categorical Data Analyses. Alan Agresti Good Confidence Intervals for Categorical Data Analyses Alan Agresti Department of Statistics, University of Florida visiting Statistics Department, Harvard University LSHTM, July 22, 2011 p. 1/36 Outline

More information

Statistics 3858 : Contingency Tables

Statistics 3858 : Contingency Tables Statistics 3858 : Contingency Tables 1 Introduction Before proceeding with this topic the student should review generalized likelihood ratios ΛX) for multinomial distributions, its relation to Pearson

More information

Count data page 1. Count data. 1. Estimating, testing proportions

Count data page 1. Count data. 1. Estimating, testing proportions Count data page 1 Count data 1. Estimating, testing proportions 100 seeds, 45 germinate. We estimate probability p that a plant will germinate to be 0.45 for this population. Is a 50% germination rate

More information

(Where does Ch. 7 on comparing 2 means or 2 proportions fit into this?)

(Where does Ch. 7 on comparing 2 means or 2 proportions fit into this?) 12. Comparing Groups: Analysis of Variance (ANOVA) Methods Response y Explanatory x var s Method Categorical Categorical Contingency tables (Ch. 8) (chi-squared, etc.) Quantitative Quantitative Regression

More information

N Utilization of Nursing Research in Advanced Practice, Summer 2008

N Utilization of Nursing Research in Advanced Practice, Summer 2008 University of Michigan Deep Blue deepblue.lib.umich.edu 2008-07 536 - Utilization of ursing Research in Advanced Practice, Summer 2008 Tzeng, Huey-Ming Tzeng, H. (2008, ctober 1). Utilization of ursing

More information

Readings Howitt & Cramer (2014)

Readings Howitt & Cramer (2014) Readings Howitt & Cramer (014) Ch 7: Relationships between two or more variables: Diagrams and tables Ch 8: Correlation coefficients: Pearson correlation and Spearman s rho Ch 11: Statistical significance

More information

Short Course Introduction to Categorical Data Analysis

Short Course Introduction to Categorical Data Analysis Short Course Introduction to Categorical Data Analysis Alan Agresti Distinguished Professor Emeritus University of Florida, USA Presented for ESALQ/USP, Piracicaba Brazil March 8-10, 2016 c Alan Agresti,

More information

Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA)

Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA) BSTT523 Pagano & Gauvreau Chapter 13 1 Nonparametric Statistics Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA) In particular, data

More information

1 A Review of Correlation and Regression

1 A Review of Correlation and Regression 1 A Review of Correlation and Regression SW, Chapter 12 Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then

More information

Tests for Two Correlated Proportions in a Matched Case- Control Design

Tests for Two Correlated Proportions in a Matched Case- Control Design Chapter 155 Tests for Two Correlated Proportions in a Matched Case- Control Design Introduction A 2-by-M case-control study investigates a risk factor relevant to the development of a disease. A population

More information

Passing-Bablok Regression for Method Comparison

Passing-Bablok Regression for Method Comparison Chapter 313 Passing-Bablok Regression for Method Comparison Introduction Passing-Bablok regression for method comparison is a robust, nonparametric method for fitting a straight line to two-dimensional

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

General Linear Model (Chapter 4)

General Linear Model (Chapter 4) General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients

More information

Two Correlated Proportions Non- Inferiority, Superiority, and Equivalence Tests

Two Correlated Proportions Non- Inferiority, Superiority, and Equivalence Tests Chapter 59 Two Correlated Proportions on- Inferiority, Superiority, and Equivalence Tests Introduction This chapter documents three closely related procedures: non-inferiority tests, superiority (by a

More information