Session 3 The proportional odds model and the Mann-Whitney test

Similar documents
Session 9 Power and sample size

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p.

CDA Chapter 3 part II

Topic 21 Goodness of Fit

APPENDIX B Sample-Size Calculation Methods: Classical Design

Recall that in order to prove Theorem 8.8, we argued that under certain regularity conditions, the following facts are true under H 0 : 1 n

Central Limit Theorem ( 5.3)

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

A Very Brief Summary of Statistical Inference, and Examples

A3. Statistical Inference Hypothesis Testing for General Population Parameters

Statistics Handbook. All statistical tables were computed by the author.

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

Inferential Statistics

PSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests

Nonparametric Location Tests: k-sample

Unit 14: Nonparametric Statistical Methods

Categorical Data Analysis Chapter 3

STATISTICS 4, S4 (4769) A2

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

Reports of the Institute of Biostatistics

simple if it completely specifies the density of x

4.5.1 The use of 2 log Λ when θ is scalar

Ling 289 Contingency Table Statistics

Summary of Chapters 7-9

Unit 9: Inferences for Proportions and Count Data

Lecture 7: Hypothesis Testing and ANOVA

TUTORIAL 8 SOLUTIONS #

Wald s theorem and the Asimov data set

STAT 135 Lab 7 Distributions derived from the normal distribution, and comparing independent samples.

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

STA 303 H1S / 1002 HS Winter 2011 Test March 7, ab 1cde 2abcde 2fghij 3

Introduction to Statistical Analysis. Cancer Research UK 12 th of February 2018 D.-L. Couturier / M. Eldridge / M. Fernandes [Bioinformatics core]

Transition Passage to Descriptive Statistics 28

Modeling and inference for an ordinal effect size measure

BIOS 625 Fall 2015 Homework Set 3 Solutions

Inferences for Proportions and Count Data

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing

Testing Independence

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

Simple logistic regression

Fish SR P Diff Sgn rank Fish SR P Diff Sng rank

3 Joint Distributions 71

Psychology 282 Lecture #4 Outline Inferences in SLR

= 1 i. normal approximation to χ 2 df > df

ST3241 Categorical Data Analysis I Two-way Contingency Tables. Odds Ratio and Tests of Independence

Linear rank statistics

PhD course in Advanced survival analysis. One-sample tests. Properties. Idea: (ABGK, sect. V.1.1) Counting process N(t)

Two Correlated Proportions Non- Inferiority, Superiority, and Equivalence Tests

Maximum-Likelihood Estimation: Basic Ideas

Statistics 135 Fall 2008 Final Exam

MISCELLANEOUS TOPICS RELATED TO LIKELIHOOD. Copyright c 2012 (Iowa State University) Statistics / 30

Comparison of Two Samples

Contents. Acknowledgments. xix

Institute of Actuaries of India

Sections 3.4, 3.5. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis

TA: Sheng Zhgang (Th 1:20) / 342 (W 1:20) / 343 (W 2:25) / 344 (W 12:05) Haoyang Fan (W 1:20) / 346 (Th 12:05) FINAL EXAM

SAS Analysis Examples Replication C8. * SAS Analysis Examples Replication for ASDA 2nd Edition * Berglund April 2017 * Chapter 8 ;

ST3241 Categorical Data Analysis I Generalized Linear Models. Introduction and Some Examples

Parametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1

1/24/2008. Review of Statistical Inference. C.1 A Sample of Data. C.2 An Econometric Model. C.4 Estimating the Population Variance and Other Moments

Statistical Procedures for Testing Homogeneity of Water Quality Parameters

DA Freedman Notes on the MLE Fall 2003

Kumaun University Nainital

Unit 9: Inferences for Proportions and Count Data

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F).

APPENDICES APPENDIX A. STATISTICAL TABLES AND CHARTS 651 APPENDIX B. BIBLIOGRAPHY 677 APPENDIX C. ANSWERS TO SELECTED EXERCISES 679

PRINCIPLES OF STATISTICAL INFERENCE

Lecture 26. December 19, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University.

Lecture 32: Asymptotic confidence sets and likelihoods

ST4241 Design and Analysis of Clinical Trials Lecture 7: N. Lecture 7: Non-parametric tests for PDG data

Frequency table: Var2 (Spreadsheet1) Count Cumulative Percent Cumulative From To. Percent <x<=

Review for Final. Chapter 1 Type of studies: anecdotal, observational, experimental Random sampling

Bayesian inference for sample surveys. Roderick Little Module 2: Bayesian models for simple random samples

Non-parametric (Distribution-free) approaches p188 CN

Master s Written Examination - Solution

UNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS. Duration - 3 hours. Aids Allowed: Calculator

Textbook Examples of. SPSS Procedure

Nonparametric Statistics. Leah Wright, Tyler Ross, Taylor Brown

Master s Written Examination

Analyzing Small Sample Experimental Data

Formulas and Tables by Mario F. Triola

4 Hypothesis testing. 4.1 Types of hypothesis and types of error 4 HYPOTHESIS TESTING 49

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015

UNIVERSITY OF TORONTO Faculty of Arts and Science

Statistics 3858 : Contingency Tables

Introduction to Statistical Analysis

1 Comparing two binomials

TMA4255 Applied Statistics V2016 (23)

Chapter 4: Constrained estimators and tests in the multiple linear regression model (Part III)

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

5 Introduction to the Theory of Order Statistics and Rank Statistics

Nonparametric tests. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 704: Data Analysis I

Non-parametric Inference and Resampling

Statistics of Contingency Tables - Extension to I x J. stat 557 Heike Hofmann

Confidence Intervals, Testing and ANOVA Summary

A Review of the Behrens-Fisher Problem and Some of Its Analogs: Does the Same Size Fit All?

Physics 509: Non-Parametric Statistics and Correlation Testing

Agonistic Display in Betta splendens: Data Analysis I. Betta splendens Research: Parametric or Non-parametric Data?

Classical Inference for Gaussian Linear Models

Topic 15: Simple Hypotheses

Transcription:

Session 3 The proportional odds model and the Mann-Whitney test 3.1 A unified approach to inference 3.2 Analysis via dichotomisation 3.3 Proportional odds 3.4 Relationship with the Mann-Whitney test Session 3 1

3.1 A unified approach to inference Data (assumed here to be discrete): x = (x 1,, x n ) A single unknown parameter: θ Likelihood: L(θ; x) = probability of x, given θ Log-likelihood: l(θ) = log L(θ; x) Session 3 2

Suppose that θ is small (it might be a treatment effect) Taylor s expansion: l l l l 1 2 3 ( θ ) = (0) + θ (0) + 2 θ (0) + O( θ ) + θ θ 1 2 const. Z 2 V where Z = l (0), efficient score V = l (0), Fisher s information Session 3 3

For large samples and small θ Z ~ N(θV, V) approximately (Scharfstein et al., 1997) This is the basis for many common statistical tests: Pearson s Chi-squared test Armitage s trend test The logrank test The Mann-Whitney (or Wilcoxon) test and it leads to asymptotically efficient methods Session 3 4

Estimation of θ 1. Maximum likelihood estimate where ˆ θ l ( θ ˆ) = 0 2. Estimate based on score statistics Z V which has variance 1 V Session 3 5

Hypothesis testing To test the null hypothesis of H 0 : θ = 0 1. Likelihood ratio test 2. Score test 2 Z ~ V 3. Wald test χ 2 1 { ˆ } 2 1 W = 2log L(0) / L( θ) ~ χ θˆ s.e. ( θˆ ) 2 ~ χ 2 1 Session 3 6

For likelihoods with nuisance parameters: φ Replace log-likelihood with profile log-likelihood l( θ, φ) l( θ, φˆ ( θ)) where ˆφ(θ) is the maximum likelihood estimate of φ, given the value of θ l( θ, φˆ ( θ) ) is a function of θ only Session 3 7

Example: Binary data Control Group Treated Group Total Success s C s T s Failure f C f T f Total n C n T n Probability of success: p C and p T on C and T respectively log p (1 p ) T C θ = e p C(1 p T ) (log - odds ratio) Session 3 8

The unconditional likelihood of θ (comparison of binomial observations) leads to (see Session 2) Control Group Treated Group Total Success s C s T s Failure f C f T f Total n C n T n Z s f n s f n n sf = n T C C T C T = V 3 Session 3 9

The conditional likelihood of θ given s successes in total (hypergeometric distribution) leads to Control Group Treated Group Total Success s C s T s Failure f C f T f Total n C n T n Z = s f T C n s f C T n n sf V C T = n 2 (n 1) Session 3 10

3.2 Analysis via dichotomisation Head injury data from Example 1 GOS at 3 months GCS on entry 3-5 6-8 Total 1. Good Recovery 73 219 292 2. Moderate Disability 55 118 173 3. Severe Disability 79 66 145 4. Vegetative State 37 10 47 5. Dead 358 92 450 Total 602 505 1107 Session 3 11

1. Success is Good GOS GCS on entry 3-5 6-8 Total Success 73 219 292 Failure 529 286 815 Total 602 505 1107 θ = log odds of success for (GCS = 6-8) versus (GCS = 3-5) Z 1 = 85.8, V 1 = 53.4 Estimate of θ = Z 1 /V 1 = 1.61 Score test: Z 12 /V 1 = 138.1 (c.f. χ 2 on 1 df) Session 3 12

2. Success is Good or Moderate GOS GCS on entry 3-5 6-8 Total Success 128 337 465 Failure 474 168 642 Total 602 505 1107 θ = log odds of success for (GCS = 6-8) versus (GCS = 3-5) Z 2 = 124.9, V 2 = 67.0 Estimate of θ = Z 2 /V 2 = 1.87 Score test: Z 22 /V 2 = 232.8 (c.f. χ 2 on 1 df) Session 3 13

3. Failure is Vegetative or Dead GCS on entry 3-5 6-8 Total Success 207 403 610 Failure 395 102 497 Total 602 505 1107 θ = log odds of success for (GCS = 6-8) versus (GCS = 3-5) Z 3 = 124.7, V 3 = 68.0 Estimate of θ = Z 3 /V 3 = 1.83 Score test: Z 32 /V 3 = 228.7 (c.f. χ 2 on 1 df) Session 3 14

4. Failure is Dead GCS on entry 3-5 6-8 Total Success 224 413 657 Failure 358 92 450 Total 602 505 1107 θ = log odds of success for (GCS = 6-8) versus (GCS = 3-5) Z 4 = 113.3, V 4 = 66.3 Estimate of θ = Z 4 /V 4 = 1.71 Score test: Z 42 /V 4 = 193.6 (c.f. χ 2 on 1 df) Session 3 15

Session 3 16

Session 3 17

Analyses each indicate that GCS 6-8 is preferable to GCS 3-5 Magnitude of advantage, on the log-odds ratio scale, is consistent How can these four analyses be combined? Session 3 18

3.3 Proportional odds Notation Category Control Group Treated Group Total C 1 n 1C n 1T n 1 C 2 n 2C n 2T n 2 M M M M C m n mc n mt n m Total n C n T n Session 3 19

Let L kt = n 1T + + n (k-1)t, U kt = n (k+1)t + + n mt L kc = n 1C + + n (k-1)c, U kc = n (k+1)c + + n mc Thus, if Success is {C 1,, C k }, the derived 2 2 table is Control Group Treated Group Total Success L (k+1)c L (k+1)t s Failure U kc U kt f Total n C n T n Session 3 20

Let p kc = P(C k ; Control Group) Q kc = P(C k or Better; Control Group) = p 1C + + p kc, k = 1,, m; so that Q mc = 1 p kt and Q kt are defined similarly for treated group and Q kt (1 Q kc) θ k = log Q kc (1 Q kt ) k = 1,, m 1 Session 3 21

θ k is the log-odds ratio of Success where Success is {C 1,,C k } The proportional odds assumption is θ 1 = θ 2 = = θ m 1 = θ The common value, θ, is a measure of the advantage of being in the Treated Group > 0 Treated Group better θ = 0 no difference < 0 Treated Group worse Session 3 22

Score and information Using a marginal likelihood based on the ranks, with allowance for ties (Jones and Whitehead (1979)) the efficient score for θ is m 1 Z = n kc(lkt U kt ) n + 1 k = 1 and Fisher s information is m ntncn nk V = 1 2 3(n + 1) k= 1 n 3 Session 3 23

Application to Head Injury data 1 Z = {73 (0 118 66 10 92) 1108 + 55 (219 66 10 92) + 79 (219 + 118 10 92) + 37 (219 + 118 + 66 92) + 358 (219 + 118 + 66 + 10 0)} = 144.3 3 3 292 173 1 V = 602 505 1107 1107 1107 2 3 3 3 3 1108 145 47 450 1107 1107 1107 = 91.38 (1 0.0917) = 83.0 Session 3 24

Hence 2 Z 250.9 V = 2 larger than any individual 2 2 table, and c.f. very highly significant χ 1 Estimate of θ = Z V = 1.74 between the values from the 2 2 tables 95% confidence interval for θ is Z V ± 1.96 V = (1.52, 1.96) Note: The 2 2 tables contain 64%, 81%, 82% and 80% of total information respectively Session 3 25

The 2 2 table as a special case (m = 2) Control Group Treated Group Total Success s C s T s Failure f C f T f Total n C n T n 1 s f s f Z = sc ( 0 ft ) + fc ( st 0) = n + 1 n + 1 { } T C C T Note: n+1 instead of n in denominator Session 3 26

The 2 2 table as a special case (m = 2) Control Group Treated Group Total Success s C s T s Failure f C f T f Total n C n T n 3 3 ncn T s f ncntsf V = 1 2 = 2 3(n + 1) n n (n + 1) n Note: (n+1) 2 n instead of (n) 2 (n-1) in denominator Session 3 27

3.4 Relationship with the Mann-Whitney test (Wilcoxon, 1945; Mann and Whitney, 1947) Samples: x 1,..., x a (low values y 1,..., y b are good) Scores: Mann-Whitney statistic: 1if xi < y d = 0 if x = y + 1if xi > y ij i j M a b i= 1 j= 1 ij (other variations exist) j j = d var M = ab( a + b + 1) 3 Mann-Whitney test: 2 2 M var M c.f. χ 1 Session 3 28

Mann-Whitney test with ties Values : u 1 u 2... u m Total x s n 1C n 2C n mc n C y s n 1T n 2T n mt n T Total n 1 n 2 n m n M = (n + 1)Z Siegel (1957) gives variance of M with ties as m ( ) ( 3 ) ( ) C T C T j j j= 1 { } var M = n n n + 1 3 n n n n 3n n 1 n n n n n n j ( ) m m C T 3 3 = j + 3n n 1 j= 1 j= 1 Session 3 29

3 3 m n C n Tn j = 1 (n + 1) 3n(n 1) j 1 n = n 2 V Thus, the score test under the proportional odds model is the Mann-Whitney test Session 3 30