Outline. The ordinal package: Analyzing ordinal data. Ordinal scales are commonly used (attribute rating or liking) Ordinal data the wine data
|
|
- Nathaniel Payne
- 6 years ago
- Views:
Transcription
1 Outline Outline The ordinal package: Analyzing ordinal data Per Bruun Brockhoff DTU Compute Section for Statistics Technical University of Denmark August c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34 1 Income group data Soup data (Paired) Degree-of-difference data -Alternative choice Cumulative link models 3 4 Cumulative Link Mixed Models 5 Thurstonian -AC model via CLMMs c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics 015 / 34 Ordinal scales are commonly used (attribute rating or liking) Ordinal data the wine data 5-point scales: 7-point scales: Randall, J (1989). The analysis of sensory data by generalised linear model. Biometrical journal 7, AND: included in ordinal pacakge. A sensory experiment: Temperature and contact between juice and skins can be controlled when crushing grapes during wine production. These factors are thought to affect the bitterness of the wine. 9-point scales: Objective: How does perceived bitterness depend on temperature and contact? c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34 c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34
2 Ordinal data the wine data Data for the bitterness of white wines Objective: How does perceived bitternes depend on temperature and contact? Table: (Randall, 1989), N=7 Variables Type Values bitterness response 1,, 3,4, 5 less more temperature predictor cold, warm contact predictor no, yes judges random 1,..., 9 Temperature and contact between juice and skins can be controlled when cruching grapes during wine production. Table: Ratings of the bitterness of some white wines. Data are adopted from Randall (1989). Judge Temperature Contact Bottle cold no cold no cold yes cold yes warm no warm no warm yes warm yes Bitterness ratings: 1(least),, 3, 4, 5(most) c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34 c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34 Income group data Soup data Ordinal data Income group data McCullagh, P. (1980) Regression Models for Ordinal Data. Journal of the Royal Statistical Society. Series B (Methodological), Vol. 4, No.., pp AND: included in ordinal pacakge. head(income) year pct income c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34 Ordinal data Soup data Christensen, R. H. B., Cleaver, G., & Brockhoff, P. B. (011). Statistical and Thurstonian models for the A-not A protocol with and without sureness. Food Quality and Preference,, Industrial product development experiment - Unilever. AND: included in ordinal pacakge. A-not A with sureness scale: head(soup) Reference Not Reference Sure Not Sure Guess Guess Not Sure Sure RESP PROD PRODID SURENESS DAY SOUPTYPE SOUPFREQ COLD EASY GENDER 1 1 Ref Canned >1/week Yes 7 Female 1 Test 5 1 Canned >1/week Yes 7 Female 3 1 Ref Canned >1/week Yes 7 Female 4 1 Test Canned >1/week Yes 7 Female 5 1 Ref 1 5 Canned >1/week Yes 7 Female 6 1 Test 6 5 Canned >1/week Yes 7 Female AGEGROUP LOCATION Region Region 1 c 3 Per Bruun Region Brockhoff 1 (DTU) Region 1 The ordinal package: Analyzing ordinal data DTU Sensometrics / Region 1
3 (Paired) Degree-of-difference data -Alternative choice Ordinal data (Paired) Degree-of-difference data Ordinal data The -Alternative choice test (-AC) 5 panelists 8 replications. Table: Paired degree-of-difference test, data adopted from (?) Pair Identical Uncertain Different Total Identical pair Different pair Christensen, R. H. B. and P. B. Brockhoff (011) Analysis of replicated categorical ratings data from sensory experiments. Journal of the French Statistical Society, SFdS, 154(3), Alternative Choice (-AC): Do you prefer A or B or do you not have a preference? Which is strongest, A or B, or is there no difference? Table: 08 consumers with 4 replications Condition Prefer A No-preference Prefer B Total A B Christensen, R. H. B., H.-S. Lee and P. B. Brockhoff (01) Estimation of the Thurstonian model for the -AC protocol. Food Quality and Preference, 4, c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34 c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34 -Alternative choice Cumulative link models Appropriate models for ordinal data Understanding the cumulative link model Y: Ordinal data not continuous data A linear regression model on the scores (1,...,5)? Breach of assumptions: The scores are not normally distributed A score of 4 is not twice as much as Variance not likely to be constant Our approach: A cumulative link model (CLM) Only use information about ordering Intuitively: A linear model that respects the ordinal nature of the response β P(Y = cold) θ 1 α θ θ 3 θ 4 warm cold Latent bitterness follows a linear model: S i = α + x i β + ε i, ε i N(0, σ ) = α + β(temp i ) + ε i We only observe a grouped version of S i : θ j 1 S i < θ j Y = j P (Y i j) = F (θ j x i β) c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34 c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34
4 Cumulative link models Fitting cumulative link models with clm data(wine) fm1 <- clm(rating ~ contact + temp, data=wine, link="probit") summary(fm1) formula: rating ~ contact + temp data: wine link threshold nobs loglik AIC niter max.grad cond.h probit flexible (0) 1.43e-13.e+01 Coefficients: Estimate Std. Error z value Pr(> z ) contactyes ** tempwarm e-07 *** --- Signif. codes: 0 '***' '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 Threshold coefficients: Estimate Std. Error z value c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34 Cumulative link models A cumulative link model for the wine data Additive effects for temperature and contact: P (Y i j) = Φ (θ j β 1 (temp i ) β (contact i )) Is there an interaction between temp and contact? Table: ANODE table for the wine data. Source df deviance p value Total < Treatment < Temperature, T < Contact, C < Interaction, T C Residual c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34 An extended CLM Framework Understanding structured thresholds restrictions Standard CLM: F (θ j x i β) Y: β Extended CLM: ( g(θj ) wi F β j x i β ) Threshold effects Nominal effects exp(zi ζ) Scale effects CLMM (Mixed effects): F (θ j fixed Xβ random Zb ) P(Y = cold) warm cold The cumulative link model: P (Y i j) = F (θ j β(temp i )) θ j ordered, but otherwise not restricted Require symmetry? Require equidistance? θ 1 α θ θ 3 θ 4 c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34 c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34
5 Fitting models with structured thresholds Sensory applications of structured thresholds fm.equi <- clm(rating ~ contact + temp, data=wine, link="probit", threshold="equidistant") summary(fm.equi) formula: rating ~ contact + temp data: wine link threshold nobs loglik AIC niter max.grad cond.h probit equidistant (0) 1.40e-08 3.e+01 Coefficients: Estimate Std. Error z value Pr(> z ) contactyes ** tempwarm e-07 *** --- Signif. codes: 0 '***' '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 Threshold coefficients: Estimate Std. Error z value threshold spacing Symmetric thresholds for sureness scales: Reference Not Reference Sure Not Sure Guess Guess Not Sure Sure Equidistant thresholds for 7- or 9-point preference scales: Equally spaced categories is a necessary condition for using linear models for continuous data on ordinal data. c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34 c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34 Understanding scale effects in CLMs Thurstonian model for the A-not A with sureness protocol Y: β Model for latent bitterness: S i = α + β 1 (temp i ) + β (contact i ) + ε i, Reference Test products products N(0, 1) N(δ, σ ) δ warm cold ε i N(0, σ (temp i )) Mccullagh, 1980, Cox, 1995, Agresti 00 ( ) θj β 1(temp γ ij = F i ) β (contact i) ζ 1(temp i ) θ 1 θ θ 3 θ 4 θ 5 Sensory intensity Reference Not Reference Product Sure Not Sure Guess Guess Not Sure Sure Reference Test θ 1 α θ θ 3 θ 4 Table: Discrimination of packet soup Christensen, Cleaver and Brockhoff, 011 c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34 c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34
6 Cumulative Link Mixed Models Including random effects in CLMs Cumulative Link Mixed Models Cumulative link mixed models β warm The cumulative link model: γ ij = F (θ j β 1 (temp i ) β (contact i )) Judges perceive wine bitterness differently Judges use the response scale differently Add random effects for judges: γ k = F (B k ψ Zv o k ) V N(0, Σ τ ) The log-likelihood function: l(ψ, τ ; y) = log p ψ (y v)p τ (v) dv R r Integration methods: Laplace approximation Tierney and Kadane, 1986, Pinheiro and Bates, 1995, Joe 008 cold γ ij = F (θ j β 1 (temp i ) β (contact i ) b(judge i )), b N(0, σ b ) Gauss-Hermite quadrature (GHQ) Hedeker and Gibbons, 1994 Adaptive Gauss-Hermite quadrature (AGQ) Liu and Pierce, 1994 θ 1 α θ θ 3 θ 4 A Newton-Raphson algorithm updates the conditional modes of the random effects (Laplace and AGQ) c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34 c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics 015 / 34 Allowing for differences between judges ANODE for mixed effects CLM Research questions: Are judges rating the wines differently? Are there differences between bottles? Additive random effects for judges: P (Y i j) = Φ (θ j β 1 (temp i ) β (contact i ) u(judge i )) u(judge i ) N(0, σu) Additive random effects for judges and bottles: P (Y i j) = Φ (θ j β 1 (temp i ) β (contact i ) u(judge i ) b(bottle i )) u(judge i ) N(0, σu) b(bottle i ) N(0, σb ) Results: Table: ANODE table for the wine data with random effects. Source df deviance p value Total < Var(Judge) < Var(Bottle) Treatment < Temperature, T < Contact, C < Interaction, T C Bottles are probably not that different Judges do rate the wines differently c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34 c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34
7 Panel inference judge effects 5 panelists 8 replications. Judge effect Table: Paired degree-of-difference test, data adopted from (?) Pair Identical Uncertain Different Total Identical pair Different pair Judge Christensen, R. H. B. and P. B. Brockhoff (011) Analysis of replicated categorical ratings data from sensory experiments. Journal of the French Statistical Society, SFdS, 154(3), c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34 c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34 Thurstonian model for the A-not A with sureness protocol 5 panelists 8 replications. Table: Comparison of tests of product differences. Test χ -value df p-value Naive Pearson test Stuart-Maxwell test (?) LR test in CLMM Reference Test products products N(0, 1) N(δ, σ ) δ θ 1 θ θ 3 θ 4 θ 5 Sensory intensity Christensen, R. H. B. and P. B. Brockhoff (011) Analysis of replicated categorical ratings data from sensory experiments. Journal of the French Statistical Society, SFdS, 154(3), c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34 Reference Not Reference Product Sure Not Sure Guess Guess Not Sure Sure Reference Test Table: Discrimination of packet soup Christensen, Cleaver nd Brockhoff, 011 c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34
8 Including assessor effects Inference for respondents respondent-specific d s Assumptions: Assessors do not use the response scale differently 3 Assessors do not have different d s Accommodate this with mixed model extensions: Allow normally distributed random effects for assessors P (S i θ j ) = Φ (θ j δ(prod i ) u(assessor i )) u N(0, σu) d prime (confidence interval) 1 0 Note: This is similar to assessor effects in models for sensory profiling! c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34 Assessors c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34 Thurstonian -AC model via CLMMs Thurstonian -AC model via CLMMs The -Alternative choice test (-AC) Thurstonian model for the -AC protocol -Alternative Choice (-AC): θ 1 θ Do you prefer A or B or do you not have a preference? Which is strongest, A or B, or is there no difference? B A ~ N(d', ) B A d' ~ N(0, 1) π 1 π π 3 π 1 π π 3 Table: 08 consumers with 4 replications τ 0 τ d' A stronger than B no difference B stronger than A τ d' 0 τ d' Condition Prefer A No-preference Prefer B Total A B Christensen, R. H. B., H.-S. Lee and P. B. Brockhoff (01) Estimation of the Thurstonian model for the -AC protocol. Food Quality and Preference, 4, c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34 The Thurstonian model for the -AC protocol can be formulated as a cumulative link model: ˆτ = (ˆθ ˆθ 1 )/ ˆδ = ( ˆθ ˆθ 1 )/ se(ˆτ) = {var(θ ) + var(θ 1 ) cov(θ, θ 1 )}/ se(ˆδ) = {var(θ ) + var(θ 1 ) + cov(θ, θ 1 )}/ c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34
9 Thurstonian -AC model via CLMMs Outline Illustrating the model Outline Average consumer 5th percentile consumer Reference A Reference B 95th percentile consumer prefer A no preference prefer B 95% of population within ±1.96σ d = ±3.3 (d units) The largest effect is consumer differences: χ 1 = 153.6, p < Effect of reference in duo-trio test only for consumers with an average preference 1 Income group data Soup data (Paired) Degree-of-difference data -Alternative choice Cumulative link models 3 4 Cumulative Link Mixed Models 5 Thurstonian -AC model via CLMMs c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34 c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34 Excercises - Day afternoon Excercises - Day afternoon NO ordinal exercises IF you want: look at ordinal vignettes and/or manuel/help material Recommend instead - work with exercises from the previous three teaching blocks: sensr part 1 exercises sensr part exercises lmertest exercises SensMixed exercises c Per Bruun Brockhoff (DTU) The ordinal package: Analyzing ordinal data DTU Sensometrics / 34
A Tutorial on fitting Cumulative Link Models with the ordinal Package
A Tutorial on fitting Cumulative Link Models with the ordinal Package Rune Haubo B Christensen September 10, 2012 Abstract It is shown by example how a cumulative link mixed model is fitted with the clm
More informationChristine Borgen Linander 1,, Rune Haubo Bojesen Christensen 1, Rebecca Evans 2, Graham Cleaver 2, Per Bruun Brockhoff 1. University of Denmark
Individual differences in replicated multi-product 2-AFC data with and without supplementary difference scoring: Comparing Thurstonian mixed regression models for binary and ordinal data with linear mixed
More informationMain purposes of sensr. The sensr package: Difference and similarity testing. Main functions in sensr
Main purposes of sensr : Difference and similarity testing Per B Brockhoff DTU Compute Section for Statistics Technical University of Denmark perbb@dtu.dk August 17 2015 Statistical tests of sensory discrimation
More informationCumulative Link Models for Ordinal Regression with the R Package ordinal
Cumulative Link Models for Ordinal Regression with the R Package ordinal Rune Haubo B Christensen Technical University of Denmark & Christensen Statistics Abstract This paper introduces the R-package ordinal
More informationR-companion to: Estimation of the Thurstonian model for the 2-AC protocol
R-companion to: Estimation of the Thurstonian model for the 2-AC protocol Rune Haubo Bojesen Christensen, Hye-Seong Lee & Per Bruun Brockhoff August 24, 2017 This document describes how the examples in
More informationModels for Replicated Discrimination Tests: A Synthesis of Latent Class Mixture Models and Generalized Linear Mixed Models
Models for Replicated Discrimination Tests: A Synthesis of Latent Class Mixture Models and Generalized Linear Mixed Models Rune Haubo Bojesen Christensen & Per Bruun Brockhoff DTU Informatics Section for
More informationAdvances in Sensory Discrimination Testing: Thurstonian-Derived Models, Covariates, and Consumer Relevance
Advances in Sensory Discrimination Testing: Thurstonian-Derived Models, Covariates, and Consumer Relevance John C. Castura, Sara K. King, C. J. Findlay Compusense Inc. Guelph, ON, Canada Derivation of
More informationAnalysis of sensory ratings data with cumulative link models
Downloaded from orbit.dtu.dk on: Oct 02, 2018 Analysis of sensory ratings data with cumulative link models Christensen, Rune Haubo Bojesen; Brockhoff, Per B. Published in: Journal de la Societe Francaise
More informationThreshold models with fixed and random effects for ordered categorical data
Threshold models with fixed and random effects for ordered categorical data Hans-Peter Piepho Universität Hohenheim, Germany Edith Kalka Universität Kassel, Germany Contents 1. Introduction. Case studies
More informationData-analysis and Retrieval Ordinal Classification
Data-analysis and Retrieval Ordinal Classification Ad Feelders Universiteit Utrecht Data-analysis and Retrieval 1 / 30 Strongly disagree Ordinal Classification 1 2 3 4 5 0% (0) 10.5% (2) 21.1% (4) 42.1%
More informationOutline of GLMs. Definitions
Outline of GLMs Definitions This is a short outline of GLM details, adapted from the book Nonparametric Regression and Generalized Linear Models, by Green and Silverman. The responses Y i have density
More informationSTA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F).
STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis 1. Indicate whether each of the following is true (T) or false (F). (a) (b) (c) (d) (e) In 2 2 tables, statistical independence is equivalent
More informationReview of Multinomial Distribution If n trials are performed: in each trial there are J > 2 possible outcomes (categories) Multicategory Logit Models
Chapter 6 Multicategory Logit Models Response Y has J > 2 categories. Extensions of logistic regression for nominal and ordinal Y assume a multinomial distribution for Y. 6.1 Logit Models for Nominal Responses
More informationClass Notes: Week 8. Probit versus Logit Link Functions and Count Data
Ronald Heck Class Notes: Week 8 1 Class Notes: Week 8 Probit versus Logit Link Functions and Count Data This week we ll take up a couple of issues. The first is working with a probit link function. While
More informationMcGill University. Faculty of Science. Department of Mathematics and Statistics. Statistics Part A Comprehensive Exam Methodology Paper
Student Name: ID: McGill University Faculty of Science Department of Mathematics and Statistics Statistics Part A Comprehensive Exam Methodology Paper Date: Friday, May 13, 2016 Time: 13:00 17:00 Instructions
More informationLinear Regression Model. Badr Missaoui
Linear Regression Model Badr Missaoui Introduction What is this course about? It is a course on applied statistics. It comprises 2 hours lectures each week and 1 hour lab sessions/tutorials. We will focus
More informationModel Estimation Example
Ronald H. Heck 1 EDEP 606: Multivariate Methods (S2013) April 7, 2013 Model Estimation Example As we have moved through the course this semester, we have encountered the concept of model estimation. Discussions
More informationMixed Models for Longitudinal Ordinal and Nominal Outcomes
Mixed Models for Longitudinal Ordinal and Nominal Outcomes Don Hedeker Department of Public Health Sciences Biological Sciences Division University of Chicago hedeker@uchicago.edu Hedeker, D. (2008). Multilevel
More informationSTA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F).
STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis 1. Indicate whether each of the following is true (T) or false (F). (a) T In 2 2 tables, statistical independence is equivalent to a population
More informationUNIVERSITY OF TORONTO Faculty of Arts and Science
UNIVERSITY OF TORONTO Faculty of Arts and Science December 2013 Final Examination STA442H1F/2101HF Methods of Applied Statistics Jerry Brunner Duration - 3 hours Aids: Calculator Model(s): Any calculator
More informationStandard Errors & Confidence Intervals. N(0, I( β) 1 ), I( β) = [ 2 l(β, φ; y) β i β β= β j
Standard Errors & Confidence Intervals β β asy N(0, I( β) 1 ), where I( β) = [ 2 l(β, φ; y) ] β i β β= β j We can obtain asymptotic 100(1 α)% confidence intervals for β j using: β j ± Z 1 α/2 se( β j )
More informationSCHOOL OF MATHEMATICS AND STATISTICS. Linear and Generalised Linear Models
SCHOOL OF MATHEMATICS AND STATISTICS Linear and Generalised Linear Models Autumn Semester 2017 18 2 hours Attempt all the questions. The allocation of marks is shown in brackets. RESTRICTED OPEN BOOK EXAMINATION
More information8 Nominal and Ordinal Logistic Regression
8 Nominal and Ordinal Logistic Regression 8.1 Introduction If the response variable is categorical, with more then two categories, then there are two options for generalized linear models. One relies on
More informationMixed models in R using the lme4 package Part 7: Generalized linear mixed models
Mixed models in R using the lme4 package Part 7: Generalized linear mixed models Douglas Bates University of Wisconsin - Madison and R Development Core Team University of
More informationST3241 Categorical Data Analysis I Multicategory Logit Models. Logit Models For Nominal Responses
ST3241 Categorical Data Analysis I Multicategory Logit Models Logit Models For Nominal Responses 1 Models For Nominal Responses Y is nominal with J categories. Let {π 1,, π J } denote the response probabilities
More informationSingle-level Models for Binary Responses
Single-level Models for Binary Responses Distribution of Binary Data y i response for individual i (i = 1,..., n), coded 0 or 1 Denote by r the number in the sample with y = 1 Mean and variance E(y) =
More informationØ Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.
Statistical Tools in Evaluation HPS 41 Dr. Joe G. Schmalfeldt Types of Scores Continuous Scores scores with a potentially infinite number of values. Discrete Scores scores limited to a specific number
More informationOverdispersion Workshop in generalized linear models Uppsala, June 11-12, Outline. Overdispersion
Biostokastikum Overdispersion is not uncommon in practice. In fact, some would maintain that overdispersion is the norm in practice and nominal dispersion the exception McCullagh and Nelder (1989) Overdispersion
More informationOn the Triangle Test with Replications
On the Triangle Test with Replications Joachim Kunert and Michael Meyners Fachbereich Statistik, University of Dortmund, D-44221 Dortmund, Germany E-mail: kunert@statistik.uni-dortmund.de E-mail: meyners@statistik.uni-dortmund.de
More informationLOGISTIC REGRESSION Joseph M. Hilbe
LOGISTIC REGRESSION Joseph M. Hilbe Arizona State University Logistic regression is the most common method used to model binary response data. When the response is binary, it typically takes the form of
More informationMAXIMUM LIKELIHOOD INFERENCE IN ROBUST LINEAR MIXED-EFFECTS MODELS USING MULTIVARIATE t DISTRIBUTIONS
Statistica Sinica 17(2007), 929-943 MAXIMUM LIKELIHOOD INFERENCE IN ROBUST LINEAR MIXED-EFFECTS MODELS USING MULTIVARIATE t DISTRIBUTIONS Peter X.-K. Song 1, Peng Zhang 2 and Annie Qu 3 1 University of
More informationFitting mixed-effects models for repeated ordinal outcomes with the NLMIXED procedure
Behavior Research Methods, Instruments, & Computers 00, 34 (), 151-157 ARTICLES FROM THE SCIP CONFERENCE Fitting mixed-effects models for repeated ordinal outcomes with the NLMIXED procedure CHING-FAN
More informationESP 178 Applied Research Methods. 2/23: Quantitative Analysis
ESP 178 Applied Research Methods 2/23: Quantitative Analysis Data Preparation Data coding create codebook that defines each variable, its response scale, how it was coded Data entry for mail surveys and
More informationMixed models in R using the lme4 package Part 5: Generalized linear mixed models
Mixed models in R using the lme4 package Part 5: Generalized linear mixed models Douglas Bates Madison January 11, 2011 Contents 1 Definition 1 2 Links 2 3 Example 7 4 Model building 9 5 Conclusions 14
More informationExam Applied Statistical Regression. Good Luck!
Dr. M. Dettling Summer 2011 Exam Applied Statistical Regression Approved: Tables: Note: Any written material, calculator (without communication facility). Attached. All tests have to be done at the 5%-level.
More information" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2
Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the
More informationMixed effects models
Mixed effects models The basic theory and application in R Mitchel van Loon Research Paper Business Analytics Mixed effects models The basic theory and application in R Author: Mitchel van Loon Research
More informationSample size determination for logistic regression: A simulation study
Sample size determination for logistic regression: A simulation study Stephen Bush School of Mathematical Sciences, University of Technology Sydney, PO Box 123 Broadway NSW 2007, Australia Abstract This
More informationGeneralized Linear and Nonlinear Mixed-Effects Models
Generalized Linear and Nonlinear Mixed-Effects Models Douglas Bates University of Wisconsin - Madison and R Development Core Team University of Potsdam August 8, 2008 Outline
More informationMixed models in R using the lme4 package Part 5: Generalized linear mixed models
Mixed models in R using the lme4 package Part 5: Generalized linear mixed models Douglas Bates 2011-03-16 Contents 1 Generalized Linear Mixed Models Generalized Linear Mixed Models When using linear mixed
More informationEPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7
Introduction to Generalized Univariate Models: Models for Binary Outcomes EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7 EPSY 905: Intro to Generalized In This Lecture A short review
More informationReview. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis
Review Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 22 Chapter 1: background Nominal, ordinal, interval data. Distributions: Poisson, binomial,
More informationNATIONAL UNIVERSITY OF SINGAPORE EXAMINATION. ST3241 Categorical Data Analysis. (Semester II: ) April/May, 2011 Time Allowed : 2 Hours
NATIONAL UNIVERSITY OF SINGAPORE EXAMINATION Categorical Data Analysis (Semester II: 2010 2011) April/May, 2011 Time Allowed : 2 Hours Matriculation No: Seat No: Grade Table Question 1 2 3 4 5 6 Full marks
More informationNATIONAL UNIVERSITY OF SINGAPORE EXAMINATION (SOLUTIONS) ST3241 Categorical Data Analysis. (Semester II: )
NATIONAL UNIVERSITY OF SINGAPORE EXAMINATION (SOLUTIONS) Categorical Data Analysis (Semester II: 2010 2011) April/May, 2011 Time Allowed : 2 Hours Matriculation No: Seat No: Grade Table Question 1 2 3
More informationGoodness-of-Fit Tests for the Ordinal Response Models with Misspecified Links
Communications of the Korean Statistical Society 2009, Vol 16, No 4, 697 705 Goodness-of-Fit Tests for the Ordinal Response Models with Misspecified Links Kwang Mo Jeong a, Hyun Yung Lee 1, a a Department
More informationComputationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models
Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models Tihomir Asparouhov 1, Bengt Muthen 2 Muthen & Muthen 1 UCLA 2 Abstract Multilevel analysis often leads to modeling
More informationProportional Odds Logistic Regression. stat 557 Heike Hofmann
Proportional Odds Logistic Regression stat 557 Heike Hofmann Outline Proportional Odds Logistic Regression Model Definition Properties Latent Variables Intro to Loglinear Models Ordinal Response Y is categorical
More informationMatched Pair Data. Stat 557 Heike Hofmann
Matched Pair Data Stat 557 Heike Hofmann Outline Marginal Homogeneity - review Binary Response with covariates Ordinal response Symmetric Models Subject-specific vs Marginal Model conditional logistic
More informationLecture 14: Introduction to Poisson Regression
Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu 8 May 2007 1 / 52 Overview Modelling counts Contingency tables Poisson regression models 2 / 52 Modelling counts I Why
More informationModelling counts. Lecture 14: Introduction to Poisson Regression. Overview
Modelling counts I Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu Why count data? Number of traffic accidents per day Mortality counts in a given neighborhood, per week
More informationOn the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models
On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models Thomas Kneib Institute of Statistics and Econometrics Georg-August-University Göttingen Department of Statistics
More informationR in Linguistic Analysis. Wassink 2012 University of Washington Week 6
R in Linguistic Analysis Wassink 2012 University of Washington Week 6 Overview R for phoneticians and lab phonologists Johnson 3 Reading Qs Equivalence of means (t-tests) Multiple Regression Principal
More informationLinear Regression Models P8111
Linear Regression Models P8111 Lecture 25 Jeff Goldsmith April 26, 2016 1 of 37 Today s Lecture Logistic regression / GLMs Model framework Interpretation Estimation 2 of 37 Linear regression Course started
More informationAnalysis of Categorical Data. Nick Jackson University of Southern California Department of Psychology 10/11/2013
Analysis of Categorical Data Nick Jackson University of Southern California Department of Psychology 10/11/2013 1 Overview Data Types Contingency Tables Logit Models Binomial Ordinal Nominal 2 Things not
More informationGeneralized linear models
Generalized linear models Douglas Bates November 01, 2010 Contents 1 Definition 1 2 Links 2 3 Estimating parameters 5 4 Example 6 5 Model building 8 6 Conclusions 8 7 Summary 9 1 Generalized Linear Models
More informationParametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1
Parametric Modelling of Over-dispersed Count Data Part III / MMath (Applied Statistics) 1 Introduction Poisson regression is the de facto approach for handling count data What happens then when Poisson
More informationIdentify the scale of measurement most appropriate for each of the following variables. (Use A = nominal, B = ordinal, C = interval, D = ratio.
Answers to Items from Problem Set 1 Item 1 Identify the scale of measurement most appropriate for each of the following variables. (Use A = nominal, B = ordinal, C = interval, D = ratio.) a. response latency
More informationGeneralized Estimating Equations
Outline Review of Generalized Linear Models (GLM) Generalized Linear Model Exponential Family Components of GLM MLE for GLM, Iterative Weighted Least Squares Measuring Goodness of Fit - Deviance and Pearson
More informationCh 6: Multicategory Logit Models
293 Ch 6: Multicategory Logit Models Y has J categories, J>2. Extensions of logistic regression for nominal and ordinal Y assume a multinomial distribution for Y. In R, we will fit these models using the
More informationReparametrization of COM-Poisson Regression Models with Applications in the Analysis of Experimental Count Data
Reparametrization of COM-Poisson Regression Models with Applications in the Analysis of Experimental Count Data Eduardo Elias Ribeiro Junior 1 2 Walmes Marques Zeviani 1 Wagner Hugo Bonat 1 Clarice Garcia
More informationWhy analyze as ordinal? Mixed Models for Longitudinal Ordinal Data Don Hedeker University of Illinois at Chicago
Why analyze as ordinal? Mixed Models for Longitudinal Ordinal Data Don Hedeker University of Illinois at Chicago hedeker@uic.edu www.uic.edu/ hedeker/long.html Efficiency: Armstrong & Sloan (1989, Amer
More informationMixed-effects Maximum Likelihood Difference Scaling
Mixed-effects Maximum Likelihood Difference Scaling Kenneth Knoblauch Inserm U 846 Stem Cell and Brain Research Institute Dept. Integrative Neurosciences Bron, France Laurence T. Maloney Department of
More informationLongitudinal Modeling with Logistic Regression
Newsom 1 Longitudinal Modeling with Logistic Regression Longitudinal designs involve repeated measurements of the same individuals over time There are two general classes of analyses that correspond to
More informationOutline. Statistical inference for linear mixed models. One-way ANOVA in matrix-vector form
Outline Statistical inference for linear mixed models Rasmus Waagepetersen Department of Mathematics Aalborg University Denmark general form of linear mixed models examples of analyses using linear mixed
More information9 Generalized Linear Models
9 Generalized Linear Models The Generalized Linear Model (GLM) is a model which has been built to include a wide range of different models you already know, e.g. ANOVA and multiple linear regression models
More informationLogistic & Tobit Regression
Logistic & Tobit Regression Different Types of Regression Binary Regression (D) Logistic transformation + e P( y x) = 1 + e! " x! + " x " P( y x) % ln$ ' = ( + ) x # 1! P( y x) & logit of P(y x){ P(y
More informationPackage HGLMMM for Hierarchical Generalized Linear Models
Package HGLMMM for Hierarchical Generalized Linear Models Marek Molas Emmanuel Lesaffre Erasmus MC Erasmus Universiteit - Rotterdam The Netherlands ERASMUSMC - Biostatistics 20-04-2010 1 / 52 Outline General
More informationOutline. Mixed models in R using the lme4 package Part 5: Generalized linear mixed models. Parts of LMMs carried over to GLMMs
Outline Mixed models in R using the lme4 package Part 5: Generalized linear mixed models Douglas Bates University of Wisconsin - Madison and R Development Core Team UseR!2009,
More informationStatistical methodology for sensory discrimination tests and its implementation in sensr
Statistical methodology for sensory discrimination tests and its implementation in sensr Rune Haubo Bojesen Christensen August 24, 2017 Abstract The statistical methodology of sensory discrimination analysis
More informationØ Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.
Statistical Tools in Evaluation HPS 41 Fall 213 Dr. Joe G. Schmalfeldt Types of Scores Continuous Scores scores with a potentially infinite number of values. Discrete Scores scores limited to a specific
More informationLogistic Regressions. Stat 430
Logistic Regressions Stat 430 Final Project Final Project is, again, team based You will decide on a project - only constraint is: you are supposed to use techniques for a solution that are related to
More informationMS&E 226: Small Data. Lecture 11: Maximum likelihood (v2) Ramesh Johari
MS&E 226: Small Data Lecture 11: Maximum likelihood (v2) Ramesh Johari ramesh.johari@stanford.edu 1 / 18 The likelihood function 2 / 18 Estimating the parameter This lecture develops the methodology behind
More informationOn the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models
On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models Thomas Kneib Department of Mathematics Carl von Ossietzky University Oldenburg Sonja Greven Department of
More informationCategorical Variables and Contingency Tables: Description and Inference
Categorical Variables and Contingency Tables: Description and Inference STAT 526 Professor Olga Vitek March 3, 2011 Reading: Agresti Ch. 1, 2 and 3 Faraway Ch. 4 3 Univariate Binomial and Multinomial Measurements
More informationSTAT 526 Advanced Statistical Methodology
STAT 526 Advanced Statistical Methodology Fall 2017 Lecture Note 10 Analyzing Clustered/Repeated Categorical Data 0-0 Outline Clustered/Repeated Categorical Data Generalized Linear Mixed Models Generalized
More informationClassification. Chapter Introduction. 6.2 The Bayes classifier
Chapter 6 Classification 6.1 Introduction Often encountered in applications is the situation where the response variable Y takes values in a finite set of labels. For example, the response Y could encode
More informationBeyond GLM and likelihood
Stat 6620: Applied Linear Models Department of Statistics Western Michigan University Statistics curriculum Core knowledge (modeling and estimation) Math stat 1 (probability, distributions, convergence
More informationUNIVERSITY OF MASSACHUSETTS. Department of Mathematics and Statistics. Basic Exam - Applied Statistics. Tuesday, January 17, 2017
UNIVERSITY OF MASSACHUSETTS Department of Mathematics and Statistics Basic Exam - Applied Statistics Tuesday, January 17, 2017 Work all problems 60 points are needed to pass at the Masters Level and 75
More informationData Analysis 1 LINEAR REGRESSION. Chapter 03
Data Analysis 1 LINEAR REGRESSION Chapter 03 Data Analysis 2 Outline The Linear Regression Model Least Squares Fit Measures of Fit Inference in Regression Other Considerations in Regression Model Qualitative
More informationInvestigating Models with Two or Three Categories
Ronald H. Heck and Lynn N. Tabata 1 Investigating Models with Two or Three Categories For the past few weeks we have been working with discriminant analysis. Let s now see what the same sort of model might
More informationANALYSING BINARY DATA IN A REPEATED MEASUREMENTS SETTING USING SAS
Libraries 1997-9th Annual Conference Proceedings ANALYSING BINARY DATA IN A REPEATED MEASUREMENTS SETTING USING SAS Eleanor F. Allan Follow this and additional works at: http://newprairiepress.org/agstatconference
More informationCategorical Data Analysis Chapter 3
Categorical Data Analysis Chapter 3 The actual coverage probability is usually a bit higher than the nominal level. Confidence intervals for association parameteres Consider the odds ratio in the 2x2 table,
More informationExercise 5.4 Solution
Exercise 5.4 Solution Niels Richard Hansen University of Copenhagen May 7, 2010 1 5.4(a) > leukemia
More informationFrequency Distribution Cross-Tabulation
Frequency Distribution Cross-Tabulation 1) Overview 2) Frequency Distribution 3) Statistics Associated with Frequency Distribution i. Measures of Location ii. Measures of Variability iii. Measures of Shape
More informationA Handbook of Statistical Analyses Using R. Brian S. Everitt and Torsten Hothorn
A Handbook of Statistical Analyses Using R Brian S. Everitt and Torsten Hothorn CHAPTER 6 Logistic Regression and Generalised Linear Models: Blood Screening, Women s Role in Society, and Colonic Polyps
More informationStat 8053, Fall 2013: Multinomial Logistic Models
Stat 8053, Fall 2013: Multinomial Logistic Models Here is the example on page 269 of Agresti on food preference of alligators: s is size class, g is sex of the alligator, l is name of the lake, and f is
More informationLecture 3.1 Basic Logistic LDA
y Lecture.1 Basic Logistic LDA 0.2.4.6.8 1 Outline Quick Refresher on Ordinary Logistic Regression and Stata Women s employment example Cross-Over Trial LDA Example -100-50 0 50 100 -- Longitudinal Data
More informationUsing the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, and Discrete Changes 1
Using the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, Discrete Changes 1 JunXuJ.ScottLong Indiana University 2005-02-03 1 General Formula The delta method is a general
More informationTopic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model
Topic 17 - Single Factor Analysis of Variance - Fall 2013 One way ANOVA Cell means model Factor effects model Outline Topic 17 2 One-way ANOVA Response variable Y is continuous Explanatory variable is
More informationWORKSHOP 3 Measuring Association
WORKSHOP 3 Measuring Association Concepts Analysing Categorical Data o Testing of Proportions o Contingency Tables & Tests o Odds Ratios Linear Association Measures o Correlation o Simple Linear Regression
More informationGeneralized Models: Part 1
Generalized Models: Part 1 Topics: Introduction to generalized models Introduction to maximum likelihood estimation Models for binary outcomes Models for proportion outcomes Models for categorical outcomes
More informationCLUe Training An Introduction to Machine Learning in R with an example from handwritten digit recognition
CLUe Training An Introduction to Machine Learning in R with an example from handwritten digit recognition Ad Feelders Universiteit Utrecht Department of Information and Computing Sciences Algorithmic Data
More informationNELS 88. Latent Response Variable Formulation Versus Probability Curve Formulation
NELS 88 Table 2.3 Adjusted odds ratios of eighth-grade students in 988 performing below basic levels of reading and mathematics in 988 and dropping out of school, 988 to 990, by basic demographics Variable
More informationRegression Methods for Survey Data
Regression Methods for Survey Data Professor Ron Fricker! Naval Postgraduate School! Monterey, California! 3/26/13 Reading:! Lohr chapter 11! 1 Goals for this Lecture! Linear regression! Review of linear
More informationGeneralized linear models
Generalized linear models Outline for today What is a generalized linear model Linear predictors and link functions Example: estimate a proportion Analysis of deviance Example: fit dose- response data
More informationIntroduction to General and Generalized Linear Models
Introduction to General and Generalized Linear Models Generalized Linear Models - part II Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs.
More informationLast Lecture. Distinguish Populations from Samples. Knowing different Sampling Techniques. Distinguish Parameters from Statistics
Last Lecture Distinguish Populations from Samples Importance of identifying a population and well chosen sample Knowing different Sampling Techniques Distinguish Parameters from Statistics Knowing different
More informationErrata and Updates for ASM Exam MAS-I (First Edition) Sorted by Page
Errata for ASM Exam MAS-I Study Manual (First Edition) Sorted by Page 1 Errata and Updates for ASM Exam MAS-I (First Edition) Sorted by Page Practice Exam 5 Question 6 is defective. See the correction
More informationErrata and Updates for ASM Exam MAS-I (First Edition) Sorted by Date
Errata for ASM Exam MAS-I Study Manual (First Edition) Sorted by Date 1 Errata and Updates for ASM Exam MAS-I (First Edition) Sorted by Date Practice Exam 5 Question 6 is defective. See the correction
More informationWeighted Least Squares
Weighted Least Squares The standard linear model assumes that Var(ε i ) = σ 2 for i = 1,..., n. As we have seen, however, there are instances where Var(Y X = x i ) = Var(ε i ) = σ2 w i. Here w 1,..., w
More information