Models for Replicated Discrimination Tests: A Synthesis of Latent Class Mixture Models and Generalized Linear Mixed Models

Size: px
Start display at page:

Download "Models for Replicated Discrimination Tests: A Synthesis of Latent Class Mixture Models and Generalized Linear Mixed Models"

Transcription

1 Models for Replicated Discrimination Tests: A Synthesis of Latent Class Mixture Models and Generalized Linear Mixed Models Rune Haubo Bojesen Christensen & Per Bruun Brockhoff DTU Informatics Section for Statistics Technical University of Denmark rhbc@imm.dtu.dk August Rune H B Christensen (DTU) Models for Replicated Discrimination Tests UseR! 2008, Dortmund 1 / 19

2 Motivation This Talk is About A non-standard type of mixed models Applicable to a range of discrimination tests Focus: Insight into models not computational methods Motivated by examples from sensometrics and psychometrics (but also applicable in signal detection, medical decision making etc.) The models extend existing models by Modelling the covariance structure Having a close connection to psychological theory of cognitive decision making Providing inference for individuals via random effect estimates Rune H B Christensen (DTU) Models for Replicated Discrimination Tests UseR! 2008, Dortmund 2 / 19

3 Outline Outline 1 Background 2 Models for Independent Data 3 Models for Replicated Data 4 Examples 5 Summary Rune H B Christensen (DTU) Models for Replicated Discrimination Tests UseR! 2008, Dortmund 3 / 19

4 Background A Replicated Discrimination Test Example: Coke Inc. wants to substitute a sweetener in a diet coke, A with a cheaper alternative B. Coke Inc. employs 30 consumers in a discrimination (triangle) test Each consumer performs the test 10 times (replications) Can consumers distinguish between the two recipes? Why Replications? Advantages: Cheap and Easy: Substitute some assessors with replications Information on difference between assessors Challenges: Observations are often correlated and not independent Rune H B Christensen (DTU) Models for Replicated Discrimination Tests UseR! 2008, Dortmund 4 / 19

5 Background The Triangle Test Two regular products and one new product are presented to the consumer Two A- products (a1, a2) and one B-product (b1) are presented to the consumer Task: Identify the odd product δ = µ B µ A : A measure of discriminal ability and difference between products A δ a1 a2 b1 Sensory magnitude B Rune H B Christensen (DTU) Models for Replicated Discrimination Tests UseR! 2008, Dortmund 5 / 19

6 Background The Triangle Test π Triangle Psychometric Function A B Triangle Model Answers are binomial: A proportion of correct answers Y i Bin(π i ; n i ) π 0 π i < 1 Guessing probability: π 0 = 1/3 Relation between π i and δ i : π i = f (δ i ) = 0 +Φ { Φ ( z 3 + δ 2/3 ( z 3 δ 2/3 ) ) } φ(z)dz 0.0 a1 a2 b δ Psychometric function: Relates the probability of a correct answer to the ability to discriminate Rune H B Christensen (DTU) Models for Replicated Discrimination Tests UseR! 2008, Dortmund 6 / 19

7 Models for Independent Data The Basic (Naive) Model (GLM) Y i Bin(π i, n i ) π i = f triangle (δ i ) δ i = δ A Generalized Linear Model (GLM) with Binomial distribution Psychometric function as inverse link function Simple linear predictor Assumes π and δ identical for all individuals Family-object; triangle in package sensr for use with glm Extends discrimination tests to allow for explanatory variables Prepares the way for mixed effect models for discrimination tests Rune H B Christensen (DTU) Models for Replicated Discrimination Tests UseR! 2008, Dortmund 7 / 19

8 Models for Replicated Data Models for Replicated Discrimination Tests Ignore covariance structure in data Basic GLM Marginal Models (adjust se s for overdispersion) quasi-binomial GLM Latent Class Mixture model Conditional Models (model covariance structure) Generalized Linear Mixed Model (GLMM) Synthesis of Mixture and GLMM Rune H B Christensen (DTU) Models for Replicated Discrimination Tests UseR! 2008, Dortmund 8 / 19

9 Models for Replicated Data Models for Replicated Discrimination Tests Ignore covariance structure in data Basic GLM Marginal Models (adjust se s for overdispersion) quasi-binomial GLM Latent Class Mixture model Conditional Models (model covariance structure) Generalized Linear Mixed Model (GLMM) Synthesis of Mixture and GLMM Rune H B Christensen (DTU) Models for Replicated Discrimination Tests UseR! 2008, Dortmund 8 / 19

10 Models for Replicated Data Latent Class Mixture Models Two-class model for discriminal ability P i Bernoulli(p) Y i p i Bin(π i ; n i ) { 0 if pi = 0 π i = f triangle (δ i ) δ i = δ if p i = p = P(δ i = 0) p = P(δ i > 0) No dispersion among subjects with positive δ Often an inappropriate assumption! δ δ i Rune H B Christensen (DTU) Models for Replicated Discrimination Tests UseR! 2008, Dortmund 9 / 19

11 Models for Replicated Data Generalized Linear Mixed Model b i N(0, σ 2 δ ) Y i b i Bin(π i ; n i ) 0.4 π i = f triangle (δ i ) δ i = δ + b i 0.3 Assumes a continuous distribution for subjects Allows for dispersion among subjects Subjects can have δ < 0! Impossible in the triangle test δ δ i σ δ Rune H B Christensen (DTU) Models for Replicated Discrimination Tests UseR! 2008, Dortmund 10 / 19

12 Models for Replicated Data Generalized Linear Mixed Model b i N(0, σ 2 δ ) Y i b i Bin(π i ; n i ) 0.4 π i = f triangle (δ i ) δ i = δ + b i 0.3 Assumes a continuous distribution for subjects Allows for dispersion among subjects Subjects can have δ < 0! Impossible in the triangle test δ δ i σ δ Rune H B Christensen (DTU) Models for Replicated Discrimination Tests UseR! 2008, Dortmund 10 / 19

13 An Appropriate Alternative Latent Class Mixed Model Y i b i Bin(π i ; n i ) δ i F(δ, σ 2 δ ) 0.4 π i = f triangle (δ i ) δ i = δ + b i { 1 p, δi = 0 f (δ i ) = ( ) 1 σ δ φ δi δ σ δ, δ i > 0 p = 1 Φ( δ/σ δ ) 0.1 One-dimensional random effect with two attributes: 0.0 δ Class probabilities pi The magnitude of discriminal ability δ δ i i P(δ i = 0)= Φ( δ σ δ ) σ δ P(δ i > 0)= 1 Φ( δ σ δ ) Rune H B Christensen (DTU) Models for Replicated Discrimination Tests UseR! 2008, Dortmund 11 / 19

14 An Appropriate Alternative Estimation in Latent Class Mixed Model Likelihood function marginal density of y f (y i ) = (1 p)f 1 (y i ) + pf 2 (y i ) Likelihood at δ i = 0: Likelihood at δ i > 0: f 1 (y i ) = ( n i y i ) π y i 0 (1 π 0) (n i y i ) f 2 (y i ) = 1 p 0 f π (y i δ i )φ((δ i δ)/σ δ ) σ δ dδ i where f π (y i δ i ) = ( n i y i ) π y i i (1 π i) (n i y i ) Define likelihood function as R-function via integrate optimize with optim Structure motives an EM algorithm Rune H B Christensen (DTU) Models for Replicated Discrimination Tests UseR! 2008, Dortmund 12 / 19

15 An Appropriate Alternative Attenuation Effect 1.0 Marginal link function: 0.9 π m = E δi [E[y i δ i ]] 0.8 = π 0 (1 p) f triangle (δ i )φ((δ i δ)/σ δ )/σ δ dδ π i Marginal estimates are closer to stationary points rather than closer to zero Marginal link function depends on σ 2 δ Original link function δ 0 Marginal link function Stationary point δ Rune H B Christensen (DTU) Models for Replicated Discrimination Tests UseR! 2008, Dortmund 13 / 19

16 Examples Example Triangle Data Model δ se(δ) σ δ p Basic (GLM) Overdisp. GLM Proposed Model % Difference in estimate of δ (attenuation effect) Basic model gives too small se s Clear variation between subjects Large proportion of discriminators Rune H B Christensen (DTU) Models for Replicated Discrimination Tests UseR! 2008, Dortmund 14 / 19

17 Examples Example Triangle Data Random effect estimates as Conditional expectations: δ i = E[δ i y i ] = = δ i f (δ i y i )dδ i δ if (y i δ i )f (δ i ) dδ i f (y i ) One extra integral per individual δ^ = 1.62 σ^δ = δ i Rune H B Christensen (DTU) Models for Replicated Discrimination Tests UseR! 2008, Dortmund 15 / 19

18 Examples Example Triangle Data Class probabilities as Conditional expectations: p i = E[p i y i ] = = = P i (0,1) P i df(p i y i ) P i f (y i P i )f (P i )/f (y i ) pf 2 (y i ) (1 p)f 1 (y i ) + pf 2 (y i ) Random effects are almost normal Expect similar results from GLMM Observed Quantiles (δ i ) Normal Quantiles Rune H B Christensen (DTU) Models for Replicated Discrimination Tests UseR! 2008, Dortmund 16 / 19 1

19 Examples Example: 2-AFC Test Random effect estimates: δ i, p i Clear tail on the left Clear lower bound on δ Discrete nature of data more clear Observed Quantiles (δ i ) Normal Quantiles Rune H B Christensen (DTU) Models for Replicated Discrimination Tests UseR! 2008, Dortmund 17 / 19

20 Summary Summary and Challenges Summary Close connection to psychological theory Model-type apply to a range of discrimination test protocols (eg. triangle, duo-trio, 2-AFC and 3-AFC) A synthesis of Latent Class Mixture Models and GLMMs One-dimensional random effects with two attributes Class probabilities p i The magnitude of discriminal ability δ i Challenges Extension to additional fixed effects (changes area of integration) random effects (multi-dimensional integrals) Variance of random effect estimates Implementation with better computational methods Rune H B Christensen (DTU) Models for Replicated Discrimination Tests UseR! 2008, Dortmund 18 / 19

21 Summary Acknowledgments Thanks to the Program Committee Thanks to Professor Per Bruun Brockhoff Thank you for listening! Rune H B Christensen (DTU) Models for Replicated Discrimination Tests UseR! 2008, Dortmund 19 / 19

Christine Borgen Linander 1,, Rune Haubo Bojesen Christensen 1, Rebecca Evans 2, Graham Cleaver 2, Per Bruun Brockhoff 1. University of Denmark

Christine Borgen Linander 1,, Rune Haubo Bojesen Christensen 1, Rebecca Evans 2, Graham Cleaver 2, Per Bruun Brockhoff 1. University of Denmark Individual differences in replicated multi-product 2-AFC data with and without supplementary difference scoring: Comparing Thurstonian mixed regression models for binary and ordinal data with linear mixed

More information

Main purposes of sensr. The sensr package: Difference and similarity testing. Main functions in sensr

Main purposes of sensr. The sensr package: Difference and similarity testing. Main functions in sensr Main purposes of sensr : Difference and similarity testing Per B Brockhoff DTU Compute Section for Statistics Technical University of Denmark perbb@dtu.dk August 17 2015 Statistical tests of sensory discrimation

More information

Outline. The ordinal package: Analyzing ordinal data. Ordinal scales are commonly used (attribute rating or liking) Ordinal data the wine data

Outline. The ordinal package: Analyzing ordinal data. Ordinal scales are commonly used (attribute rating or liking) Ordinal data the wine data Outline Outline The ordinal package: Analyzing ordinal data Per Bruun Brockhoff DTU Compute Section for Statistics Technical University of Denmark perbb@dtu.dk August 0 015 c Per Bruun Brockhoff (DTU)

More information

On the Triangle Test with Replications

On the Triangle Test with Replications On the Triangle Test with Replications Joachim Kunert and Michael Meyners Fachbereich Statistik, University of Dortmund, D-44221 Dortmund, Germany E-mail: kunert@statistik.uni-dortmund.de E-mail: meyners@statistik.uni-dortmund.de

More information

Statistical methodology for sensory discrimination tests and its implementation in sensr

Statistical methodology for sensory discrimination tests and its implementation in sensr Statistical methodology for sensory discrimination tests and its implementation in sensr Rune Haubo Bojesen Christensen August 24, 2017 Abstract The statistical methodology of sensory discrimination analysis

More information

Advances in Sensory Discrimination Testing: Thurstonian-Derived Models, Covariates, and Consumer Relevance

Advances in Sensory Discrimination Testing: Thurstonian-Derived Models, Covariates, and Consumer Relevance Advances in Sensory Discrimination Testing: Thurstonian-Derived Models, Covariates, and Consumer Relevance John C. Castura, Sara K. King, C. J. Findlay Compusense Inc. Guelph, ON, Canada Derivation of

More information

Analysis of sensory ratings data with cumulative link models

Analysis of sensory ratings data with cumulative link models Downloaded from orbit.dtu.dk on: Oct 02, 2018 Analysis of sensory ratings data with cumulative link models Christensen, Rune Haubo Bojesen; Brockhoff, Per B. Published in: Journal de la Societe Francaise

More information

High-Throughput Sequencing Course

High-Throughput Sequencing Course High-Throughput Sequencing Course DESeq Model for RNA-Seq Biostatistics and Bioinformatics Summer 2017 Outline Review: Standard linear regression model (e.g., to model gene expression as function of an

More information

Generalized Linear. Mixed Models. Methods and Applications. Modern Concepts, Walter W. Stroup. Texts in Statistical Science.

Generalized Linear. Mixed Models. Methods and Applications. Modern Concepts, Walter W. Stroup. Texts in Statistical Science. Texts in Statistical Science Generalized Linear Mixed Models Modern Concepts, Methods and Applications Walter W. Stroup CRC Press Taylor & Francis Croup Boca Raton London New York CRC Press is an imprint

More information

Semiparametric Generalized Linear Models

Semiparametric Generalized Linear Models Semiparametric Generalized Linear Models North American Stata Users Group Meeting Chicago, Illinois Paul Rathouz Department of Health Studies University of Chicago prathouz@uchicago.edu Liping Gao MS Student

More information

R-companion to: Estimation of the Thurstonian model for the 2-AC protocol

R-companion to: Estimation of the Thurstonian model for the 2-AC protocol R-companion to: Estimation of the Thurstonian model for the 2-AC protocol Rune Haubo Bojesen Christensen, Hye-Seong Lee & Per Bruun Brockhoff August 24, 2017 This document describes how the examples in

More information

Contents. Part I: Fundamentals of Bayesian Inference 1

Contents. Part I: Fundamentals of Bayesian Inference 1 Contents Preface xiii Part I: Fundamentals of Bayesian Inference 1 1 Probability and inference 3 1.1 The three steps of Bayesian data analysis 3 1.2 General notation for statistical inference 4 1.3 Bayesian

More information

H-LIKELIHOOD ESTIMATION METHOOD FOR VARYING CLUSTERED BINARY MIXED EFFECTS MODEL

H-LIKELIHOOD ESTIMATION METHOOD FOR VARYING CLUSTERED BINARY MIXED EFFECTS MODEL H-LIKELIHOOD ESTIMATION METHOOD FOR VARYING CLUSTERED BINARY MIXED EFFECTS MODEL Intesar N. El-Saeiti Department of Statistics, Faculty of Science, University of Bengahzi-Libya. entesar.el-saeiti@uob.edu.ly

More information

STAT5044: Regression and Anova

STAT5044: Regression and Anova STAT5044: Regression and Anova Inyoung Kim 1 / 18 Outline 1 Logistic regression for Binary data 2 Poisson regression for Count data 2 / 18 GLM Let Y denote a binary response variable. Each observation

More information

Generalized Linear Mixed-Effects Models. Copyright c 2015 Dan Nettleton (Iowa State University) Statistics / 58

Generalized Linear Mixed-Effects Models. Copyright c 2015 Dan Nettleton (Iowa State University) Statistics / 58 Generalized Linear Mixed-Effects Models Copyright c 2015 Dan Nettleton (Iowa State University) Statistics 510 1 / 58 Reconsideration of the Plant Fungus Example Consider again the experiment designed to

More information

Parametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1

Parametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1 Parametric Modelling of Over-dispersed Count Data Part III / MMath (Applied Statistics) 1 Introduction Poisson regression is the de facto approach for handling count data What happens then when Poisson

More information

Introduction to General and Generalized Linear Models

Introduction to General and Generalized Linear Models Introduction to General and Generalized Linear Models Generalized Linear Models - part II Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs.

More information

Generalized, Linear, and Mixed Models

Generalized, Linear, and Mixed Models Generalized, Linear, and Mixed Models CHARLES E. McCULLOCH SHAYLER.SEARLE Departments of Statistical Science and Biometrics Cornell University A WILEY-INTERSCIENCE PUBLICATION JOHN WILEY & SONS, INC. New

More information

Chapter 22: Log-linear regression for Poisson counts

Chapter 22: Log-linear regression for Poisson counts Chapter 22: Log-linear regression for Poisson counts Exposure to ionizing radiation is recognized as a cancer risk. In the United States, EPA sets guidelines specifying upper limits on the amount of exposure

More information

Generalized Linear Models (GLZ)

Generalized Linear Models (GLZ) Generalized Linear Models (GLZ) Generalized Linear Models (GLZ) are an extension of the linear modeling process that allows models to be fit to data that follow probability distributions other than the

More information

Package HGLMMM for Hierarchical Generalized Linear Models

Package HGLMMM for Hierarchical Generalized Linear Models Package HGLMMM for Hierarchical Generalized Linear Models Marek Molas Emmanuel Lesaffre Erasmus MC Erasmus Universiteit - Rotterdam The Netherlands ERASMUSMC - Biostatistics 20-04-2010 1 / 52 Outline General

More information

Research Projects. Hanxiang Peng. March 4, Department of Mathematical Sciences Indiana University-Purdue University at Indianapolis

Research Projects. Hanxiang Peng. March 4, Department of Mathematical Sciences Indiana University-Purdue University at Indianapolis Hanxiang Department of Mathematical Sciences Indiana University-Purdue University at Indianapolis March 4, 2009 Outline Project I: Free Knot Spline Cox Model Project I: Free Knot Spline Cox Model Consider

More information

Overdispersion Workshop in generalized linear models Uppsala, June 11-12, Outline. Overdispersion

Overdispersion Workshop in generalized linear models Uppsala, June 11-12, Outline. Overdispersion Biostokastikum Overdispersion is not uncommon in practice. In fact, some would maintain that overdispersion is the norm in practice and nominal dispersion the exception McCullagh and Nelder (1989) Overdispersion

More information

Statistical tests for differential expression in count data (1)

Statistical tests for differential expression in count data (1) Statistical tests for differential expression in count data (1) NBIC Advanced RNA-seq course 25-26 August 2011 Academic Medical Center, Amsterdam The analysis of a microarray experiment Pre-process image

More information

Poisson regression: Further topics

Poisson regression: Further topics Poisson regression: Further topics April 21 Overdispersion One of the defining characteristics of Poisson regression is its lack of a scale parameter: E(Y ) = Var(Y ), and no parameter is available to

More information

Introduction to Machine Learning. Maximum Likelihood and Bayesian Inference. Lecturers: Eran Halperin, Yishay Mansour, Lior Wolf

Introduction to Machine Learning. Maximum Likelihood and Bayesian Inference. Lecturers: Eran Halperin, Yishay Mansour, Lior Wolf 1 Introduction to Machine Learning Maximum Likelihood and Bayesian Inference Lecturers: Eran Halperin, Yishay Mansour, Lior Wolf 2013-14 We know that X ~ B(n,p), but we do not know p. We get a random sample

More information

Introduction to Machine Learning. Maximum Likelihood and Bayesian Inference. Lecturers: Eran Halperin, Lior Wolf

Introduction to Machine Learning. Maximum Likelihood and Bayesian Inference. Lecturers: Eran Halperin, Lior Wolf 1 Introduction to Machine Learning Maximum Likelihood and Bayesian Inference Lecturers: Eran Halperin, Lior Wolf 2014-15 We know that X ~ B(n,p), but we do not know p. We get a random sample from X, a

More information

Sensometrics: Thurstonian and Statistical Models

Sensometrics: Thurstonian and Statistical Models Downloaded from orbit.dtu.dk on: Dec 16, 2018 Sensometrics: Thurstonian and Statistical Models Christensen, Rune Haubo Bojesen; Brockhoff, Per B. Publication date: 2012 Document Version Publisher's PDF,

More information

Dispersion modeling for RNAseq differential analysis

Dispersion modeling for RNAseq differential analysis Dispersion modeling for RNAseq differential analysis E. Bonafede 1, F. Picard 2, S. Robin 3, C. Viroli 1 ( 1 ) univ. Bologna, ( 3 ) CNRS/univ. Lyon I, ( 3 ) INRA/AgroParisTech, Paris IBC, Victoria, July

More information

Lattice Data. Tonglin Zhang. Spatial Statistics for Point and Lattice Data (Part III)

Lattice Data. Tonglin Zhang. Spatial Statistics for Point and Lattice Data (Part III) Title: Spatial Statistics for Point Processes and Lattice Data (Part III) Lattice Data Tonglin Zhang Outline Description Research Problems Global Clustering and Local Clusters Permutation Test Spatial

More information

Chapter 1. Modeling Basics

Chapter 1. Modeling Basics Chapter 1. Modeling Basics What is a model? Model equation and probability distribution Types of model effects Writing models in matrix form Summary 1 What is a statistical model? A model is a mathematical

More information

STATS 306B: Unsupervised Learning Spring Lecture 2 April 2

STATS 306B: Unsupervised Learning Spring Lecture 2 April 2 STATS 306B: Unsupervised Learning Spring 2014 Lecture 2 April 2 Lecturer: Lester Mackey Scribe: Junyang Qian, Minzhe Wang 2.1 Recap In the last lecture, we formulated our working definition of unsupervised

More information

SUPPLEMENTARY SIMULATIONS & FIGURES

SUPPLEMENTARY SIMULATIONS & FIGURES Supplementary Material: Supplementary Material for Mixed Effects Models for Resampled Network Statistics Improve Statistical Power to Find Differences in Multi-Subject Functional Connectivity Manjari Narayan,

More information

Notes for week 4 (part 2)

Notes for week 4 (part 2) Notes for week 4 (part 2) Ben Bolker October 3, 2013 Licensed under the Creative Commons attribution-noncommercial license (http: //creativecommons.org/licenses/by-nc/3.0/). Please share & remix noncommercially,

More information

Psychology 405: Psychometric Theory Validity

Psychology 405: Psychometric Theory Validity Preliminaries Psychology 405: Psychometric Theory Validity William Revelle Department of Psychology Northwestern University Evanston, Illinois USA May, 2016 1/17 Preliminaries Outline Preliminaries 2/17

More information

R Package glmm: Likelihood-Based Inference for Generalized Linear Mixed Models

R Package glmm: Likelihood-Based Inference for Generalized Linear Mixed Models R Package glmm: Likelihood-Based Inference for Generalized Linear Mixed Models Christina Knudson, Ph.D. University of St. Thomas user!2017 Reviewing the Linear Model The usual linear model assumptions:

More information

Markov Chain Monte Carlo in Practice

Markov Chain Monte Carlo in Practice Markov Chain Monte Carlo in Practice Edited by W.R. Gilks Medical Research Council Biostatistics Unit Cambridge UK S. Richardson French National Institute for Health and Medical Research Vilejuif France

More information

Nonparametric Bayesian Methods (Gaussian Processes)

Nonparametric Bayesian Methods (Gaussian Processes) [70240413 Statistical Machine Learning, Spring, 2015] Nonparametric Bayesian Methods (Gaussian Processes) Jun Zhu dcszj@mail.tsinghua.edu.cn http://bigml.cs.tsinghua.edu.cn/~jun State Key Lab of Intelligent

More information

Lecture: Gaussian Process Regression. STAT 6474 Instructor: Hongxiao Zhu

Lecture: Gaussian Process Regression. STAT 6474 Instructor: Hongxiao Zhu Lecture: Gaussian Process Regression STAT 6474 Instructor: Hongxiao Zhu Motivation Reference: Marc Deisenroth s tutorial on Robot Learning. 2 Fast Learning for Autonomous Robots with Gaussian Processes

More information

Statistical Distribution Assumptions of General Linear Models

Statistical Distribution Assumptions of General Linear Models Statistical Distribution Assumptions of General Linear Models Applied Multilevel Models for Cross Sectional Data Lecture 4 ICPSR Summer Workshop University of Colorado Boulder Lecture 4: Statistical Distributions

More information

Introduction to General and Generalized Linear Models

Introduction to General and Generalized Linear Models Introduction to General and Generalized Linear Models Mixed effects models - Part II Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby

More information

CS281 Section 4: Factor Analysis and PCA

CS281 Section 4: Factor Analysis and PCA CS81 Section 4: Factor Analysis and PCA Scott Linderman At this point we have seen a variety of machine learning models, with a particular emphasis on models for supervised learning. In particular, we

More information

Generalized linear models

Generalized linear models Generalized linear models Douglas Bates November 01, 2010 Contents 1 Definition 1 2 Links 2 3 Estimating parameters 5 4 Example 6 5 Model building 8 6 Conclusions 8 7 Summary 9 1 Generalized Linear Models

More information

Exam: 4 hour multiple choice. Agenda. Course Introduction to Statistics. Lecture 1: Introduction to Statistics. Per Bruun Brockhoff

Exam: 4 hour multiple choice. Agenda. Course Introduction to Statistics. Lecture 1: Introduction to Statistics. Per Bruun Brockhoff Course 02402 Lecture 1: Per Bruun Brockhoff DTU Informatics Building 305 - room 110 Danish Technical University 2800 Lyngby Denmark e-mail: pbb@imm.dtu.dk Agenda 1 2 3 4 Per Bruun Brockhoff (pbb@imm.dtu.dk),

More information

In-house germination methods validation studies: analysis

In-house germination methods validation studies: analysis 1 In-house germination methods validation studies: analysis Jean-Louis Laffont - ISTA Statistics Committee Design assumptions for the validation study Based on peer validation guidelines From Table 1 in

More information

CS 340 Fall 2007: Homework 3

CS 340 Fall 2007: Homework 3 CS 34 Fall 27: Homework 3 1 Marginal likelihood for the Beta-Bernoulli model We showed that the marginal likelihood is the ratio of the normalizing constants: p(d) = B(α 1 + N 1, α + N ) B(α 1, α ) = Γ(α

More information

Linear Dynamical Systems

Linear Dynamical Systems Linear Dynamical Systems Sargur N. srihari@cedar.buffalo.edu Machine Learning Course: http://www.cedar.buffalo.edu/~srihari/cse574/index.html Two Models Described by Same Graph Latent variables Observations

More information

Introduction to General and Generalized Linear Models

Introduction to General and Generalized Linear Models Introduction to General and Generalized Linear Models Generalized Linear Models - part III Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs.

More information

Today. HW 1: due February 4, pm. Aspects of Design CD Chapter 2. Continue with Chapter 2 of ELM. In the News:

Today. HW 1: due February 4, pm. Aspects of Design CD Chapter 2. Continue with Chapter 2 of ELM. In the News: Today HW 1: due February 4, 11.59 pm. Aspects of Design CD Chapter 2 Continue with Chapter 2 of ELM In the News: STA 2201: Applied Statistics II January 14, 2015 1/35 Recap: data on proportions data: y

More information

Latent Variable Models

Latent Variable Models Latent Variable Models Stefano Ermon, Aditya Grover Stanford University Lecture 5 Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 5 1 / 31 Recap of last lecture 1 Autoregressive models:

More information

Modeling Longitudinal Count Data with Excess Zeros and Time-Dependent Covariates: Application to Drug Use

Modeling Longitudinal Count Data with Excess Zeros and Time-Dependent Covariates: Application to Drug Use Modeling Longitudinal Count Data with Excess Zeros and : Application to Drug Use University of Northern Colorado November 17, 2014 Presentation Outline I and Data Issues II Correlated Count Regression

More information

Tento projekt je spolufinancován Evropským sociálním fondem a Státním rozpočtem ČR InoBio CZ.1.07/2.2.00/

Tento projekt je spolufinancován Evropským sociálním fondem a Státním rozpočtem ČR InoBio CZ.1.07/2.2.00/ Tento projekt je spolufinancován Evropským sociálním fondem a Státním rozpočtem ČR InoBio CZ.1.07/2.2.00/28.0018 Statistical Analysis in Ecology using R Linear Models/GLM Ing. Daniel Volařík, Ph.D. 13.

More information

Repeated ordinal measurements: a generalised estimating equation approach

Repeated ordinal measurements: a generalised estimating equation approach Repeated ordinal measurements: a generalised estimating equation approach David Clayton MRC Biostatistics Unit 5, Shaftesbury Road Cambridge CB2 2BW April 7, 1992 Abstract Cumulative logit and related

More information

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis Review Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 22 Chapter 1: background Nominal, ordinal, interval data. Distributions: Poisson, binomial,

More information

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent Latent Variable Models for Binary Data Suppose that for a given vector of explanatory variables x, the latent variable, U, has a continuous cumulative distribution function F (u; x) and that the binary

More information

Reports of the Institute of Biostatistics

Reports of the Institute of Biostatistics Reports of the Institute of Biostatistics No 02 / 2008 Leibniz University of Hannover Natural Sciences Faculty Title: Properties of confidence intervals for the comparison of small binomial proportions

More information

Mixed models in R using the lme4 package Part 5: Generalized linear mixed models

Mixed models in R using the lme4 package Part 5: Generalized linear mixed models Mixed models in R using the lme4 package Part 5: Generalized linear mixed models Douglas Bates Madison January 11, 2011 Contents 1 Definition 1 2 Links 2 3 Example 7 4 Model building 9 5 Conclusions 14

More information

Analytics Software. Beyond deterministic chain ladder reserves. Neil Covington Director of Solutions Management GI

Analytics Software. Beyond deterministic chain ladder reserves. Neil Covington Director of Solutions Management GI Analytics Software Beyond deterministic chain ladder reserves Neil Covington Director of Solutions Management GI Objectives 2 Contents 01 Background 02 Formulaic Stochastic Reserving Methods 03 Bootstrapping

More information

Gaussian Models

Gaussian Models Gaussian Models ddebarr@uw.edu 2016-04-28 Agenda Introduction Gaussian Discriminant Analysis Inference Linear Gaussian Systems The Wishart Distribution Inferring Parameters Introduction Gaussian Density

More information

Generalized Linear Models. Kurt Hornik

Generalized Linear Models. Kurt Hornik Generalized Linear Models Kurt Hornik Motivation Assuming normality, the linear model y = Xβ + e has y = β + ε, ε N(0, σ 2 ) such that y N(μ, σ 2 ), E(y ) = μ = β. Various generalizations, including general

More information

The performance of estimation methods for generalized linear mixed models

The performance of estimation methods for generalized linear mixed models University of Wollongong Research Online University of Wollongong Thesis Collection 1954-2016 University of Wollongong Thesis Collections 2008 The performance of estimation methods for generalized linear

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science

UNIVERSITY OF TORONTO Faculty of Arts and Science UNIVERSITY OF TORONTO Faculty of Arts and Science December 2013 Final Examination STA442H1F/2101HF Methods of Applied Statistics Jerry Brunner Duration - 3 hours Aids: Calculator Model(s): Any calculator

More information

Generalized Linear Models for Non-Normal Data

Generalized Linear Models for Non-Normal Data Generalized Linear Models for Non-Normal Data Today s Class: 3 parts of a generalized model Models for binary outcomes Complications for generalized multivariate or multilevel models SPLH 861: Lecture

More information

Introduction to Generalized Linear Models

Introduction to Generalized Linear Models Introduction to Generalized Linear Models Edps/Psych/Soc 589 Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Fall 2018 Outline Introduction (motivation

More information

Using statistical methods to analyse environmental extremes.

Using statistical methods to analyse environmental extremes. Using statistical methods to analyse environmental extremes. Emma Eastoe Department of Mathematics and Statistics Lancaster University December 16, 2008 Focus of talk Discuss statistical models used to

More information

Biostat Methods STAT 5820/6910 Handout #9a: Intro. to Meta-Analysis Methods

Biostat Methods STAT 5820/6910 Handout #9a: Intro. to Meta-Analysis Methods Biostat Methods STAT 5820/6910 Handout #9a: Intro. to Meta-Analysis Methods Meta-analysis describes statistical approach to systematically combine results from multiple studies [identified follong an exhaustive

More information

INTRODUCING LINEAR REGRESSION MODELS Response or Dependent variable y

INTRODUCING LINEAR REGRESSION MODELS Response or Dependent variable y INTRODUCING LINEAR REGRESSION MODELS Response or Dependent variable y Predictor or Independent variable x Model with error: for i = 1,..., n, y i = α + βx i + ε i ε i : independent errors (sampling, measurement,

More information

Mixed models in R using the lme4 package Part 5: Generalized linear mixed models

Mixed models in R using the lme4 package Part 5: Generalized linear mixed models Mixed models in R using the lme4 package Part 5: Generalized linear mixed models Douglas Bates 2011-03-16 Contents 1 Generalized Linear Mixed Models Generalized Linear Mixed Models When using linear mixed

More information

Section 7: Local linear regression (loess) and regression discontinuity designs

Section 7: Local linear regression (loess) and regression discontinuity designs Section 7: Local linear regression (loess) and regression discontinuity designs Yotam Shem-Tov Fall 2015 Yotam Shem-Tov STAT 239/ PS 236A October 26, 2015 1 / 57 Motivation We will focus on local linear

More information

Regression models. Generalized linear models in R. Normal regression models are not always appropriate. Generalized linear models. Examples.

Regression models. Generalized linear models in R. Normal regression models are not always appropriate. Generalized linear models. Examples. Regression models Generalized linear models in R Dr Peter K Dunn http://www.usq.edu.au Department of Mathematics and Computing University of Southern Queensland ASC, July 00 The usual linear regression

More information

Outline. Mixed models in R using the lme4 package Part 5: Generalized linear mixed models. Parts of LMMs carried over to GLMMs

Outline. Mixed models in R using the lme4 package Part 5: Generalized linear mixed models. Parts of LMMs carried over to GLMMs Outline Mixed models in R using the lme4 package Part 5: Generalized linear mixed models Douglas Bates University of Wisconsin - Madison and R Development Core Team UseR!2009,

More information

Lecture 1 Introduction to Multi-level Models

Lecture 1 Introduction to Multi-level Models Lecture 1 Introduction to Multi-level Models Course Website: http://www.biostat.jhsph.edu/~ejohnson/multilevel.htm All lecture materials extracted and further developed from the Multilevel Model course

More information

Learning with Noisy Labels. Kate Niehaus Reading group 11-Feb-2014

Learning with Noisy Labels. Kate Niehaus Reading group 11-Feb-2014 Learning with Noisy Labels Kate Niehaus Reading group 11-Feb-2014 Outline Motivations Generative model approach: Lawrence, N. & Scho lkopf, B. Estimating a Kernel Fisher Discriminant in the Presence of

More information

Generalized Linear Models. Last time: Background & motivation for moving beyond linear

Generalized Linear Models. Last time: Background & motivation for moving beyond linear Generalized Linear Models Last time: Background & motivation for moving beyond linear regression - non-normal/non-linear cases, binary, categorical data Today s class: 1. Examples of count and ordered

More information

Statistics 572 Semester Review

Statistics 572 Semester Review Statistics 572 Semester Review Final Exam Information: The final exam is Friday, May 16, 10:05-12:05, in Social Science 6104. The format will be 8 True/False and explains questions (3 pts. each/ 24 pts.

More information

On the Importance of Dispersion Modeling for Claims Reserving: Application of the Double GLM Theory

On the Importance of Dispersion Modeling for Claims Reserving: Application of the Double GLM Theory On the Importance of Dispersion Modeling for Claims Reserving: Application of the Double GLM Theory Danaïl Davidov under the supervision of Jean-Philippe Boucher Département de mathématiques Université

More information

Generalized Linear Models Introduction

Generalized Linear Models Introduction Generalized Linear Models Introduction Statistics 135 Autumn 2005 Copyright c 2005 by Mark E. Irwin Generalized Linear Models For many problems, standard linear regression approaches don t work. Sometimes,

More information

Classification. Chapter Introduction. 6.2 The Bayes classifier

Classification. Chapter Introduction. 6.2 The Bayes classifier Chapter 6 Classification 6.1 Introduction Often encountered in applications is the situation where the response variable Y takes values in a finite set of labels. For example, the response Y could encode

More information

Outline of GLMs. Definitions

Outline of GLMs. Definitions Outline of GLMs Definitions This is a short outline of GLM details, adapted from the book Nonparametric Regression and Generalized Linear Models, by Green and Silverman. The responses Y i have density

More information

Outline. Remedial Measures) Extra Sums of Squares Standardized Version of the Multiple Regression Model

Outline. Remedial Measures) Extra Sums of Squares Standardized Version of the Multiple Regression Model Outline 1 Multiple Linear Regression (Estimation, Inference, Diagnostics and Remedial Measures) 2 Special Topics for Multiple Regression Extra Sums of Squares Standardized Version of the Multiple Regression

More information

General Regression Model

General Regression Model Scott S. Emerson, M.D., Ph.D. Department of Biostatistics, University of Washington, Seattle, WA 98195, USA January 5, 2015 Abstract Regression analysis can be viewed as an extension of two sample statistical

More information

The Impact of Model Misspecification in Clustered and Continuous Growth Modeling

The Impact of Model Misspecification in Clustered and Continuous Growth Modeling The Impact of Model Misspecification in Clustered and Continuous Growth Modeling Daniel J. Bauer Odum Institute for Research in Social Science The University of North Carolina at Chapel Hill Patrick J.

More information

For Bonnie and Jesse (again)

For Bonnie and Jesse (again) SECOND EDITION A P P L I E D R E G R E S S I O N A N A L Y S I S a n d G E N E R A L I Z E D L I N E A R M O D E L S For Bonnie and Jesse (again) SECOND EDITION A P P L I E D R E G R E S S I O N A N A

More information

Non-Gaussian Response Variables

Non-Gaussian Response Variables Non-Gaussian Response Variables What is the Generalized Model Doing? The fixed effects are like the factors in a traditional analysis of variance or linear model The random effects are different A generalized

More information

STA 414/2104: Lecture 8

STA 414/2104: Lecture 8 STA 414/2104: Lecture 8 6-7 March 2017: Continuous Latent Variable Models, Neural networks With thanks to Russ Salakhutdinov, Jimmy Ba and others Outline Continuous latent variable models Background PCA

More information

Thurstonian scales obtained by transformation of beta distributions

Thurstonian scales obtained by transformation of beta distributions Available online at www.sciencedirect.com Food Quality and Preference 19 (2008) 407 411 www.elsevier.com/locate/foodqual Thurstonian scales obtained by transformation of beta distributions L. Carbonell,

More information

Generative classifiers: The Gaussian classifier. Ata Kaban School of Computer Science University of Birmingham

Generative classifiers: The Gaussian classifier. Ata Kaban School of Computer Science University of Birmingham Generative classifiers: The Gaussian classifier Ata Kaban School of Computer Science University of Birmingham Outline We have already seen how Bayes rule can be turned into a classifier In all our examples

More information

Stat 5101 Lecture Notes

Stat 5101 Lecture Notes Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random

More information

The Multilevel Logit Model for Binary Dependent Variables Marco R. Steenbergen

The Multilevel Logit Model for Binary Dependent Variables Marco R. Steenbergen The Multilevel Logit Model for Binary Dependent Variables Marco R. Steenbergen January 23-24, 2012 Page 1 Part I The Single Level Logit Model: A Review Motivating Example Imagine we are interested in voting

More information

The combined model: A tool for simulating correlated counts with overdispersion. George KALEMA Samuel IDDI Geert MOLENBERGHS

The combined model: A tool for simulating correlated counts with overdispersion. George KALEMA Samuel IDDI Geert MOLENBERGHS The combined model: A tool for simulating correlated counts with overdispersion George KALEMA Samuel IDDI Geert MOLENBERGHS Interuniversity Institute for Biostatistics and statistical Bioinformatics George

More information

Class Notes: Week 8. Probit versus Logit Link Functions and Count Data

Class Notes: Week 8. Probit versus Logit Link Functions and Count Data Ronald Heck Class Notes: Week 8 1 Class Notes: Week 8 Probit versus Logit Link Functions and Count Data This week we ll take up a couple of issues. The first is working with a probit link function. While

More information

Dimensionality Reduction and Principle Components

Dimensionality Reduction and Principle Components Dimensionality Reduction and Principle Components Ken Kreutz-Delgado (Nuno Vasconcelos) UCSD ECE Department Winter 2012 Motivation Recall, in Bayesian decision theory we have: World: States Y in {1,...,

More information

Generalized Linear Models

Generalized Linear Models York SPIDA John Fox Notes Generalized Linear Models Copyright 2010 by John Fox Generalized Linear Models 1 1. Topics I The structure of generalized linear models I Poisson and other generalized linear

More information

Mixtures of Negative Binomial distributions for modelling overdispersion in RNA-Seq data

Mixtures of Negative Binomial distributions for modelling overdispersion in RNA-Seq data Mixtures of Negative Binomial distributions for modelling overdispersion in RNA-Seq data Cinzia Viroli 1 joint with E. Bonafede 1, S. Robin 2 & F. Picard 3 1 Department of Statistical Sciences, University

More information

Stat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010

Stat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010 1 Linear models Y = Xβ + ɛ with ɛ N (0, σ 2 e) or Y N (Xβ, σ 2 e) where the model matrix X contains the information on predictors and β includes all coefficients (intercept, slope(s) etc.). 1. Number of

More information

Statistics for Managers Using Microsoft Excel (3 rd Edition)

Statistics for Managers Using Microsoft Excel (3 rd Edition) Statistics for Managers Using Microsoft Excel (3 rd Edition) Chapter 4 Basic Probability and Discrete Probability Distributions 2002 Prentice-Hall, Inc. Chap 4-1 Chapter Topics Basic probability concepts

More information

Supplemental Resource: Brain and Cognitive Sciences Statistics & Visualization for Data Analysis & Inference January (IAP) 2009

Supplemental Resource: Brain and Cognitive Sciences Statistics & Visualization for Data Analysis & Inference January (IAP) 2009 MIT OpenCourseWare http://ocw.mit.edu Supplemental Resource: Brain and Cognitive Sciences Statistics & Visualization for Data Analysis & Inference January (IAP) 2009 For information about citing these

More information

GENERALIZED LINEAR MIXED MODELS AND MEASUREMENT ERROR. Raymond J. Carroll: Texas A&M University

GENERALIZED LINEAR MIXED MODELS AND MEASUREMENT ERROR. Raymond J. Carroll: Texas A&M University GENERALIZED LINEAR MIXED MODELS AND MEASUREMENT ERROR Raymond J. Carroll: Texas A&M University Naisyin Wang: Xihong Lin: Roberto Gutierrez: Texas A&M University University of Michigan Southern Methodist

More information

ST 740: Markov Chain Monte Carlo

ST 740: Markov Chain Monte Carlo ST 740: Markov Chain Monte Carlo Alyson Wilson Department of Statistics North Carolina State University October 14, 2012 A. Wilson (NCSU Stsatistics) MCMC October 14, 2012 1 / 20 Convergence Diagnostics:

More information

Statistícal Methods for Spatial Data Analysis

Statistícal Methods for Spatial Data Analysis Texts in Statistícal Science Statistícal Methods for Spatial Data Analysis V- Oliver Schabenberger Carol A. Gotway PCT CHAPMAN & K Contents Preface xv 1 Introduction 1 1.1 The Need for Spatial Analysis

More information