Christine Borgen Linander 1,, Rune Haubo Bojesen Christensen 1, Rebecca Evans 2, Graham Cleaver 2, Per Bruun Brockhoff 1. University of Denmark

Size: px
Start display at page:

Download "Christine Borgen Linander 1,, Rune Haubo Bojesen Christensen 1, Rebecca Evans 2, Graham Cleaver 2, Per Bruun Brockhoff 1. University of Denmark"

Transcription

1 Individual differences in replicated multi-product 2-AFC data with and without supplementary difference scoring: Comparing Thurstonian mixed regression models for binary and ordinal data with linear mixed models Christine Borgen Linander 1,, Rune Haubo Bojesen Christensen 1, Rebecca Evans 2, Graham Cleaver 2, Per Bruun Brockhoff 1 1 Department of Informatics and Mathematical Modeling, Section for Statistics, Technical University of Denmark 2 Unilever Research and Development Port Sunlight, UK *Contact author: chjo@imm.dtu.dk Christine Borgen Linander Sensometrics 2012 July 13th /30

2 Outline 1 Introduction 2 Results 3 Final remarks Christine Borgen Linander Sensometrics 2012 July 13th /30

3 Introduction Experiment Data: 12 Products - denoted A, B, C,..., L 24 Assessors 8 Attributes 5 different attributes - denoted A, B, C, D, E 3 of those evaluated again after 5 minutes - denoted C.5m, D.5m, E.5m 2 Sessions leading to replications Each Product was compared to a control. Not all combinations of Assessor and Product are present. Hence incomplete data. (Have included Assessors with at least 50% of the observations) For each Attribute a maximum of 24 observations for each Assessor. Christine Borgen Linander Sensometrics 2012 July 13th /30

4 Introduction Experiment Test protocol: 2-AFC protocol With an additional task to score the difference from not different to extremely different with 5 categories The data from the additional task is transformed into a scale from -4 to 4 where: 0 corresponds to no difference negative values are in favor of the control positive values are in favor of the Product Thus possibility to consider the response in 3 ways: Binary with values 0, 1 Quantitative with values -4, 3,..., 3, 4 Ordinal with values -4, 3,..., 3, 4 Christine Borgen Linander Sensometrics 2012 July 13th /30

5 Introduction Aims for the analysis Analytical aims: We want to illustrate that it is possible to fit a Thurstonian mixed model with the binary response We want to have the capability to take account of other sources of variation when analysing data from the 2-AFC protocol To be more specific - we want to model the individual assessors We want to analyse the results from a set of products in one analysis rather than one product comparison at a time Business oriented aim: We want to investigate if we gain valuable extra information adding the additional task to the usual 2-AFC protocol. Christine Borgen Linander Sensometrics 2012 July 13th /30

6 Introduction Models We consider 2 types of models: Naive aggregation model - ignoring the replicated structure Mixed model - including Assessor as a random effect, modelling the replicated structure Where the naive model will be fitted with the binary response. And the mixed model will be fitted with the 3 different responses. Christine Borgen Linander Sensometrics 2012 July 13th /30

7 F test - mixed model - Quantitative response The PanelCheck way by the R-package lmertest Assessor by Product interaction effects F statistic < p value 0.01 < p value < < p value < 0.01 p value < A B C D E C.5m D.5m E.5m Attributes Christine Borgen Linander Sensometrics 2012 July 13th /30

8 F test - mixed model - Quantitative response Assessor main effects F statistic < p value 0.01 < p value < < p value < 0.01 p value < A B C D E C.5m D.5m E.5m Attributes Christine Borgen Linander Sensometrics 2012 July 13th /30

9 F test - mixed model - Quantitative response Product main effects F statistic < p value 0.01 < p value < < p value < 0.01 p value < A B C D E C.5m D.5m E.5m Attributes Christine Borgen Linander Sensometrics 2012 July 13th /30

10 Likelihood Ratio test - mixed model - Binary response Assessor by Product interaction effects Likelihood Ratio statistic < p value 0.01 < p value < < p value < 0.01 p value < A B C D E C.5m D.5m E.5m Attributes Christine Borgen Linander Sensometrics 2012 July 13th /30

11 Likelihood Ratio test - mixed model - Binary response Assessor main effects Likelihood Ratio statistic < p value 0.01 < p value < < p value < 0.01 p value < A B C D E C.5m D.5m E.5m Attributes Christine Borgen Linander Sensometrics 2012 July 13th /30

12 Likelihood Ratio test - mixed model - Binary response Product main effects mixed model Likelihood Ratio statistic < p value 0.01 < p value < < p value < 0.01 p value < A B C D E C.5m D.5m E.5m Attributes Christine Borgen Linander Sensometrics 2012 July 13th /30

13 Likelihood Ratio test - naive model - Binary response Product main effects naive model Likelihood Ratio statistic < p value 0.01 < p value < < p value < 0.01 p value < A B C D E C.5m D.5m E.5m Attributes Christine Borgen Linander Sensometrics 2012 July 13th /30

14 Likelihood Ratio test - comparison of Product - Binary response Comparing test statistics for the mixed and naive model Likelihood Ratio statistic Mixed model Naive model A B C D E C.5m D.5m E.5m Attributes Christine Borgen Linander Sensometrics 2012 July 13th /30

15 Comparing the two models for Product - Binary response Not a big difference in values between the methods. This fits nicely with the Assessor-by-Product interaction being non-significant In general you would expect the values from the mixed model to be less than the values from the naive model For some attributes this is not the case. These are the ones with the highest values of the test of Assessor main effect Looking at the plot of the Assessor main effect you would expect that this was also the case for for the D.5m attribute. But looking at the plot of the Assessor-by-Product interaction this is the attribute with the highest value Christine Borgen Linander Sensometrics 2012 July 13th /30

16 Post hoc - Product differences in terms of d-prime Product estimates for the attribute E.5m (95% confidence interval) A B C D E F G H I J K L Products Christine Borgen Linander Sensometrics 2012 July 13th /30

17 Post hoc - Product differences in terms of d-prime A B C D A D G J A D G J A D G J A D G J Products Products Products Products E C.5m D.5m E.5m A D G J A D G J A D G J A D G J Products Products Products Products Christine Borgen Linander Sensometrics 2012 July 13th /30

18 Post hoc - PCA of product d-prime values Bi plot Principal component 2 (11.4%) L E H A J C K B B E A D E.5m D.5m G C C.5m D I F Principal component 1 (74.4%) Christine Borgen Linander Sensometrics 2012 July 13th /30

19 Post hoc - Assessor performance in terms of d-prime Assessor estimates for the attribute E.5m (95% confidence interval) Christine Borgen Linander Sensometrics 2012 July 13th /30

20 Post hoc - Assessor performance in terms of d-prime A B C D Assessors Assessors Assessors Assessors E C.5m D.5m E.5m Assessors Assessors Assessors Assessors Christine Borgen Linander Sensometrics 2012 July 13th /30

21 Likelihood Ratio test - mixed model - Ordinal response Using the R-package ordinal Assessor by Product interaction effects Likelihood Ratio statistic < p value 0.01 < p value < < p value < 0.01 p value < A B C D E C.5m D.5m E.5m Attributes Christine Borgen Linander Sensometrics 2012 July 13th /30

22 Likelihood Ratio test - mixed model - Ordinal response Assessor main effects Likelihood Ratio statistic < p value 0.01 < p value < < p value < 0.01 p value < A B C D E C.5m D.5m E.5m Attributes Christine Borgen Linander Sensometrics 2012 July 13th /30

23 Likelihood Ratio test - mixed model - Ordinal response Product main effects Likelihood Ratio statistic < p value 0.01 < p value < < p value < 0.01 p value < A B C D E C.5m D.5m E.5m Attributes Christine Borgen Linander Sensometrics 2012 July 13th /30

24 Likelihood Ratio test - comparison Assessor Likelihood Ratio test Assessor main effects Likelihood ratio statistic Binary Ordinal Attributes Christine Borgen Linander Sensometrics 2012 July 13th /30

25 Likelihood Ratio test - comparison Product Likelihood Ratio test Product main effects Likelihood ratio statistic Binary Ordinal Attributes Christine Borgen Linander Sensometrics 2012 July 13th /30

26 Likelihood Ratio test - comparison Assessor-by-Product Likelihood Ratio test Assessor by Product interaction effects Likelihood ratio statistic Binary Ordinal Attributes Christine Borgen Linander Sensometrics 2012 July 13th /30

27 Final remarks Final remarks We can handle incomplete observations We can fit the Thurstonian mixed model with binary data The Thurstonian mixed model provides additional information (compared to the naive aggregation model) d-prime interpretations of Product estimates d-prime interpretations of Assessor estimates These preliminary results indicate that the ordinal information is more sensitive than the binary information Christine Borgen Linander Sensometrics 2012 July 13th /30

28 Future work Final remarks At some point in the future: Investigation of how much binary data is needed to find a significant Assessor-by-Product interaction Come up with a proper Thurstonian interpretation of the ordinal approach General modelling - for instance how to handle order effects Christine Borgen Linander Sensometrics 2012 July 13th /30

29 Final remarks Thank you for your attention! Christine Borgen Linander Sensometrics 2012 July 13th /30

30 References Final remarks PanelCheck: Open source software for sensory profile data, Christensen, R. H. B. (2012). Ordinal - Regression Models for Ordinal Data. R package version Christensen, R. H. B. and P. B. Brockhoff (2012). Analysis of replicated categorical ratings data from sensory experiments. Working paper. Alexandra Kuznetsova, Per Bruun Brockhoff and Rune Haubo Bojesen Christensen (2012). lmertest: Tests for random and fixed effects for linear mixed effect models (lmer objects of lme4 package).. R package version 1.0. Brockhoff, P.B. and Christensen, R.H.B. (2010). Thurstonian models for sensory discrimination tests as generalized linear models. Food Quality and Preference 21(3), Christine Borgen Linander Sensometrics 2012 July 13th /30

Models for Replicated Discrimination Tests: A Synthesis of Latent Class Mixture Models and Generalized Linear Mixed Models

Models for Replicated Discrimination Tests: A Synthesis of Latent Class Mixture Models and Generalized Linear Mixed Models Models for Replicated Discrimination Tests: A Synthesis of Latent Class Mixture Models and Generalized Linear Mixed Models Rune Haubo Bojesen Christensen & Per Bruun Brockhoff DTU Informatics Section for

More information

Outline. The ordinal package: Analyzing ordinal data. Ordinal scales are commonly used (attribute rating or liking) Ordinal data the wine data

Outline. The ordinal package: Analyzing ordinal data. Ordinal scales are commonly used (attribute rating or liking) Ordinal data the wine data Outline Outline The ordinal package: Analyzing ordinal data Per Bruun Brockhoff DTU Compute Section for Statistics Technical University of Denmark perbb@dtu.dk August 0 015 c Per Bruun Brockhoff (DTU)

More information

Main purposes of sensr. The sensr package: Difference and similarity testing. Main functions in sensr

Main purposes of sensr. The sensr package: Difference and similarity testing. Main functions in sensr Main purposes of sensr : Difference and similarity testing Per B Brockhoff DTU Compute Section for Statistics Technical University of Denmark perbb@dtu.dk August 17 2015 Statistical tests of sensory discrimation

More information

R-companion to: Estimation of the Thurstonian model for the 2-AC protocol

R-companion to: Estimation of the Thurstonian model for the 2-AC protocol R-companion to: Estimation of the Thurstonian model for the 2-AC protocol Rune Haubo Bojesen Christensen, Hye-Seong Lee & Per Bruun Brockhoff August 24, 2017 This document describes how the examples in

More information

Advances in Sensory Discrimination Testing: Thurstonian-Derived Models, Covariates, and Consumer Relevance

Advances in Sensory Discrimination Testing: Thurstonian-Derived Models, Covariates, and Consumer Relevance Advances in Sensory Discrimination Testing: Thurstonian-Derived Models, Covariates, and Consumer Relevance John C. Castura, Sara K. King, C. J. Findlay Compusense Inc. Guelph, ON, Canada Derivation of

More information

Statistical methodology for sensory discrimination tests and its implementation in sensr

Statistical methodology for sensory discrimination tests and its implementation in sensr Statistical methodology for sensory discrimination tests and its implementation in sensr Rune Haubo Bojesen Christensen August 24, 2017 Abstract The statistical methodology of sensory discrimination analysis

More information

Analysis of sensory ratings data with cumulative link models

Analysis of sensory ratings data with cumulative link models Downloaded from orbit.dtu.dk on: Oct 02, 2018 Analysis of sensory ratings data with cumulative link models Christensen, Rune Haubo Bojesen; Brockhoff, Per B. Published in: Journal de la Societe Francaise

More information

Experimental Design and Data Analysis for Biologists

Experimental Design and Data Analysis for Biologists Experimental Design and Data Analysis for Biologists Gerry P. Quinn Monash University Michael J. Keough University of Melbourne CAMBRIDGE UNIVERSITY PRESS Contents Preface page xv I I Introduction 1 1.1

More information

Threshold models with fixed and random effects for ordered categorical data

Threshold models with fixed and random effects for ordered categorical data Threshold models with fixed and random effects for ordered categorical data Hans-Peter Piepho Universität Hohenheim, Germany Edith Kalka Universität Kassel, Germany Contents 1. Introduction. Case studies

More information

Introduction to Statistical Analysis using IBM SPSS Statistics (v24)

Introduction to Statistical Analysis using IBM SPSS Statistics (v24) to Statistical Analysis using IBM SPSS Statistics (v24) to Statistical Analysis Using IBM SPSS Statistics is a two day instructor-led classroom course that provides an application-oriented introduction

More information

Thurstonian scales obtained by transformation of beta distributions

Thurstonian scales obtained by transformation of beta distributions Available online at www.sciencedirect.com Food Quality and Preference 19 (2008) 407 411 www.elsevier.com/locate/foodqual Thurstonian scales obtained by transformation of beta distributions L. Carbonell,

More information

Sensory Analysis, R tools to analyze Preference liking in function of descriptive measures.

Sensory Analysis, R tools to analyze Preference liking in function of descriptive measures. Sensory Analysis, R tools to analyze Preference liking in function of descriptive measures. Dhafer Malouche essai.academia.edu/dhafermalouche Center of Political Studies, Institute of Social Research University

More information

Sensometrics: Thurstonian and Statistical Models

Sensometrics: Thurstonian and Statistical Models Downloaded from orbit.dtu.dk on: Dec 16, 2018 Sensometrics: Thurstonian and Statistical Models Christensen, Rune Haubo Bojesen; Brockhoff, Per B. Publication date: 2012 Document Version Publisher's PDF,

More information

Generalization to Multi-Class and Continuous Responses. STA Data Mining I

Generalization to Multi-Class and Continuous Responses. STA Data Mining I Generalization to Multi-Class and Continuous Responses STA 5703 - Data Mining I 1. Categorical Responses (a) Splitting Criterion Outline Goodness-of-split Criterion Chi-square Tests and Twoing Rule (b)

More information

Chapter 8: Regression Models with Qualitative Predictors

Chapter 8: Regression Models with Qualitative Predictors Chapter 8: Regression Models with Qualitative Predictors Some predictors may be binary (e.g., male/female) or otherwise categorical (e.g., small/medium/large). These typically enter the regression model

More information

Classification: Linear Discriminant Analysis

Classification: Linear Discriminant Analysis Classification: Linear Discriminant Analysis Discriminant analysis uses sample information about individuals that are known to belong to one of several populations for the purposes of classification. Based

More information

9/12/17. Types of learning. Modeling data. Supervised learning: Classification. Supervised learning: Regression. Unsupervised learning: Clustering

9/12/17. Types of learning. Modeling data. Supervised learning: Classification. Supervised learning: Regression. Unsupervised learning: Clustering Types of learning Modeling data Supervised: we know input and targets Goal is to learn a model that, given input data, accurately predicts target data Unsupervised: we know the input only and want to make

More information

Classification & Regression. Multicollinearity Intro to Nominal Data

Classification & Regression. Multicollinearity Intro to Nominal Data Multicollinearity Intro to Nominal Let s Start With A Question y = β 0 + β 1 x 1 +β 2 x 2 y = Anxiety Level x 1 = heart rate x 2 = recorded pulse Since we can all agree heart rate and pulse are related,

More information

On the Triangle Test with Replications

On the Triangle Test with Replications On the Triangle Test with Replications Joachim Kunert and Michael Meyners Fachbereich Statistik, University of Dortmund, D-44221 Dortmund, Germany E-mail: kunert@statistik.uni-dortmund.de E-mail: meyners@statistik.uni-dortmund.de

More information

Statistical Methods. Missing Data snijders/sm.htm. Tom A.B. Snijders. November, University of Oxford 1 / 23

Statistical Methods. Missing Data  snijders/sm.htm. Tom A.B. Snijders. November, University of Oxford 1 / 23 1 / 23 Statistical Methods Missing Data http://www.stats.ox.ac.uk/ snijders/sm.htm Tom A.B. Snijders University of Oxford November, 2011 2 / 23 Literature: Joseph L. Schafer and John W. Graham, Missing

More information

Multilevel Statistical Models: 3 rd edition, 2003 Contents

Multilevel Statistical Models: 3 rd edition, 2003 Contents Multilevel Statistical Models: 3 rd edition, 2003 Contents Preface Acknowledgements Notation Two and three level models. A general classification notation and diagram Glossary Chapter 1 An introduction

More information

26:010:557 / 26:620:557 Social Science Research Methods

26:010:557 / 26:620:557 Social Science Research Methods 26:010:557 / 26:620:557 Social Science Research Methods Dr. Peter R. Gillett Associate Professor Department of Accounting & Information Systems Rutgers Business School Newark & New Brunswick 1 Overview

More information

Structural Equation Modeling and Confirmatory Factor Analysis. Types of Variables

Structural Equation Modeling and Confirmatory Factor Analysis. Types of Variables /4/04 Structural Equation Modeling and Confirmatory Factor Analysis Advanced Statistics for Researchers Session 3 Dr. Chris Rakes Website: http://csrakes.yolasite.com Email: Rakes@umbc.edu Twitter: @RakesChris

More information

Review: what is a linear model. Y = β 0 + β 1 X 1 + β 2 X 2 + A model of the following form:

Review: what is a linear model. Y = β 0 + β 1 X 1 + β 2 X 2 + A model of the following form: Outline for today What is a generalized linear model Linear predictors and link functions Example: fit a constant (the proportion) Analysis of deviance table Example: fit dose-response data using logistic

More information

Feature Engineering, Model Evaluations

Feature Engineering, Model Evaluations Feature Engineering, Model Evaluations Giri Iyengar Cornell University gi43@cornell.edu Feb 5, 2018 Giri Iyengar (Cornell Tech) Feature Engineering Feb 5, 2018 1 / 35 Overview 1 ETL 2 Feature Engineering

More information

Regression Adjustment with Artificial Neural Networks

Regression Adjustment with Artificial Neural Networks Regression Adjustment with Artificial Neural Networks Age of Big Data: data comes in a rate and in a variety of types that exceed our ability to analyse it Texts, image, speech, video Real motivation:

More information

BIOMETRICS INFORMATION

BIOMETRICS INFORMATION BIOMETRICS INFORMATION Index of Pamphlet Topics (for pamphlets #1 to #60) as of December, 2000 Adjusted R-square ANCOVA: Analysis of Covariance 13: ANCOVA: Analysis of Covariance ANOVA: Analysis of Variance

More information

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3 University of California, Irvine 2017-2018 1 Statistics (STATS) Courses STATS 5. Seminar in Data Science. 1 Unit. An introduction to the field of Data Science; intended for entering freshman and transfers.

More information

Basics of Multivariate Modelling and Data Analysis

Basics of Multivariate Modelling and Data Analysis Basics of Multivariate Modelling and Data Analysis Kurt-Erik Häggblom 2. Overview of multivariate techniques 2.1 Different approaches to multivariate data analysis 2.2 Classification of multivariate techniques

More information

Score-Based Tests of Measurement Invariance with Respect to Continuous and Ordinal Variables

Score-Based Tests of Measurement Invariance with Respect to Continuous and Ordinal Variables Score-Based Tests of Measurement Invariance with Respect to Continuous and Ordinal Variables Achim Zeileis, Edgar C. Merkle, Ting Wang http://eeecon.uibk.ac.at/~zeileis/ Overview Motivation Framework Score-based

More information

Article from. Predictive Analytics and Futurism. July 2016 Issue 13

Article from. Predictive Analytics and Futurism. July 2016 Issue 13 Article from Predictive Analytics and Futurism July 2016 Issue 13 Regression and Classification: A Deeper Look By Jeff Heaton Classification and regression are the two most common forms of models fitted

More information

Classification. Classification is similar to regression in that the goal is to use covariates to predict on outcome.

Classification. Classification is similar to regression in that the goal is to use covariates to predict on outcome. Classification Classification is similar to regression in that the goal is to use covariates to predict on outcome. We still have a vector of covariates X. However, the response is binary (or a few classes),

More information

Contrast Coding in R: An Exploration of a Dataset

Contrast Coding in R: An Exploration of a Dataset Contrast Coding in R: An Exploration of a Dataset Rachel Baker Phonatics, Sept. 29, 2009 Thanks to: http://www.ats.ucla.edu/stat/r/library/contrast_coding.htm Roger Levy Coding for Regressions Categorical

More information

Sensometrics: Thurstonian and Statistical Models

Sensometrics: Thurstonian and Statistical Models Sensometrics: Thurstonian and Statistical Models Rune Haubo Bojesen Christensen Kongens Lyngby 2008 IMM-M.Sc-2008-62 Technical University of Denmark Informatics and Mathematical Modelling Building 321,

More information

Methods to analyse sensory profiling data. a comparison

Methods to analyse sensory profiling data. a comparison Methods to analyse sensory profiling data a comparison Michael Meyners Fachbereich Statistik Universität Dortmund D-44221 Dortmund Germany 6 th Sensometrics, Dormund, July 31- August 2, 2002 Data Sensory

More information

Course in Data Science

Course in Data Science Course in Data Science About the Course: In this course you will get an introduction to the main tools and ideas which are required for Data Scientist/Business Analyst/Data Analyst. The course gives an

More information

Whats beyond Concerto: An introduction to the R package catr. Session 4: Overview of polytomous IRT models

Whats beyond Concerto: An introduction to the R package catr. Session 4: Overview of polytomous IRT models Whats beyond Concerto: An introduction to the R package catr Session 4: Overview of polytomous IRT models The Psychometrics Centre, Cambridge, June 10th, 2014 2 Outline: 1. Introduction 2. General notations

More information

Classification 1: Linear regression of indicators, linear discriminant analysis

Classification 1: Linear regression of indicators, linear discriminant analysis Classification 1: Linear regression of indicators, linear discriminant analysis Ryan Tibshirani Data Mining: 36-462/36-662 April 2 2013 Optional reading: ISL 4.1, 4.2, 4.4, ESL 4.1 4.3 1 Classification

More information

Inter Item Correlation Matrix (R )

Inter Item Correlation Matrix (R ) 7 1. I have the ability to influence my child s well-being. 2. Whether my child avoids injury is just a matter of luck. 3. Luck plays a big part in determining how healthy my child is. 4. I can do a lot

More information

Generalized Linear Models (GLZ)

Generalized Linear Models (GLZ) Generalized Linear Models (GLZ) Generalized Linear Models (GLZ) are an extension of the linear modeling process that allows models to be fit to data that follow probability distributions other than the

More information

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS Page 1 MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level

More information

Gaussian discriminant analysis Naive Bayes

Gaussian discriminant analysis Naive Bayes DM825 Introduction to Machine Learning Lecture 7 Gaussian discriminant analysis Marco Chiarandini Department of Mathematics & Computer Science University of Southern Denmark Outline 1. is 2. Multi-variate

More information

CSC411: Final Review. James Lucas & David Madras. December 3, 2018

CSC411: Final Review. James Lucas & David Madras. December 3, 2018 CSC411: Final Review James Lucas & David Madras December 3, 2018 Agenda 1. A brief overview 2. Some sample questions Basic ML Terminology The final exam will be on the entire course; however, it will be

More information

Power Analysis using GPower 3.1

Power Analysis using GPower 3.1 Power Analysis using GPower 3.1 a one-day hands-on practice workshop (session 1) Dr. John Xie Statistics Support Officer, Quantitative Consulting Unit, Research Office, Charles Sturt University, NSW, Australia

More information

Multimodel Ensemble forecasts

Multimodel Ensemble forecasts Multimodel Ensemble forecasts Calibrated methods Michael K. Tippett International Research Institute for Climate and Society The Earth Institute, Columbia University ERFS Climate Predictability Tool Training

More information

SUPPLEMENTARY SIMULATIONS & FIGURES

SUPPLEMENTARY SIMULATIONS & FIGURES Supplementary Material: Supplementary Material for Mixed Effects Models for Resampled Network Statistics Improve Statistical Power to Find Differences in Multi-Subject Functional Connectivity Manjari Narayan,

More information

CDA Chapter 3 part II

CDA Chapter 3 part II CDA Chapter 3 part II Two-way tables with ordered classfications Let u 1 u 2... u I denote scores for the row variable X, and let ν 1 ν 2... ν J denote column Y scores. Consider the hypothesis H 0 : X

More information

MARGINAL HOMOGENEITY MODEL FOR ORDERED CATEGORIES WITH OPEN ENDS IN SQUARE CONTINGENCY TABLES

MARGINAL HOMOGENEITY MODEL FOR ORDERED CATEGORIES WITH OPEN ENDS IN SQUARE CONTINGENCY TABLES REVSTAT Statistical Journal Volume 13, Number 3, November 2015, 233 243 MARGINAL HOMOGENEITY MODEL FOR ORDERED CATEGORIES WITH OPEN ENDS IN SQUARE CONTINGENCY TABLES Authors: Serpil Aktas Department of

More information

Sparse Kernel Machines - SVM

Sparse Kernel Machines - SVM Sparse Kernel Machines - SVM Henrik I. Christensen Robotics & Intelligent Machines @ GT Georgia Institute of Technology, Atlanta, GA 30332-0280 hic@cc.gatech.edu Henrik I. Christensen (RIM@GT) Support

More information

12 Generalized linear models

12 Generalized linear models 12 Generalized linear models In this chapter, we combine regression models with other parametric probability models like the binomial and Poisson distributions. Binary responses In many situations, we

More information

Online Supplementary Material. MetaLP: A Nonparametric Distributed Learning Framework for Small and Big Data

Online Supplementary Material. MetaLP: A Nonparametric Distributed Learning Framework for Small and Big Data Online Supplementary Material MetaLP: A Nonparametric Distributed Learning Framework for Small and Big Data PI : Subhadeep Mukhopadhyay Department of Statistics, Temple University Philadelphia, Pennsylvania,

More information

ECE 5984: Introduction to Machine Learning

ECE 5984: Introduction to Machine Learning ECE 5984: Introduction to Machine Learning Topics: (Finish) Expectation Maximization Principal Component Analysis (PCA) Readings: Barber 15.1-15.4 Dhruv Batra Virginia Tech Administrativia Poster Presentation:

More information

Generalized Linear. Mixed Models. Methods and Applications. Modern Concepts, Walter W. Stroup. Texts in Statistical Science.

Generalized Linear. Mixed Models. Methods and Applications. Modern Concepts, Walter W. Stroup. Texts in Statistical Science. Texts in Statistical Science Generalized Linear Mixed Models Modern Concepts, Methods and Applications Walter W. Stroup CRC Press Taylor & Francis Croup Boca Raton London New York CRC Press is an imprint

More information

Random Forests for Ordinal Response Data: Prediction and Variable Selection

Random Forests for Ordinal Response Data: Prediction and Variable Selection Silke Janitza, Gerhard Tutz, Anne-Laure Boulesteix Random Forests for Ordinal Response Data: Prediction and Variable Selection Technical Report Number 174, 2014 Department of Statistics University of Munich

More information

Multivariate Analysis of Ecological Data using CANOCO

Multivariate Analysis of Ecological Data using CANOCO Multivariate Analysis of Ecological Data using CANOCO JAN LEPS University of South Bohemia, and Czech Academy of Sciences, Czech Republic Universitats- uric! Lanttesbibiiothek Darmstadt Bibliothek Biologie

More information

[Part 2] Model Development for the Prediction of Survival Times using Longitudinal Measurements

[Part 2] Model Development for the Prediction of Survival Times using Longitudinal Measurements [Part 2] Model Development for the Prediction of Survival Times using Longitudinal Measurements Aasthaa Bansal PhD Pharmaceutical Outcomes Research & Policy Program University of Washington 69 Biomarkers

More information

CPSC 540: Machine Learning

CPSC 540: Machine Learning CPSC 540: Machine Learning Undirected Graphical Models Mark Schmidt University of British Columbia Winter 2016 Admin Assignment 3: 2 late days to hand it in today, Thursday is final day. Assignment 4:

More information

Stat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010

Stat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010 1 Linear models Y = Xβ + ɛ with ɛ N (0, σ 2 e) or Y N (Xβ, σ 2 e) where the model matrix X contains the information on predictors and β includes all coefficients (intercept, slope(s) etc.). 1. Number of

More information

A Post-Aggregation Error Record Extraction Based on Naive Bayes for Statistical Survey Enumeration

A Post-Aggregation Error Record Extraction Based on Naive Bayes for Statistical Survey Enumeration A Post-Aggregation Error Record Extraction Based on Naive Bayes for Statistical Survey Enumeration Kiyomi Shirakawa, National Statistics Center, Tokyo, JAPAN e-mail: kshirakawa@nstac.go.jp Abstract At

More information

Machine Learning. B. Unsupervised Learning B.2 Dimensionality Reduction. Lars Schmidt-Thieme, Nicolas Schilling

Machine Learning. B. Unsupervised Learning B.2 Dimensionality Reduction. Lars Schmidt-Thieme, Nicolas Schilling Machine Learning B. Unsupervised Learning B.2 Dimensionality Reduction Lars Schmidt-Thieme, Nicolas Schilling Information Systems and Machine Learning Lab (ISMLL) Institute for Computer Science University

More information

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p.

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p. Preface p. xi Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p. 6 The Scientific Method and the Design of

More information

Investigating Models with Two or Three Categories

Investigating Models with Two or Three Categories Ronald H. Heck and Lynn N. Tabata 1 Investigating Models with Two or Three Categories For the past few weeks we have been working with discriminant analysis. Let s now see what the same sort of model might

More information

DISCOVERING STATISTICS USING R

DISCOVERING STATISTICS USING R DISCOVERING STATISTICS USING R ANDY FIELD I JEREMY MILES I ZOE FIELD Los Angeles London New Delhi Singapore j Washington DC CONTENTS Preface How to use this book Acknowledgements Dedication Symbols used

More information

Fundamentals to Biostatistics. Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur

Fundamentals to Biostatistics. Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur Fundamentals to Biostatistics Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur Statistics collection, analysis, interpretation of data development of new

More information

Types of Statistical Tests DR. MIKE MARRAPODI

Types of Statistical Tests DR. MIKE MARRAPODI Types of Statistical Tests DR. MIKE MARRAPODI Tests t tests ANOVA Correlation Regression Multivariate Techniques Non-parametric t tests One sample t test Independent t test Paired sample t test One sample

More information

New Machine Learning Methods for Neuroimaging

New Machine Learning Methods for Neuroimaging New Machine Learning Methods for Neuroimaging Gatsby Computational Neuroscience Unit University College London, UK Dept of Computer Science University of Helsinki, Finland Outline Resting-state networks

More information

Part I. Linear Discriminant Analysis. Discriminant analysis. Discriminant analysis

Part I. Linear Discriminant Analysis. Discriminant analysis. Discriminant analysis Week 5 Based in part on slides from textbook, slides of Susan Holmes Part I Linear Discriminant Analysis October 29, 2012 1 / 1 2 / 1 Nearest centroid rule Suppose we break down our data matrix as by the

More information

Introduction to LMER. Andrew Zieffler

Introduction to LMER. Andrew Zieffler Introduction to LMER Andrew Zieffler Traditional Regression General form of the linear model where y i = 0 + 1 (x 1i )+ 2 (x 2i )+...+ p (x pi )+ i yi is the response for the i th individual (i = 1,...,

More information

Model Based Statistics in Biology. Part V. The Generalized Linear Model. Chapter 18.1 Logistic Regression (Dose - Response)

Model Based Statistics in Biology. Part V. The Generalized Linear Model. Chapter 18.1 Logistic Regression (Dose - Response) Model Based Statistics in Biology. Part V. The Generalized Linear Model. Logistic Regression ( - Response) ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6, 7) ReCap Part III (Ch 9, 10, 11), Part IV

More information

What Does the F-Ratio Tell Us?

What Does the F-Ratio Tell Us? Planned Comparisons What Does the F-Ratio Tell Us? The F-ratio (called an omnibus or overall F) provides a test of whether or not there a treatment effects in an experiment A significant F-ratio suggests

More information

Gaussian and Linear Discriminant Analysis; Multiclass Classification

Gaussian and Linear Discriminant Analysis; Multiclass Classification Gaussian and Linear Discriminant Analysis; Multiclass Classification Professor Ameet Talwalkar Slide Credit: Professor Fei Sha Professor Ameet Talwalkar CS260 Machine Learning Algorithms October 13, 2015

More information

High-Throughput Sequencing Course

High-Throughput Sequencing Course High-Throughput Sequencing Course DESeq Model for RNA-Seq Biostatistics and Bioinformatics Summer 2017 Outline Review: Standard linear regression model (e.g., to model gene expression as function of an

More information

Proper Platinum Resistance Thermometer Calibration Uncertainty Analysis

Proper Platinum Resistance Thermometer Calibration Uncertainty Analysis Proper Platinum Resistance Thermometer Calibration Uncertainty Analysis Thomas Wiandt Fluke Corporation, Hart Scientific Division NCSLI 2007 Session 8B 1 Introduction Uncertainty quantification is a primary

More information

Group comparison test for independent samples

Group comparison test for independent samples Group comparison test for independent samples The purpose of the Analysis of Variance (ANOVA) is to test for significant differences between means. Supposing that: samples come from normal populations

More information

Generative Models. CS4780/5780 Machine Learning Fall Thorsten Joachims Cornell University

Generative Models. CS4780/5780 Machine Learning Fall Thorsten Joachims Cornell University Generative Models CS4780/5780 Machine Learning Fall 2012 Thorsten Joachims Cornell University Reading: Mitchell, Chapter 6.9-6.10 Duda, Hart & Stork, Pages 20-39 Bayes decision rule Bayes theorem Generative

More information

Introduction to Basic Statistics Version 2

Introduction to Basic Statistics Version 2 Introduction to Basic Statistics Version 2 Pat Hammett, Ph.D. University of Michigan 2014 Instructor Comments: This document contains a brief overview of basic statistics and core terminology/concepts

More information

Outline. Mixed models in R using the lme4 package Part 3: Longitudinal data. Sleep deprivation data. Simple longitudinal data

Outline. Mixed models in R using the lme4 package Part 3: Longitudinal data. Sleep deprivation data. Simple longitudinal data Outline Mixed models in R using the lme4 package Part 3: Longitudinal data Douglas Bates Longitudinal data: sleepstudy A model with random effects for intercept and slope University of Wisconsin - Madison

More information

GIS model & modeling

GIS model & modeling GIS model & modeling Model : a simplified representation of a phenomenon or a system. GIS modeling : the use of GIS in the process of building models with spatial data. Basic requirement in modeling :

More information

284 B1 cao (ii) (b(i) or B1 for showing addition of 71 and 95 or 91 and 75

284 B1 cao (ii) (b(i) or B1 for showing addition of 71 and 95 or 91 and 75 1. 7.8 B1 cao 2. 7 B1 cao 3. 7.84 B1 cao 4. 25 B1 cao 5. (a) 2, 3, 6, 7, 8 2 B2 for 2, 3, 6, 7, 8 (b) 3,8 1 B1 cao 6. (a)(i) 23 2 B1 cao (B1 for any 3 or 4 correct, no extras or 2, 3, 6, 7, 8 seen with

More information

SUPERVISED LEARNING: INTRODUCTION TO CLASSIFICATION

SUPERVISED LEARNING: INTRODUCTION TO CLASSIFICATION SUPERVISED LEARNING: INTRODUCTION TO CLASSIFICATION 1 Outline Basic terminology Features Training and validation Model selection Error and loss measures Statistical comparison Evaluation measures 2 Terminology

More information

Missing values imputation for mixed data based on principal component methods

Missing values imputation for mixed data based on principal component methods Missing values imputation for mixed data based on principal component methods Vincent Audigier, François Husson & Julie Josse Agrocampus Rennes Compstat' 2012, Limassol (Cyprus), 28-08-2012 1 / 21 A real

More information

Multivariate Analysis of Ecological Data

Multivariate Analysis of Ecological Data Multivariate Analysis of Ecological Data MICHAEL GREENACRE Professor of Statistics at the Pompeu Fabra University in Barcelona, Spain RAUL PRIMICERIO Associate Professor of Ecology, Evolutionary Biology

More information

Chapter 13 Section D. F versus Q: Different Approaches to Controlling Type I Errors with Multiple Comparisons

Chapter 13 Section D. F versus Q: Different Approaches to Controlling Type I Errors with Multiple Comparisons Explaining Psychological Statistics (2 nd Ed.) by Barry H. Cohen Chapter 13 Section D F versus Q: Different Approaches to Controlling Type I Errors with Multiple Comparisons In section B of this chapter,

More information

Freeman (2005) - Graphic Techniques for Exploring Social Network Data

Freeman (2005) - Graphic Techniques for Exploring Social Network Data Freeman (2005) - Graphic Techniques for Exploring Social Network Data The analysis of social network data has two main goals: 1. Identify cohesive groups 2. Identify social positions Moreno (1932) was

More information

Chapter 8 Statistical Quality Control, 7th Edition by Douglas C. Montgomery. Copyright (c) 2013 John Wiley & Sons, Inc.

Chapter 8 Statistical Quality Control, 7th Edition by Douglas C. Montgomery. Copyright (c) 2013 John Wiley & Sons, Inc. 1 Learning Objectives Chapter 8 Statistical Quality Control, 7th Edition by Douglas C. Montgomery. 2 Process Capability Natural tolerance limits are defined as follows: Chapter 8 Statistical Quality Control,

More information

On Testing the Missing at Random Assumption

On Testing the Missing at Random Assumption On Testing the Missing at Random Assumption Manfred Jaeger Institut for Datalogi, Aalborg Universitet, Fredrik Bajers Vej 7E, DK-9220 Aalborg Ø jaeger@cs.aau.dk Abstract. Most approaches to learning from

More information

Confidence Bounds for Multiplicative Comparisons

Confidence Bounds for Multiplicative Comparisons Developing and applying advanced research tools for human perceptual measurement IFPress Research Papers Number 902 Confidence Bounds for Multiplicative Comparisons John M. Ennis and Daniel M. Ennis The

More information

Econometrics I G (Part I) Fall 2004

Econometrics I G (Part I) Fall 2004 Econometrics I G31.2100 (Part I) Fall 2004 Instructor: Time: Professor Christopher Flinn 269 Mercer Street, Room 302 Phone: 998 8925 E-mail: christopher.flinn@nyu.edu Homepage: http://www.econ.nyu.edu/user/flinnc

More information

Linear Regression. In this lecture we will study a particular type of regression model: the linear regression model

Linear Regression. In this lecture we will study a particular type of regression model: the linear regression model 1 Linear Regression 2 Linear Regression In this lecture we will study a particular type of regression model: the linear regression model We will first consider the case of the model with one predictor

More information

AN INVESTIGATION OF THE ALIGNMENT METHOD FOR DETECTING MEASUREMENT NON- INVARIANCE ACROSS MANY GROUPS WITH DICHOTOMOUS INDICATORS

AN INVESTIGATION OF THE ALIGNMENT METHOD FOR DETECTING MEASUREMENT NON- INVARIANCE ACROSS MANY GROUPS WITH DICHOTOMOUS INDICATORS 1 AN INVESTIGATION OF THE ALIGNMENT METHOD FOR DETECTING MEASUREMENT NON- INVARIANCE ACROSS MANY GROUPS WITH DICHOTOMOUS INDICATORS Jessica Flake, Erin Strauts, Betsy McCoach, Jane Rogers, Megan Welsh

More information

Conflicts of Interest

Conflicts of Interest Analysis of Dependent Variables: Correlation and Simple Regression Zacariah Labby, PhD, DABR Asst. Prof. (CHS), Dept. of Human Oncology University of Wisconsin Madison Conflicts of Interest None to disclose

More information

Dimension Reduction (PCA, ICA, CCA, FLD,

Dimension Reduction (PCA, ICA, CCA, FLD, Dimension Reduction (PCA, ICA, CCA, FLD, Topic Models) Yi Zhang 10-701, Machine Learning, Spring 2011 April 6 th, 2011 Parts of the PCA slides are from previous 10-701 lectures 1 Outline Dimension reduction

More information

Factor Analysis of Data Matrices

Factor Analysis of Data Matrices Factor Analysis of Data Matrices PAUL HORST University of Washington HOLT, RINEHART AND WINSTON, INC. New York Chicago San Francisco Toronto London Contents Preface PART I. Introductory Background 1. The

More information

Principal component analysis

Principal component analysis Principal component analysis Motivation i for PCA came from major-axis regression. Strong assumption: single homogeneous sample. Free of assumptions when used for exploration. Classical tests of significance

More information

Mixtures of Gaussians with Sparse Structure

Mixtures of Gaussians with Sparse Structure Mixtures of Gaussians with Sparse Structure Costas Boulis 1 Abstract When fitting a mixture of Gaussians to training data there are usually two choices for the type of Gaussians used. Either diagonal or

More information

Supplemental Resource: Brain and Cognitive Sciences Statistics & Visualization for Data Analysis & Inference January (IAP) 2009

Supplemental Resource: Brain and Cognitive Sciences Statistics & Visualization for Data Analysis & Inference January (IAP) 2009 MIT OpenCourseWare http://ocw.mit.edu Supplemental Resource: Brain and Cognitive Sciences Statistics & Visualization for Data Analysis & Inference January (IAP) 2009 For information about citing these

More information

LARGE NUMBERS OF EXPLANATORY VARIABLES. H.S. Battey. WHAO-PSI, St Louis, 9 September 2018

LARGE NUMBERS OF EXPLANATORY VARIABLES. H.S. Battey. WHAO-PSI, St Louis, 9 September 2018 LARGE NUMBERS OF EXPLANATORY VARIABLES HS Battey Department of Mathematics, Imperial College London WHAO-PSI, St Louis, 9 September 2018 Regression, broadly defined Response variable Y i, eg, blood pressure,

More information

Introduction to General and Generalized Linear Models

Introduction to General and Generalized Linear Models Introduction to General and Generalized Linear Models Generalized Linear Models - part III Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs.

More information

Don t be Fancy. Impute Your Dependent Variables!

Don t be Fancy. Impute Your Dependent Variables! Don t be Fancy. Impute Your Dependent Variables! Kyle M. Lang, Todd D. Little Institute for Measurement, Methodology, Analysis & Policy Texas Tech University Lubbock, TX May 24, 2016 Presented at the 6th

More information

Lecture 2: Linear Algebra Review

Lecture 2: Linear Algebra Review CS 4980/6980: Introduction to Data Science c Spring 2018 Lecture 2: Linear Algebra Review Instructor: Daniel L. Pimentel-Alarcón Scribed by: Anh Nguyen and Kira Jordan This is preliminary work and has

More information