Threshold models with fixed and random effects for ordered categorical data
|
|
- David Poole
- 6 years ago
- Views:
Transcription
1 Threshold models with fixed and random effects for ordered categorical data Hans-Peter Piepho Universität Hohenheim, Germany Edith Kalka Universität Kassel, Germany
2 Contents 1. Introduction. Case studies 3. Numerical issues 4. Simulations 5. Conclusion
3 1. Introduction Analysis of ordered categorical data: ANOVA, LSD Transformation to normality Nonparametric methods Threshold model
4 Percent Consumer Employee Fig. 1.1: Observed frequencies for snack food scores (Best et al., 000).
5 0.16 Consumers π π 1 π π 6 π 3 π 4 π 7 π 8 π θ 1 θ θ 3 θ 4 θ 5 θ 6 θ 7 θ 8 m Latent variable
6 0.16 Employees θ 1 θ θ 3 θ 4 θ 5 θ 6 θ 7 θ 8 m Latent variable
7 Latent variables Latent variable m (quantitative) observed score y (ordered categorical) " increasing the energy of the physical stimulus, or the concentration or amount of food ingredient, should result in an increase in how strong something feels, looks, smells, or tastes. We increase the salt in a soup, and it tastes saltier." (Lawless and Heymann, 1998, p. 08)
8 Multinomial probabilities Threshold values θ 1, θ,... Multinomial probabilities π 1, π, π1 π π... 3 π I π = Φ 1 Φ Φ ( θ η) ( θ η) Φ( θ η) ( θ η) Φ( θ η) = 1 = 3 ( θ I η) Φ( θ η) ( θ η) = Φ Φ 1 I 1 I = 1 I Φ = standard normal c.d.f. η = mean on the latent scale I = number of categories
9 Linear modelling Expected value on latent scale: η k = µ + α k where η k = mean for the k-th panel (k = 1: consumers k = : employees) α k = effect of k-th panel Random variable on latent scale: m jk = η k + e jk e jk ~ N ( 0,1)
10 Percent Consumer Employee Fig. 1.3: Fitted frequencies of scores for snack food scores.
11 Percent Mean= 0 Mean= Var(scores) = 3.65 Var(scores) = Fig. 1.4: Fitted frequencies based on threshold model for snack food scores, assuming means of η = 0 and η = 3 for the latent variable.
12 Percent Sample1 Sample Var(scores)=1.77 Var(scores)= Fig. 1.5: Histogram of observed overall acceptance data (Villanueva et al., 000). Scores display significant variance heterogeneity by an LR-test, assuming normality (p = < ).
13 Percent sample1 sample Fig. 1.6: Fitted histogram of overall acceptance data (Villanueva et al., 000), assuming threshold model with homogeneous variance ( log L = ).
14 Percent sample1 sample σ 1 = 1 σ = Fig. 1.7: Fitted histogram of overall acceptance data (Villanueva et al., 000), assuming threshold model with heterogeneous variance ( log L = ). σ k = standard deviation on latent scale for k-th product.
15 Unequal variance on latent scale invalidates ANOVA of observed scores! Example: Date ganerated by threshold model with three categories (θ 1 = 1 and θ = 1) Parameter Group 1 Group Latent scale: Mean Variance Observed scale: Mean.4.
16 . Case studies Case study 1: Consumer preferences for on farm and industrially processed quarg samples nine point hedonic scale 1 = dislike extremely to 9 = like extremely 15 consumers Seven quarg products
17 An initial model m ij = α i + u j + e ij (.1) where m ij = latent variable for i-th product and j-th assessor α i = expected value of i-th product u j = random effect of j-th assessor; u j ~ N(0, σ u e ij = random deviation of j-th assessor for i-th product; e ij N(0, 1) σ u corr( m ij, m i ' j ) = σ + 1 u )
18 Table.1: The Johnson System for u j. Distribution Model for u j Support ( ) S L u j exp w j S B u j = 0 < u j exp = φ 1+ exp ( w ) j ( w ) ( ) S U u j φ sinh w j ( ) : w N µ, σ j ~ w w j 0 u j φ = < u j
19 Factor-analytic models m ij = α i + λ i u j + e ij λ i = factor loading for i-th product u j = latent value for j-th assessor; u j N(0,1) e ij = error; e ij N(0,1) Model implies heterogeneity: var( m ) λ ij = i + corr( m ij, m i ' j 1 ) = i ( )( λ + 1 λ + 1) i λ λ i i
20 Model selection by AIC Akaike Information Criterion = AIC: AIC = log L + p where log L = maximized log-likelihood p = number of parameters "Smaller is better"
21 Table.: Model fits for quarg data. Model for latent variable Distribution for random AIC effects m ij = α i + e ij ( 0, ) m ij = α i + u j + e ij j ~ N u u σ ( u, µ, σ ) u u j ~ S L w w φ (, µ, ) w σ (, µ, σ ) j ~ S B w φ j ~ S U w w m ij = α i + λ i u j + e ij ~ N( 0,1 ) m ij = α i + λ i u j1 + λ i u j + e ij ~ N( 0, 1) m ij = α i + λ i u j1 + λ i u j + λ i3 u j3 + e ij ~ N( 0,1) φ u j u jk u jk
22 Factor-analytic threshold model (two multiplicative terms): Estimate of Product α i ANOVA of scores: Product Score mean 1.10 bc.0 c d be de 6.11 bc 7.88 a SED b 5.60 b c b c b a LSD Estimates in a column followed by the same letter are not significantly different.
23 Artificial example Four treatments (T1, T, T3, T4) Comparison of T1 and T4 significant Initial display: T1 a T a T3 a misses significance T4 a of T1-T4 comparison Insertion: T1 a b T1 a b T1 b T a b T a b T a b T3 a b T3 a b T3 a b T4 a b T4 a b T4 a
24 7 Observed score mean Estimated effect Fig..1: Plot of observed score means (ANOVA) versus estimated effects under threshold model for seven quarg products.
25 Expected score mean Linear predictor Fig..4: Plot of score mean versus linear predictor under estimated threshold model for quarg data.
26 Case study : Sensory descriptive analysis of on farm and industrial processed quarg samples by a trained panel trained panel of 15 assessors 8 quarg samples (B1-B8) 1 sensory attributes 1 sessions ordinal scale (0 = non- detectable to 5 = very strong)
27 Mixed linear model (per attribute): m ijk = u j + τ i + α k + (ατ) ik + e ijk (.6) where u j τ i α k = effect of j-th assessor = effect of i-th product = effect of k-th session (random) (ατ) ik = interaction of i-th product and k-th session (random)
28 Principal component analysis Two standardizations: σ = I i= 1 ( τ i τ ) ( I 1) (.7a) σ = σ ατ +1 (.7b) singular value decomposition of standardized values
29 PC Axis.5 B5 1.5 e b l B7 g B B8 B3 B4 i B1 f h c k a j d B PC Axis 1 Fig..5: Biplot for PCA of 8 quarg samples and 1 sensory attributes based on standardization (.7a). 83% of variance are explained. B1-B8: samples; letters: attributes)
30 PC Axis.5 B5 1.5 e g B3 B4 c d k b l B i a d B1 j B6-1.5 B7 B8 h PC Axis 1 Fig..6: Biplot for PCA of 8 quarg samples and 1 sensory attributes based on standardization (.7b). 91% of variance are explained. B1-B8: samples; letters: attributes).
31 Heterogeneity of variance We hypothesized that variance was larger for samples B1 through B4 Fit model with heterogeneity in variance of interaction (ατ) ik. σ σ ατ (1) ατ () : variance for products B1 to B4 : variance for products B5 to B8
32 Table.7: Variance components (case study ). *: p-value < Homogeneous Heterogeneous LR-test B1-B8 B1-B4 B5-B8 σ σ σ ατ ατ 1) ατ () Attributes ( Granular ns Separation of whey * Creamy yellow color * Bitter ns Sour ns Flavour of starter cultures * Flavour of yoghurt * Rancid * Other flavour ns Firm * Gritty ns Creamy ns
33 Inter-assessor agreement Assessed separately for each session m ij = u j + τ i + e ij (.8) u j = effect of j-th assessor τ i = effect of i-th product τ i ~ N(0, σ τ ) e ij = random error; standard normal Intra-class correlation: ρ IC = σ τ 1+ σ τ
34 Table.9: Estimates of the intra-class correlation ρ IC and Kendall's coefficient of concordance for quarg data. Attributes Session granular colour sour Kendall's measure of concordance: Intraclass correlation (ρ IC ):
35 Case study 3 Consumer preferences for raspberry beverages Two regions (geographies) 8 beverages Panel 1 (13 assessors): 1 = dislike extremely, 9 = like extremely Panel (7 assessors): 1 =bad, 5 =good Questions: Difference in accuracy among scales Differences in ratings among regions Correlation among attributes
36 Difference in accuracy Linear model: m ij = α i + u jk + e ijk (.9) α i = effect of i-th product u jk = effect of j-th assessor on k-th scale u ~ N(0, σ ) jk e ijk = error e ~ ijk N(0, σ u(k ) e(k ) ) Separate set of thresholds for both scales! Hypotheses of interest: H 01 : H 0 : σ = σ u( 1) u() σ = σ e( 1) e()
37 Table.9: Likelihood statistics and variance component estimates for liking, raspberry beverages. Geography 1: Model restriction log L σ e(1) σ e() σ u(1) σ u() None σ = σ u( 1) u() σ e( 1) = σ e() σ e( 1) = σ e() ; σ u( 1) = σ u()
38 Geography : Model restriction log L σ e(1) σ e() σ u(1) σ u() None σ = σ u( 1) u() σ e( 1) = σ e() σ e( 1) = σ e() ; σ u( 1) = σ u()
39 Differences in ratings among regions m ijh = α i + γ ih + u jh + e ijh (.11) where α i = main effect of i-th product γ ih = deviation of i-th product in h-th geography (h = 1, ) u jh = effect of j-th assessor in h-th geography, u ~ N(0, ) jh σ e ijh = error (standard normal) u Ratings the same? H 0 : γ i1 = γ i for every product i
40 Table.1: Comparison of mixed threshold models for testing H 0 : γ 1i = γ i (consumers behave identically in both regions). Attribute: liking (case study 3). Scale Model for m ijh AIC Test of H 0 : γ 1i = γ i Wald-χ p-value 5 point α i + γ ih + u jh + e ijh α i + γ ih + λ i u jh + e ijh point α i + γ ih + u jh + e ijh < α i + γ ih + λ i u jh + e ijh <0.0001
41 Correlation among attributes liking and appearance geography 1 five-point ordinal scale Bivariate mixed model: m ij1 = α i1 + u j1 + e ij1 (.13) m ij = α i + u j + e ij where α ih = effect of i-th product for h-th attribute u jh = effect of j-th assessor for h-th attribute e ijh = residual corresponding to m ijh
42 Distributional assumptions: , 0 0 ~ u u u u u u u u j j BVN u u σ σ σ ρ σ σ ρ σ 1 1, 0 0 ~ 1 e e ij ij BVN e e ρ ρ where BVN(.,.) = bivariate normal distribution ρ u = between-product correlation ρ e = within-product correlation
43 Bivariate multinomial probabilities of observed scores: π ab with = ( θ η, θ η ρ ) P( y π 1 = a, y = b) = Φ 1a 1 b, e a b r = 1 s= 1 rs< ab (a = 1,, 5; b = 1,, 5) rs (.14) y 1, y = observed scores for liking and appearance Φ = bivariate standard normal c.d.f θ 1a, θ b = thresholds; θ 15 = θ 5 = η 1, η = linear predictors (means) on latent scale Result: ˆ ρ = ( s. e. = 0.011) (within-product analysis) e
44 Case study 4 Recovery of inter-assessor information Sensory panels of 33 assessors Three groups of cheese products Hedonic 9-point scale Tests in incomplete blocks: five or six products per assessor Type of analysis Intra-assessor analysis Assumption about assessor effects fixed Recovery of inter-assessor information random
45 Table.14: Average standard errors of a difference (s.e.d.). Average s.e.d. Use of inter-assessor Hard Fresh cheese Fresh Information cheese with additives cheese yes no
46 3. Numerical issues y = observed scores u = random effects Joint density of u and y: φ yu (y, u) = φ yu (y u)φ u (u) (3.1) where φ yu (y u) = conditional density of y, given u φ u (u) (multinomial) = marginal density of u (normal) Estimation maximize φ y ( y) = φyu ( y v) φu ( v) dv (3.)
47 Numerical integration Approximations of the likelihood poor performance for small samples Gaussian quadrature Computationally demanding Handles only one subject level All methods rely on asymptotic theory!
48 4. Simulations 4.1 No random effects Hypothesis: Show that ANOVA may be problematic with ordinal data Simulate according to: m ijk = α i + u j + e ijk (4.1) where α i = product effect (i = 1,, I) u j = random assessor effect (j = 1,, J), e ijk = error (standard normal distribution) (k = 1,.., R) Three (nine) categories
49 Simulations under: global null hypothesis departure from global null Teste evaluated: Global H 0 : ANOVA F-test LR test for threshold model H 0 : α 1 = α : ANOVA-LSD paired t-test Wald-test for threshold model
50 Table 4.5: Simulation results for threshold model with fixed effects, Type I error at nominal significance level of 5%, Global H 0 false, nine categories (θ 1 = 0.5, θ = 0.4, θ 3 = 0.3, θ 4 = 0.1, θ 5 = 0.1, θ 6 = 0., θ 7 = 0.3, θ 8 = 0.5, σ u = 0). Test of H 0 : α 1 = α. R J I α 1 α α i> ANOVA Threshold Paired LSD Wald t-test
51 4. Pairwise comparison with heteroscedasticity on latent scale Hypothesis: Heteroscedasticity on latent scale of threshold model invalidates ANOVA Simulated according to: m ij = α i + e ij (4.) where α i = product effect (i = 1, ) e ij = error of j-th replicate for i-th product (j = 1,, J i ) σ 1 = standard deviation of e 1j (product 1) σ = standard deviation of e j (product ) Nine categories.
52 Tests evaluated: Simple t-test Satterthwaite t-test (heteroscedastic) Standard threshold model Threshold model with heteroscedastic errors
53 Table 4.6: Simulation results for heteroscedastic threshold model, Type I error at nominal significance level of 5%, α 1 = α = 0. Test of H 0 : α 1 = α. Threshold model θ 1 θ θ 3 θ 4 θ 5 θ 6 θ 7 θ 8 σ t-test σ 1 = σ σ 1 σ J 1 = 50; J = 150:
54 Threshold model θ 1 θ θ 3 θ 4 θ 5 θ 6 θ 7 θ 8 σ t-test σ 1 = σ σ 1 σ J 1 = 150; J = 50:
55 4.3 One random effect Hypothessis: Asymptotics are even more critical for mixed models Simulated according to: m ijk = α i + u j + f ij + e ijk (4.3) where f ij = random assessor product interaction
56 Table 4.7: Simulation results for threshold model with fixed and random effects (4.3), Type I errors at nominal significance level of 5%. Global H 0 true, R = 4 replicates, three categories. Test of H 0 : α 1 = α. $: Wald-Test, : LR-test. Overall ANOVA Threshold Paired model ANOVA θ 1 θ σ u σ f J I global $ α 1 = α $ global α 1 = α $ α 1 = α
57 5. Conclusion ANOVA can fail badly with ordinal data Threshold model well suited to analyse ordinal data Many experimental settings call for mixed models Threshold model can be extended to cover random effects Allows great flexibility in answering relevant research questions Maximum likelihood estimation computationally demanding Asymptotics work for large samples only Tests in particular need to be taken with a gain of salt More simulations needed
" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2
Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the
More informationAnalysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments.
Analysis of Covariance In some experiments, the experimental units (subjects) are nonhomogeneous or there is variation in the experimental conditions that are not due to the treatments. For example, a
More informationStructural Equation Modeling and Confirmatory Factor Analysis. Types of Variables
/4/04 Structural Equation Modeling and Confirmatory Factor Analysis Advanced Statistics for Researchers Session 3 Dr. Chris Rakes Website: http://csrakes.yolasite.com Email: Rakes@umbc.edu Twitter: @RakesChris
More informationExperimental Design and Data Analysis for Biologists
Experimental Design and Data Analysis for Biologists Gerry P. Quinn Monash University Michael J. Keough University of Melbourne CAMBRIDGE UNIVERSITY PRESS Contents Preface page xv I I Introduction 1 1.1
More informationOutline. The ordinal package: Analyzing ordinal data. Ordinal scales are commonly used (attribute rating or liking) Ordinal data the wine data
Outline Outline The ordinal package: Analyzing ordinal data Per Bruun Brockhoff DTU Compute Section for Statistics Technical University of Denmark perbb@dtu.dk August 0 015 c Per Bruun Brockhoff (DTU)
More informationIntroduction to Within-Person Analysis and RM ANOVA
Introduction to Within-Person Analysis and RM ANOVA Today s Class: From between-person to within-person ANOVAs for longitudinal data Variance model comparisons using 2 LL CLP 944: Lecture 3 1 The Two Sides
More informationBayes methods for categorical data. April 25, 2017
Bayes methods for categorical data April 25, 2017 Motivation for joint probability models Increasing interest in high-dimensional data in broad applications Focus may be on prediction, variable selection,
More informationModel Estimation Example
Ronald H. Heck 1 EDEP 606: Multivariate Methods (S2013) April 7, 2013 Model Estimation Example As we have moved through the course this semester, we have encountered the concept of model estimation. Discussions
More informationHierarchical Generalized Linear Models. ERSH 8990 REMS Seminar on HLM Last Lecture!
Hierarchical Generalized Linear Models ERSH 8990 REMS Seminar on HLM Last Lecture! Hierarchical Generalized Linear Models Introduction to generalized models Models for binary outcomes Interpreting parameter
More informationTextbook Examples of. SPSS Procedure
Textbook s of IBM SPSS Procedures Each SPSS procedure listed below has its own section in the textbook. These sections include a purpose statement that describes the statistical test, identification of
More informationSections 2.3, 2.4. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis 1 / 21
Sections 2.3, 2.4 Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 21 2.3 Partial association in stratified 2 2 tables In describing a relationship
More informationSession 3 The proportional odds model and the Mann-Whitney test
Session 3 The proportional odds model and the Mann-Whitney test 3.1 A unified approach to inference 3.2 Analysis via dichotomisation 3.3 Proportional odds 3.4 Relationship with the Mann-Whitney test Session
More informationExtending the Robust Means Modeling Framework. Alyssa Counsell, Phil Chalmers, Matt Sigal, Rob Cribbie
Extending the Robust Means Modeling Framework Alyssa Counsell, Phil Chalmers, Matt Sigal, Rob Cribbie One-way Independent Subjects Design Model: Y ij = µ + τ j + ε ij, j = 1,, J Y ij = score of the ith
More informationCOMPLETELY RANDOM DESIGN (CRD) -Design can be used when experimental units are essentially homogeneous.
COMPLETELY RANDOM DESIGN (CRD) Description of the Design -Simplest design to use. -Design can be used when experimental units are essentially homogeneous. -Because of the homogeneity requirement, it may
More informationFuncICA for time series pattern discovery
FuncICA for time series pattern discovery Nishant Mehta and Alexander Gray Georgia Institute of Technology The problem Given a set of inherently continuous time series (e.g. EEG) Find a set of patterns
More informationMaximum Likelihood Estimation; Robust Maximum Likelihood; Missing Data with Maximum Likelihood
Maximum Likelihood Estimation; Robust Maximum Likelihood; Missing Data with Maximum Likelihood PRE 906: Structural Equation Modeling Lecture #3 February 4, 2015 PRE 906, SEM: Estimation Today s Class An
More informationRepeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models
Repeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models EPSY 905: Multivariate Analysis Spring 2016 Lecture #12 April 20, 2016 EPSY 905: RM ANOVA, MANOVA, and Mixed Models
More informationReview. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis
Review Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 22 Chapter 1: background Nominal, ordinal, interval data. Distributions: Poisson, binomial,
More informationVARIANCE COMPONENT ANALYSIS
VARIANCE COMPONENT ANALYSIS T. KRISHNAN Cranes Software International Limited Mahatma Gandhi Road, Bangalore - 560 001 krishnan.t@systat.com 1. Introduction In an experiment to compare the yields of two
More informationChapter 2: Describing Contingency Tables - II
: Describing Contingency Tables - II Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM [Acknowledgements to Tim Hanson and Haitao Chu]
More informationStat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010
1 Linear models Y = Xβ + ɛ with ɛ N (0, σ 2 e) or Y N (Xβ, σ 2 e) where the model matrix X contains the information on predictors and β includes all coefficients (intercept, slope(s) etc.). 1. Number of
More informationDescribing Contingency tables
Today s topics: Describing Contingency tables 1. Probability structure for contingency tables (distributions, sensitivity/specificity, sampling schemes). 2. Comparing two proportions (relative risk, odds
More informationLinear models and their mathematical foundations: Simple linear regression
Linear models and their mathematical foundations: Simple linear regression Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term 2018/19 1/21 Introduction
More informationComparison between conditional and marginal maximum likelihood for a class of item response models
(1/24) Comparison between conditional and marginal maximum likelihood for a class of item response models Francesco Bartolucci, University of Perugia (IT) Silvia Bacci, University of Perugia (IT) Claudia
More informationTHE PRINCIPLES AND PRACTICE OF STATISTICS IN BIOLOGICAL RESEARCH. Robert R. SOKAL and F. James ROHLF. State University of New York at Stony Brook
BIOMETRY THE PRINCIPLES AND PRACTICE OF STATISTICS IN BIOLOGICAL RESEARCH THIRD E D I T I O N Robert R. SOKAL and F. James ROHLF State University of New York at Stony Brook W. H. FREEMAN AND COMPANY New
More informationDiscrete Multivariate Statistics
Discrete Multivariate Statistics Univariate Discrete Random variables Let X be a discrete random variable which, in this module, will be assumed to take a finite number of t different values which are
More information36-309/749 Experimental Design for Behavioral and Social Sciences. Dec 1, 2015 Lecture 11: Mixed Models (HLMs)
36-309/749 Experimental Design for Behavioral and Social Sciences Dec 1, 2015 Lecture 11: Mixed Models (HLMs) Independent Errors Assumption An error is the deviation of an individual observed outcome (DV)
More informationMixed Models for Longitudinal Ordinal and Nominal Outcomes
Mixed Models for Longitudinal Ordinal and Nominal Outcomes Don Hedeker Department of Public Health Sciences Biological Sciences Division University of Chicago hedeker@uchicago.edu Hedeker, D. (2008). Multilevel
More informationHypothesis Testing. Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true
Hypothesis esting Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true Statistical Hypothesis: conjecture about a population parameter
More informationGeneralized Linear Models for Non-Normal Data
Generalized Linear Models for Non-Normal Data Today s Class: 3 parts of a generalized model Models for binary outcomes Complications for generalized multivariate or multilevel models SPLH 861: Lecture
More informationSections 3.4, 3.5. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis
Sections 3.4, 3.5 Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 22 3.4 I J tables with ordinal outcomes Tests that take advantage of ordinal
More informationParametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami
Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric Assumptions The observations must be independent. Dependent variable should be continuous
More informationSolution to Tutorial 7
1. (a) We first fit the independence model ST3241 Categorical Data Analysis I Semester II, 2012-2013 Solution to Tutorial 7 log µ ij = λ + λ X i + λ Y j, i = 1, 2, j = 1, 2. The parameter estimates are
More informationStatistics 203: Introduction to Regression and Analysis of Variance Course review
Statistics 203: Introduction to Regression and Analysis of Variance Course review Jonathan Taylor - p. 1/?? Today Review / overview of what we learned. - p. 2/?? General themes in regression models Specifying
More informationHypothesis testing:power, test statistic CMS:
Hypothesis testing:power, test statistic The more sensitive the test, the better it can discriminate between the null and the alternative hypothesis, quantitatively, maximal power In order to achieve this
More informationConfidence Intervals, Testing and ANOVA Summary
Confidence Intervals, Testing and ANOVA Summary 1 One Sample Tests 1.1 One Sample z test: Mean (σ known) Let X 1,, X n a r.s. from N(µ, σ) or n > 30. Let The test statistic is H 0 : µ = µ 0. z = x µ 0
More informationIntroduction to Analysis of Variance (ANOVA) Part 2
Introduction to Analysis of Variance (ANOVA) Part 2 Single factor Serpulid recruitment and biofilms Effect of biofilm type on number of recruiting serpulid worms in Port Phillip Bay Response variable:
More informationANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS
ANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS Ravinder Malhotra and Vipul Sharma National Dairy Research Institute, Karnal-132001 The most common use of statistics in dairy science is testing
More informationStatistics Toolbox 6. Apply statistical algorithms and probability models
Statistics Toolbox 6 Apply statistical algorithms and probability models Statistics Toolbox provides engineers, scientists, researchers, financial analysts, and statisticians with a comprehensive set of
More informationLogistic Regression. Continued Psy 524 Ainsworth
Logistic Regression Continued Psy 524 Ainsworth Equations Regression Equation Y e = 1 + A+ B X + B X + B X 1 1 2 2 3 3 i A+ B X + B X + B X e 1 1 2 2 3 3 Equations The linear part of the logistic regression
More informationScore test for random changepoint in a mixed model
Score test for random changepoint in a mixed model Corentin Segalas and Hélène Jacqmin-Gadda INSERM U1219, Biostatistics team, Bordeaux GDR Statistiques et Santé October 6, 2017 Biostatistics 1 / 27 Introduction
More informationR in Linguistic Analysis. Wassink 2012 University of Washington Week 6
R in Linguistic Analysis Wassink 2012 University of Washington Week 6 Overview R for phoneticians and lab phonologists Johnson 3 Reading Qs Equivalence of means (t-tests) Multiple Regression Principal
More informationExample. Multiple Regression. Review of ANOVA & Simple Regression /749 Experimental Design for Behavioral and Social Sciences
36-309/749 Experimental Design for Behavioral and Social Sciences Sep. 29, 2015 Lecture 5: Multiple Regression Review of ANOVA & Simple Regression Both Quantitative outcome Independent, Gaussian errors
More informationStandard Errors & Confidence Intervals. N(0, I( β) 1 ), I( β) = [ 2 l(β, φ; y) β i β β= β j
Standard Errors & Confidence Intervals β β asy N(0, I( β) 1 ), where I( β) = [ 2 l(β, φ; y) ] β i β β= β j We can obtain asymptotic 100(1 α)% confidence intervals for β j using: β j ± Z 1 α/2 se( β j )
More informationMARGINAL HOMOGENEITY MODEL FOR ORDERED CATEGORIES WITH OPEN ENDS IN SQUARE CONTINGENCY TABLES
REVSTAT Statistical Journal Volume 13, Number 3, November 2015, 233 243 MARGINAL HOMOGENEITY MODEL FOR ORDERED CATEGORIES WITH OPEN ENDS IN SQUARE CONTINGENCY TABLES Authors: Serpil Aktas Department of
More informationSample size determination for logistic regression: A simulation study
Sample size determination for logistic regression: A simulation study Stephen Bush School of Mathematical Sciences, University of Technology Sydney, PO Box 123 Broadway NSW 2007, Australia Abstract This
More informationNonparametric statistic methods. Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health
Nonparametric statistic methods Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health Measurement What are the 4 levels of measurement discussed? 1. Nominal or Classificatory Scale Gender,
More informationGeneralized Linear Models: An Introduction
Applied Statistics With R Generalized Linear Models: An Introduction John Fox WU Wien May/June 2006 2006 by John Fox Generalized Linear Models: An Introduction 1 A synthesis due to Nelder and Wedderburn,
More informationInference for the Regression Coefficient
Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression line. We can shows that b 0 and b 1 are the unbiased estimates
More informationA Study into Mechanisms of Attitudinal Scale Conversion: A Randomized Stochastic Ordering Approach
A Study into Mechanisms of Attitudinal Scale Conversion: A Randomized Stochastic Ordering Approach Zvi Gilula (Hebrew University) Robert McCulloch (Arizona State) Ya acov Ritov (University of Michigan)
More informationUNIVERSITY OF TORONTO Faculty of Arts and Science
UNIVERSITY OF TORONTO Faculty of Arts and Science December 2013 Final Examination STA442H1F/2101HF Methods of Applied Statistics Jerry Brunner Duration - 3 hours Aids: Calculator Model(s): Any calculator
More informationMissing values imputation for mixed data based on principal component methods
Missing values imputation for mixed data based on principal component methods Vincent Audigier, François Husson & Julie Josse Agrocampus Rennes Compstat' 2012, Limassol (Cyprus), 28-08-2012 1 / 21 A real
More informationPAIRED COMPARISONS MODELS AND APPLICATIONS. Regina Dittrich Reinhold Hatzinger Walter Katzenbeisser
PAIRED COMPARISONS MODELS AND APPLICATIONS Regina Dittrich Reinhold Hatzinger Walter Katzenbeisser PAIRED COMPARISONS (Dittrich, Hatzinger, Katzenbeisser) WU Wien 6.11.2003 1 PAIRED COMPARISONS (PC) a
More informationPLSC PRACTICE TEST ONE
PLSC 724 - PRACTICE TEST ONE 1. Discuss briefly the relationship between the shape of the normal curve and the variance. 2. What is the relationship between a statistic and a parameter? 3. How is the α
More informationAnalysing geoadditive regression data: a mixed model approach
Analysing geoadditive regression data: a mixed model approach Institut für Statistik, Ludwig-Maximilians-Universität München Joint work with Ludwig Fahrmeir & Stefan Lang 25.11.2005 Spatio-temporal regression
More informationExam details. Final Review Session. Things to Review
Exam details Final Review Session Short answer, similar to book problems Formulae and tables will be given You CAN use a calculator Date and Time: Dec. 7, 006, 1-1:30 pm Location: Osborne Centre, Unit
More informationST3241 Categorical Data Analysis I Multicategory Logit Models. Logit Models For Nominal Responses
ST3241 Categorical Data Analysis I Multicategory Logit Models Logit Models For Nominal Responses 1 Models For Nominal Responses Y is nominal with J categories. Let {π 1,, π J } denote the response probabilities
More informationBivariate Relationships Between Variables
Bivariate Relationships Between Variables BUS 735: Business Decision Making and Research 1 Goals Specific goals: Detect relationships between variables. Be able to prescribe appropriate statistical methods
More informationDESIGNING EXPERIMENTS AND ANALYZING DATA A Model Comparison Perspective
DESIGNING EXPERIMENTS AND ANALYZING DATA A Model Comparison Perspective Second Edition Scott E. Maxwell Uniuersity of Notre Dame Harold D. Delaney Uniuersity of New Mexico J,t{,.?; LAWRENCE ERLBAUM ASSOCIATES,
More informationData-analysis and Retrieval Ordinal Classification
Data-analysis and Retrieval Ordinal Classification Ad Feelders Universiteit Utrecht Data-analysis and Retrieval 1 / 30 Strongly disagree Ordinal Classification 1 2 3 4 5 0% (0) 10.5% (2) 21.1% (4) 42.1%
More informationthe long tau-path for detecting monotone association in an unspecified subpopulation
the long tau-path for detecting monotone association in an unspecified subpopulation Joe Verducci Current Challenges in Statistical Learning Workshop Banff International Research Station Tuesday, December
More informationReview of CLDP 944: Multilevel Models for Longitudinal Data
Review of CLDP 944: Multilevel Models for Longitudinal Data Topics: Review of general MLM concepts and terminology Model comparisons and significance testing Fixed and random effects of time Significance
More informationCHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)
FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter
More informationCategorical Data Analysis Chapter 3
Categorical Data Analysis Chapter 3 The actual coverage probability is usually a bit higher than the nominal level. Confidence intervals for association parameteres Consider the odds ratio in the 2x2 table,
More informationIntroduction to the Analysis of Variance (ANOVA) Computing One-Way Independent Measures (Between Subjects) ANOVAs
Introduction to the Analysis of Variance (ANOVA) Computing One-Way Independent Measures (Between Subjects) ANOVAs The Analysis of Variance (ANOVA) The analysis of variance (ANOVA) is a statistical technique
More informationCentral Limit Theorem ( 5.3)
Central Limit Theorem ( 5.3) Let X 1, X 2,... be a sequence of independent random variables, each having n mean µ and variance σ 2. Then the distribution of the partial sum S n = X i i=1 becomes approximately
More informationStatistical. Psychology
SEVENTH у *i km m it* & П SB Й EDITION Statistical M e t h o d s for Psychology D a v i d C. Howell University of Vermont ; \ WADSWORTH f% CENGAGE Learning* Australia Biaall apan Korea Меяко Singapore
More informationStat 5101 Lecture Notes
Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random
More informationJoint Factor Analysis for Speaker Verification
Joint Factor Analysis for Speaker Verification Mengke HU ASPITRG Group, ECE Department Drexel University mengke.hu@gmail.com October 12, 2012 1/37 Outline 1 Speaker Verification Baseline System Session
More informationsimple if it completely specifies the density of x
3. Hypothesis Testing Pure significance tests Data x = (x 1,..., x n ) from f(x, θ) Hypothesis H 0 : restricts f(x, θ) Are the data consistent with H 0? H 0 is called the null hypothesis simple if it completely
More informationImpact of serial correlation structures on random effect misspecification with the linear mixed model.
Impact of serial correlation structures on random effect misspecification with the linear mixed model. Brandon LeBeau University of Iowa file:///c:/users/bleb/onedrive%20 %20University%20of%20Iowa%201/JournalArticlesInProgress/Diss/Study2/Pres/pres.html#(2)
More information1. How will an increase in the sample size affect the width of the confidence interval?
Study Guide Concept Questions 1. How will an increase in the sample size affect the width of the confidence interval? 2. How will an increase in the sample size affect the power of a statistical test?
More information1 One-way analysis of variance
LIST OF FORMULAS (Version from 21. November 2014) STK2120 1 One-way analysis of variance Assume X ij = µ+α i +ɛ ij ; j = 1, 2,..., J i ; i = 1, 2,..., I ; where ɛ ij -s are independent and N(0, σ 2 ) distributed.
More informationA Diagnostic for Seasonality Based Upon Autoregressive Roots
A Diagnostic for Seasonality Based Upon Autoregressive Roots Tucker McElroy (U.S. Census Bureau) 2018 Seasonal Adjustment Practitioners Workshop April 26, 2018 1 / 33 Disclaimer This presentation is released
More informationThe Multinomial Model
The Multinomial Model STA 312: Fall 2012 Contents 1 Multinomial Coefficients 1 2 Multinomial Distribution 2 3 Estimation 4 4 Hypothesis tests 8 5 Power 17 1 Multinomial Coefficients Multinomial coefficient
More informationChapter 12. Analysis of variance
Serik Sagitov, Chalmers and GU, January 9, 016 Chapter 1. Analysis of variance Chapter 11: I = samples independent samples paired samples Chapter 1: I 3 samples of equal size J one-way layout two-way layout
More informationStatistical Tests for Parameterized Multinomial Models: Power Approximation and Optimization. Edgar Erdfelder University of Mannheim Germany
Statistical Tests for Parameterized Multinomial Models: Power Approximation and Optimization Edgar Erdfelder University of Mannheim Germany The Problem Typically, model applications start with a global
More informationNATIONAL UNIVERSITY OF SINGAPORE EXAMINATION (SOLUTIONS) ST3241 Categorical Data Analysis. (Semester II: )
NATIONAL UNIVERSITY OF SINGAPORE EXAMINATION (SOLUTIONS) Categorical Data Analysis (Semester II: 2010 2011) April/May, 2011 Time Allowed : 2 Hours Matriculation No: Seat No: Grade Table Question 1 2 3
More informationReadings Howitt & Cramer (2014) Overview
Readings Howitt & Cramer (4) Ch 7: Relationships between two or more variables: Diagrams and tables Ch 8: Correlation coefficients: Pearson correlation and Spearman s rho Ch : Statistical significance
More informationCourse topics (tentative) The role of random effects
Course topics (tentative) random effects linear mixed models analysis of variance frequentist likelihood-based inference (MLE and REML) prediction Bayesian inference The role of random effects Rasmus Waagepetersen
More informationHANDBOOK OF APPLICABLE MATHEMATICS
HANDBOOK OF APPLICABLE MATHEMATICS Chief Editor: Walter Ledermann Volume VI: Statistics PART A Edited by Emlyn Lloyd University of Lancaster A Wiley-Interscience Publication JOHN WILEY & SONS Chichester
More informationAnalysis of Variance. Read Chapter 14 and Sections to review one-way ANOVA.
Analysis of Variance Read Chapter 14 and Sections 15.1-15.2 to review one-way ANOVA. Design of an experiment the process of planning an experiment to insure that an appropriate analysis is possible. Some
More informationRelated Concepts: Lecture 9 SEM, Statistical Modeling, AI, and Data Mining. I. Terminology of SEM
Lecture 9 SEM, Statistical Modeling, AI, and Data Mining I. Terminology of SEM Related Concepts: Causal Modeling Path Analysis Structural Equation Modeling Latent variables (Factors measurable, but thru
More informationSTAT 525 Fall Final exam. Tuesday December 14, 2010
STAT 525 Fall 2010 Final exam Tuesday December 14, 2010 Time: 2 hours Name (please print): Show all your work and calculations. Partial credit will be given for work that is partially correct. Points will
More informationLec 3: Model Adequacy Checking
November 16, 2011 Model validation Model validation is a very important step in the model building procedure. (one of the most overlooked) A high R 2 value does not guarantee that the model fits the data
More informationA general mixed model approach for spatio-temporal regression data
A general mixed model approach for spatio-temporal regression data Thomas Kneib, Ludwig Fahrmeir & Stefan Lang Department of Statistics, Ludwig-Maximilians-University Munich 1. Spatio-temporal regression
More informationSequential Procedure for Testing Hypothesis about Mean of Latent Gaussian Process
Applied Mathematical Sciences, Vol. 4, 2010, no. 62, 3083-3093 Sequential Procedure for Testing Hypothesis about Mean of Latent Gaussian Process Julia Bondarenko Helmut-Schmidt University Hamburg University
More informationST3241 Categorical Data Analysis I Generalized Linear Models. Introduction and Some Examples
ST3241 Categorical Data Analysis I Generalized Linear Models Introduction and Some Examples 1 Introduction We have discussed methods for analyzing associations in two-way and three-way tables. Now we will
More informationGeneralized Linear Models (1/29/13)
STA613/CBB540: Statistical methods in computational biology Generalized Linear Models (1/29/13) Lecturer: Barbara Engelhardt Scribe: Yangxiaolu Cao When processing discrete data, two commonly used probability
More information3. (a) (8 points) There is more than one way to correctly express the null hypothesis in matrix form. One way to state the null hypothesis is
Stat 501 Solutions and Comments on Exam 1 Spring 005-4 0-4 1. (a) (5 points) Y ~ N, -1-4 34 (b) (5 points) X (X,X ) = (5,8) ~ N ( 11.5, 0.9375 ) 3 1 (c) (10 points, for each part) (i), (ii), and (v) are
More informationReadings Howitt & Cramer (2014)
Readings Howitt & Cramer (014) Ch 7: Relationships between two or more variables: Diagrams and tables Ch 8: Correlation coefficients: Pearson correlation and Spearman s rho Ch 11: Statistical significance
More informationYu Xie, Institute for Social Research, 426 Thompson Street, University of Michigan, Ann
Association Model, Page 1 Yu Xie, Institute for Social Research, 426 Thompson Street, University of Michigan, Ann Arbor, MI 48106. Email: yuxie@umich.edu. Tel: (734)936-0039. Fax: (734)998-7415. Association
More informationNonparametric Independence Tests
Nonparametric Independence Tests Nathaniel E. Helwig Assistant Professor of Psychology and Statistics University of Minnesota (Twin Cities) Updated 04-Jan-2017 Nathaniel E. Helwig (U of Minnesota) Nonparametric
More informationChristine Borgen Linander 1,, Rune Haubo Bojesen Christensen 1, Rebecca Evans 2, Graham Cleaver 2, Per Bruun Brockhoff 1. University of Denmark
Individual differences in replicated multi-product 2-AFC data with and without supplementary difference scoring: Comparing Thurstonian mixed regression models for binary and ordinal data with linear mixed
More informationPsychology 282 Lecture #4 Outline Inferences in SLR
Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations
More informationLikelihood-based inference with missing data under missing-at-random
Likelihood-based inference with missing data under missing-at-random Jae-kwang Kim Joint work with Shu Yang Department of Statistics, Iowa State University May 4, 014 Outline 1. Introduction. Parametric
More informationStat 217 Final Exam. Name: May 1, 2002
Stat 217 Final Exam Name: May 1, 2002 Problem 1. Three brands of batteries are under study. It is suspected that the lives (in weeks) of the three brands are different. Five batteries of each brand are
More informationTime-Invariant Predictors in Longitudinal Models
Time-Invariant Predictors in Longitudinal Models Today s Class (or 3): Summary of steps in building unconditional models for time What happens to missing predictors Effects of time-invariant predictors
More informationGeneralized Linear. Mixed Models. Methods and Applications. Modern Concepts, Walter W. Stroup. Texts in Statistical Science.
Texts in Statistical Science Generalized Linear Mixed Models Modern Concepts, Methods and Applications Walter W. Stroup CRC Press Taylor & Francis Croup Boca Raton London New York CRC Press is an imprint
More informationResearch Article A Nonparametric Two-Sample Wald Test of Equality of Variances
Advances in Decision Sciences Volume 211, Article ID 74858, 8 pages doi:1.1155/211/74858 Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances David Allingham 1 andj.c.w.rayner
More information