The Use of Quadratic Form Statistics of Residuals to Identify IRT Model Misfit in Marginal Subtables
|
|
- Poppy Cox
- 5 years ago
- Views:
Transcription
1 The Use of Quadratic Form Statistics of Residuals to Identify IRT Model Misfit in Marginal Subtables 1 Yang Liu Alberto Maydeu-Olivares 1 Department of Psychology, The University of North Carolina at Chapel Hill Department of Psychology, University of Barcelona IMPS 01 Lincoln, NE 1/0
2 /0 Acknowledgment Dr. David Thissen Other folks from the Thurstone lab
3 Overview: goodness-of-fit test of IRT model specification parameter estimation overall GOF test reject H 0 fail to reject H 0. remove items change model... 3/0
4 Overview: goodness-of-fit test of IRT model specification parameter estimation overall GOF test reject H 0 fail to reject H 0 marginal subtable GOF test remove items change model... 3/0
5 4/0 Marginal subtable GOF test Consider A, a subset of r items (e.g. r = for pairs, 3 for triplets) Principles of testing marginal fit 1 Do not want to reject well-fitting items r should not be too large, usually r 3 Null hypothesis H 0 : π A = π A (θ A ) π A : probabilities of cells in subtable A θ A : item parameters in A Use test statistics based on residuals ê A := p A π A (ˆθ A ) p A : observed proportion π A (ˆθ A ): model-implied probability ˆθA : some estimator of θ A
6 4/0 Marginal subtable GOF test Consider A, a subset of r items (e.g. r = for pairs, 3 for triplets) Principles of testing marginal fit 1 Do not want to reject well-fitting items r should not be too large, usually r 3 Null hypothesis H 0 : π A = π A (θ A ) π A : probabilities of cells in subtable A θ A : item parameters in A Use test statistics based on residuals ê A := p A π A (ˆθ A ) p A : observed proportion π A (ˆθ A ): model-implied probability ˆθA : some estimator of θ A
7 4/0 Marginal subtable GOF test Consider A, a subset of r items (e.g. r = for pairs, 3 for triplets) Principles of testing marginal fit 1 Do not want to reject well-fitting items r should not be too large, usually r 3 Null hypothesis H 0 : π A = π A (θ A ) π A : probabilities of cells in subtable A θ A : item parameters in A Use test statistics based on residuals ê A := p A π A (ˆθ A ) p A : observed proportion π A (ˆθ A ): model-implied probability ˆθA : some estimator of θ A
8 4/0 Marginal subtable GOF test Consider A, a subset of r items (e.g. r = for pairs, 3 for triplets) Principles of testing marginal fit 1 Do not want to reject well-fitting items r should not be too large, usually r 3 Null hypothesis H 0 : π A = π A (θ A ) π A : probabilities of cells in subtable A θ A : item parameters in A Use test statistics based on residuals ê A := p A π A (ˆθ A ) p A : observed proportion π A (ˆθ A ): model-implied probability ˆθA : some estimator of θ A
9 5/0 Asymptotic tests based on residuals Proposition 1 (Maydeu-Olivares & Liu, 01): Let ˆθ A be the full table MLE corresponding to item subset A, under H 0 : N êa d N (0, ΣA ) where Σ A = Γ A A I 1 [A] A, Γ A = D A π A (θ A )π A (θ A ), D A = Diag(π A (θ A )), A = π A / θ A, I [A] is the block of Fisher information matrix I corresponding to θ A. Standardized residuals (Reiser, 1996; Maydeu-Olivares & Joe, 006) Quadratic form statistics of residuals
10 5/0 Asymptotic tests based on residuals Proposition 1 (Maydeu-Olivares & Liu, 01): Let ˆθ A be the full table MLE corresponding to item subset A, under H 0 : N êa d N (0, ΣA ) where Σ A = Γ A A I 1 [A] A, Γ A = D A π A (θ A )π A (θ A ), D A = Diag(π A (θ A )), A = π A / θ A, I [A] is the block of Fisher information matrix I corresponding to θ A. Standardized residuals (Reiser, 1996; Maydeu-Olivares & Joe, 006) Quadratic form statistics of residuals
11 5/0 Asymptotic tests based on residuals Proposition 1 (Maydeu-Olivares & Liu, 01): Let ˆθ A be the full table MLE corresponding to item subset A, under H 0 : N êa d N (0, ΣA ) where Σ A = Γ A A I 1 [A] A, Γ A = D A π A (θ A )π A (θ A ), D A = Diag(π A (θ A )), A = π A / θ A, I [A] is the block of Fisher information matrix I corresponding to θ A. Standardized residuals (Reiser, 1996; Maydeu-Olivares & Joe, 006) Quadratic form statistics of residuals
12 6/0 Quadratic form statistics of residuals 1 Pearson s X XA = N ê 1 A ˆD A êa W statistic (Reiser, 1996) W A = N ê A ˆΣ + AêA where ˆΣ + A is the Moore-Penrose pseudoinverse of ˆΣ A 3 M n statistic (Maydeu-Olivares & Joe, 005, 006) M A = N ê AÛAê A where ÛA = ˆD 1 A ˆD 1 A ˆ A ( ˆ A ˆD 1 A ˆ A ) 1 ˆ A ˆD 1 A
13 Asymptotic distributions under H 0 Proposition (Cochran, 1934; Box, 1954; Tan, 1977): Let p-dimensional random vector z N (0, Ω), and W is any p p real symmetric matrix, then z Wz p λ i χ 1 i=1 where each chi-squared random variable is distributed independently of every other, and λ i s are eigenvalues of matrix ΩW. 1 W A d χ s where s is the rank of Σ A (use ˆΣ A in practice) M A d χ t where t = dim(π A ) dim(θ A ) 1 3 X A d λ i χ 1, where λ i s are eigenvalues for matrix Σ A D 1 A (use ˆΣ A ˆD 1 A in practice) 7/0
14 Asymptotic distributions under H 0 Proposition (Cochran, 1934; Box, 1954; Tan, 1977): Let p-dimensional random vector z N (0, Ω), and W is any p p real symmetric matrix, then z Wz p λ i χ 1 i=1 where each chi-squared random variable is distributed independently of every other, and λ i s are eigenvalues of matrix ΩW. 1 W A d χ s where s is the rank of Σ A (use ˆΣ A in practice) M A d χ t where t = dim(π A ) dim(θ A ) 1 3 X A d λ i χ 1, where λ i s are eigenvalues for matrix Σ A D 1 A (use ˆΣ A ˆD 1 A in practice) 7/0
15 Asymptotic distributions under H 0 Proposition (Cochran, 1934; Box, 1954; Tan, 1977): Let p-dimensional random vector z N (0, Ω), and W is any p p real symmetric matrix, then z Wz p λ i χ 1 i=1 where each chi-squared random variable is distributed independently of every other, and λ i s are eigenvalues of matrix ΩW. 1 W A d χ s where s is the rank of Σ A (use ˆΣ A in practice) M A d χ t where t = dim(π A ) dim(θ A ) 1 3 X A d λ i χ 1, where λ i s are eigenvalues for matrix Σ A D 1 A (use ˆΣ A ˆD 1 A in practice) 7/0
16 Asymptotic distributions under H 0 Proposition (Cochran, 1934; Box, 1954; Tan, 1977): Let p-dimensional random vector z N (0, Ω), and W is any p p real symmetric matrix, then z Wz p λ i χ 1 i=1 where each chi-squared random variable is distributed independently of every other, and λ i s are eigenvalues of matrix ΩW. 1 W A d χ s where s is the rank of Σ A (use ˆΣ A in practice) M A d χ t where t = dim(π A ) dim(θ A ) 1 3 X A d λ i χ 1, where λ i s are eigenvalues for matrix Σ A D 1 A (use ˆΣ A ˆD 1 A in practice) 7/0
17 8/0 Computing asymptotic p-values 1 W statistic (requires Î) In general we do not know the rank of Σ A use the rank of ˆΣ A set some cutoff for zero EV (e.g ) M n statistic (does not require Î) Model should be identified in the subtable (dim(θ A ) < dim(π A ) 1) When r =, graded model is identified, PL is not 3 X statistic Computing/approximating the tail probabilities (requires Î) Satterthwaite s approximation (Satterthwaite, 1941) Inversion method (Imhof, 1961) Heuristical chi-squared reference distributions (does not require Î) Subtable MLE: d.f. = dim(π A ) dim(θ A ) 1 Chen-Thissen (1997): d.f. of independence model test
18 8/0 Computing asymptotic p-values 1 W statistic (requires Î) In general we do not know the rank of Σ A use the rank of ˆΣ A set some cutoff for zero EV (e.g ) M n statistic (does not require Î) Model should be identified in the subtable (dim(θ A ) < dim(π A ) 1) When r =, graded model is identified, PL is not 3 X statistic Computing/approximating the tail probabilities (requires Î) Satterthwaite s approximation (Satterthwaite, 1941) Inversion method (Imhof, 1961) Heuristical chi-squared reference distributions (does not require Î) Subtable MLE: d.f. = dim(π A ) dim(θ A ) 1 Chen-Thissen (1997): d.f. of independence model test
19 8/0 Computing asymptotic p-values 1 W statistic (requires Î) In general we do not know the rank of Σ A use the rank of ˆΣ A set some cutoff for zero EV (e.g ) M n statistic (does not require Î) Model should be identified in the subtable (dim(θ A ) < dim(π A ) 1) When r =, graded model is identified, PL is not 3 X statistic Computing/approximating the tail probabilities (requires Î) Satterthwaite s approximation (Satterthwaite, 1941) Inversion method (Imhof, 1961) Heuristical chi-squared reference distributions (does not require Î) Subtable MLE: d.f. = dim(π A ) dim(θ A ) 1 Chen-Thissen (1997): d.f. of independence model test
20 8/0 Computing asymptotic p-values 1 W statistic (requires Î) In general we do not know the rank of Σ A use the rank of ˆΣ A set some cutoff for zero EV (e.g ) M n statistic (does not require Î) Model should be identified in the subtable (dim(θ A ) < dim(π A ) 1) When r =, graded model is identified, PL is not 3 X statistic Computing/approximating the tail probabilities (requires Î) Satterthwaite s approximation (Satterthwaite, 1941) Inversion method (Imhof, 1961) Heuristical chi-squared reference distributions (does not require Î) Subtable MLE: d.f. = dim(π A ) dim(θ A ) 1 Chen-Thissen (1997): d.f. of independence model test
21 Simulation A: dichotomous items 9 items, fitted model is always PL, 1000 reps for each subcondition Condition A1: PL model Slopes β = (.7, 1.67, 1.8,.7, 1.67, 1.8) Intercepts α = (1.4, 0, 1.06, 1.4, 0, 1.06, 1.4, 0, 1.06) Condition A: Mixture model LV 1 N (0, 1) + 1 N (3, 1) Condition A3: 3-dimensional (independent cluster) model LVs N 0, β = /0
22 Simulation A: dichotomous items 9 items, fitted model is always PL, 1000 reps for each subcondition Condition A1: PL model Slopes β = (.7, 1.67, 1.8,.7, 1.67, 1.8) Intercepts α = (1.4, 0, 1.06, 1.4, 0, 1.06, 1.4, 0, 1.06) Condition A: Mixture model LV 1 N (0, 1) + 1 N (3, 1) Condition A3: 3-dimensional (independent cluster) model LVs N 0, β = /0
23 Simulation A: dichotomous items 9 items, fitted model is always PL, 1000 reps for each subcondition Condition A1: PL model Slopes β = (.7, 1.67, 1.8,.7, 1.67, 1.8) Intercepts α = (1.4, 0, 1.06, 1.4, 0, 1.06, 1.4, 0, 1.06) Condition A: Mixture model LV 1 N (0, 1) + 1 N (3, 1) Condition A3: 3-dimensional (independent cluster) model LVs N 0, β = /0
24 Simulation A: Statistics Subtable Statistics Reference distribution Pair W (jk) χ s ; s = # EV of ˆΣ (jk) 10 5 (j, k) X(jk) Chen-Thissen χ 1 Satterthwaite s approximation Inversion method Triplet W (jkl) χ s ; s = # EV of ˆΣ (jkl) 10 5 (j, k, l) M (jkl) χ 1 X(jkl) Subtable MLE χ 1 Chen-Thissen χ 4 Satterthwaite s approximation Inversion method 10/0
25 11/0 Result A1: PL (true), PL (fitted) Pair (, 3) Triplet (1,, 3) Rejection rate W(3) X (3) (Chen Thissen) X (3) (Satterthwaite) X (3) (Inversion) Rejection rate W(13) M(13) X (13) (Subtable MLE) X (13) (Chen Thissen) X (13) (Satterthwaite) X (13) (Inversion) α level α level
26 1/0 Result A: Mixture (true), PL (fitted) Pair (, 3) Triplet (1,, 3) Rejection rate W(3) X (3) (Satterthwaite) X (3) (Inversion) Rejection rate W(13) M(13) X (13) (Satterthwaite) X (13) (Inversion) α level α level
27 13/0 Result A3: 3-dim (true), PL (fitted) Pair (, 3) Triplet (1,, 3) Rejection rate W(3) X (3) (Satterthwaite) X (3) (Inversion) Rejection rate W(13) M(13) X (13) (Satterthwaite) X (13) (Inversion) α level α level
28 14/0 Condition B: 3-category items 6 items, fitted model is always graded, 1000 reps for each subcondition Condition B1: Graded model Condition B: Mixture model β = (.7, 1.67, 1.8,.7, 1.67, 1.8) [ α = LV 1 N (0, 1) + 1 N (3, 1) Condition B3: -dimensional (independent cluster) model ([ ] [ ]) LVs N, [ β = ] ]
29 14/0 Condition B: 3-category items 6 items, fitted model is always graded, 1000 reps for each subcondition Condition B1: Graded model Condition B: Mixture model β = (.7, 1.67, 1.8,.7, 1.67, 1.8) [ α = LV 1 N (0, 1) + 1 N (3, 1) Condition B3: -dimensional (independent cluster) model ([ ] [ ]) LVs N, [ β = ] ]
30 14/0 Condition B: 3-category items 6 items, fitted model is always graded, 1000 reps for each subcondition Condition B1: Graded model Condition B: Mixture model β = (.7, 1.67, 1.8,.7, 1.67, 1.8) [ α = LV 1 N (0, 1) + 1 N (3, 1) Condition B3: -dimensional (independent cluster) model ([ ] [ ]) LVs N, [ β = ] ]
31 Condition B: Statistics Subtable Statistics Reference distribution Pairs W (jk) χ s ; s = # EV of ˆΣ (jk) 10 5 (j, k) M (jk) χ X(jk) Subtable MLE χ Chen-Thissen χ 4 Satterthwaite s approximation Inversion method 15/0
32 16/0 Result B1: graded (true), graded (fitted) Pair (1, ) Pair (, 3) Rejection rate W(1) M(1) X (1) (Subtable MLE) X (1) (Chen Thissen) X (1) (Satterthwaite) X (1) (Inversion) Rejection rate W(3) M(3) X (3) (Subtable MLE) X (3) (Chen Thissen) X (3) (Satterthwaite) X (3) (Inversion) α level α level
33 17/0 Result B: mixture (true), graded (fitted) Pair (1, ) Pair (, 3) Rejection rate W(1) M(1) X (1) (Satterthwaite) X (1) (Inversion) Rejection rate W(3) M(3) X (3) (Satterthwaite) X (3) (Inversion) α level α level
34 18/0 Result B3: -dim (true), graded (fitted) Pair (1, ) Pair (, 3) Rejection rate W(1) M(1) X (1) (Satterthwaite) X (1) (Inversion) Rejection rate W(3) M(3) X (3) (Satterthwaite) X (3) (Inversion) α level α level
35 19/0 Concluding remarks When the model is correctly specified Empirical distributions of W and M n closely match their asymptotic reference distributions Heuristical chi-squared reference distributions for X works poorly p-values based on the correct asymptotic distribution of X works well; the Satterthwaite s approximation works as well as the inversion method Among W, M n and X, we suggest the use of W statistics for pairs (and triplets, if necessary) which always have high power Future direction Long tests where exact Fisher information cannot be computed: Use OPG (a.k.a. XPD) estimator of information ÎOPG = ˆ Ŵ ˆ where Ŵ = Diag{p c /πc} C c=1
36 19/0 Concluding remarks When the model is correctly specified Empirical distributions of W and M n closely match their asymptotic reference distributions Heuristical chi-squared reference distributions for X works poorly p-values based on the correct asymptotic distribution of X works well; the Satterthwaite s approximation works as well as the inversion method Among W, M n and X, we suggest the use of W statistics for pairs (and triplets, if necessary) which always have high power Future direction Long tests where exact Fisher information cannot be computed: Use OPG (a.k.a. XPD) estimator of information ÎOPG = ˆ Ŵ ˆ where Ŵ = Diag{p c /πc} C c=1
37 19/0 Concluding remarks When the model is correctly specified Empirical distributions of W and M n closely match their asymptotic reference distributions Heuristical chi-squared reference distributions for X works poorly p-values based on the correct asymptotic distribution of X works well; the Satterthwaite s approximation works as well as the inversion method Among W, M n and X, we suggest the use of W statistics for pairs (and triplets, if necessary) which always have high power Future direction Long tests where exact Fisher information cannot be computed: Use OPG (a.k.a. XPD) estimator of information ÎOPG = ˆ Ŵ ˆ where Ŵ = Diag{p c /πc} C c=1
38 19/0 Concluding remarks When the model is correctly specified Empirical distributions of W and M n closely match their asymptotic reference distributions Heuristical chi-squared reference distributions for X works poorly p-values based on the correct asymptotic distribution of X works well; the Satterthwaite s approximation works as well as the inversion method Among W, M n and X, we suggest the use of W statistics for pairs (and triplets, if necessary) which always have high power Future direction Long tests where exact Fisher information cannot be computed: Use OPG (a.k.a. XPD) estimator of information ÎOPG = ˆ Ŵ ˆ where Ŵ = Diag{p c /πc} C c=1
39 19/0 Concluding remarks When the model is correctly specified Empirical distributions of W and M n closely match their asymptotic reference distributions Heuristical chi-squared reference distributions for X works poorly p-values based on the correct asymptotic distribution of X works well; the Satterthwaite s approximation works as well as the inversion method Among W, M n and X, we suggest the use of W statistics for pairs (and triplets, if necessary) which always have high power Future direction Long tests where exact Fisher information cannot be computed: Use OPG (a.k.a. XPD) estimator of information ÎOPG = ˆ Ŵ ˆ where Ŵ = Diag{p c /πc} C c=1
40 Thanks! 0/0
Local Dependence Diagnostics in IRT Modeling of Binary Data
Local Dependence Diagnostics in IRT Modeling of Binary Data Educational and Psychological Measurement 73(2) 254 274 Ó The Author(s) 2012 Reprints and permission: sagepub.com/journalspermissions.nav DOI:
More informationA Note on a Tucker-Lewis-Index for Item Response Theory Models. Taehun Lee, Li Cai University of California, Los Angeles
A Note on a Tucker-Lewis-Index for Item Response Theory Models Taehun Lee, Li Cai University of California, Los Angeles 1 Item Response Theory (IRT) Models IRT is a family of statistical models used to
More informationPsychology 282 Lecture #4 Outline Inferences in SLR
Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations
More informationCentral Limit Theorem ( 5.3)
Central Limit Theorem ( 5.3) Let X 1, X 2,... be a sequence of independent random variables, each having n mean µ and variance σ 2. Then the distribution of the partial sum S n = X i i=1 becomes approximately
More informationCan you tell the relationship between students SAT scores and their college grades?
Correlation One Challenge Can you tell the relationship between students SAT scores and their college grades? A: The higher SAT scores are, the better GPA may be. B: The higher SAT scores are, the lower
More informationSTAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015
STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots March 8, 2015 The duality between CI and hypothesis testing The duality between CI and hypothesis
More informationLecture 3. Inference about multivariate normal distribution
Lecture 3. Inference about multivariate normal distribution 3.1 Point and Interval Estimation Let X 1,..., X n be i.i.d. N p (µ, Σ). We are interested in evaluation of the maximum likelihood estimates
More informationAssessment of fit of item response theory models used in large scale educational survey assessments
DOI 10.1186/s40536-016-0025-3 RESEARCH Open Access Assessment of fit of item response theory models used in large scale educational survey assessments Peter W. van Rijn 1, Sandip Sinharay 2*, Shelby J.
More informationItem Response Theory (IRT) Analysis of Item Sets
University of Connecticut DigitalCommons@UConn NERA Conference Proceedings 2011 Northeastern Educational Research Association (NERA) Annual Conference Fall 10-21-2011 Item Response Theory (IRT) Analysis
More informationAn Equivalency Test for Model Fit. Craig S. Wells. University of Massachusetts Amherst. James. A. Wollack. Ronald C. Serlin
Equivalency Test for Model Fit 1 Running head: EQUIVALENCY TEST FOR MODEL FIT An Equivalency Test for Model Fit Craig S. Wells University of Massachusetts Amherst James. A. Wollack Ronald C. Serlin University
More informationHYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC
1 HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC 7 steps of Hypothesis Testing 1. State the hypotheses 2. Identify level of significant 3. Identify the critical values 4. Calculate test statistics 5. Compare
More informationTHE PEARSON CORRELATION COEFFICIENT
CORRELATION Two variables are said to have a relation if knowing the value of one variable gives you information about the likely value of the second variable this is known as a bivariate relation There
More informationCategorical Data Analysis Chapter 3
Categorical Data Analysis Chapter 3 The actual coverage probability is usually a bit higher than the nominal level. Confidence intervals for association parameteres Consider the odds ratio in the 2x2 table,
More informationThe regression model with one fixed regressor cont d
The regression model with one fixed regressor cont d 3150/4150 Lecture 4 Ragnar Nymoen 27 January 2012 The model with transformed variables Regression with transformed variables I References HGL Ch 2.8
More informationMS&E 226: Small Data
MS&E 226: Small Data Lecture 15: Examples of hypothesis tests (v5) Ramesh Johari ramesh.johari@stanford.edu 1 / 32 The recipe 2 / 32 The hypothesis testing recipe In this lecture we repeatedly apply the
More informationA3. Statistical Inference Hypothesis Testing for General Population Parameters
Appendix / A3. Statistical Inference / General Parameters- A3. Statistical Inference Hypothesis Testing for General Population Parameters POPULATION H 0 : θ = θ 0 θ is a generic parameter of interest (e.g.,
More informationTUTORIAL 8 SOLUTIONS #
TUTORIAL 8 SOLUTIONS #9.11.21 Suppose that a single observation X is taken from a uniform density on [0,θ], and consider testing H 0 : θ = 1 versus H 1 : θ =2. (a) Find a test that has significance level
More informationSome New Aspects of Dose-Response Models with Applications to Multistage Models Having Parameters on the Boundary
Some New Aspects of Dose-Response Models with Applications to Multistage Models Having Parameters on the Boundary Bimal Sinha Department of Mathematics & Statistics University of Maryland, Baltimore County,
More informationAnswer Key for STAT 200B HW No. 8
Answer Key for STAT 200B HW No. 8 May 8, 2007 Problem 3.42 p. 708 The values of Ȳ for x 00, 0, 20, 30 are 5/40, 0, 20/50, and, respectively. From Corollary 3.5 it follows that MLE exists i G is identiable
More informationCorrelation. A statistics method to measure the relationship between two variables. Three characteristics
Correlation Correlation A statistics method to measure the relationship between two variables Three characteristics Direction of the relationship Form of the relationship Strength/Consistency Direction
More informationA Marginal Maximum Likelihood Procedure for an IRT Model with Single-Peaked Response Functions
A Marginal Maximum Likelihood Procedure for an IRT Model with Single-Peaked Response Functions Cees A.W. Glas Oksana B. Korobko University of Twente, the Netherlands OMD Progress Report 07-01. Cees A.W.
More informationComponents of the Pearson-Fisher Chi-squared Statistic
JOURNAL OF APPLIED MATHEMATICS AND DECISION SCIENCES, 6(4), 241 254 Copyright c 2002, Lawrence Erlbaum Associates, Inc. Components of the Pearson-Fisher Chi-squared Statistic G.D. RAYNER National Australia
More informationML Testing (Likelihood Ratio Testing) for non-gaussian models
ML Testing (Likelihood Ratio Testing) for non-gaussian models Surya Tokdar ML test in a slightly different form Model X f (x θ), θ Θ. Hypothesist H 0 : θ Θ 0 Good set: B c (x) = {θ : l x (θ) max θ Θ l
More information" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2
Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the
More informationSRMR in Mplus. Tihomir Asparouhov and Bengt Muthén. May 2, 2018
SRMR in Mplus Tihomir Asparouhov and Bengt Muthén May 2, 2018 1 Introduction In this note we describe the Mplus implementation of the SRMR standardized root mean squared residual) fit index for the models
More informationStatistics 135 Fall 2008 Final Exam
Name: SID: Statistics 135 Fall 2008 Final Exam Show your work. The number of points each question is worth is shown at the beginning of the question. There are 10 problems. 1. [2] The normal equations
More informationStatistical Inference
Statistical Inference Classical and Bayesian Methods Revision Class for Midterm Exam AMS-UCSC Th Feb 9, 2012 Winter 2012. Session 1 (Revision Class) AMS-132/206 Th Feb 9, 2012 1 / 23 Topics Topics We will
More informationTopic 21 Goodness of Fit
Topic 21 Goodness of Fit Contingency Tables 1 / 11 Introduction Two-way Table Smoking Habits The Hypothesis The Test Statistic Degrees of Freedom Outline 2 / 11 Introduction Contingency tables, also known
More informationThe Impact of Model Misspecification in Clustered and Continuous Growth Modeling
The Impact of Model Misspecification in Clustered and Continuous Growth Modeling Daniel J. Bauer Odum Institute for Research in Social Science The University of North Carolina at Chapel Hill Patrick J.
More informationPart 1.) We know that the probability of any specific x only given p ij = p i p j is just multinomial(n, p) where p k1 k 2
Problem.) I will break this into two parts: () Proving w (m) = p( x (m) X i = x i, X j = x j, p ij = p i p j ). In other words, the probability of a specific table in T x given the row and column counts
More informationSTT 843 Key to Homework 1 Spring 2018
STT 843 Key to Homework Spring 208 Due date: Feb 4, 208 42 (a Because σ = 2, σ 22 = and ρ 2 = 05, we have σ 2 = ρ 2 σ σ22 = 2/2 Then, the mean and covariance of the bivariate normal is µ = ( 0 2 and Σ
More informationNominal Data. Parametric Statistics. Nonparametric Statistics. Parametric vs Nonparametric Tests. Greg C Elvers
Nominal Data Greg C Elvers 1 Parametric Statistics The inferential statistics that we have discussed, such as t and ANOVA, are parametric statistics A parametric statistic is a statistic that makes certain
More informationInstitute of Actuaries of India
Institute of Actuaries of India Subject CT3 Probability & Mathematical Statistics May 2011 Examinations INDICATIVE SOLUTION Introduction The indicative solution has been written by the Examiners with the
More informationThe goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions.
The goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions. A common problem of this type is concerned with determining
More informationLecture 2: Linear Models. Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011
Lecture 2: Linear Models Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011 1 Quick Review of the Major Points The general linear model can be written as y = X! + e y = vector
More informationSession 3 The proportional odds model and the Mann-Whitney test
Session 3 The proportional odds model and the Mann-Whitney test 3.1 A unified approach to inference 3.2 Analysis via dichotomisation 3.3 Proportional odds 3.4 Relationship with the Mann-Whitney test Session
More informationI i=1 1 I(J 1) j=1 (Y ij Ȳi ) 2. j=1 (Y j Ȳ )2 ] = 2n( is the two-sample t-test statistic.
Serik Sagitov, Chalmers and GU, February, 08 Solutions chapter Matlab commands: x = data matrix boxplot(x) anova(x) anova(x) Problem.3 Consider one-way ANOVA test statistic For I = and = n, put F = MS
More informationChapter 4: Constrained estimators and tests in the multiple linear regression model (Part III)
Chapter 4: Constrained estimators and tests in the multiple linear regression model (Part III) Florian Pelgrin HEC September-December 2010 Florian Pelgrin (HEC) Constrained estimators September-December
More informationStatistical Tests for Parameterized Multinomial Models: Power Approximation and Optimization. Edgar Erdfelder University of Mannheim Germany
Statistical Tests for Parameterized Multinomial Models: Power Approximation and Optimization Edgar Erdfelder University of Mannheim Germany The Problem Typically, model applications start with a global
More informationWhats beyond Concerto: An introduction to the R package catr. Session 4: Overview of polytomous IRT models
Whats beyond Concerto: An introduction to the R package catr Session 4: Overview of polytomous IRT models The Psychometrics Centre, Cambridge, June 10th, 2014 2 Outline: 1. Introduction 2. General notations
More informationHypothesis Testing hypothesis testing approach
Hypothesis Testing In this case, we d be trying to form an inference about that neighborhood: Do people there shop more often those people who are members of the larger population To ascertain this, we
More informationLikelihood Ratio tests
Likelihood Ratio tests For general composite hypotheses optimality theory is not usually successful in producing an optimal test. instead we look for heuristics to guide our choices. The simplest approach
More informationProbability Theory and Statistics. Peter Jochumzen
Probability Theory and Statistics Peter Jochumzen April 18, 2016 Contents 1 Probability Theory And Statistics 3 1.1 Experiment, Outcome and Event................................ 3 1.2 Probability............................................
More informationOptimal exact tests for complex alternative hypotheses on cross tabulated data
Optimal exact tests for complex alternative hypotheses on cross tabulated data Daniel Yekutieli Statistics and OR Tel Aviv University CDA course 29 July 2017 Yekutieli (TAU) Optimal exact tests for complex
More informationRater agreement - ordinal ratings. Karl Bang Christensen Dept. of Biostatistics, Univ. of Copenhagen NORDSTAT,
Rater agreement - ordinal ratings Karl Bang Christensen Dept. of Biostatistics, Univ. of Copenhagen NORDSTAT, 2012 http://biostat.ku.dk/~kach/ 1 Rater agreement - ordinal ratings Methods for analyzing
More informationEcon 325: Introduction to Empirical Economics
Econ 325: Introduction to Empirical Economics Lecture 6 Sampling and Sampling Distributions Ch. 6-1 Populations and Samples A Population is the set of all items or individuals of interest Examples: All
More informationA Markov chain Monte Carlo approach to confirmatory item factor analysis. Michael C. Edwards The Ohio State University
A Markov chain Monte Carlo approach to confirmatory item factor analysis Michael C. Edwards The Ohio State University An MCMC approach to CIFA Overview Motivating examples Intro to Item Response Theory
More informationParametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1
Parametric Modelling of Over-dispersed Count Data Part III / MMath (Applied Statistics) 1 Introduction Poisson regression is the de facto approach for handling count data What happens then when Poisson
More informationLikelihood-based inference with missing data under missing-at-random
Likelihood-based inference with missing data under missing-at-random Jae-kwang Kim Joint work with Shu Yang Department of Statistics, Iowa State University May 4, 014 Outline 1. Introduction. Parametric
More informationAsymptotic Statistics-VI. Changliang Zou
Asymptotic Statistics-VI Changliang Zou Kolmogorov-Smirnov distance Example (Kolmogorov-Smirnov confidence intervals) We know given α (0, 1), there is a well-defined d = d α,n such that, for any continuous
More informationInter-Rater Agreement
Engineering Statistics (EGC 630) Dec., 008 http://core.ecu.edu/psyc/wuenschk/spss.htm Degree of agreement/disagreement among raters Inter-Rater Agreement Psychologists commonly measure various characteristics
More informationPREDICTING THE DISTRIBUTION OF A GOODNESS-OF-FIT STATISTIC APPROPRIATE FOR USE WITH PERFORMANCE-BASED ASSESSMENTS. Mary A. Hansen
PREDICTING THE DISTRIBUTION OF A GOODNESS-OF-FIT STATISTIC APPROPRIATE FOR USE WITH PERFORMANCE-BASED ASSESSMENTS by Mary A. Hansen B.S., Mathematics and Computer Science, California University of PA,
More informationReview. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis
Review Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 22 Chapter 1: background Nominal, ordinal, interval data. Distributions: Poisson, binomial,
More informationChapter 10. Chapter 10. Multinomial Experiments and. Multinomial Experiments and Contingency Tables. Contingency Tables.
Chapter 10 Multinomial Experiments and Contingency Tables 1 Chapter 10 Multinomial Experiments and Contingency Tables 10-1 1 Overview 10-2 2 Multinomial Experiments: of-fitfit 10-3 3 Contingency Tables:
More information1 Degree distributions and data
1 Degree distributions and data A great deal of effort is often spent trying to identify what functional form best describes the degree distribution of a network, particularly the upper tail of that distribution.
More informationSTAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression
STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression Rebecca Barter April 20, 2015 Fisher s Exact Test Fisher s Exact Test
More informationChapte The McGraw-Hill Companies, Inc. All rights reserved.
er15 Chapte Chi-Square Tests d Chi-Square Tests for -Fit Uniform Goodness- Poisson Goodness- Goodness- ECDF Tests (Optional) Contingency Tables A contingency table is a cross-tabulation of n paired observations
More informationFinal Exam. 1. (6 points) True/False. Please read the statements carefully, as no partial credit will be given.
1. (6 points) True/False. Please read the statements carefully, as no partial credit will be given. (a) If X and Y are independent, Corr(X, Y ) = 0. (b) (c) (d) (e) A consistent estimator must be asymptotically
More informationStatistics for Business and Economics
Statistics for Business and Economics Chapter 6 Sampling and Sampling Distributions Ch. 6-1 6.1 Tools of Business Statistics n Descriptive statistics n Collecting, presenting, and describing data n Inferential
More informationCHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)
FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter
More information8. Hypothesis Testing
FE661 - Statistical Methods for Financial Engineering 8. Hypothesis Testing Jitkomut Songsiri introduction Wald test likelihood-based tests significance test for linear regression 8-1 Introduction elements
More informationConfidence Intervals, Testing and ANOVA Summary
Confidence Intervals, Testing and ANOVA Summary 1 One Sample Tests 1.1 One Sample z test: Mean (σ known) Let X 1,, X n a r.s. from N(µ, σ) or n > 30. Let The test statistic is H 0 : µ = µ 0. z = x µ 0
More informationRANDOM INTERCEPT ITEM FACTOR ANALYSIS. IE Working Paper MK8-102-I 02 / 04 / Alberto Maydeu Olivares
RANDOM INTERCEPT ITEM FACTOR ANALYSIS IE Working Paper MK8-102-I 02 / 04 / 2003 Alberto Maydeu Olivares Instituto de Empresa Marketing Dept. C / María de Molina 11-15, 28006 Madrid España Alberto.Maydeu@ie.edu
More informationEstimating ability for two samples
Estimating ability for two samples William Revelle David M. Condon Northwestern University Abstract Using IRT to estimate ability is easy, but how accurate are the estimate and what about multiple samples?
More informationSome Curiosities Arising in Objective Bayesian Analysis
. Some Curiosities Arising in Objective Bayesian Analysis Jim Berger Duke University Statistical and Applied Mathematical Institute Yale University May 15, 2009 1 Three vignettes related to John s work
More informationGENERALIZED FIDUCIAL INFERENCE FOR GRADED RESPONSE MODELS. Yang Liu
GENERALIZED FIDUCIAL INFERENCE FOR GRADED RESPONSE MODELS Yang Liu A dissertation submitted to the faculty of the University of North Carolina at Chapel Hill in partial fulfillment of the requirements
More informationCanonical Correlation Analysis of Longitudinal Data
Biometrics Section JSM 2008 Canonical Correlation Analysis of Longitudinal Data Jayesh Srivastava Dayanand N Naik Abstract Studying the relationship between two sets of variables is an important multivariate
More informationCross-Validation with Confidence
Cross-Validation with Confidence Jing Lei Department of Statistics, Carnegie Mellon University UMN Statistics Seminar, Mar 30, 2017 Overview Parameter est. Model selection Point est. MLE, M-est.,... Cross-validation
More informationStatistics 3858 : Contingency Tables
Statistics 3858 : Contingency Tables 1 Introduction Before proceeding with this topic the student should review generalized likelihood ratios ΛX) for multinomial distributions, its relation to Pearson
More informationBIO5312 Biostatistics Lecture 13: Maximum Likelihood Estimation
BIO5312 Biostatistics Lecture 13: Maximum Likelihood Estimation Yujin Chung November 29th, 2016 Fall 2016 Yujin Chung Lec13: MLE Fall 2016 1/24 Previous Parametric tests Mean comparisons (normality assumption)
More informationAkaike Information Criterion
Akaike Information Criterion Shuhua Hu Center for Research in Scientific Computation North Carolina State University Raleigh, NC February 7, 2012-1- background Background Model statistical model: Y j =
More informationArea1 Scaled Score (NAPLEX) .535 ** **.000 N. Sig. (2-tailed)
Institutional Assessment Report Texas Southern University College of Pharmacy and Health Sciences "An Analysis of 2013 NAPLEX, P4-Comp. Exams and P3 courses The following analysis illustrates relationships
More informationCorrelation and Linear Regression
Correlation and Linear Regression Correlation: Relationships between Variables So far, nearly all of our discussion of inferential statistics has focused on testing for differences between group means
More informationMIT Spring 2015
Assessing Goodness Of Fit MIT 8.443 Dr. Kempthorne Spring 205 Outline 2 Poisson Distribution Counts of events that occur at constant rate Counts in disjoint intervals/regions are independent If intervals/regions
More informationCh 2: Simple Linear Regression
Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component
More informationFall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.
1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n
More informationLecture 7: Hypothesis Testing and ANOVA
Lecture 7: Hypothesis Testing and ANOVA Goals Overview of key elements of hypothesis testing Review of common one and two sample tests Introduction to ANOVA Hypothesis Testing The intent of hypothesis
More informationVisual interpretation with normal approximation
Visual interpretation with normal approximation H 0 is true: H 1 is true: p =0.06 25 33 Reject H 0 α =0.05 (Type I error rate) Fail to reject H 0 β =0.6468 (Type II error rate) 30 Accept H 1 Visual interpretation
More informationPaper Review: Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties by Jianqing Fan and Runze Li (2001)
Paper Review: Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties by Jianqing Fan and Runze Li (2001) Presented by Yang Zhao March 5, 2010 1 / 36 Outlines 2 / 36 Motivation
More informationStatistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017
Statistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017 I. χ 2 or chi-square test Objectives: Compare how close an experimentally derived value agrees with an expected value. One method to
More informationApplication of Variance Homogeneity Tests Under Violation of Normality Assumption
Application of Variance Homogeneity Tests Under Violation of Normality Assumption Alisa A. Gorbunova, Boris Yu. Lemeshko Novosibirsk State Technical University Novosibirsk, Russia e-mail: gorbunova.alisa@gmail.com
More informationStudy Ch. 13.1, # 1 4 all Study Ch. 13.2, # 9 15, 25, 27, 31 [# 11 17, ~27, 29, ~33]
GOALS: 1. Learn the properties of the χ 2 Distribution. 2. Understand how the shape of the χ 2 Distribution changes as the df increases. 3. Be able to find p values. 4. Recognize that χ 2 tests are right
More informationBlack White Total Observed Expected χ 2 = (f observed f expected ) 2 f expected (83 126) 2 ( )2 126
Psychology 60 Fall 2013 Practice Final Actual Exam: This Wednesday. Good luck! Name: To view the solutions, check the link at the end of the document. This practice final should supplement your studying;
More information:the actual population proportion are equal to the hypothesized sample proportions 2. H a
AP Statistics Chapter 14 Chi- Square Distribution Procedures I. Chi- Square Distribution ( χ 2 ) The chi- square test is used when comparing categorical data or multiple proportions. a. Family of only
More informationStat 5102 Final Exam May 14, 2015
Stat 5102 Final Exam May 14, 2015 Name Student ID The exam is closed book and closed notes. You may use three 8 1 11 2 sheets of paper with formulas, etc. You may also use the handouts on brand name distributions
More informationarxiv: v1 [math.st] 22 Jun 2018
Hypothesis testing near singularities and boundaries arxiv:1806.08458v1 [math.st] Jun 018 Jonathan D. Mitchell, Elizabeth S. Allman, and John A. Rhodes Department of Mathematics & Statistics University
More informationApplied Mathematics Research Report 07-08
Estimate-based Goodness-of-Fit Test for Large Sparse Multinomial Distributions by Sung-Ho Kim, Heymi Choi, and Sangjin Lee Applied Mathematics Research Report 0-0 November, 00 DEPARTMENT OF MATHEMATICAL
More informationMaster s Written Examination - Solution
Master s Written Examination - Solution Spring 204 Problem Stat 40 Suppose X and X 2 have the joint pdf f X,X 2 (x, x 2 ) = 2e (x +x 2 ), 0 < x < x 2
More informationChapter 7. Hypothesis Testing
Chapter 7. Hypothesis Testing Joonpyo Kim June 24, 2017 Joonpyo Kim Ch7 June 24, 2017 1 / 63 Basic Concepts of Testing Suppose that our interest centers on a random variable X which has density function
More informationMultivariate Regression
Multivariate Regression The so-called supervised learning problem is the following: we want to approximate the random variable Y with an appropriate function of the random variables X 1,..., X p with the
More information4.5.1 The use of 2 log Λ when θ is scalar
4.5. ASYMPTOTIC FORM OF THE G.L.R.T. 97 4.5.1 The use of 2 log Λ when θ is scalar Suppose we wish to test the hypothesis NH : θ = θ where θ is a given value against the alternative AH : θ θ on the basis
More informationA Very Brief Summary of Statistical Inference, and Examples
A Very Brief Summary of Statistical Inference, and Examples Trinity Term 2009 Prof. Gesine Reinert Our standard situation is that we have data x = x 1, x 2,..., x n, which we view as realisations of random
More informationLing 289 Contingency Table Statistics
Ling 289 Contingency Table Statistics Roger Levy and Christopher Manning This is a summary of the material that we ve covered on contingency tables. Contingency tables: introduction Odds ratios Counting,
More informationProbability and Statistics Notes
Probability and Statistics Notes Chapter Seven Jesse Crawford Department of Mathematics Tarleton State University Spring 2011 (Tarleton State University) Chapter Seven Notes Spring 2011 1 / 42 Outline
More informationECON 4160, Autumn term Lecture 1
ECON 4160, Autumn term 2017. Lecture 1 a) Maximum Likelihood based inference. b) The bivariate normal model Ragnar Nymoen University of Oslo 24 August 2017 1 / 54 Principles of inference I Ordinary least
More informationSTATS 200: Introduction to Statistical Inference. Lecture 29: Course review
STATS 200: Introduction to Statistical Inference Lecture 29: Course review Course review We started in Lecture 1 with a fundamental assumption: Data is a realization of a random process. The goal throughout
More informationReview of Statistics 101
Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods
More informationMATH 240. Chapter 8 Outlines of Hypothesis Tests
MATH 4 Chapter 8 Outlines of Hypothesis Tests Test for Population Proportion p Specify the null and alternative hypotheses, ie, choose one of the three, where p is some specified number: () H : p H : p
More information2.1.3 The Testing Problem and Neave s Step Method
we can guarantee (1) that the (unknown) true parameter vector θ t Θ is an interior point of Θ, and (2) that ρ θt (R) > 0 for any R 2 Q. These are two of Birch s regularity conditions that were critical
More informationFormulas and Tables by Mario F. Triola
Copyright 010 Pearson Education, Inc. Ch. 3: Descriptive Statistics x f # x x f Mean 1x - x s - 1 n 1 x - 1 x s 1n - 1 s B variance s Ch. 4: Probability Mean (frequency table) Standard deviation P1A or
More information