The Use of Quadratic Form Statistics of Residuals to Identify IRT Model Misfit in Marginal Subtables

Size: px
Start display at page:

Download "The Use of Quadratic Form Statistics of Residuals to Identify IRT Model Misfit in Marginal Subtables"

Transcription

1 The Use of Quadratic Form Statistics of Residuals to Identify IRT Model Misfit in Marginal Subtables 1 Yang Liu Alberto Maydeu-Olivares 1 Department of Psychology, The University of North Carolina at Chapel Hill Department of Psychology, University of Barcelona IMPS 01 Lincoln, NE 1/0

2 /0 Acknowledgment Dr. David Thissen Other folks from the Thurstone lab

3 Overview: goodness-of-fit test of IRT model specification parameter estimation overall GOF test reject H 0 fail to reject H 0. remove items change model... 3/0

4 Overview: goodness-of-fit test of IRT model specification parameter estimation overall GOF test reject H 0 fail to reject H 0 marginal subtable GOF test remove items change model... 3/0

5 4/0 Marginal subtable GOF test Consider A, a subset of r items (e.g. r = for pairs, 3 for triplets) Principles of testing marginal fit 1 Do not want to reject well-fitting items r should not be too large, usually r 3 Null hypothesis H 0 : π A = π A (θ A ) π A : probabilities of cells in subtable A θ A : item parameters in A Use test statistics based on residuals ê A := p A π A (ˆθ A ) p A : observed proportion π A (ˆθ A ): model-implied probability ˆθA : some estimator of θ A

6 4/0 Marginal subtable GOF test Consider A, a subset of r items (e.g. r = for pairs, 3 for triplets) Principles of testing marginal fit 1 Do not want to reject well-fitting items r should not be too large, usually r 3 Null hypothesis H 0 : π A = π A (θ A ) π A : probabilities of cells in subtable A θ A : item parameters in A Use test statistics based on residuals ê A := p A π A (ˆθ A ) p A : observed proportion π A (ˆθ A ): model-implied probability ˆθA : some estimator of θ A

7 4/0 Marginal subtable GOF test Consider A, a subset of r items (e.g. r = for pairs, 3 for triplets) Principles of testing marginal fit 1 Do not want to reject well-fitting items r should not be too large, usually r 3 Null hypothesis H 0 : π A = π A (θ A ) π A : probabilities of cells in subtable A θ A : item parameters in A Use test statistics based on residuals ê A := p A π A (ˆθ A ) p A : observed proportion π A (ˆθ A ): model-implied probability ˆθA : some estimator of θ A

8 4/0 Marginal subtable GOF test Consider A, a subset of r items (e.g. r = for pairs, 3 for triplets) Principles of testing marginal fit 1 Do not want to reject well-fitting items r should not be too large, usually r 3 Null hypothesis H 0 : π A = π A (θ A ) π A : probabilities of cells in subtable A θ A : item parameters in A Use test statistics based on residuals ê A := p A π A (ˆθ A ) p A : observed proportion π A (ˆθ A ): model-implied probability ˆθA : some estimator of θ A

9 5/0 Asymptotic tests based on residuals Proposition 1 (Maydeu-Olivares & Liu, 01): Let ˆθ A be the full table MLE corresponding to item subset A, under H 0 : N êa d N (0, ΣA ) where Σ A = Γ A A I 1 [A] A, Γ A = D A π A (θ A )π A (θ A ), D A = Diag(π A (θ A )), A = π A / θ A, I [A] is the block of Fisher information matrix I corresponding to θ A. Standardized residuals (Reiser, 1996; Maydeu-Olivares & Joe, 006) Quadratic form statistics of residuals

10 5/0 Asymptotic tests based on residuals Proposition 1 (Maydeu-Olivares & Liu, 01): Let ˆθ A be the full table MLE corresponding to item subset A, under H 0 : N êa d N (0, ΣA ) where Σ A = Γ A A I 1 [A] A, Γ A = D A π A (θ A )π A (θ A ), D A = Diag(π A (θ A )), A = π A / θ A, I [A] is the block of Fisher information matrix I corresponding to θ A. Standardized residuals (Reiser, 1996; Maydeu-Olivares & Joe, 006) Quadratic form statistics of residuals

11 5/0 Asymptotic tests based on residuals Proposition 1 (Maydeu-Olivares & Liu, 01): Let ˆθ A be the full table MLE corresponding to item subset A, under H 0 : N êa d N (0, ΣA ) where Σ A = Γ A A I 1 [A] A, Γ A = D A π A (θ A )π A (θ A ), D A = Diag(π A (θ A )), A = π A / θ A, I [A] is the block of Fisher information matrix I corresponding to θ A. Standardized residuals (Reiser, 1996; Maydeu-Olivares & Joe, 006) Quadratic form statistics of residuals

12 6/0 Quadratic form statistics of residuals 1 Pearson s X XA = N ê 1 A ˆD A êa W statistic (Reiser, 1996) W A = N ê A ˆΣ + AêA where ˆΣ + A is the Moore-Penrose pseudoinverse of ˆΣ A 3 M n statistic (Maydeu-Olivares & Joe, 005, 006) M A = N ê AÛAê A where ÛA = ˆD 1 A ˆD 1 A ˆ A ( ˆ A ˆD 1 A ˆ A ) 1 ˆ A ˆD 1 A

13 Asymptotic distributions under H 0 Proposition (Cochran, 1934; Box, 1954; Tan, 1977): Let p-dimensional random vector z N (0, Ω), and W is any p p real symmetric matrix, then z Wz p λ i χ 1 i=1 where each chi-squared random variable is distributed independently of every other, and λ i s are eigenvalues of matrix ΩW. 1 W A d χ s where s is the rank of Σ A (use ˆΣ A in practice) M A d χ t where t = dim(π A ) dim(θ A ) 1 3 X A d λ i χ 1, where λ i s are eigenvalues for matrix Σ A D 1 A (use ˆΣ A ˆD 1 A in practice) 7/0

14 Asymptotic distributions under H 0 Proposition (Cochran, 1934; Box, 1954; Tan, 1977): Let p-dimensional random vector z N (0, Ω), and W is any p p real symmetric matrix, then z Wz p λ i χ 1 i=1 where each chi-squared random variable is distributed independently of every other, and λ i s are eigenvalues of matrix ΩW. 1 W A d χ s where s is the rank of Σ A (use ˆΣ A in practice) M A d χ t where t = dim(π A ) dim(θ A ) 1 3 X A d λ i χ 1, where λ i s are eigenvalues for matrix Σ A D 1 A (use ˆΣ A ˆD 1 A in practice) 7/0

15 Asymptotic distributions under H 0 Proposition (Cochran, 1934; Box, 1954; Tan, 1977): Let p-dimensional random vector z N (0, Ω), and W is any p p real symmetric matrix, then z Wz p λ i χ 1 i=1 where each chi-squared random variable is distributed independently of every other, and λ i s are eigenvalues of matrix ΩW. 1 W A d χ s where s is the rank of Σ A (use ˆΣ A in practice) M A d χ t where t = dim(π A ) dim(θ A ) 1 3 X A d λ i χ 1, where λ i s are eigenvalues for matrix Σ A D 1 A (use ˆΣ A ˆD 1 A in practice) 7/0

16 Asymptotic distributions under H 0 Proposition (Cochran, 1934; Box, 1954; Tan, 1977): Let p-dimensional random vector z N (0, Ω), and W is any p p real symmetric matrix, then z Wz p λ i χ 1 i=1 where each chi-squared random variable is distributed independently of every other, and λ i s are eigenvalues of matrix ΩW. 1 W A d χ s where s is the rank of Σ A (use ˆΣ A in practice) M A d χ t where t = dim(π A ) dim(θ A ) 1 3 X A d λ i χ 1, where λ i s are eigenvalues for matrix Σ A D 1 A (use ˆΣ A ˆD 1 A in practice) 7/0

17 8/0 Computing asymptotic p-values 1 W statistic (requires Î) In general we do not know the rank of Σ A use the rank of ˆΣ A set some cutoff for zero EV (e.g ) M n statistic (does not require Î) Model should be identified in the subtable (dim(θ A ) < dim(π A ) 1) When r =, graded model is identified, PL is not 3 X statistic Computing/approximating the tail probabilities (requires Î) Satterthwaite s approximation (Satterthwaite, 1941) Inversion method (Imhof, 1961) Heuristical chi-squared reference distributions (does not require Î) Subtable MLE: d.f. = dim(π A ) dim(θ A ) 1 Chen-Thissen (1997): d.f. of independence model test

18 8/0 Computing asymptotic p-values 1 W statistic (requires Î) In general we do not know the rank of Σ A use the rank of ˆΣ A set some cutoff for zero EV (e.g ) M n statistic (does not require Î) Model should be identified in the subtable (dim(θ A ) < dim(π A ) 1) When r =, graded model is identified, PL is not 3 X statistic Computing/approximating the tail probabilities (requires Î) Satterthwaite s approximation (Satterthwaite, 1941) Inversion method (Imhof, 1961) Heuristical chi-squared reference distributions (does not require Î) Subtable MLE: d.f. = dim(π A ) dim(θ A ) 1 Chen-Thissen (1997): d.f. of independence model test

19 8/0 Computing asymptotic p-values 1 W statistic (requires Î) In general we do not know the rank of Σ A use the rank of ˆΣ A set some cutoff for zero EV (e.g ) M n statistic (does not require Î) Model should be identified in the subtable (dim(θ A ) < dim(π A ) 1) When r =, graded model is identified, PL is not 3 X statistic Computing/approximating the tail probabilities (requires Î) Satterthwaite s approximation (Satterthwaite, 1941) Inversion method (Imhof, 1961) Heuristical chi-squared reference distributions (does not require Î) Subtable MLE: d.f. = dim(π A ) dim(θ A ) 1 Chen-Thissen (1997): d.f. of independence model test

20 8/0 Computing asymptotic p-values 1 W statistic (requires Î) In general we do not know the rank of Σ A use the rank of ˆΣ A set some cutoff for zero EV (e.g ) M n statistic (does not require Î) Model should be identified in the subtable (dim(θ A ) < dim(π A ) 1) When r =, graded model is identified, PL is not 3 X statistic Computing/approximating the tail probabilities (requires Î) Satterthwaite s approximation (Satterthwaite, 1941) Inversion method (Imhof, 1961) Heuristical chi-squared reference distributions (does not require Î) Subtable MLE: d.f. = dim(π A ) dim(θ A ) 1 Chen-Thissen (1997): d.f. of independence model test

21 Simulation A: dichotomous items 9 items, fitted model is always PL, 1000 reps for each subcondition Condition A1: PL model Slopes β = (.7, 1.67, 1.8,.7, 1.67, 1.8) Intercepts α = (1.4, 0, 1.06, 1.4, 0, 1.06, 1.4, 0, 1.06) Condition A: Mixture model LV 1 N (0, 1) + 1 N (3, 1) Condition A3: 3-dimensional (independent cluster) model LVs N 0, β = /0

22 Simulation A: dichotomous items 9 items, fitted model is always PL, 1000 reps for each subcondition Condition A1: PL model Slopes β = (.7, 1.67, 1.8,.7, 1.67, 1.8) Intercepts α = (1.4, 0, 1.06, 1.4, 0, 1.06, 1.4, 0, 1.06) Condition A: Mixture model LV 1 N (0, 1) + 1 N (3, 1) Condition A3: 3-dimensional (independent cluster) model LVs N 0, β = /0

23 Simulation A: dichotomous items 9 items, fitted model is always PL, 1000 reps for each subcondition Condition A1: PL model Slopes β = (.7, 1.67, 1.8,.7, 1.67, 1.8) Intercepts α = (1.4, 0, 1.06, 1.4, 0, 1.06, 1.4, 0, 1.06) Condition A: Mixture model LV 1 N (0, 1) + 1 N (3, 1) Condition A3: 3-dimensional (independent cluster) model LVs N 0, β = /0

24 Simulation A: Statistics Subtable Statistics Reference distribution Pair W (jk) χ s ; s = # EV of ˆΣ (jk) 10 5 (j, k) X(jk) Chen-Thissen χ 1 Satterthwaite s approximation Inversion method Triplet W (jkl) χ s ; s = # EV of ˆΣ (jkl) 10 5 (j, k, l) M (jkl) χ 1 X(jkl) Subtable MLE χ 1 Chen-Thissen χ 4 Satterthwaite s approximation Inversion method 10/0

25 11/0 Result A1: PL (true), PL (fitted) Pair (, 3) Triplet (1,, 3) Rejection rate W(3) X (3) (Chen Thissen) X (3) (Satterthwaite) X (3) (Inversion) Rejection rate W(13) M(13) X (13) (Subtable MLE) X (13) (Chen Thissen) X (13) (Satterthwaite) X (13) (Inversion) α level α level

26 1/0 Result A: Mixture (true), PL (fitted) Pair (, 3) Triplet (1,, 3) Rejection rate W(3) X (3) (Satterthwaite) X (3) (Inversion) Rejection rate W(13) M(13) X (13) (Satterthwaite) X (13) (Inversion) α level α level

27 13/0 Result A3: 3-dim (true), PL (fitted) Pair (, 3) Triplet (1,, 3) Rejection rate W(3) X (3) (Satterthwaite) X (3) (Inversion) Rejection rate W(13) M(13) X (13) (Satterthwaite) X (13) (Inversion) α level α level

28 14/0 Condition B: 3-category items 6 items, fitted model is always graded, 1000 reps for each subcondition Condition B1: Graded model Condition B: Mixture model β = (.7, 1.67, 1.8,.7, 1.67, 1.8) [ α = LV 1 N (0, 1) + 1 N (3, 1) Condition B3: -dimensional (independent cluster) model ([ ] [ ]) LVs N, [ β = ] ]

29 14/0 Condition B: 3-category items 6 items, fitted model is always graded, 1000 reps for each subcondition Condition B1: Graded model Condition B: Mixture model β = (.7, 1.67, 1.8,.7, 1.67, 1.8) [ α = LV 1 N (0, 1) + 1 N (3, 1) Condition B3: -dimensional (independent cluster) model ([ ] [ ]) LVs N, [ β = ] ]

30 14/0 Condition B: 3-category items 6 items, fitted model is always graded, 1000 reps for each subcondition Condition B1: Graded model Condition B: Mixture model β = (.7, 1.67, 1.8,.7, 1.67, 1.8) [ α = LV 1 N (0, 1) + 1 N (3, 1) Condition B3: -dimensional (independent cluster) model ([ ] [ ]) LVs N, [ β = ] ]

31 Condition B: Statistics Subtable Statistics Reference distribution Pairs W (jk) χ s ; s = # EV of ˆΣ (jk) 10 5 (j, k) M (jk) χ X(jk) Subtable MLE χ Chen-Thissen χ 4 Satterthwaite s approximation Inversion method 15/0

32 16/0 Result B1: graded (true), graded (fitted) Pair (1, ) Pair (, 3) Rejection rate W(1) M(1) X (1) (Subtable MLE) X (1) (Chen Thissen) X (1) (Satterthwaite) X (1) (Inversion) Rejection rate W(3) M(3) X (3) (Subtable MLE) X (3) (Chen Thissen) X (3) (Satterthwaite) X (3) (Inversion) α level α level

33 17/0 Result B: mixture (true), graded (fitted) Pair (1, ) Pair (, 3) Rejection rate W(1) M(1) X (1) (Satterthwaite) X (1) (Inversion) Rejection rate W(3) M(3) X (3) (Satterthwaite) X (3) (Inversion) α level α level

34 18/0 Result B3: -dim (true), graded (fitted) Pair (1, ) Pair (, 3) Rejection rate W(1) M(1) X (1) (Satterthwaite) X (1) (Inversion) Rejection rate W(3) M(3) X (3) (Satterthwaite) X (3) (Inversion) α level α level

35 19/0 Concluding remarks When the model is correctly specified Empirical distributions of W and M n closely match their asymptotic reference distributions Heuristical chi-squared reference distributions for X works poorly p-values based on the correct asymptotic distribution of X works well; the Satterthwaite s approximation works as well as the inversion method Among W, M n and X, we suggest the use of W statistics for pairs (and triplets, if necessary) which always have high power Future direction Long tests where exact Fisher information cannot be computed: Use OPG (a.k.a. XPD) estimator of information ÎOPG = ˆ Ŵ ˆ where Ŵ = Diag{p c /πc} C c=1

36 19/0 Concluding remarks When the model is correctly specified Empirical distributions of W and M n closely match their asymptotic reference distributions Heuristical chi-squared reference distributions for X works poorly p-values based on the correct asymptotic distribution of X works well; the Satterthwaite s approximation works as well as the inversion method Among W, M n and X, we suggest the use of W statistics for pairs (and triplets, if necessary) which always have high power Future direction Long tests where exact Fisher information cannot be computed: Use OPG (a.k.a. XPD) estimator of information ÎOPG = ˆ Ŵ ˆ where Ŵ = Diag{p c /πc} C c=1

37 19/0 Concluding remarks When the model is correctly specified Empirical distributions of W and M n closely match their asymptotic reference distributions Heuristical chi-squared reference distributions for X works poorly p-values based on the correct asymptotic distribution of X works well; the Satterthwaite s approximation works as well as the inversion method Among W, M n and X, we suggest the use of W statistics for pairs (and triplets, if necessary) which always have high power Future direction Long tests where exact Fisher information cannot be computed: Use OPG (a.k.a. XPD) estimator of information ÎOPG = ˆ Ŵ ˆ where Ŵ = Diag{p c /πc} C c=1

38 19/0 Concluding remarks When the model is correctly specified Empirical distributions of W and M n closely match their asymptotic reference distributions Heuristical chi-squared reference distributions for X works poorly p-values based on the correct asymptotic distribution of X works well; the Satterthwaite s approximation works as well as the inversion method Among W, M n and X, we suggest the use of W statistics for pairs (and triplets, if necessary) which always have high power Future direction Long tests where exact Fisher information cannot be computed: Use OPG (a.k.a. XPD) estimator of information ÎOPG = ˆ Ŵ ˆ where Ŵ = Diag{p c /πc} C c=1

39 19/0 Concluding remarks When the model is correctly specified Empirical distributions of W and M n closely match their asymptotic reference distributions Heuristical chi-squared reference distributions for X works poorly p-values based on the correct asymptotic distribution of X works well; the Satterthwaite s approximation works as well as the inversion method Among W, M n and X, we suggest the use of W statistics for pairs (and triplets, if necessary) which always have high power Future direction Long tests where exact Fisher information cannot be computed: Use OPG (a.k.a. XPD) estimator of information ÎOPG = ˆ Ŵ ˆ where Ŵ = Diag{p c /πc} C c=1

40 Thanks! 0/0

Local Dependence Diagnostics in IRT Modeling of Binary Data

Local Dependence Diagnostics in IRT Modeling of Binary Data Local Dependence Diagnostics in IRT Modeling of Binary Data Educational and Psychological Measurement 73(2) 254 274 Ó The Author(s) 2012 Reprints and permission: sagepub.com/journalspermissions.nav DOI:

More information

A Note on a Tucker-Lewis-Index for Item Response Theory Models. Taehun Lee, Li Cai University of California, Los Angeles

A Note on a Tucker-Lewis-Index for Item Response Theory Models. Taehun Lee, Li Cai University of California, Los Angeles A Note on a Tucker-Lewis-Index for Item Response Theory Models Taehun Lee, Li Cai University of California, Los Angeles 1 Item Response Theory (IRT) Models IRT is a family of statistical models used to

More information

Psychology 282 Lecture #4 Outline Inferences in SLR

Psychology 282 Lecture #4 Outline Inferences in SLR Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations

More information

Central Limit Theorem ( 5.3)

Central Limit Theorem ( 5.3) Central Limit Theorem ( 5.3) Let X 1, X 2,... be a sequence of independent random variables, each having n mean µ and variance σ 2. Then the distribution of the partial sum S n = X i i=1 becomes approximately

More information

Can you tell the relationship between students SAT scores and their college grades?

Can you tell the relationship between students SAT scores and their college grades? Correlation One Challenge Can you tell the relationship between students SAT scores and their college grades? A: The higher SAT scores are, the better GPA may be. B: The higher SAT scores are, the lower

More information

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015 STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots March 8, 2015 The duality between CI and hypothesis testing The duality between CI and hypothesis

More information

Lecture 3. Inference about multivariate normal distribution

Lecture 3. Inference about multivariate normal distribution Lecture 3. Inference about multivariate normal distribution 3.1 Point and Interval Estimation Let X 1,..., X n be i.i.d. N p (µ, Σ). We are interested in evaluation of the maximum likelihood estimates

More information

Assessment of fit of item response theory models used in large scale educational survey assessments

Assessment of fit of item response theory models used in large scale educational survey assessments DOI 10.1186/s40536-016-0025-3 RESEARCH Open Access Assessment of fit of item response theory models used in large scale educational survey assessments Peter W. van Rijn 1, Sandip Sinharay 2*, Shelby J.

More information

Item Response Theory (IRT) Analysis of Item Sets

Item Response Theory (IRT) Analysis of Item Sets University of Connecticut DigitalCommons@UConn NERA Conference Proceedings 2011 Northeastern Educational Research Association (NERA) Annual Conference Fall 10-21-2011 Item Response Theory (IRT) Analysis

More information

An Equivalency Test for Model Fit. Craig S. Wells. University of Massachusetts Amherst. James. A. Wollack. Ronald C. Serlin

An Equivalency Test for Model Fit. Craig S. Wells. University of Massachusetts Amherst. James. A. Wollack. Ronald C. Serlin Equivalency Test for Model Fit 1 Running head: EQUIVALENCY TEST FOR MODEL FIT An Equivalency Test for Model Fit Craig S. Wells University of Massachusetts Amherst James. A. Wollack Ronald C. Serlin University

More information

HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC

HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC 1 HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC 7 steps of Hypothesis Testing 1. State the hypotheses 2. Identify level of significant 3. Identify the critical values 4. Calculate test statistics 5. Compare

More information

THE PEARSON CORRELATION COEFFICIENT

THE PEARSON CORRELATION COEFFICIENT CORRELATION Two variables are said to have a relation if knowing the value of one variable gives you information about the likely value of the second variable this is known as a bivariate relation There

More information

Categorical Data Analysis Chapter 3

Categorical Data Analysis Chapter 3 Categorical Data Analysis Chapter 3 The actual coverage probability is usually a bit higher than the nominal level. Confidence intervals for association parameteres Consider the odds ratio in the 2x2 table,

More information

The regression model with one fixed regressor cont d

The regression model with one fixed regressor cont d The regression model with one fixed regressor cont d 3150/4150 Lecture 4 Ragnar Nymoen 27 January 2012 The model with transformed variables Regression with transformed variables I References HGL Ch 2.8

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 15: Examples of hypothesis tests (v5) Ramesh Johari ramesh.johari@stanford.edu 1 / 32 The recipe 2 / 32 The hypothesis testing recipe In this lecture we repeatedly apply the

More information

A3. Statistical Inference Hypothesis Testing for General Population Parameters

A3. Statistical Inference Hypothesis Testing for General Population Parameters Appendix / A3. Statistical Inference / General Parameters- A3. Statistical Inference Hypothesis Testing for General Population Parameters POPULATION H 0 : θ = θ 0 θ is a generic parameter of interest (e.g.,

More information

TUTORIAL 8 SOLUTIONS #

TUTORIAL 8 SOLUTIONS # TUTORIAL 8 SOLUTIONS #9.11.21 Suppose that a single observation X is taken from a uniform density on [0,θ], and consider testing H 0 : θ = 1 versus H 1 : θ =2. (a) Find a test that has significance level

More information

Some New Aspects of Dose-Response Models with Applications to Multistage Models Having Parameters on the Boundary

Some New Aspects of Dose-Response Models with Applications to Multistage Models Having Parameters on the Boundary Some New Aspects of Dose-Response Models with Applications to Multistage Models Having Parameters on the Boundary Bimal Sinha Department of Mathematics & Statistics University of Maryland, Baltimore County,

More information

Answer Key for STAT 200B HW No. 8

Answer Key for STAT 200B HW No. 8 Answer Key for STAT 200B HW No. 8 May 8, 2007 Problem 3.42 p. 708 The values of Ȳ for x 00, 0, 20, 30 are 5/40, 0, 20/50, and, respectively. From Corollary 3.5 it follows that MLE exists i G is identiable

More information

Correlation. A statistics method to measure the relationship between two variables. Three characteristics

Correlation. A statistics method to measure the relationship between two variables. Three characteristics Correlation Correlation A statistics method to measure the relationship between two variables Three characteristics Direction of the relationship Form of the relationship Strength/Consistency Direction

More information

A Marginal Maximum Likelihood Procedure for an IRT Model with Single-Peaked Response Functions

A Marginal Maximum Likelihood Procedure for an IRT Model with Single-Peaked Response Functions A Marginal Maximum Likelihood Procedure for an IRT Model with Single-Peaked Response Functions Cees A.W. Glas Oksana B. Korobko University of Twente, the Netherlands OMD Progress Report 07-01. Cees A.W.

More information

Components of the Pearson-Fisher Chi-squared Statistic

Components of the Pearson-Fisher Chi-squared Statistic JOURNAL OF APPLIED MATHEMATICS AND DECISION SCIENCES, 6(4), 241 254 Copyright c 2002, Lawrence Erlbaum Associates, Inc. Components of the Pearson-Fisher Chi-squared Statistic G.D. RAYNER National Australia

More information

ML Testing (Likelihood Ratio Testing) for non-gaussian models

ML Testing (Likelihood Ratio Testing) for non-gaussian models ML Testing (Likelihood Ratio Testing) for non-gaussian models Surya Tokdar ML test in a slightly different form Model X f (x θ), θ Θ. Hypothesist H 0 : θ Θ 0 Good set: B c (x) = {θ : l x (θ) max θ Θ l

More information

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

 M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2 Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the

More information

SRMR in Mplus. Tihomir Asparouhov and Bengt Muthén. May 2, 2018

SRMR in Mplus. Tihomir Asparouhov and Bengt Muthén. May 2, 2018 SRMR in Mplus Tihomir Asparouhov and Bengt Muthén May 2, 2018 1 Introduction In this note we describe the Mplus implementation of the SRMR standardized root mean squared residual) fit index for the models

More information

Statistics 135 Fall 2008 Final Exam

Statistics 135 Fall 2008 Final Exam Name: SID: Statistics 135 Fall 2008 Final Exam Show your work. The number of points each question is worth is shown at the beginning of the question. There are 10 problems. 1. [2] The normal equations

More information

Statistical Inference

Statistical Inference Statistical Inference Classical and Bayesian Methods Revision Class for Midterm Exam AMS-UCSC Th Feb 9, 2012 Winter 2012. Session 1 (Revision Class) AMS-132/206 Th Feb 9, 2012 1 / 23 Topics Topics We will

More information

Topic 21 Goodness of Fit

Topic 21 Goodness of Fit Topic 21 Goodness of Fit Contingency Tables 1 / 11 Introduction Two-way Table Smoking Habits The Hypothesis The Test Statistic Degrees of Freedom Outline 2 / 11 Introduction Contingency tables, also known

More information

The Impact of Model Misspecification in Clustered and Continuous Growth Modeling

The Impact of Model Misspecification in Clustered and Continuous Growth Modeling The Impact of Model Misspecification in Clustered and Continuous Growth Modeling Daniel J. Bauer Odum Institute for Research in Social Science The University of North Carolina at Chapel Hill Patrick J.

More information

Part 1.) We know that the probability of any specific x only given p ij = p i p j is just multinomial(n, p) where p k1 k 2

Part 1.) We know that the probability of any specific x only given p ij = p i p j is just multinomial(n, p) where p k1 k 2 Problem.) I will break this into two parts: () Proving w (m) = p( x (m) X i = x i, X j = x j, p ij = p i p j ). In other words, the probability of a specific table in T x given the row and column counts

More information

STT 843 Key to Homework 1 Spring 2018

STT 843 Key to Homework 1 Spring 2018 STT 843 Key to Homework Spring 208 Due date: Feb 4, 208 42 (a Because σ = 2, σ 22 = and ρ 2 = 05, we have σ 2 = ρ 2 σ σ22 = 2/2 Then, the mean and covariance of the bivariate normal is µ = ( 0 2 and Σ

More information

Nominal Data. Parametric Statistics. Nonparametric Statistics. Parametric vs Nonparametric Tests. Greg C Elvers

Nominal Data. Parametric Statistics. Nonparametric Statistics. Parametric vs Nonparametric Tests. Greg C Elvers Nominal Data Greg C Elvers 1 Parametric Statistics The inferential statistics that we have discussed, such as t and ANOVA, are parametric statistics A parametric statistic is a statistic that makes certain

More information

Institute of Actuaries of India

Institute of Actuaries of India Institute of Actuaries of India Subject CT3 Probability & Mathematical Statistics May 2011 Examinations INDICATIVE SOLUTION Introduction The indicative solution has been written by the Examiners with the

More information

The goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions.

The goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions. The goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions. A common problem of this type is concerned with determining

More information

Lecture 2: Linear Models. Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011

Lecture 2: Linear Models. Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011 Lecture 2: Linear Models Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011 1 Quick Review of the Major Points The general linear model can be written as y = X! + e y = vector

More information

Session 3 The proportional odds model and the Mann-Whitney test

Session 3 The proportional odds model and the Mann-Whitney test Session 3 The proportional odds model and the Mann-Whitney test 3.1 A unified approach to inference 3.2 Analysis via dichotomisation 3.3 Proportional odds 3.4 Relationship with the Mann-Whitney test Session

More information

I i=1 1 I(J 1) j=1 (Y ij Ȳi ) 2. j=1 (Y j Ȳ )2 ] = 2n( is the two-sample t-test statistic.

I i=1 1 I(J 1) j=1 (Y ij Ȳi ) 2. j=1 (Y j Ȳ )2 ] = 2n( is the two-sample t-test statistic. Serik Sagitov, Chalmers and GU, February, 08 Solutions chapter Matlab commands: x = data matrix boxplot(x) anova(x) anova(x) Problem.3 Consider one-way ANOVA test statistic For I = and = n, put F = MS

More information

Chapter 4: Constrained estimators and tests in the multiple linear regression model (Part III)

Chapter 4: Constrained estimators and tests in the multiple linear regression model (Part III) Chapter 4: Constrained estimators and tests in the multiple linear regression model (Part III) Florian Pelgrin HEC September-December 2010 Florian Pelgrin (HEC) Constrained estimators September-December

More information

Statistical Tests for Parameterized Multinomial Models: Power Approximation and Optimization. Edgar Erdfelder University of Mannheim Germany

Statistical Tests for Parameterized Multinomial Models: Power Approximation and Optimization. Edgar Erdfelder University of Mannheim Germany Statistical Tests for Parameterized Multinomial Models: Power Approximation and Optimization Edgar Erdfelder University of Mannheim Germany The Problem Typically, model applications start with a global

More information

Whats beyond Concerto: An introduction to the R package catr. Session 4: Overview of polytomous IRT models

Whats beyond Concerto: An introduction to the R package catr. Session 4: Overview of polytomous IRT models Whats beyond Concerto: An introduction to the R package catr Session 4: Overview of polytomous IRT models The Psychometrics Centre, Cambridge, June 10th, 2014 2 Outline: 1. Introduction 2. General notations

More information

Hypothesis Testing hypothesis testing approach

Hypothesis Testing hypothesis testing approach Hypothesis Testing In this case, we d be trying to form an inference about that neighborhood: Do people there shop more often those people who are members of the larger population To ascertain this, we

More information

Likelihood Ratio tests

Likelihood Ratio tests Likelihood Ratio tests For general composite hypotheses optimality theory is not usually successful in producing an optimal test. instead we look for heuristics to guide our choices. The simplest approach

More information

Probability Theory and Statistics. Peter Jochumzen

Probability Theory and Statistics. Peter Jochumzen Probability Theory and Statistics Peter Jochumzen April 18, 2016 Contents 1 Probability Theory And Statistics 3 1.1 Experiment, Outcome and Event................................ 3 1.2 Probability............................................

More information

Optimal exact tests for complex alternative hypotheses on cross tabulated data

Optimal exact tests for complex alternative hypotheses on cross tabulated data Optimal exact tests for complex alternative hypotheses on cross tabulated data Daniel Yekutieli Statistics and OR Tel Aviv University CDA course 29 July 2017 Yekutieli (TAU) Optimal exact tests for complex

More information

Rater agreement - ordinal ratings. Karl Bang Christensen Dept. of Biostatistics, Univ. of Copenhagen NORDSTAT,

Rater agreement - ordinal ratings. Karl Bang Christensen Dept. of Biostatistics, Univ. of Copenhagen NORDSTAT, Rater agreement - ordinal ratings Karl Bang Christensen Dept. of Biostatistics, Univ. of Copenhagen NORDSTAT, 2012 http://biostat.ku.dk/~kach/ 1 Rater agreement - ordinal ratings Methods for analyzing

More information

Econ 325: Introduction to Empirical Economics

Econ 325: Introduction to Empirical Economics Econ 325: Introduction to Empirical Economics Lecture 6 Sampling and Sampling Distributions Ch. 6-1 Populations and Samples A Population is the set of all items or individuals of interest Examples: All

More information

A Markov chain Monte Carlo approach to confirmatory item factor analysis. Michael C. Edwards The Ohio State University

A Markov chain Monte Carlo approach to confirmatory item factor analysis. Michael C. Edwards The Ohio State University A Markov chain Monte Carlo approach to confirmatory item factor analysis Michael C. Edwards The Ohio State University An MCMC approach to CIFA Overview Motivating examples Intro to Item Response Theory

More information

Parametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1

Parametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1 Parametric Modelling of Over-dispersed Count Data Part III / MMath (Applied Statistics) 1 Introduction Poisson regression is the de facto approach for handling count data What happens then when Poisson

More information

Likelihood-based inference with missing data under missing-at-random

Likelihood-based inference with missing data under missing-at-random Likelihood-based inference with missing data under missing-at-random Jae-kwang Kim Joint work with Shu Yang Department of Statistics, Iowa State University May 4, 014 Outline 1. Introduction. Parametric

More information

Asymptotic Statistics-VI. Changliang Zou

Asymptotic Statistics-VI. Changliang Zou Asymptotic Statistics-VI Changliang Zou Kolmogorov-Smirnov distance Example (Kolmogorov-Smirnov confidence intervals) We know given α (0, 1), there is a well-defined d = d α,n such that, for any continuous

More information

Inter-Rater Agreement

Inter-Rater Agreement Engineering Statistics (EGC 630) Dec., 008 http://core.ecu.edu/psyc/wuenschk/spss.htm Degree of agreement/disagreement among raters Inter-Rater Agreement Psychologists commonly measure various characteristics

More information

PREDICTING THE DISTRIBUTION OF A GOODNESS-OF-FIT STATISTIC APPROPRIATE FOR USE WITH PERFORMANCE-BASED ASSESSMENTS. Mary A. Hansen

PREDICTING THE DISTRIBUTION OF A GOODNESS-OF-FIT STATISTIC APPROPRIATE FOR USE WITH PERFORMANCE-BASED ASSESSMENTS. Mary A. Hansen PREDICTING THE DISTRIBUTION OF A GOODNESS-OF-FIT STATISTIC APPROPRIATE FOR USE WITH PERFORMANCE-BASED ASSESSMENTS by Mary A. Hansen B.S., Mathematics and Computer Science, California University of PA,

More information

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis Review Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 22 Chapter 1: background Nominal, ordinal, interval data. Distributions: Poisson, binomial,

More information

Chapter 10. Chapter 10. Multinomial Experiments and. Multinomial Experiments and Contingency Tables. Contingency Tables.

Chapter 10. Chapter 10. Multinomial Experiments and. Multinomial Experiments and Contingency Tables. Contingency Tables. Chapter 10 Multinomial Experiments and Contingency Tables 1 Chapter 10 Multinomial Experiments and Contingency Tables 10-1 1 Overview 10-2 2 Multinomial Experiments: of-fitfit 10-3 3 Contingency Tables:

More information

1 Degree distributions and data

1 Degree distributions and data 1 Degree distributions and data A great deal of effort is often spent trying to identify what functional form best describes the degree distribution of a network, particularly the upper tail of that distribution.

More information

STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression

STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression Rebecca Barter April 20, 2015 Fisher s Exact Test Fisher s Exact Test

More information

Chapte The McGraw-Hill Companies, Inc. All rights reserved.

Chapte The McGraw-Hill Companies, Inc. All rights reserved. er15 Chapte Chi-Square Tests d Chi-Square Tests for -Fit Uniform Goodness- Poisson Goodness- Goodness- ECDF Tests (Optional) Contingency Tables A contingency table is a cross-tabulation of n paired observations

More information

Final Exam. 1. (6 points) True/False. Please read the statements carefully, as no partial credit will be given.

Final Exam. 1. (6 points) True/False. Please read the statements carefully, as no partial credit will be given. 1. (6 points) True/False. Please read the statements carefully, as no partial credit will be given. (a) If X and Y are independent, Corr(X, Y ) = 0. (b) (c) (d) (e) A consistent estimator must be asymptotically

More information

Statistics for Business and Economics

Statistics for Business and Economics Statistics for Business and Economics Chapter 6 Sampling and Sampling Distributions Ch. 6-1 6.1 Tools of Business Statistics n Descriptive statistics n Collecting, presenting, and describing data n Inferential

More information

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007) FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter

More information

8. Hypothesis Testing

8. Hypothesis Testing FE661 - Statistical Methods for Financial Engineering 8. Hypothesis Testing Jitkomut Songsiri introduction Wald test likelihood-based tests significance test for linear regression 8-1 Introduction elements

More information

Confidence Intervals, Testing and ANOVA Summary

Confidence Intervals, Testing and ANOVA Summary Confidence Intervals, Testing and ANOVA Summary 1 One Sample Tests 1.1 One Sample z test: Mean (σ known) Let X 1,, X n a r.s. from N(µ, σ) or n > 30. Let The test statistic is H 0 : µ = µ 0. z = x µ 0

More information

RANDOM INTERCEPT ITEM FACTOR ANALYSIS. IE Working Paper MK8-102-I 02 / 04 / Alberto Maydeu Olivares

RANDOM INTERCEPT ITEM FACTOR ANALYSIS. IE Working Paper MK8-102-I 02 / 04 / Alberto Maydeu Olivares RANDOM INTERCEPT ITEM FACTOR ANALYSIS IE Working Paper MK8-102-I 02 / 04 / 2003 Alberto Maydeu Olivares Instituto de Empresa Marketing Dept. C / María de Molina 11-15, 28006 Madrid España Alberto.Maydeu@ie.edu

More information

Estimating ability for two samples

Estimating ability for two samples Estimating ability for two samples William Revelle David M. Condon Northwestern University Abstract Using IRT to estimate ability is easy, but how accurate are the estimate and what about multiple samples?

More information

Some Curiosities Arising in Objective Bayesian Analysis

Some Curiosities Arising in Objective Bayesian Analysis . Some Curiosities Arising in Objective Bayesian Analysis Jim Berger Duke University Statistical and Applied Mathematical Institute Yale University May 15, 2009 1 Three vignettes related to John s work

More information

GENERALIZED FIDUCIAL INFERENCE FOR GRADED RESPONSE MODELS. Yang Liu

GENERALIZED FIDUCIAL INFERENCE FOR GRADED RESPONSE MODELS. Yang Liu GENERALIZED FIDUCIAL INFERENCE FOR GRADED RESPONSE MODELS Yang Liu A dissertation submitted to the faculty of the University of North Carolina at Chapel Hill in partial fulfillment of the requirements

More information

Canonical Correlation Analysis of Longitudinal Data

Canonical Correlation Analysis of Longitudinal Data Biometrics Section JSM 2008 Canonical Correlation Analysis of Longitudinal Data Jayesh Srivastava Dayanand N Naik Abstract Studying the relationship between two sets of variables is an important multivariate

More information

Cross-Validation with Confidence

Cross-Validation with Confidence Cross-Validation with Confidence Jing Lei Department of Statistics, Carnegie Mellon University UMN Statistics Seminar, Mar 30, 2017 Overview Parameter est. Model selection Point est. MLE, M-est.,... Cross-validation

More information

Statistics 3858 : Contingency Tables

Statistics 3858 : Contingency Tables Statistics 3858 : Contingency Tables 1 Introduction Before proceeding with this topic the student should review generalized likelihood ratios ΛX) for multinomial distributions, its relation to Pearson

More information

BIO5312 Biostatistics Lecture 13: Maximum Likelihood Estimation

BIO5312 Biostatistics Lecture 13: Maximum Likelihood Estimation BIO5312 Biostatistics Lecture 13: Maximum Likelihood Estimation Yujin Chung November 29th, 2016 Fall 2016 Yujin Chung Lec13: MLE Fall 2016 1/24 Previous Parametric tests Mean comparisons (normality assumption)

More information

Akaike Information Criterion

Akaike Information Criterion Akaike Information Criterion Shuhua Hu Center for Research in Scientific Computation North Carolina State University Raleigh, NC February 7, 2012-1- background Background Model statistical model: Y j =

More information

Area1 Scaled Score (NAPLEX) .535 ** **.000 N. Sig. (2-tailed)

Area1 Scaled Score (NAPLEX) .535 ** **.000 N. Sig. (2-tailed) Institutional Assessment Report Texas Southern University College of Pharmacy and Health Sciences "An Analysis of 2013 NAPLEX, P4-Comp. Exams and P3 courses The following analysis illustrates relationships

More information

Correlation and Linear Regression

Correlation and Linear Regression Correlation and Linear Regression Correlation: Relationships between Variables So far, nearly all of our discussion of inferential statistics has focused on testing for differences between group means

More information

MIT Spring 2015

MIT Spring 2015 Assessing Goodness Of Fit MIT 8.443 Dr. Kempthorne Spring 205 Outline 2 Poisson Distribution Counts of events that occur at constant rate Counts in disjoint intervals/regions are independent If intervals/regions

More information

Ch 2: Simple Linear Regression

Ch 2: Simple Linear Regression Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component

More information

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A. 1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n

More information

Lecture 7: Hypothesis Testing and ANOVA

Lecture 7: Hypothesis Testing and ANOVA Lecture 7: Hypothesis Testing and ANOVA Goals Overview of key elements of hypothesis testing Review of common one and two sample tests Introduction to ANOVA Hypothesis Testing The intent of hypothesis

More information

Visual interpretation with normal approximation

Visual interpretation with normal approximation Visual interpretation with normal approximation H 0 is true: H 1 is true: p =0.06 25 33 Reject H 0 α =0.05 (Type I error rate) Fail to reject H 0 β =0.6468 (Type II error rate) 30 Accept H 1 Visual interpretation

More information

Paper Review: Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties by Jianqing Fan and Runze Li (2001)

Paper Review: Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties by Jianqing Fan and Runze Li (2001) Paper Review: Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties by Jianqing Fan and Runze Li (2001) Presented by Yang Zhao March 5, 2010 1 / 36 Outlines 2 / 36 Motivation

More information

Statistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017

Statistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017 Statistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017 I. χ 2 or chi-square test Objectives: Compare how close an experimentally derived value agrees with an expected value. One method to

More information

Application of Variance Homogeneity Tests Under Violation of Normality Assumption

Application of Variance Homogeneity Tests Under Violation of Normality Assumption Application of Variance Homogeneity Tests Under Violation of Normality Assumption Alisa A. Gorbunova, Boris Yu. Lemeshko Novosibirsk State Technical University Novosibirsk, Russia e-mail: gorbunova.alisa@gmail.com

More information

Study Ch. 13.1, # 1 4 all Study Ch. 13.2, # 9 15, 25, 27, 31 [# 11 17, ~27, 29, ~33]

Study Ch. 13.1, # 1 4 all Study Ch. 13.2, # 9 15, 25, 27, 31 [# 11 17, ~27, 29, ~33] GOALS: 1. Learn the properties of the χ 2 Distribution. 2. Understand how the shape of the χ 2 Distribution changes as the df increases. 3. Be able to find p values. 4. Recognize that χ 2 tests are right

More information

Black White Total Observed Expected χ 2 = (f observed f expected ) 2 f expected (83 126) 2 ( )2 126

Black White Total Observed Expected χ 2 = (f observed f expected ) 2 f expected (83 126) 2 ( )2 126 Psychology 60 Fall 2013 Practice Final Actual Exam: This Wednesday. Good luck! Name: To view the solutions, check the link at the end of the document. This practice final should supplement your studying;

More information

:the actual population proportion are equal to the hypothesized sample proportions 2. H a

:the actual population proportion are equal to the hypothesized sample proportions 2. H a AP Statistics Chapter 14 Chi- Square Distribution Procedures I. Chi- Square Distribution ( χ 2 ) The chi- square test is used when comparing categorical data or multiple proportions. a. Family of only

More information

Stat 5102 Final Exam May 14, 2015

Stat 5102 Final Exam May 14, 2015 Stat 5102 Final Exam May 14, 2015 Name Student ID The exam is closed book and closed notes. You may use three 8 1 11 2 sheets of paper with formulas, etc. You may also use the handouts on brand name distributions

More information

arxiv: v1 [math.st] 22 Jun 2018

arxiv: v1 [math.st] 22 Jun 2018 Hypothesis testing near singularities and boundaries arxiv:1806.08458v1 [math.st] Jun 018 Jonathan D. Mitchell, Elizabeth S. Allman, and John A. Rhodes Department of Mathematics & Statistics University

More information

Applied Mathematics Research Report 07-08

Applied Mathematics Research Report 07-08 Estimate-based Goodness-of-Fit Test for Large Sparse Multinomial Distributions by Sung-Ho Kim, Heymi Choi, and Sangjin Lee Applied Mathematics Research Report 0-0 November, 00 DEPARTMENT OF MATHEMATICAL

More information

Master s Written Examination - Solution

Master s Written Examination - Solution Master s Written Examination - Solution Spring 204 Problem Stat 40 Suppose X and X 2 have the joint pdf f X,X 2 (x, x 2 ) = 2e (x +x 2 ), 0 < x < x 2

More information

Chapter 7. Hypothesis Testing

Chapter 7. Hypothesis Testing Chapter 7. Hypothesis Testing Joonpyo Kim June 24, 2017 Joonpyo Kim Ch7 June 24, 2017 1 / 63 Basic Concepts of Testing Suppose that our interest centers on a random variable X which has density function

More information

Multivariate Regression

Multivariate Regression Multivariate Regression The so-called supervised learning problem is the following: we want to approximate the random variable Y with an appropriate function of the random variables X 1,..., X p with the

More information

4.5.1 The use of 2 log Λ when θ is scalar

4.5.1 The use of 2 log Λ when θ is scalar 4.5. ASYMPTOTIC FORM OF THE G.L.R.T. 97 4.5.1 The use of 2 log Λ when θ is scalar Suppose we wish to test the hypothesis NH : θ = θ where θ is a given value against the alternative AH : θ θ on the basis

More information

A Very Brief Summary of Statistical Inference, and Examples

A Very Brief Summary of Statistical Inference, and Examples A Very Brief Summary of Statistical Inference, and Examples Trinity Term 2009 Prof. Gesine Reinert Our standard situation is that we have data x = x 1, x 2,..., x n, which we view as realisations of random

More information

Ling 289 Contingency Table Statistics

Ling 289 Contingency Table Statistics Ling 289 Contingency Table Statistics Roger Levy and Christopher Manning This is a summary of the material that we ve covered on contingency tables. Contingency tables: introduction Odds ratios Counting,

More information

Probability and Statistics Notes

Probability and Statistics Notes Probability and Statistics Notes Chapter Seven Jesse Crawford Department of Mathematics Tarleton State University Spring 2011 (Tarleton State University) Chapter Seven Notes Spring 2011 1 / 42 Outline

More information

ECON 4160, Autumn term Lecture 1

ECON 4160, Autumn term Lecture 1 ECON 4160, Autumn term 2017. Lecture 1 a) Maximum Likelihood based inference. b) The bivariate normal model Ragnar Nymoen University of Oslo 24 August 2017 1 / 54 Principles of inference I Ordinary least

More information

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review STATS 200: Introduction to Statistical Inference Lecture 29: Course review Course review We started in Lecture 1 with a fundamental assumption: Data is a realization of a random process. The goal throughout

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

MATH 240. Chapter 8 Outlines of Hypothesis Tests

MATH 240. Chapter 8 Outlines of Hypothesis Tests MATH 4 Chapter 8 Outlines of Hypothesis Tests Test for Population Proportion p Specify the null and alternative hypotheses, ie, choose one of the three, where p is some specified number: () H : p H : p

More information

2.1.3 The Testing Problem and Neave s Step Method

2.1.3 The Testing Problem and Neave s Step Method we can guarantee (1) that the (unknown) true parameter vector θ t Θ is an interior point of Θ, and (2) that ρ θt (R) > 0 for any R 2 Q. These are two of Birch s regularity conditions that were critical

More information

Formulas and Tables by Mario F. Triola

Formulas and Tables by Mario F. Triola Copyright 010 Pearson Education, Inc. Ch. 3: Descriptive Statistics x f # x x f Mean 1x - x s - 1 n 1 x - 1 x s 1n - 1 s B variance s Ch. 4: Probability Mean (frequency table) Standard deviation P1A or

More information