Appendix A Installing and Reading Data in R

Size: px
Start display at page:

Download "Appendix A Installing and Reading Data in R"

Transcription

1 Appendix A Installing and Reading Data in R A.1 Installing R R (R Core Team 2016) is a free software environment that runs on Windows, MacOS and Unix platforms. R can be downloaded from the Comprehensive R Archive Network (CRAN) webpage 1 : There are precompiled binary distributions of the base system and contributed packages for different platforms. The following list shows the links where these distributions can be downloaded for different platforms when installing R for the first time: 1. R for Linux ( 2. R for MacOS ( 3. R for Windows ( All of the examples given in this book have been created and tested on Windows using R version To install R, run the executable file (R win.exe) by double clicking on it, which will start the R for Windows setup wizard. A.1.1 R Studio After installing the R software it is possible to install R studio which is a graphical user interface. R studio ( provides the usual R console and has the advantage of providing a history/workspace window with common data set features such as load/save/import. There are also a window with easy access to files 1 It is recommended to use a mirror site near the place where the program is being downloaded. A full list of mirrors can be found at Springer International Publishing AG 2017 J. González, M. Wiberg, Applying Test Equating Methods, Methodology of Educational Measurement and Assessment, DOI /

2 180 Appendix A and list of available packages. The Plots tab include easy export to a PDF file, a GIF image and copy features to facilitate to insert plots into Word documents. A.2 Installing and Loading R Packages Once R has been installed, it is time to open the program and start working. To install contributed R packages that do not come with the initial distribution, one can go to the tab Packages/Install package(s). Choose the package of interest (e.g. equate) and press ok to install the package. To use the package, one can either go to the menu and select Packages/load package or simply type in the R command prompt > library(equate) Using the library command gives access to all functions within that package. When help is needed with an R function or an R package, the user can simply write help() in the R command prompt. For instance, if help is needed on the use of the equate() function, one can write > help(equate) It should be noted that there are a number of books, online help resources, and online books (simply search for R Statistics book ) where information about R can be obtained. The R packages used for the examples shown in this book have been executed under the following versions: equate version SNSequate version kequate version equateirt version ltm version mirt version A.3 Working Directory and Accessing Data It is convenient to choose a working directory and to store everything connected to a program in such a folder. This can be done in two ways, either by right clicking on the R icon on the desktop and setting which directory R always should start in, or from inside R. In the R command prompt, the current working directory can be identified by writing > getwd()

3 Appendix A 181 and to set a working directory we can write > setwd("c:/users/documents/myrproject") A.4 Loading Data of Different File Formats In Chap. 2 we described how the data sets used in this book can be loaded. If the user has data with different file formats, it is possible to use other functions such as read.table(). Assume that the ADM data described in Chap. 2 are in another file format than.rda. The following examples show how data with different formats can be read into R. The data sets used in the examples can be download from the book s webpage. First, if we want to read in a text file (.txt), that is separated with commas and has headers, we can write the following > ADMt<-read.table(file="ADMt.txt",sep="\t",header=TRUE) If the data are in an Excel file, an easy way to read the data into R is to save the file as either a.dat file or a.csv file. The following commands can be used after saving the files as either of those file types > ADMc1 <-read.csv("admc.csv",sep=";",header=true) > ADMc2 <-read.table("admc.csv",sep=";",header=true) > ADMd <-read.table("admd.dat",header=t) The same kind of data can be read directly from the Excel file (.xls) if the R package XLConnect is installed. In this example Sheet1 in the Excel file is named ADMsheet. Reading the data into R can be done by writing > library(xlconnect) > ADMe <- loadworkbook("adme.xls") > ADMee <- readworksheet(adme, sheet="admsheet") Finally, we can read in an SPSS file (.sav) using the R package foreign > library(foreign) > ADMs <- read.spss("adms.sav",to.data.frame=true) We can use any of these files with the R package equate to obtain the sum scores for the verbal section of the admissions test as described in Chap. 2. Each line of the following code yields the same result as the object "verb.y" in Sect > ADMtSum <- apply(admt[,41:120],1,sum) > ADMc1Sum <- apply(admc1[,41:120],1,sum) > ADMc2Sum <- apply(admc2[,41:120],1,sum) > ADMdSum <- apply(admd[,41:120],1,sum) > ADMeSum <- apply(admee[,41:120],1,sum) > ADMsSum <- apply(adms[,41:120],1,sum) One way to assure that data are read correctly into R is to visualize the data set. To perform a visual inspection, we can, for example, write > View(ADMt)

4 182 Appendix A Reference R Core Team (2016). R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing.

5 Appendix B Additional Material In this appendix we give more details on definitions, derivations, and other relevant material covered throughout the chapters of the book. B.1 Design Functions As seen in Sect , score data can be modeled separately (EG design) or jointly (SG, CB, NEAT, or NEC designs). The design function DF is used to map the score probabilities (either univariate or bivariate) into r and s. Thus, different design functions (DF) are used for different equating designs (von Davier et al. 2004). For instance, under the EG design, because two independent vectors of scores are obtained, the DF is just the identity function, i.e., r D DF.r; s/ D s IJ 0 0 I K r s (B.1) The situation is different, however, when bivariate distributions are considered (e.g., under the SG, CB, NEAT, and NEC designs). Define P as the J K matrix with entries p jk D Pr.X D x j ; Y D y k /. Further, define the vectorized version of P as v.p/ D 0 p 1 : : p K 1 C A ; (B.2) Springer International Publishing AG 2017 J. González, M. Wiberg, Applying Test Equating Methods, Methodology of Educational Measurement and Assessment, DOI /

6 184 Appendix B where p k is the k-th column of P. In the SG design, the DF is defined as r D DF.P/ D s M v.p/; N where M D.I J I J / is a J KJ matrix and 0 1 t J B 0t J 1 0t J N : : : C : A (B.3) (B.4) 0 t J 0t J 1t J is a K KJ matrix. The CB design can be obtained by using two independent SG designs. In this case, the DF is defined as r D DF.P.12/ ; P.21/ / D s wx M.1 w X /M.1 w Y /N w Y N v.p.12/ / v.p.21/ / (B.5) where the weights w X and w Y satisfies 0 w X ; w Y 1. DFs are different under the NEAT design depending on whether CE or PSE is used. For CE, the DF is defined as r P Bt P t Q A D DF.P; Q/ D DFP.P/ D B DF s Q MP N P 0 0 NQ M Q 1 C v.p/ A v.q/ (B.6) The DF for NEAT PSE is given by 0 r D DF.P; Q/ s P l w C.1 w/ P P k q kl j p jl.1 w/ C w P j p jl P P l k q kl p l q l 1 A (B.7) where p l D.p 1l ; p 2l ;:::;p Jl / t and q l D.q 1l ; q 2l ;:::;q Kl / t. Finally, the DF for the NEC design (Wiberg and Bränberg 2015) is given by 0 r D DF.P; Q/ s P P m m w C.1 w/t Qm t Pm p m.1 w/ C wt Pm t Qm q m 1 A ; (B.8) where p m is the mth column of P and q m is the mth column of Q and the summation is over all the m possible combinations of the values of the covariates, in other words a summation over all columns in the matrices.

7 Appendix B 185 B.2 C Matrices The polynomial log-linear models described in Sect can be written in matrix form as follows log.r/ D C u C B tˇ If Oˇ is the maximum likelihood estimator (MLE) of ˇ, the estimator of r j is Or j DOr j.ˇ/, i.e. the MLE or fitted value of r j. It is assumed that the estimators, Or and Os, are obtained separately (independently) so that Cov.Or; Os/ D Or;Os D 0: The following theorem establishes a computationally easy way to calculate Or;Os. Theorem If Or is the MLE of a log-linear model for r, the estimated covariance matrix Or D Cov.r/ can be obtained as where C r is a J T r matrix Or D C r C t r ; C r D N 1=2 D p rq: The diagonal matrix, D p r has elements p Or j.also,q is a JT r orthogonal matrix that comes from the following QR factorization ŒD p r p OrOr t B t D QR ; where Q is a J T r matrix with orthogonal columns and R is an T r T r upper triangular matrix. A proof and more details can be seen in Holland and Thayer (1989) and von Davier et al. (2004). B.3 Calculation of the SEE To calculate the SEE, start by letting R and S be the vectors of the pre-smoothed score distributions obtained under any of the data collection designs. If we assume that they are estimated independently, we can write the covariances as (see Sect. B.2) Cov! OR CR C D t R 0 OS 0 C S C t D CC t ; S (B.9)

8 186 Appendix B where CR 0 C D 0 C S : (B.10) The pre-smoothed score distributions, obtained in kernel equating (step 1) can then be transformed into r and s through the DF given in Sect. B.1 for the different designs. The Jacobian of the DF is defined as J : (B.11) Further, the Jacobian of the equating transformation in Eq. (4.10) is defined by J : Note, if OR and OS are approximately normal distributed with mean R and S, respectively, and variances given in Eq. (B.9), then SEE Y.x/ D J' J DF C (B.13) where kk is the Euclidian norm of the vector. For more details refer, to von Davier et al. (2004). B.4 Score Distributions Under the NEAT Design To formally derive the score distributions of X and Y on T let us use f XP.x j a/ and f XQ.x j a/ to denote the conditional distributions of X scores on P and Q, respectively. Similarly, f YP.y j a/ and f YQ.y j a/ are the conditional distributions for Y scores on P and Q, respectively. Further, because both samples (and thus both populations) take the anchor test A, we define f AP.a/ and f AQ.a/ as the (marginal) distributions for anchor scores in P and Q, respectively. Following the definition of a synthetic population, we have that f AT.a/ D w P f AP.a/ C w Q f AQ.a/; and thus the score distributions of X and Y on T can be obtained as Z Z f XT.x/ D f XP.x j a/f AP.a/da w P C f XQ.x j a/f AQ.a/da Z f YT.y/ D Z f YP.y j a/f AP.a/da w P C w Q f YQ.y j a/f AQ.a/da w Q (B.14) (B.15)

9 Appendix B 187 Because under a NEAT design test form X is only administered to population P and test form Y is only administered to population Q, the conditional distributions f XQ.x j a/ and f YP.y j a/ cannot be estimated from the collected data. 1 However, note that if we assume that the conditional distributions of X j A are the same in both populations P and Q and analogously for that of Y j A, it can easily be shown that Eqs. (B.14) and (B.15) become Z f XT.x/ D f XP.x j a/f AT.a/da Z f YT.y/ D f YQ.y j a/f AT.a/da which can be estimated from the observed quantities. The CDFs F XT.x/ and F YT.y/ can be obtained by cumulating f XT.x/ and f YT.y/ over values of X and Y, respectively. With this quantities, the equating transformation ' T.x/ D FYT 1.F XT.x// is obtained. A different approach for the NEAT design that does not make use of a synthetic population for equating can be found in San Martín and González (2017). B.5 The Lord-Wingersky Algorithm Let X D P J jd1 X ij be the sum score of a test taker with ability. If we assume that test takers with a given ability correctly answer each of J items of a test with probability p j (i D 1;:::;J), then the conditional distribution of X for a given is called the compound binomial (Lord and Novick 1968) and is defined as 0 1 Pr.X D x j j / D f.x j / D X A ; (B.16) P xj Dx p x j j q1 x j j jd1 where p j D.;! j / (as defined in Sect. 5.1), and q j D 1 p j. The direct use of Eq. (B.16) for the calculation of score probabilities is computationally demanding, especially as the number of items in the test increases. An alternative to the direct calculation of score probabilities using the compound binomial distribution has traditionally been the use of a recursion formula given by Lord and Wingersky (1984). The formula reads as follows f r.x j / Df r 1.x j /q r ; x D 0 (B.17) 1 Technically, the score probability distributions are not identifiable (see, San Martín and González 2017).

10 188 Appendix B Dq r f r 1.x j /C p r f r 1.x 1 j /; 0 < x < r (B.18) Df r 1.x 1 j /p r ; x D r (B.19) where f r.x j / is the distribution of sum scores over the first r items for test takers with ability, and p r D p j is the probability defined from the chosen IRT model. Thus, having item parameter estimates so that Op j D.; O! j /, the above formula can be used to obtain the observed score distribution for test takers of a given ability. Alternatives to the Lord-Wingersky algorithm can be found in González et al. (2016), and an extension of this algorithm for the case of polytomous score data can be found in Thissen et al. (1995). B.6 Other Justifications for Local Equating There are other motivations and justifications of the local equating method that have been discussed in the literature. For instance, local equating can be justified from a matching and conditioning point of view as described in Wiberg and van der Linden (2011). Matching on a variable in observed-score equating results in different pairs of distributions of the observed scores X and Y for each value of the matching variable. This means that the focus is shifted from the marginal distributions of X and Y to their conditional distributions given the values of the matching variables. By conditioning on information about the test takers ability the equating becomes less dependent on the ability distribution of the population of test takers. If we could condition on the test takers true abilities the equating would be independent of population. Matching and conditioning was used in the equating literature before local equating was developed but only in the context of adjusting a population distribution so that the population could be used with a single equating transformation (Cook and Petersen 1987; Dorans 1990;Liou etal.2001; Livingston et al. 1990; Wright and Dorans 1993). The idea of one-size-fits all in the history of test theory has been used for standard error of measurement. Nowadays the single standard error of measurement is often replaced by the conditional standard deviation of the observed scores, given the ability ŒVar.Xj/ 1=2 : (B.20) Because local equating is based on conditional distributions of observed scores, the same argument can be made for using local equating as can be made for using the standard error of measurement (van der Linden 2011, 2013). Using different standard errors for different test takers is nowadays well accepted. In line with this, one should accept that different equating transformations might be used for different test takers in observed score equating (van der Linden 2006a,b; van der Linden and Wiberg 2010).

11 Appendix B 189 Local equating essentially relies on the same two basic assumptions that classical test theory and IRT rely on as stated in van der Linden (2006b): 1. For a fixed test taker, the observed score is random across replications of the test. 2. The observed score has a different distribution for each individual test taker. The first assumption implies that each test taker has an observed score distribution. An equating transformation is needed to map the scale of the X test to the scale of the Y test such that the distributions of the score X and the transformed version of score Y are identical. The true equating transformation can accomplish this. The second assumption implies that there exist different true equating transformations for different test takers. These two assumptions together imply that instead of one equating transformation one should use a family of true equating transformations (van der Linden 2006b). B.7 Epanechnikov Kernel Density Estimate and Derivatives Taking the derivative in Eq. (7.2), which we repeat here for completeness, F hx D X J r j 3R jx.x/ R 3 jx.x/ C 2 4 C X K r j ; (B.21) we obtain F 0 h X D f hx D 1 a X h X X J 3r j 1 R 2 jx.x/ 4 (B.22) The first derivative of f hx which we denote as f.1/ h X f.1/ h X.x/ D 1 a X h J D 1 a X h X D 1 a X h X X J 1 X h X.x/ D.a X h X / 2 f.1/ 3r j.1 R 2 jx.x//! 4 3rj 1 X 4 a X h X J 3r j R jx.x/ 2 J 1 a X h X is 3r j R 2 jx.x/! 3r j R jx.x/ : (B.23) 2 4

12 190 Appendix B B.8 The Double Smoothing Bandwidth Selection Method in Kernel Equating Let J be the total number of possible scores in test form X. Then, for l D 1;:::2J 1 the components of Eq. (7.5) are defined as Or l ( Or lc1 if l is odd, D 2 (B.24) Of h X.xl /; if l is even. and Of h X.x l / D JX jd1 x OaX x j.1 Oa X / O X 1 Of gx.x j / ; h X Oa X h X Oa X (B.25) where.z/ is the standard normal density function, and with Of gx D JX jd1 g x Oa x X Or j x j.1 Oa g X X / O X g X Oa g x X 1 g X Oa g X X ; (B.26) q Oa g x X D O X 2=. O X 2 C g2 X /: (B.27) In practice, the DS method is implemented in kernel equating (Häggström and Wiberg 2014) by carrying out the following steps. 1. Start with a very smooth estimate of the density function by using a large subjectively chosen bandwidth g X and estimate f gx at the score values and the values halfway between the score values. This means x D xl D Œx 1 ; x 1 C 0:5; x 2 ;:::x J 0:5; x J T, l D 1;:::2J Because the obtained smooth estimate Of gx will not perfectly interpolate the estimated score probabilities; the Or:s, the first estimate can be improved by estimating f hx at x using Of gx at the actual score values x. Thus we obtain a DS estimate Of h X 3. Select the bandwidth of h X that minimizes the sum of the squared difference between Or l and the lth DS estimate Of h X.xl / (see Eq. (7.5)).

13 Appendix B 191 B.9 The DBPP Model We assume that for every z 2 Z, F z has a density function, with respect to Lebesgue measure of the form 1X f.z/./ D w jˇ jdk j.z/e; k dk j.z/ec1 ; jd1 (B.28) where w j D v j Qi<j Œ1 v i, v 1 ;v 2 ;::: are i.i.d. random variables with common distribution Beta.1; / with >0, k is a discrete random variable with distribution indexed by a finite-dimensional parameter, j.z/ D h z.r j.z//, r 1 ; r 2 ;::: are independent and identically distributed real-valued stochastic processes with law indexed by the parameter, and h z is a known Œ0; 1 -valued bijective continuous functions defined on R. Note that Eq. (B.28) is a version of the single weights DBPP, i.e., covariate dependence is introduced only in the atom processes f j.z/g by means of the link function h. For further reference, we denote this model as where H Dfh z W z 2 Z g. ff z W z 2 Z g wdbpp. ;; ;H /; B.10 Measures of Statistical Assessment When Equating Test Scores Let O'.x/ denote an estimator of '.x/. The definitions of bias, MSE, RMSE and SE are well known and are shown here explicitly for the case of equating transformations. Bias. O'/ D E f'. O' '/ MSE. O'/ D E f' Œ. O' '/ 2 q RMSE. O'/ D E f' Œ. O' '/ 2 SE. O'/ D p Var. O'/ (B.29) (B.30) (B.31) (B.32) Note that SE can be obtained from MSE and bias because the former can be decomposed as the square of the latter plus variance. Because the expectations are not generally available in closed form, randomly generated score data are used to calculate them in practice using Monte Carlo simulation. Thus, in order to practically evaluate the statistical measures in Eqs. (B.29), (B.30), (B.31), and (B.32), the Monte Carlo method is used with replicated data generated from a known

14 192 Appendix B probability model. For each assessment measure, the true and estimated equated values are compared for each test score. Assume that we have a specific test score x i, i D 0;:::;n, and n is the number of possible score values. The measures for an equated value '.x/ over 1000 replications are defined as bias. O'.x i // D 1 X1000. O'.l/.x i / '.x i //; (B.33) 1000 ld1 MSE. O'.x i // D 1 X1000. O'.l/.x i / '.x i // 2 ; (B.34) 1000 ld1 v u RMSE. O'.x i // D t 1 X1000. O' 1000.l/.x i / '.x i // 2 ; (B.35) ld1 SE. O'.x// D p Var. O'.x// (B.36) where O'.l/.x i / is the estimated equated score for the l-th replication. References Cook, L. L., & Petersen, N. S. (1987). Problems related to the use of conventional and item response theory equating methods in less than optimal circumstances. Applied Psychological Measurement, 11(3), Dorans, N. J. (1990). Equating methods and sampling designs. Applied Measurement in Education, 3(1), González, J., Wiberg, M., & von Davier, A. A. (2016). A note on the Poisson s binomial distribution in item response theory. Applied Psychological Measurement, 40(4), Häggström, J., & Wiberg, M. (2014). Optimal bandwidth selection in observed-score kernel equating. Journal of Educational Measurement, 51(2), Holland, P., & Thayer, D. (1989). The kernel method of equating score distributions. Technical report, Educational Testing Service, Princeton. Liou, M., Cheng, P. E., & Li, M.-Y. (2001). Estimating comparable scores using surrogate variables. Applied Psychological Measurement, 25(2), Livingston, S. A., Dorans, N. J., & Wright, N. K. (1990). What combination of sampling and equating methods works best? Applied Measurement in Education, 3(1), Lord, F., & Novick, M. (1968). Statistical theories of mental test scores. Reading, MA: Addison-Wesley. Lord, F., & Wingersky, M. (1984). Comparison of IRT true-score and equipercentile observed-score equatings. Applied Psychological Measurement, 8(4), San Martín, E., & González, J. (2017). Analyzing the anchor test design in equating from an identification perspective. Manuscript submitted for publication. Thissen, D., Pommerich, M., Billeaud, K., & Williams, V. S. L. (1995). Item response theory for scores on tests including polytomous items with ordered responses. Applied Psychological Measurement, 19(1),

15 Appendix B 193 van der Linden, W. J. (2006a). Equating error in observed-score equating. Applied Psychological Measurement, 30(5), van der Linden, W. J. (2006b). Equating scores from adaptive to linear tests. Applied Psychological Measurement, 30(6), van der Linden, W. J. (2011). Local observed-score equating. In A. von Davier (Ed.), Statistical models for test equating, scaling, and linking (pp ). New York: Springer. van der Linden, W. J. (2013). Some conceptual issues in observed-score equating. Journal of Educational Measurement, 50(3), van der Linden, W. J., & Wiberg, M. (2010). Local observed-score equating with anchor-test designs. Applied Psychological Measurement, 34(8), von Davier, A. A., Holland, P., & Thayer, D. (2004). The kernel method of test equating.newyork: Springer. Wiberg, M., & Bränberg, K. (2015). Kernel equating under the non-equivalent groups with covariates design. Applied Psychological Measurement, 39(5), Wiberg, M., & van der Linden, W. J. (2011). Local linear observed-score equating. Journal of Educational Measurement, 48, Wright, N. K., & Dorans, N. J. (1993). Using the selection variable for matching or equating. ETS Research Report Series, 1993(1), 1 22.

16 Index A Adaptive kernel, 94, 158 Akaike Information Criterion (AIC), 38, 86, 87 Alternative kernels, 16, 157 Anchor test, 10 12, 20, 21, 30, 31, 48 51, 64, 78, 85, 138, 140, 141, 146, 149, 154, 186 B Bandwidth selection, 16, 93, 109, 161 Bayesian nonparametric equating, 168 Bias, 10, 65, 68 70, 129, 139, 162, 173, 176, 192 Bootstrap, 53, 65 67, 69, 165 Braun Holland, 51 C Calibration, 113, CB design, 11 Chained equipercentile equating, 51 Chained linear equating, 50 Classical Test Theory (CTT), 48, 50, 120, 140, 189 Common item, 10, 12, 30, 114, 115, 117, 128, 130, 131 Conditional means, 140 Continuization, 8, 13, 73, 74, 92, 94, 158 Counterbalanced design, 11 D Data collection design, 11 Delta method, 103 Design function, 73, 90, 103, 183, 184, 186 Difference that matters (DTM), 173 Difficulty parameter, 111, 126, 129 Discrimination parameter, 112, 114, 164, 167 Double smoothing, 162 E EG design, 11 Epanechnikov kernel, 157 Equating design, 10, 12, 14, 74, 90, 113, 139, 144, 183 Equating requirements, 9, 139 Equipercentile equating, 8, 13, 43, 54, 57, 58, 64, 121, 147 Equity, 9, 138 Equivalent groups design, 11 Estimation of score probabilities, 90 F Freeman-Tukey residuals, 86, 87 Frequency estimation, 50 G Gaussian kernel, 93, 94, 100, 105, 109, 157, 158, 161, 163, 174 Springer International Publishing AG 2017 J. González, M. Wiberg, Applying Test Equating Methods, Methodology of Educational Measurement and Assessment, DOI /

17 196 Index H Haebara method, 115, 116, 118, 119, 165 I IRT, xii, 3, 5, 6, 16, 100, 109, , , , 128, 130, 133, 134, , 151, 152, 157, 163, 165, 168, 189 IRT kernel equating, 163 IRT observed-score equating, 120, 123, 124, 134 IRT true-score equating, 119, 120 Item characteristic curve (ICC), 112 K Kernel equating, 14, 73 Kurtosis, 54 L Levine observed-score, 48 Levine true-score, 49 Linear equating, 14, 43, 47 Linear interpolation, 9, 14 Linking, 10, 111, , 118, 119, 125, 134 Local equating, 137 Local equipercentile equating, 144 Local IRT observed-score equating, 145 Local linear equating, 139 Local observed-score kernel equating, 146 Log-linear modeling, 4, 9, 33, 34, 36 38, 62, 66, 69, 73 76, 78, 80, 82, 83, 85 89, 95, 100 M Mean equating, 43, 44, 46, 56, 57 Mean squared error (MSE), 162, 173 Moments, 14, 34 36, 41, 43, 68, 76, 85, 89, 103, 104, 114, 174 MSE, 10, 176, , 113, 140, 146, 149, 151, 165, 168, 183, 184, 187 NEC design, 12, 22, 44, 85, 109, 183, 184 Nominal weights, 48 Non equivalent groups with anchor test design, 12 Non equivalent groups with covariates design, 12 P Percent relative error (PRE), 103, 104, 106, 173, 174 Percentile, 8, 43, 50 Polytomous IRT kernel equating, 163 Presmoothing, 19, 33, 36, 38, 41, 42, 73, 74, 76, 78, 82, 83, 85, 86, 109 R Rasch model, 124, 126, 129, 130 Root mean squared error (RMSE), 65, 68 70, 139, 173, 176, 192 Rule-based bandwidth selection, 161 S Sample size, 22, 33, 54 Score scale, 2, 7, 19, 22, 25, 44, 56, 112, 169 SG design, 11, 12, 20 22, 27, 28, 30, 33, 34, 37, 44 46, 52, 53, 57, 76, 82, 83, 100, 109, 113, 140, 144, 183, 184 Single group design, 11 Skewness, 54 Standard error (SE), 10, 173, 176 Standard error of equating (SEE), 9, 73, 102, 103, 160 Standard error of equating differences, 103 Standard error, SE, 192 Stocking and Lord method, 115, 116, 118, 126, 165 Synthetic population, 47, 187 N NEAT design, 12, 20 22, 30, 33, 37, 44, 45, 47, 52, 53, 60, 62, 63, 68, 85, 86, 100, T True equating transformation, 138 Tucker equating, 47

Haiwen (Henry) Chen and Paul Holland 1 ETS, Princeton, New Jersey

Haiwen (Henry) Chen and Paul Holland 1 ETS, Princeton, New Jersey Research Report Construction of Chained True Score Equipercentile Equatings Under the Kernel Equating (KE) Framework and Their Relationship to Levine True Score Equating Haiwen (Henry) Chen Paul Holland

More information

Choice of Anchor Test in Equating

Choice of Anchor Test in Equating Research Report Choice of Anchor Test in Equating Sandip Sinharay Paul Holland Research & Development November 2006 RR-06-35 Choice of Anchor Test in Equating Sandip Sinharay and Paul Holland ETS, Princeton,

More information

Research on Standard Errors of Equating Differences

Research on Standard Errors of Equating Differences Research Report Research on Standard Errors of Equating Differences Tim Moses Wenmin Zhang November 2010 ETS RR-10-25 Listening. Learning. Leading. Research on Standard Errors of Equating Differences Tim

More information

GODFREY, KELLY ELIZABETH, Ph.D. A Comparison of Kernel Equating and IRT True Score Equating Methods. (2007) Directed by Dr. Terry A. Ackerman. 181 pp.

GODFREY, KELLY ELIZABETH, Ph.D. A Comparison of Kernel Equating and IRT True Score Equating Methods. (2007) Directed by Dr. Terry A. Ackerman. 181 pp. GODFREY, KELLY ELIZABETH, Ph.D. A Comparison of Kernel Equating and IRT True Score Equating Methods. (7) Directed by Dr. Terry A. Ackerman. 8 pp. This two-part study investigates ) the impact of loglinear

More information

Testing the Untestable Assumptions of the Chain and Poststratification Equating Methods for the NEAT Design

Testing the Untestable Assumptions of the Chain and Poststratification Equating Methods for the NEAT Design Research Report Testing the Untestable Assumptions of the Chain and Poststratification Equating Methods for the NEAT Design Paul W. Holland Alina A. von Davier Sandip Sinharay Ning Han Research & Development

More information

Chapter 1 A Statistical Perspective on Equating Test Scores

Chapter 1 A Statistical Perspective on Equating Test Scores Chapter 1 A Statistical Perspective on Equating Test Scores Alina A. von Davier The fact that statistical methods of inference play so slight a role... reflect[s] the lack of influence modern statistical

More information

Chained Versus Post-Stratification Equating in a Linear Context: An Evaluation Using Empirical Data

Chained Versus Post-Stratification Equating in a Linear Context: An Evaluation Using Empirical Data Research Report Chained Versus Post-Stratification Equating in a Linear Context: An Evaluation Using Empirical Data Gautam Puhan February 2 ETS RR--6 Listening. Learning. Leading. Chained Versus Post-Stratification

More information

Effectiveness of the hybrid Levine equipercentile and modified frequency estimation equating methods under the common-item nonequivalent groups design

Effectiveness of the hybrid Levine equipercentile and modified frequency estimation equating methods under the common-item nonequivalent groups design University of Iowa Iowa Research Online Theses and Dissertations 27 Effectiveness of the hybrid Levine equipercentile and modified frequency estimation equating methods under the common-item nonequivalent

More information

A Comparison of Bivariate Smoothing Methods in Common-Item Equipercentile Equating

A Comparison of Bivariate Smoothing Methods in Common-Item Equipercentile Equating A Comparison of Bivariate Smoothing Methods in Common-Item Equipercentile Equating Bradley A. Hanson American College Testing The effectiveness of smoothing the bivariate distributions of common and noncommon

More information

Equating of Subscores and Weighted Averages Under the NEAT Design

Equating of Subscores and Weighted Averages Under the NEAT Design Research Report ETS RR 11-01 Equating of Subscores and Weighted Averages Under the NEAT Design Sandip Sinharay Shelby Haberman January 2011 Equating of Subscores and Weighted Averages Under the NEAT Design

More information

PIRLS 2016 Achievement Scaling Methodology 1

PIRLS 2016 Achievement Scaling Methodology 1 CHAPTER 11 PIRLS 2016 Achievement Scaling Methodology 1 The PIRLS approach to scaling the achievement data, based on item response theory (IRT) scaling with marginal estimation, was developed originally

More information

Statistical Equating Methods

Statistical Equating Methods Statistical Equating Methods Anthony Albano January 7, 2011 Abstract The R package equate (Albano, 2011) contains functions for non-irt equating under random groups and nonequivalent groups designs. This

More information

AMS 132: Discussion Section 2

AMS 132: Discussion Section 2 Prof. David Draper Department of Applied Mathematics and Statistics University of California, Santa Cruz AMS 132: Discussion Section 2 All computer operations in this course will be described for the Windows

More information

A Unified Approach to Linear Equating for the Non-Equivalent Groups Design

A Unified Approach to Linear Equating for the Non-Equivalent Groups Design Research Report A Unified Approach to Linear Equating for the Non-Equivalent Groups Design Alina A. von Davier Nan Kong Research & Development November 003 RR-03-31 A Unified Approach to Linear Equating

More information

Use of Continuous Exponential Families to Link Forms via Anchor Tests

Use of Continuous Exponential Families to Link Forms via Anchor Tests Research Report ETS RR 11-11 Use of Continuous Exponential Families to Link Forms via Anchor Tests Shelby J. Haberman Duanli Yan April 11 Use of Continuous Exponential Families to Link Forms via Anchor

More information

Package equate. February 15, 2013

Package equate. February 15, 2013 Package equate February 15, 2013 Version 1.1-4 Date 2011-8-23 Title Statistical Methods for Test Score Equating Author Anthony Albano Maintainer Anthony Albano

More information

equate: An R Package for Observed-Score Linking and Equating

equate: An R Package for Observed-Score Linking and Equating equate: An R Package for Observed-Score Linking and Equating Anthony D. Albano University of Nebraska-Lincoln Abstract The R package equate (Albano 2016) contains functions for observed-score linking and

More information

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report Center for Advanced Studies in Measurement and Assessment CASMA Research Report Number 31 Assessing Equating Results Based on First-order and Second-order Equity Eunjung Lee, Won-Chan Lee, Robert L. Brennan

More information

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A. 1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n

More information

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report Center for Advanced Studies in Measurement and Assessment CASMA Research Report Number 37 Effects of the Number of Common Items on Equating Precision and Estimates of the Lower Bound to the Number of Common

More information

Equating Tests Under The Nominal Response Model Frank B. Baker

Equating Tests Under The Nominal Response Model Frank B. Baker Equating Tests Under The Nominal Response Model Frank B. Baker University of Wisconsin Under item response theory, test equating involves finding the coefficients of a linear transformation of the metric

More information

Observed-Score "Equatings"

Observed-Score Equatings Comparison of IRT True-Score and Equipercentile Observed-Score "Equatings" Frederic M. Lord and Marilyn S. Wingersky Educational Testing Service Two methods of equating tests are compared, one using true

More information

Population Invariance of Score Linking: Theory and Applications to Advanced Placement Program Examinations

Population Invariance of Score Linking: Theory and Applications to Advanced Placement Program Examinations Research Report Population Invariance of Score Linking: Theory and Applications to Advanced Placement Program Examinations Neil J. Dorans, Editor Research & Development October 2003 RR-03-27 Population

More information

Methods for the Comparability of Scores

Methods for the Comparability of Scores Assessment and Development of new Statistical Methods for the Comparability of Scores 1 Pontificia Universidad Católica de Chile 2 Laboratorio Interdisciplinario de Estadística Social Advisor: Jorge González

More information

Ability Metric Transformations

Ability Metric Transformations Ability Metric Transformations Involved in Vertical Equating Under Item Response Theory Frank B. Baker University of Wisconsin Madison The metric transformations of the ability scales involved in three

More information

Simulating Uniform- and Triangular- Based Double Power Method Distributions

Simulating Uniform- and Triangular- Based Double Power Method Distributions Journal of Statistical and Econometric Methods, vol.6, no.1, 2017, 1-44 ISSN: 1792-6602 (print), 1792-6939 (online) Scienpress Ltd, 2017 Simulating Uniform- and Triangular- Based Double Power Method Distributions

More information

Computerized Adaptive Testing With Equated Number-Correct Scoring

Computerized Adaptive Testing With Equated Number-Correct Scoring Computerized Adaptive Testing With Equated Number-Correct Scoring Wim J. van der Linden University of Twente A constrained computerized adaptive testing (CAT) algorithm is presented that can be used to

More information

HOW TO USE MIKANA. 1. Decompress the zip file MATLAB.zip. This will create the directory MIKANA.

HOW TO USE MIKANA. 1. Decompress the zip file MATLAB.zip. This will create the directory MIKANA. HOW TO USE MIKANA MIKANA (Method to Infer Kinetics And Network Architecture) is a novel computational method to infer reaction mechanisms and estimate the kinetic parameters of biochemical pathways from

More information

Lesson 7: Item response theory models (part 2)

Lesson 7: Item response theory models (part 2) Lesson 7: Item response theory models (part 2) Patrícia Martinková Department of Statistical Modelling Institute of Computer Science, Czech Academy of Sciences Institute for Research and Development of

More information

equate: An R Package for Observed-Score Linking and Equating

equate: An R Package for Observed-Score Linking and Equating equate: An R Package for Observed-Score Linking and Equating Anthony D. Albano University of Nebraska-Lincoln Abstract The R package equate (Albano 2014) contains functions for observed-score linking and

More information

Package equateirt. June 8, 2018

Package equateirt. June 8, 2018 Type Package Title IRT Equating Methods Imports statmod, stats, utils, mirt Suggests knitr, ltm, rmarkdown, sna Version 2.0-5 Author Package equateirt June 8, 2018 Maintainer

More information

Walkthrough for Illustrations. Illustration 1

Walkthrough for Illustrations. Illustration 1 Tay, L., Meade, A. W., & Cao, M. (in press). An overview and practical guide to IRT measurement equivalence analysis. Organizational Research Methods. doi: 10.1177/1094428114553062 Walkthrough for Illustrations

More information

On the Use of Nonparametric ICC Estimation Techniques For Checking Parametric Model Fit

On the Use of Nonparametric ICC Estimation Techniques For Checking Parametric Model Fit On the Use of Nonparametric ICC Estimation Techniques For Checking Parametric Model Fit March 27, 2004 Young-Sun Lee Teachers College, Columbia University James A.Wollack University of Wisconsin Madison

More information

Raffaela Wolf, MS, MA. Bachelor of Science, University of Maine, Master of Science, Robert Morris University, 2008

Raffaela Wolf, MS, MA. Bachelor of Science, University of Maine, Master of Science, Robert Morris University, 2008 Assessing the Impact of Characteristics of the Test, Common-items, and Examinees on the Preservation of Equity Properties in Mixed-format Test Equating by Raffaela Wolf, MS, MA Bachelor of Science, University

More information

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report Center for Advanced Studies in Measurement and Assessment CASMA Research Report Number 23 Comparison of Three IRT Linking Procedures in the Random Groups Equating Design Won-Chan Lee Jae-Chun Ban February

More information

Biology 559R: Introduction to Phylogenetic Comparative Methods Topics for this week:

Biology 559R: Introduction to Phylogenetic Comparative Methods Topics for this week: Biology 559R: Introduction to Phylogenetic Comparative Methods Topics for this week: Course general information About the course Course objectives Comparative methods: An overview R as language: uses and

More information

Use of e-rater in Scoring of the TOEFL ibt Writing Test

Use of e-rater in Scoring of the TOEFL ibt Writing Test Research Report ETS RR 11-25 Use of e-rater in Scoring of the TOEFL ibt Writing Test Shelby J. Haberman June 2011 Use of e-rater in Scoring of the TOEFL ibt Writing Test Shelby J. Haberman ETS, Princeton,

More information

Load-Strength Interference

Load-Strength Interference Load-Strength Interference Loads vary, strengths vary, and reliability usually declines for mechanical systems, electronic systems, and electrical systems. The cause of failures is a load-strength interference

More information

Item Response Theory for Scores on Tests Including Polytomous Items with Ordered Responses

Item Response Theory for Scores on Tests Including Polytomous Items with Ordered Responses Item Response Theory for Scores on Tests Including Polytomous Items with Ordered Responses David Thissen, University of North Carolina at Chapel Hill Mary Pommerich, American College Testing Kathleen Billeaud,

More information

Effect of Repeaters on Score Equating in a Large Scale Licensure Test. Sooyeon Kim Michael E. Walker ETS, Princeton, NJ

Effect of Repeaters on Score Equating in a Large Scale Licensure Test. Sooyeon Kim Michael E. Walker ETS, Princeton, NJ Effect of Repeaters on Score Equating in a Large Scale Licensure Test Sooyeon Kim Michael E. Walker ETS, Princeton, NJ Paper presented at the annual meeting of the American Educational Research Association

More information

MLMED. User Guide. Nicholas J. Rockwood The Ohio State University Beta Version May, 2017

MLMED. User Guide. Nicholas J. Rockwood The Ohio State University Beta Version May, 2017 MLMED User Guide Nicholas J. Rockwood The Ohio State University rockwood.19@osu.edu Beta Version May, 2017 MLmed is a computational macro for SPSS that simplifies the fitting of multilevel mediation and

More information

One-Way Repeated Measures Contrasts

One-Way Repeated Measures Contrasts Chapter 44 One-Way Repeated easures Contrasts Introduction This module calculates the power of a test of a contrast among the means in a one-way repeated measures design using either the multivariate test

More information

Monte Carlo Simulations for Rasch Model Tests

Monte Carlo Simulations for Rasch Model Tests Monte Carlo Simulations for Rasch Model Tests Patrick Mair Vienna University of Economics Thomas Ledl University of Vienna Abstract: Sources of deviation from model fit in Rasch models can be lack of unidimensionality,

More information

Monte Carlo Studies. The response in a Monte Carlo study is a random variable.

Monte Carlo Studies. The response in a Monte Carlo study is a random variable. Monte Carlo Studies The response in a Monte Carlo study is a random variable. The response in a Monte Carlo study has a variance that comes from the variance of the stochastic elements in the data-generating

More information

component risk analysis

component risk analysis 273: Urban Systems Modeling Lec. 3 component risk analysis instructor: Matteo Pozzi 273: Urban Systems Modeling Lec. 3 component reliability outline risk analysis for components uncertain demand and uncertain

More information

User Guide for Hermir version 0.9: Toolbox for Synthesis of Multivariate Stationary Gaussian and non-gaussian Series

User Guide for Hermir version 0.9: Toolbox for Synthesis of Multivariate Stationary Gaussian and non-gaussian Series User Guide for Hermir version 0.9: Toolbox for Synthesis of Multivariate Stationary Gaussian and non-gaussian Series Hannes Helgason, Vladas Pipiras, and Patrice Abry June 2, 2011 Contents 1 Organization

More information

DISCRETE RANDOM VARIABLES EXCEL LAB #3

DISCRETE RANDOM VARIABLES EXCEL LAB #3 DISCRETE RANDOM VARIABLES EXCEL LAB #3 ECON/BUSN 180: Quantitative Methods for Economics and Business Department of Economics and Business Lake Forest College Lake Forest, IL 60045 Copyright, 2011 Overview

More information

Appendix F. Computational Statistics Toolbox. The Computational Statistics Toolbox can be downloaded from:

Appendix F. Computational Statistics Toolbox. The Computational Statistics Toolbox can be downloaded from: Appendix F Computational Statistics Toolbox The Computational Statistics Toolbox can be downloaded from: http://www.infinityassociates.com http://lib.stat.cmu.edu. Please review the readme file for installation

More information

Algorithms and Code JMASM24: Numerical Computing for Third-Order Power Method Polynomials (Excel)

Algorithms and Code JMASM24: Numerical Computing for Third-Order Power Method Polynomials (Excel) Journal of Modern Applied Statistical Methods November, 2006, Vol. 5, No.2, 567-574 Copyright 2006 JMASM, Inc. 1538-9472/06/$95.00 Algorithms and Code JMASM24: Numerical Computing for Third-Order Power

More information

Athena Visual Software, Inc. 1

Athena Visual Software, Inc. 1 Athena Visual Studio Visual Kinetics Tutorial VisualKinetics is an integrated tool within the Athena Visual Studio software environment, which allows scientists and engineers to simulate the dynamic behavior

More information

Equating Subscores Using Total Scaled Scores as an Anchor

Equating Subscores Using Total Scaled Scores as an Anchor Research Report ETS RR 11-07 Equating Subscores Using Total Scaled Scores as an Anchor Gautam Puhan Longjuan Liang March 2011 Equating Subscores Using Total Scaled Scores as an Anchor Gautam Puhan and

More information

ROBUST SCALE TRNASFORMATION METHODS IN IRT TRUE SCORE EQUATING UNDER COMMON-ITEM NONEQUIVALENT GROUPS DESIGN

ROBUST SCALE TRNASFORMATION METHODS IN IRT TRUE SCORE EQUATING UNDER COMMON-ITEM NONEQUIVALENT GROUPS DESIGN ROBUST SCALE TRNASFORMATION METHODS IN IRT TRUE SCORE EQUATING UNDER COMMON-ITEM NONEQUIVALENT GROUPS DESIGN A Dissertation Presented to the Faculty of the Department of Educational, School and Counseling

More information

Adaptive Fractional Polynomial Modeling in SAS

Adaptive Fractional Polynomial Modeling in SAS Adaptive Fractional Polynomial Modeling in SAS George J. Knafl, PhD Professor University of North Carolina at Chapel Hill School of Nursing Overview of Topics properties of the genreg macro for adaptive

More information

Preliminary statistics

Preliminary statistics 1 Preliminary statistics The solution of a geophysical inverse problem can be obtained by a combination of information from observed data, the theoretical relation between data and earth parameters (models),

More information

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report Center for Advanced Studies in Measurement and Assessment CASMA Research Report Number 41 A Comparative Study of Item Response Theory Item Calibration Methods for the Two Parameter Logistic Model Kyung

More information

A Note on the Choice of an Anchor Test in Equating

A Note on the Choice of an Anchor Test in Equating Research Report ETS RR 12-14 A Note on the Choice of an Anchor Test in Equating Sandip Sinharay Shelby Haberman Paul Holland Charles Lewis September 2012 ETS Research Report Series EIGNOR EXECUTIVE EDITOR

More information

Dynamic System Identification using HDMR-Bayesian Technique

Dynamic System Identification using HDMR-Bayesian Technique Dynamic System Identification using HDMR-Bayesian Technique *Shereena O A 1) and Dr. B N Rao 2) 1), 2) Department of Civil Engineering, IIT Madras, Chennai 600036, Tamil Nadu, India 1) ce14d020@smail.iitm.ac.in

More information

Asymptotic Multivariate Kriging Using Estimated Parameters with Bayesian Prediction Methods for Non-linear Predictands

Asymptotic Multivariate Kriging Using Estimated Parameters with Bayesian Prediction Methods for Non-linear Predictands Asymptotic Multivariate Kriging Using Estimated Parameters with Bayesian Prediction Methods for Non-linear Predictands Elizabeth C. Mannshardt-Shamseldin Advisor: Richard L. Smith Duke University Department

More information

Using statistical equating for standard maintaining in GCSEs and A levels

Using statistical equating for standard maintaining in GCSEs and A levels Using statistical equating for standard maintaining in GCSEs and A levels Tom Bramley & Carmen Vidal Rodeiro Cambridge Assessment Research Report 22 nd January 2014 Author contact details: Tom Bramley

More information

Notes on Random Vectors and Multivariate Normal

Notes on Random Vectors and Multivariate Normal MATH 590 Spring 06 Notes on Random Vectors and Multivariate Normal Properties of Random Vectors If X,, X n are random variables, then X = X,, X n ) is a random vector, with the cumulative distribution

More information

FleXScan User Guide. for version 3.1. Kunihiko Takahashi Tetsuji Yokoyama Toshiro Tango. National Institute of Public Health

FleXScan User Guide. for version 3.1. Kunihiko Takahashi Tetsuji Yokoyama Toshiro Tango. National Institute of Public Health FleXScan User Guide for version 3.1 Kunihiko Takahashi Tetsuji Yokoyama Toshiro Tango National Institute of Public Health October 2010 http://www.niph.go.jp/soshiki/gijutsu/index_e.html User Guide version

More information

F denotes cumulative density. denotes probability density function; (.)

F denotes cumulative density. denotes probability density function; (.) BAYESIAN ANALYSIS: FOREWORDS Notation. System means the real thing and a model is an assumed mathematical form for the system.. he probability model class M contains the set of the all admissible models

More information

Regression: Lecture 2

Regression: Lecture 2 Regression: Lecture 2 Niels Richard Hansen April 26, 2012 Contents 1 Linear regression and least squares estimation 1 1.1 Distributional results................................ 3 2 Non-linear effects and

More information

Local Dependence Diagnostics in IRT Modeling of Binary Data

Local Dependence Diagnostics in IRT Modeling of Binary Data Local Dependence Diagnostics in IRT Modeling of Binary Data Educational and Psychological Measurement 73(2) 254 274 Ó The Author(s) 2012 Reprints and permission: sagepub.com/journalspermissions.nav DOI:

More information

A Quadratic Curve Equating Method to Equate the First Three Moments in Equipercentile Equating

A Quadratic Curve Equating Method to Equate the First Three Moments in Equipercentile Equating A Quadratic Curve Equating Method to Equate the First Three Moments in Equipercentile Equating Tianyou Wang and Michael J. Kolen American College Testing A quadratic curve test equating method for equating

More information

Introduction to Regression

Introduction to Regression Introduction to Regression David E Jones (slides mostly by Chad M Schafer) June 1, 2016 1 / 102 Outline General Concepts of Regression, Bias-Variance Tradeoff Linear Regression Nonparametric Procedures

More information

Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models

Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models Tihomir Asparouhov 1, Bengt Muthen 2 Muthen & Muthen 1 UCLA 2 Abstract Multilevel analysis often leads to modeling

More information

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report Center for Advanced Studies in Measurement and Assessment CASMA Research Report Number 24 in Relation to Measurement Error for Mixed Format Tests Jae-Chun Ban Won-Chan Lee February 2007 The authors are

More information

mlegp: an R package for Gaussian process modeling and sensitivity analysis

mlegp: an R package for Gaussian process modeling and sensitivity analysis mlegp: an R package for Gaussian process modeling and sensitivity analysis Garrett Dancik January 30, 2018 1 mlegp: an overview Gaussian processes (GPs) are commonly used as surrogate statistical models

More information

Estimation of cumulative distribution function with spline functions

Estimation of cumulative distribution function with spline functions INTERNATIONAL JOURNAL OF ECONOMICS AND STATISTICS Volume 5, 017 Estimation of cumulative distribution function with functions Akhlitdin Nizamitdinov, Aladdin Shamilov Abstract The estimation of the cumulative

More information

Contents. Preface to Second Edition Preface to First Edition Abbreviations PART I PRINCIPLES OF STATISTICAL THINKING AND ANALYSIS 1

Contents. Preface to Second Edition Preface to First Edition Abbreviations PART I PRINCIPLES OF STATISTICAL THINKING AND ANALYSIS 1 Contents Preface to Second Edition Preface to First Edition Abbreviations xv xvii xix PART I PRINCIPLES OF STATISTICAL THINKING AND ANALYSIS 1 1 The Role of Statistical Methods in Modern Industry and Services

More information

Multivariate Statistics

Multivariate Statistics Multivariate Statistics Chapter 2: Multivariate distributions and inference Pedro Galeano Departamento de Estadística Universidad Carlos III de Madrid pedro.galeano@uc3m.es Course 2016/2017 Master in Mathematical

More information

SESUG 2011 ABSTRACT INTRODUCTION BACKGROUND ON LOGLINEAR SMOOTHING DESCRIPTION OF AN EXAMPLE. Paper CC-01

SESUG 2011 ABSTRACT INTRODUCTION BACKGROUND ON LOGLINEAR SMOOTHING DESCRIPTION OF AN EXAMPLE. Paper CC-01 Paper CC-01 Smoothing Scaled Score Distributions from a Standardized Test using PROC GENMOD Jonathan Steinberg, Educational Testing Service, Princeton, NJ Tim Moses, Educational Testing Service, Princeton,

More information

MIXED DATA GENERATOR

MIXED DATA GENERATOR MIXED DATA GENERATOR Martin Matějka Jiří Procházka Zdeněk Šulc Abstract Very frequently, simulated data are required for quality evaluation of newly developed coefficients. In some cases, datasets with

More information

MA 575 Linear Models: Cedric E. Ginestet, Boston University Revision: Probability and Linear Algebra Week 1, Lecture 2

MA 575 Linear Models: Cedric E. Ginestet, Boston University Revision: Probability and Linear Algebra Week 1, Lecture 2 MA 575 Linear Models: Cedric E Ginestet, Boston University Revision: Probability and Linear Algebra Week 1, Lecture 2 1 Revision: Probability Theory 11 Random Variables A real-valued random variable is

More information

The bootstrap. Patrick Breheny. December 6. The empirical distribution function The bootstrap

The bootstrap. Patrick Breheny. December 6. The empirical distribution function The bootstrap Patrick Breheny December 6 Patrick Breheny BST 764: Applied Statistical Modeling 1/21 The empirical distribution function Suppose X F, where F (x) = Pr(X x) is a distribution function, and we wish to estimate

More information

Overview. Multidimensional Item Response Theory. Lecture #12 ICPSR Item Response Theory Workshop. Basics of MIRT Assumptions Models Applications

Overview. Multidimensional Item Response Theory. Lecture #12 ICPSR Item Response Theory Workshop. Basics of MIRT Assumptions Models Applications Multidimensional Item Response Theory Lecture #12 ICPSR Item Response Theory Workshop Lecture #12: 1of 33 Overview Basics of MIRT Assumptions Models Applications Guidance about estimating MIRT Lecture

More information

Search for the Dubai in the remap search bar:

Search for the Dubai in the remap search bar: This tutorial is aimed at developing maps for two time periods with in Remap (). In this tutorial we are going to develop a classification water and non-water in Dubai for the year 2000 and the year 2016.

More information

The Use of R Language in the Teaching of Central Limit Theorem

The Use of R Language in the Teaching of Central Limit Theorem The Use of R Language in the Teaching of Central Limit Theorem Cheang Wai Kwong waikwong.cheang@nie.edu.sg National Institute of Education Nanyang Technological University Singapore Abstract: The Central

More information

STAT 3008 Applied Regression Analysis Tutorial 1: Introduction LAI Chun Hei

STAT 3008 Applied Regression Analysis Tutorial 1: Introduction LAI Chun Hei STAT 3008 Applied Regression Analysis Tutorial 1: Introduction LAI Chun Hei Department of Statistics, The Chinese University of Hong Kong 1 Mathematical Background In this courses, some theorems from elementary

More information

Research Collection. Basics of structural reliability and links with structural design codes FBH Herbsttagung November 22nd, 2013.

Research Collection. Basics of structural reliability and links with structural design codes FBH Herbsttagung November 22nd, 2013. Research Collection Presentation Basics of structural reliability and links with structural design codes FBH Herbsttagung November 22nd, 2013 Author(s): Sudret, Bruno Publication Date: 2013 Permanent Link:

More information

Paradoxical Results in Multidimensional Item Response Theory

Paradoxical Results in Multidimensional Item Response Theory UNC, December 6, 2010 Paradoxical Results in Multidimensional Item Response Theory Giles Hooker and Matthew Finkelman UNC, December 6, 2010 1 / 49 Item Response Theory Educational Testing Traditional model

More information

Subject CS1 Actuarial Statistics 1 Core Principles

Subject CS1 Actuarial Statistics 1 Core Principles Institute of Actuaries of India Subject CS1 Actuarial Statistics 1 Core Principles For 2019 Examinations Aim The aim of the Actuarial Statistics 1 subject is to provide a grounding in mathematical and

More information

The Matrix Algebra of Sample Statistics

The Matrix Algebra of Sample Statistics The Matrix Algebra of Sample Statistics James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) The Matrix Algebra of Sample Statistics

More information

Journal of Statistical Software

Journal of Statistical Software JSS Journal of Statistical Software August 2014, Volume 59, Issue 7. http://www.jstatsoft.org/ SNSequate: Standard and Nonstandard Statistical Models and Methods for Test Equating Jorge González Pontificia

More information

Confidence Intervals for One-Way Repeated Measures Contrasts

Confidence Intervals for One-Way Repeated Measures Contrasts Chapter 44 Confidence Intervals for One-Way Repeated easures Contrasts Introduction This module calculates the expected width of a confidence interval for a contrast (linear combination) of the means in

More information

Multivariate Distributions

Multivariate Distributions IEOR E4602: Quantitative Risk Management Spring 2016 c 2016 by Martin Haugh Multivariate Distributions We will study multivariate distributions in these notes, focusing 1 in particular on multivariate

More information

A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints

A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints Noname manuscript No. (will be inserted by the editor) A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints Mai Zhou Yifan Yang Received: date / Accepted: date Abstract In this note

More information

Introduction to Regression

Introduction to Regression Introduction to Regression p. 1/97 Introduction to Regression Chad Schafer cschafer@stat.cmu.edu Carnegie Mellon University Introduction to Regression p. 1/97 Acknowledgement Larry Wasserman, All of Nonparametric

More information

Introduction to Smoothing spline ANOVA models (metamodelling)

Introduction to Smoothing spline ANOVA models (metamodelling) Introduction to Smoothing spline ANOVA models (metamodelling) M. Ratto DYNARE Summer School, Paris, June 215. Joint Research Centre www.jrc.ec.europa.eu Serving society Stimulating innovation Supporting

More information

Sparse Nonparametric Density Estimation in High Dimensions Using the Rodeo

Sparse Nonparametric Density Estimation in High Dimensions Using the Rodeo Outline in High Dimensions Using the Rodeo Han Liu 1,2 John Lafferty 2,3 Larry Wasserman 1,2 1 Statistics Department, 2 Machine Learning Department, 3 Computer Science Department, Carnegie Mellon University

More information

Some Approximations of the Logistic Distribution with Application to the Covariance Matrix of Logistic Regression

Some Approximations of the Logistic Distribution with Application to the Covariance Matrix of Logistic Regression Working Paper 2013:9 Department of Statistics Some Approximations of the Logistic Distribution with Application to the Covariance Matrix of Logistic Regression Ronnie Pingel Working Paper 2013:9 June

More information

Gaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012

Gaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012 Gaussian Processes Le Song Machine Learning II: Advanced Topics CSE 8803ML, Spring 01 Pictorial view of embedding distribution Transform the entire distribution to expected features Feature space Feature

More information

Introduction to Probability and Statistics (Continued)

Introduction to Probability and Statistics (Continued) Introduction to Probability and Statistics (Continued) Prof. icholas Zabaras Center for Informatics and Computational Science https://cics.nd.edu/ University of otre Dame otre Dame, Indiana, USA Email:

More information

A Note on the Equivalence Between Observed and Expected Information Functions With Polytomous IRT Models

A Note on the Equivalence Between Observed and Expected Information Functions With Polytomous IRT Models Journal of Educational and Behavioral Statistics 2015, Vol. 40, No. 1, pp. 96 105 DOI: 10.3102/1076998614558122 # 2014 AERA. http://jebs.aera.net A Note on the Equivalence Between Observed and Expected

More information

DEPARTMENT OF MATHEMATICS

DEPARTMENT OF MATHEMATICS DEPARTMENT OF MATHEMATICS Ma322 - Final Exam Spring 2011 May 3,4, 2011 DO NOT TURN THIS PAGE UNTIL YOU ARE INSTRUCTED TO DO SO. Be sure to show all work and justify your answers. There are 8 problems and

More information

Likelihood and Fairness in Multidimensional Item Response Theory

Likelihood and Fairness in Multidimensional Item Response Theory Likelihood and Fairness in Multidimensional Item Response Theory or What I Thought About On My Holidays Giles Hooker and Matthew Finkelman Cornell University, February 27, 2008 Item Response Theory Educational

More information

Random Matrix Eigenvalue Problems in Probabilistic Structural Mechanics

Random Matrix Eigenvalue Problems in Probabilistic Structural Mechanics Random Matrix Eigenvalue Problems in Probabilistic Structural Mechanics S Adhikari Department of Aerospace Engineering, University of Bristol, Bristol, U.K. URL: http://www.aer.bris.ac.uk/contact/academic/adhikari/home.html

More information

Bayesian Nonparametric Rasch Modeling: Methods and Software

Bayesian Nonparametric Rasch Modeling: Methods and Software Bayesian Nonparametric Rasch Modeling: Methods and Software George Karabatsos University of Illinois-Chicago Keynote talk Friday May 2, 2014 (9:15-10am) Ohio River Valley Objective Measurement Seminar

More information

A measure for the reliability of a rating scale based on longitudinal clinical trial data Link Peer-reviewed author version

A measure for the reliability of a rating scale based on longitudinal clinical trial data Link Peer-reviewed author version A measure for the reliability of a rating scale based on longitudinal clinical trial data Link Peer-reviewed author version Made available by Hasselt University Library in Document Server@UHasselt Reference

More information