Item Selection in Computerized Classification Testing. Nathan A. Thompson. Prometric

Size: px
Start display at page:

Download "Item Selection in Computerized Classification Testing. Nathan A. Thompson. Prometric"

Transcription

1 RUNNING HEAD: Item Selecton n CCT Item Selecton n Computerzed Classfcaton Testng Nathan A. Thompson Prometrc Address: Nathan A. Thompson Prometrc 160 Energy Lane St. Paul, MN 55108

2 Abstract Several alternatves for tem selecton algorthms based on tem response theory (IRT) n computerzed classfcaton testng (CCT) have been suggested (Spray & Reckase, 1994; Eggen, 1999; Ln & Spray, 000; Wessman, 004) wth no conclusve evdence on the substantal superorty of a sngle method. It s argued that the lack of szable effect s due to the fact that some of the methods actually assess the same concept through dfferent calculatons, and the smplest method s therefore the most approprate. Moreover, the effcency of tem selecton methods depend on the termnaton crtera that s used, whch s demonstrated through ddactc example and monte carlo smulaton. Item selecton at the cutscore, whch seems conceptually approprate for CCT, s not always the most effcent opton.

3 Item Selecton n Computerzed Classfcaton Testng Computerzed admnstraton of psychoeducatonal tests offers many advantages. A well-documented advantage s the reduced number of tems requred per examnee when varablelength testng s appled. Varable-length testng refers to tests where not every examnee receves the same number of tems; f the purpose of the test has been satsfed after only a small number of tems, the test s concluded. The most common method of varable length testng, when t s used for ablty/trat (θ) estmaton, s computerzed adaptve testng (Wess & Kngsbury, 1984). When the purpose of the test s to assgn an examnee to two or more mutually exclusve categores along the θ contnuum, t s called computerzed classfcaton testng (CCT: Ln & Spray, 000). A major component wthn CCT that realzes the advantage of reduced test length s that of ntellgent tem selecton algorthms. Contrary to a fxed-form conventonal test, tems are selected throughout the test, ether n testlets (Luecht & Nungester, 1998) or after each ndvdual tem (Spray & Reckase, 1994; Ln & Spray, 000). The tem selecton process can be termed ntellgent because the computer attempts to evaluate whch tem remanng n the bank would be the best tem to admnster next, wthn certan constrants. Item selecton methods dffer n how they perform ths evaluaton. Several ntellgent tem selecton optons have been suggested based on tem response theory (IRT), ncludng the maxmzaton of tem nformaton at the current θ estmate (Reckase, 1983), the cutscore (Spray and Reckase, 1994; 1996), across a regon of θ (Eggen, 1999), a logodds rato (Ln & Spray, 000), and across θ (Wessman, 004). However, these tend to fall nto two general types: estmate-based (EB) and cutscore-based (CB). The other major component of a CCT that produces shorter tests s the applcaton of varable-length termnaton crtera. The termnaton crteron s an algorthm that determnes whether the examnee s able to be classfed wthn certan parameters at each pont n the test. When the examnee s able to be classfed, they are assgned a category and the test s termnated. There are three prmary termnaton crtera n CCT. The sequental probablty rato test (SPRT; Wald, 1947; Eggen, 1999) formulates the decson process as a hypothess test that the examnee s θ s equal to a specfed pont above the cutscore or another specfed pont below the cutscore. Ablty confdence ntervals (ACI; Thompson, 006), orgnally termed adaptve mastery testng (Kngsbury & Wess, 1983), termnate the test when a confdence nterval for the examnee s θ s completely above or below the cutscore. Lastly, loss/utlty structures from Bayesan decson theory can be used to assgn classfcatons (Rudner, 00). The decson theoretc approach often uses random tem selecton and classcal test theory, so t s not consdered here. The purpose of ths study s threefold: (1) Demonstrate that tem selecton methods can be generally classfed as EB or CB; () Demonstrate that, wthn the EB and CB paradgms, varous tem selecton methods assess the same concept; (3) Demonstrate that EB selecton s approprate for ACI, and CB selecton s approprate for the SPRT. Item Response Theory The thrd necessary component of a CCT s the adopton of a psychometrc model. Whle CCTs can be developed usng classcal test theory (Frck, 199), methods dscussed heren only make use of tem response theory (IRT; Hambleton & Swamnathan, 1985; Embretson & Rese, 000) for several reasons. Frst, tem banks for large-scale testng programs are often calbrated Item Selecton n CCT 1

4 wth IRT. Second, CCT methods that use classcal test theory assume that examnees can be neatly dvded nto categores such as masters and nonmasters before tem calbraton, whch s not always possble. The ACI crteron for CCT and the tem selecton methods dscussed n ths study requre that tems be calbrated wth IRT because IRT places tems and examnees on the same scale. Lastly, the majorty of research on CCT s based on IRT. Ths study assumed that the data can be effcently modeled wth the three-parameter logstc model. The probablty of an examnee wth a gven θ j correctly respondng to an tem s (Hambleton & Swamnathan, 1985, Eq. 3.3): P ( X exp[ Da ( θ b )] = 1 θ j ) = c + (1 c ), (1) 1+ exp[ Da ( θ b )] where a s the tem dscrmnaton parameter, b s the tem dffculty or locaton parameter, c s the lower asymptote, or pseudoguessng parameter, and D s a scalng constant equal to 1.70 or 1.0. D was assumed to be 1.0 n ths study to make the example calculatons smpler. Item Selecton Crtera Although CCT research dates back to the 1960s (Ferguson, 1969), the frst ntellgent tem selecton method used n CCT was the maxmzaton of Fsher nformaton (FI) at the current estmate of θ (Reckase, 1983; Kngsbury & Wess, 1983). FI s broadly defned as the condtonal slope squared dvded by the condtonal varance, gven the probablty of a correct response to tem P (θ), or (Embretson & Rese, 000, Eq. 7 A.1) I ( θ) = P '( θ) P ( θ )(1 P ( θ)). () The nformaton functon for the three-parameter model s specfcally defned as (Embretson & Rese, 000, Eq. 7 A.) 1 P ( θ ) ( P ( θ ) c ) I ( θ ) = a. (3) P ( θ ) (1 c ) FI, whle a functon of θ, s evaluated at a sngle pont for each tem. Reckase (1983) and Kngsbury & Wess (1983) chose the current θ estmate as ths pont. Spray and Reckase (1994) suggested that tems nstead be selected to maxmze Fsher nformaton at the cutscore, argung that ths makes more conceptual sense snce the goal of the test s only to determne f an examnee s above or below that pont. Eggen (1999) advocated the use of nformaton across a regon of θ. Ths regonal, rather than pont, type of nformaton s known as Kullback-Lebler nformaton (KLI). KLI can be descrbed as the expectaton over observed responses x of the log-lkelhood rato for each tem, or L ( θ ; x ) K ( θ θ1) = Eθ log 1 (4) L ( θ1; x ) Item Selecton n CCT

5 wth θ 1 and θ representng two ponts on θ chosen by the test user that defne the regon that KLI s calculated on and L x 1 x ( ; x ) = P ( θ )[ 1 P ( θ ) θ ] (5) denotng the lkelhood functon for the th tem. The double vertcal bars are used n ths context to emphasze that θ 1 and θ are separated, and not vewed as the condtonal relatonshp ndcated by a sngle vertcal bar. Wth a dchotomous IRT model (Eggen, 1999; Ln & Spray, 000), ths smplfes to P ( θ ) Q ( θ ) K ( θ θ1) = P ( θ )log + Q ( θ )log (6) P ( θ ) Q ( θ ) 1 1 where P (θ ) s the probablty of a correct response at θ and Q (θ ) s the complementary probablty of an ncorrect response. The values θ 1 and θ can be chosen above and below the cutscore or the current estmate. Ln and Spray (000) developed another tem selecton crteron, selectng tems by maxmzng the log of the rato of the tem response probabltes at θ 1 and θ, P ( θ ) R = P ( θ1) X Q ( θ ) Q ( θ1) 1 X. (7) Ln and Spray suggest that θ 1 and θ be chosen above and below the cutscore, but smlar to KLI, t s also possble for them to be specfed at an nterval above and below the current θ estmate. Whle the log calculatons performed by Ln and Spray were more nvolved, they were stll dependent on maxmzng the dfference P(θ ) P(θ 1 ). The most general tem selecton method, mutual nformaton (Wessman, 004), evaluates nformaton across all responses and all values n θ. It s equvalent to the KLI between the dstrbutons of θ and X. Ths s expressed as f ( x, θ ) I( X, Θ) = f ( x, θ )log. (8) x X θ Θ f ( x ) f ( θ ) Because the functon n ths stuaton s the probablty of a response, the numerator of the bracketed rato, the jont functon of the tem response and θ, s the tem response functon (IRF). The functon f(x ) s, n psychometrc notaton, P(X = 1), the classcal dffculty value. The functon f(θ) s the assumed dstrbuton of θ. Wessman proposed that ths expresson be evaluated at dscrete ponts on θ, such as a pont above and below a cutscore. The smulaton n that study nvolved multple cutscores, but f there s only one cutscore, mutual nformaton can be reduced to KLI. Conceptually, mutual nformaton then quantfes the dfference between the IRF and the classcal dffculty across θ at the dscrete ponts specfed. An tem wth very low a results n IRF values that dffer very lttle from the classcal dffculty across θ. An tem wth hgh a results n low IRF values wth low θ and hgh IRF values wth hgh θ, resultng n hgher mutual nformaton. What makes mutual nformaton dfferent from a mere transformaton of the a parameter s the applcaton of an assumed θ dstrbuton. Dscrmnaton values equal, an tem Item Selecton n CCT 3

6 wth a dffculty near the mean of the θ dstrbuton wll have greater mutual nformaton than an tem wth extreme dffculty, because t provdes ts nformaton to a greater proporton of examnees. One drawback to mutual nformaton s the assumpton of a pror. If nothng s known concernng the examnee dstrbuton, a unform pror may be approprate, analogous to the use of maxmum lkelhood θ estmaton rather than Bayesan θ estmaton procedures. If ths were to be appled, mutual nformaton would smply be a functon of a. Ths was the approach used by Wessman (004). Wessman (004) also suggested another new crteron for tem selecton n CCT, Fsher nformaton wth a weght functon of the posteror θ dstrbuton or lkelhood functon. Smlar to other methods, ths was evaluated at the current θ estmate by Wessman, but can also be evaluated at the cutscore. Ths method does not contrbute anythng more than FI because, at a gven pont such as the cutscore or current θ estmate, the FI for all remanng tems n the bank s beng multpled by the same value whatever s the weght for that θ value. Such multplcaton does not change the rankng of the tems n terms of nformaton. EB and CB Selecton, and Equvalence Wthn Category As evdent from the outlne of the evaluatve processes above, each method can be desgned to assess the tem wth regards to the cutscore or to the current θ estmate. For the three methods wth a constraned evaluatve locale (FI, KLI, log-odds rato), the evaluatve process takes place at ether a sngle pont or two endponts of a regon, and ether approach can be specfed to occur at the cutscore or θ estmate. Even mutual nformaton, whch evaluates nformaton across any number of ponts on θ, can be specfed wth ponts relatve to cutscore(s) or a θ estmate. For ths reason, tem selecton methods n CCT can be broadly categorzed as EB or CB. Ths categorzaton s outlned n Table 1. Moreover, the three constraned methods are essentally equvalent, and tend to select the same tem. All three assess the nformaton provded by an tem at the evaluatve locale, cutscore or estmate, and smply dfference n the calculaton used to perform the evaluaton. For example, suppose a CCT s desgned to use CB tem selecton at the cutscore θ c = 0.5. FI wll select the tem wth the hghest nformaton at 0.5, whch s the tem wth the greatest a, and b nearest to 0.5. Ths same tem wll also provde the hghest average nformaton across a regon around 0.5. Addtonally, an tem wth these characterstcs wll also produce the greatest dfference P(θ ) P(θ 1 ) for a θ 1 below the cutscore and θ above the cutscore, whch maxmzes Ln and Spray s (000) crteron. Consder the followng example wth three tems, shown n Table. These three tems have smlar IRT parameters. Item 1 represents an approprate tem for admnstraton at θ c = 0.5; the locaton parameter matches exactly, and the dscrmnaton parameter s moderately hgh. Item s also approprate, but the locaton parameter s not exactly the same, whle tem 3 has the same locaton parameter as tem 1 but slghtly less dscrmnaton. Even though the tem parameter dfferences are qute small, the rankngs of the tems on the three tem selecton crtera are equvalent because they are assessng the same concept. Moreover, the values are proportonate wth each method. It s because of ths fact that very lttle dfference has been found n emprcal studes. Eggen (1999) compared several formulatons of FI and KLI, and found that the best formulatons of each method were approxmately equal n terms of ATL and PCC. Ln and Spray (000) found that the log-odds rato also performed smlarly. Wessman (004) found that mutual nformaton had hgher PCC but lower ATL than FI, but used EB FI rather than CB FI. As descrbed below, ths s an unfar comparson when the SPRT s the termnaton crteron, as t was n the study. Furthermore, the dfference was only Item Selecton n CCT 4

7 evdent for very short tests such as a maxmum test length of 10, n whch case EB FI selecton s neffcent because the test does not have a chance to obtan an accurate θ estmate (Chang & Yng, 1996). Even so, the dfference was only two to three tems n terms of average test length. In the nterests of parsmony, snce the three approaches produce the same rankng of tems, the method wth the least computatonal burden and specfcaton of parameters should be used when desgnng a CCT. Termnaton Crtera and Item Selecton As prevously mentoned, there are three termnaton crtera avalable for CCT, wth the SPRT and ACI the most commonly used n applcaton. ACI classfes an examnee by estmatng θ after each tem n the test and constructng a confdence nterval around the estmate θˆ usng the condtonal standard error of measurement (SEM), f maxmum lkelhood estmaton s appled, or the square root of the Bayesan posteror varance, f Bayesan estmaton s appled. The confdence nterval s represented mathematcally as ˆ θ z ( SEM ) < θ < ˆ θ + z ( SEM ). (9) α α where z α s the normal devate for a 1-α confdence nterval, such as 1.96 for a 95% nterval. If the confdence nterval falls completely above the cutscore, the examnee can be classfed as pass. If the confdence nterval falls completely below the cutscore, the examnee s classfed as fal. If the confdence nterval contans the cutscore, another tem s admnstered. The SPRT structures the decson as a hypothess test wth H 0 : θ = θ 1 and H 1 : θ = θ. The lkelhood of each hypothess s compared n the form of a lkelhood rato: L L( θ = θ u) = = 1 = n L( θ = θ1 u) n = 1 P ( θ ) P ( θ ) 1 X X Q ( θ ) Q ( θ ) 1 X 1 X 1. (10) Ths rato s then compared to two decson ponts A and B. Wald (1947) suggested the followng as approxmatons, assumng a nomnal Type I error rate of α and Type II rate of β: Lower decson pont = B = β/(1 α) (11) Upper decson pont = A = (1 β)/α. (1) If L > A, the examnee s classfed as above the cutscore and the test s termnated. If L < B, the examnee s classfed as below the cutscore. If B < L < A, another tem s admnstered. Note that θˆ s not nvolved. The SPRT was orgnally appled to CCT wth classcal test theory tem parameters (Ferguson, 1969). Reckase (1983) developed a procedure to apply tem response theory to the specfcaton of P and Q. Reckase suggested that two ponts θ 1 and θ be chosen on the θ metrc, where the value of θ 1 represents the lowest level that the test developer s wllng to pass, whle θ represents the hghest θ that the test developer s wllng to fal. The space between the two s called the ndfference regon, and s often specfed by addng and subtractng a user-defned value δ from the cutscore. For example, f the cutscore s 1.0 the test user could defne, accordng Item Selecton n CCT 5

8 to ther nterpretaton of the testng stuaton, a small ndfference regon wth δ = 0.1 (θ 1 = 0.9 and θ = 1.1) or a large ndfference regon wth δ = 1.0 (θ 1 = 0.0 and θ =.0). Ths ntroduces a certan amount of arbtrarness nto the procedure, whch s notable because the sze of δ has a drect effect on the performance of the termnaton crteron. Because an IRT tem response functon s strctly ncreasng, a large value of δ wll lead to a greater dsparty between P (θ 1 ) and P (θ ) f the tem was answered ncorrectly or Q (θ 1 ) and Q (θ ) f the tem was answered ncorrectly. Ths n turn causes the value of the lkelhood rato to depart from 1.0 wth fewer tems. The same reasonng apples to the effcency of tem selecton crtera. Because the SPRT operates wth regards to the cutscore only, whle ACI s constructed wth regards to θˆ, ther respectve nformaton needs dffer. Wth the SPRT, a decson s made more quckly when L s maxmzed or mnmzed by tems that maxmze the dfference P(θ) P(θ 1 ). As demonstrated prevously, ths s equvalent to selectng tems wth the greatest FI at the cutscore. Conversely, ACI makes a decson more quckly when an accurate estmate of θ s obtaned. Ths occurs when the condtonal SEM or Bayesan posteror varance s mnmzed. As the SEM s nversely related to nformaton at the current θ estmate (Embretson & Rese, 000), ACI makes a decson more quckly when nformaton s maxmzed at θˆ rather than the cutscore. Smulaton Study Ths nteracton between tem selecton method and termnaton crteron was evaluated wth a bref monte carlo smulaton. CCTs were smulated for 10,000 examnees that were randomly generated from a N(0,1) dstrbuton. The tem bank conssted of 400 dchotomously scored tems, wth a ~ N(1,0.), b ~ N(0,1), and c = 0.5. Because the SPRT ntroduces the addtonal arbtrarness n the parameter of IR wdth, the smulaton was completed for the two ACI condtons, and then the IR wdth systematcally vared untl an approxmately equvalent PCC was obtaned. Ths allows a more drect comparson between ACI and the SPRT n terms of ATL. The IR wdth for the CB condton was 0.40, and the wdth for the EB condton was The nteracton s evdent n the ATL for each condton, presented n Table 3. For the SPRT, CB tem selecton used 1.44 fewer tems on average. For ACI, CB tem selecton requred 9.66 more tems, on average, whle also havng slghtly less classfcaton accuracy. No tem exposure constrants were employed n ths smulaton, as past research s conclusve that ths smply decreases dfferences between competng methods of other CCT aspects, or has neglgble effects (Spray, Abdel-fattah, Huang, and Lau, 1997; Lau, 1998; Eggen, 1999; Eggen & Straetmans, 000; Ln & Spray, 000; Jao, Wang, & Lau, 004). Ths smulaton followed Ln and Spray (000), who specfcally dd not nclude constrants because they reduce the vsblty of comparsons among other varables. Dscusson Ths smulaton, as well as the prevous examples, demonstrates how a sngle tem selecton method s not the most effcent for all uses. Contrary to ntal percepton, cutoff tem selecton s not always approprate n varable-length CCT. Because of the way that the SPRT and ACI utlze nformaton dfferently n the classfcaton of examnees, the most approprate tem selecton method can vary. Specfcally, CB tem selecton s more effcent when the SPRT s the termnaton crteron, and EB tem selecton s more effcent when ATL s the termnaton crteron. Whle t may be ntally ntutve that CB tem selecton s more approprate for all two-classfcaton CCT because the test s only nterested n f the examnee s above or below the cutoff, ths s not true. Item Selecton n CCT 6

9 No prevous research has made an even comparson wth a complete crossng of tem selecton and termnaton crteron methods, but ths smulaton s supported by two earler smulaton studes. Eggen and Straetmans (000) found that ATL was two to three tems lower for EB selecton than CB selecton, wth ACI as the termnaton crteron. Spray and Reckase (1994) supported CB selecton wth the SPRT but found lttle dfference between CB and EB selecton n terms of ATL wth ACI. However, PCC was not nvestgated n that study. Also note that CB tem selecton entals greater exposure of tems wth locaton parameters near the cutscores. Item exposure s less of an ssue wth EB selecton; only the frst few tems tend to be the same for every examnee when no restrctons are mposed, whereas the entre set s the same for CB selecton. If tem exposure constrants were used n the current study, they would have a greater effect on ATL for CB selecton than EB selecton, for ths reason. Ths would lead to more smlar ATL for CB and EB selecton wth the SPRT, and ncrease the ATL advantage for EB selecton wth ACI. An evaluaton of the extent of ths effect offers a good target for future research, gven the wdespread use of exposure constrants n hghstakes testng. Addtonal ndependent varables also offer opportuntes for future research. Because of the dfferent nformaton needs of ACI and the SPRT, the shape of the tem bank functon must be approprate. Namely, there must be suffcent nformaton near the cutscore for CB selecton wth the SPRT, and there must be suffcent nformaton across θ for EB selecton wth ACI. Effcency would decrease wth an napproprate tem bank. Because tem selecton method has a drect effect on the effcency of the examnaton, the specfcaton of ths aspect of CCT as argued heren has drect sgnfcance for practtoners. Dependng on the remanng parameters of the CCT desgn, applcaton of an tem selecton algorthm that s more approprate for a gven termnaton crteron wll reduce the ATL for examnees, thereby modestly reducng test seat tme and the requred sze of the tem bank. Gven that ths s the ntent of varable-length CCT, proper desgn of CCTs that enhances ths effect s of practcal mportance. Conversely, correct specfcaton of tem selecton method ncreases the accuracy of decsons f test length s held constant. In hgh-stakes testng, accuracy s more mportant than effcency, as an examnee s lkely to be more upset wth a perceved msclassfcaton than wth beng admnstered more tems. Because the utlzaton of nformaton s mportant for maxmzng classfcaton accuracy, t should be gven substantal consderaton when a CCT s desgned. Item Selecton n CCT 7

10 References Chang, H.-H., & Yng, Z. (1996). A global nformaton approach to computerzed adaptve testng.appled Psychologcal Measurement, 0, Eggen, T.J.H.M. (1999). Item selecton n adaptve testng wth the sequental probablty rato test. Appled Psychologcal Measurement, 3, Eggen, T.J.H.M., & Straetmans, G. J. J. M. (000). Computerzed adaptve testng for classfyng examnees nto three categores. Educatonal and Psychologcal Measurement, 60, Embretson, S.E., & Rese, S.P. (000). Item response theory for psychologsts. Mahwah, NJ: Erlbaum. Ferguson, R. L. (1969). The development, mplementaton, and evaluaton of a computer-asssted branched test for a program of ndvdually prescrbed nstructon. Unpublshed doctoral dssertaton, Unversty of Pttsburgh. Frck, T.W. (199). Computerzed adaptve mastery tests as expert systems. Journal of Educatonal Computng Research, 8(), Hambleton, R. K., & Swamnathan, H. (1985). Item response theory: prncples and applcatons. Norwell, MA: Kluwer Academc Publshers. Jao, H., Wang, S., & Lau, C. A. (004). An Investgaton of Two Combnaton Procedures of SPRT for Three-category Classfcaton Decsons n Computerzed Classfcaton Test. Paper presented at the annual meetng of the Amercan Educatonal Research Assocaton, San Antono, Aprl 004. Kngsbury, G.G. & Wess, D.J. (1983). A comparson of IRT-based adaptve mastery testng and a sequental mastery testng procedure. In D.J. Wess (Ed.), New horzons n testng: Latent trat test theory and computerzed adaptve testng (pp ). New York: Academc Press. Lau, C. A. (1998). Robustness of a undmensonal computerzed testng mastery procedure wth multdmensonal testng data. Unpublshed doctoral dssertaton, Unversty of Iowa, Iowa Cty IA. Luecht, R.M., & Nungester, R.J. (1998). Some practcal examples of computer-adaptve sequental testng. Journal of Educatonal Measurement, 35, Ln, C.-J. & Spray, J.A. (000). Effects of tem-selecton crtera on classfcaton testng wth the sequental probablty rato test. (Research Report 000-8). Iowa Cty, IA: ACT, Inc. Reckase, M. D. (1983). A procedure for decson makng usng talored testng. In D. J. Wess (Ed.), New horzons n testng: Latent trat theory and computerzed adaptve testng (pp ). New York: Academc Press. Rudner, L.M. (00, Aprl). An examnaton of decson-theory adaptve testng procedures. Paper presented at the annual meetng of the Amercan Educatonal Research Assocaton, New Orleans, LA. Item Selecton n CCT 8

11 Spray, J. A., Abdel-fattah, A. A., Huang, C., and Lau, C. A. (1997). Undmensonal approxmatons for a computerzed test when the tem pool and latent space are multdmensonal (Research Report 97-5). Iowa Cty, Iowa: ACT, Inc. Spray, J.A., & Reckase, M.D. (1994, Aprl). The selecton of test tems for decson makng wth a computer adaptve test. Paper presented at the natonal meetng of the Natonal Councl on Measurement n Educaton, New Orleans LA. Spray, J.A., & Reckase, M.D.(1996). Comparson of SPRT and sequental Bayes procedures for classfyng examnees nto two categores usng a computerzed test. Journal of Educatonal and Behavoral Statstcs, 1, Thompson, N.A. (006). Varable-length computerzed classfcaton testng wth tem response theory. CLEAR Exam Revew, 17(), Wald, A. (1947). Sequental analyss. New York: Wley. Wess, D. J., & Kngsbury, G. G. (1984). Applcaton of computerzed adaptve testng to educatonal problems. Journal of Educatonal Measurement, 1, Wessman, A. (004). Mutual nformaton tem selecton n multple-category classfcaton CAT. Paper presented at the annual meetng of the Natonal Councl on Measurement n Educaton, San Dego CA. Item Selecton n CCT 9

12 Table 1: Classfcaton of tem selecton methods Approach Pont (FI) Regon (KLI) Estmate-Based Cutscore-Based 1. Maxmum FI at current estmate (Reckase, 1983) 1. Maxmum FI at cutscore(s) (Spray & Reckase, 1994) 1. KLI or MI n regon around current estmate (not used yet). Log-odds rato around current estmate (not used yet) 1. KLI or MI n regon around cutscore(s) (Eggen, 1999; Wessman, 004). Log-odds rato (Ln & Spray, 000) Item Selecton n CCT 10

13 Table : Example calculatons Item a b c P(θ = 0.0) P(θ = 0.5) P(θ = 1.0) FI KLI P(θ = 1.0) - P(θ = 0.0) Item Selecton n CCT 11

14 Table 3: ATL results from smulaton Termnaton Crteron Item Selecton ATL PCC SPRT CB EB ACI CB EB Item Selecton n CCT 1

Kernel Methods and SVMs Extension

Kernel Methods and SVMs Extension Kernel Methods and SVMs Extenson The purpose of ths document s to revew materal covered n Machne Learnng 1 Supervsed Learnng regardng support vector machnes (SVMs). Ths document also provdes a general

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U) Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.

More information

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests Smulated of the Cramér-von Mses Goodness-of-Ft Tests Steele, M., Chaselng, J. and 3 Hurst, C. School of Mathematcal and Physcal Scences, James Cook Unversty, Australan School of Envronmental Studes, Grffth

More information

Homework 9 STAT 530/J530 November 22 nd, 2005

Homework 9 STAT 530/J530 November 22 nd, 2005 Homework 9 STAT 530/J530 November 22 nd, 2005 Instructor: Bran Habng 1) Dstrbuton Q-Q plot Boxplot Heavy Taled Lght Taled Normal Skewed Rght Department of Statstcs LeConte 203 ch-square dstrbuton, Telephone:

More information

FREQUENCY DISTRIBUTIONS Page 1 of The idea of a frequency distribution for sets of observations will be introduced,

FREQUENCY DISTRIBUTIONS Page 1 of The idea of a frequency distribution for sets of observations will be introduced, FREQUENCY DISTRIBUTIONS Page 1 of 6 I. Introducton 1. The dea of a frequency dstrbuton for sets of observatons wll be ntroduced, together wth some of the mechancs for constructng dstrbutons of data. Then

More information

Boostrapaggregating (Bagging)

Boostrapaggregating (Bagging) Boostrapaggregatng (Baggng) An ensemble meta-algorthm desgned to mprove the stablty and accuracy of machne learnng algorthms Can be used n both regresson and classfcaton Reduces varance and helps to avod

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010 Parametrc fractonal mputaton for mssng data analyss Jae Kwang Km Survey Workng Group Semnar March 29, 2010 1 Outlne Introducton Proposed method Fractonal mputaton Approxmaton Varance estmaton Multple mputaton

More information

Markov Chain Monte Carlo (MCMC), Gibbs Sampling, Metropolis Algorithms, and Simulated Annealing Bioinformatics Course Supplement

Markov Chain Monte Carlo (MCMC), Gibbs Sampling, Metropolis Algorithms, and Simulated Annealing Bioinformatics Course Supplement Markov Chan Monte Carlo MCMC, Gbbs Samplng, Metropols Algorthms, and Smulated Annealng 2001 Bonformatcs Course Supplement SNU Bontellgence Lab http://bsnuackr/ Outlne! Markov Chan Monte Carlo MCMC! Metropols-Hastngs

More information

Credit Card Pricing and Impact of Adverse Selection

Credit Card Pricing and Impact of Adverse Selection Credt Card Prcng and Impact of Adverse Selecton Bo Huang and Lyn C. Thomas Unversty of Southampton Contents Background Aucton model of credt card solctaton - Errors n probablty of beng Good - Errors n

More information

Chapter 5 Multilevel Models

Chapter 5 Multilevel Models Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level

More information

A Bayes Algorithm for the Multitask Pattern Recognition Problem Direct Approach

A Bayes Algorithm for the Multitask Pattern Recognition Problem Direct Approach A Bayes Algorthm for the Multtask Pattern Recognton Problem Drect Approach Edward Puchala Wroclaw Unversty of Technology, Char of Systems and Computer etworks, Wybrzeze Wyspanskego 7, 50-370 Wroclaw, Poland

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

STAT 511 FINAL EXAM NAME Spring 2001

STAT 511 FINAL EXAM NAME Spring 2001 STAT 5 FINAL EXAM NAME Sprng Instructons: Ths s a closed book exam. No notes or books are allowed. ou may use a calculator but you are not allowed to store notes or formulas n the calculator. Please wrte

More information

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS M. Krshna Reddy, B. Naveen Kumar and Y. Ramu Department of Statstcs, Osmana Unversty, Hyderabad -500 007, Inda. nanbyrozu@gmal.com, ramu0@gmal.com

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

Global Sensitivity. Tuesday 20 th February, 2018

Global Sensitivity. Tuesday 20 th February, 2018 Global Senstvty Tuesday 2 th February, 28 ) Local Senstvty Most senstvty analyses [] are based on local estmates of senstvty, typcally by expandng the response n a Taylor seres about some specfc values

More information

2E Pattern Recognition Solutions to Introduction to Pattern Recognition, Chapter 2: Bayesian pattern classification

2E Pattern Recognition Solutions to Introduction to Pattern Recognition, Chapter 2: Bayesian pattern classification E395 - Pattern Recognton Solutons to Introducton to Pattern Recognton, Chapter : Bayesan pattern classfcaton Preface Ths document s a soluton manual for selected exercses from Introducton to Pattern Recognton

More information

Bayesian predictive Configural Frequency Analysis

Bayesian predictive Configural Frequency Analysis Psychologcal Test and Assessment Modelng, Volume 54, 2012 (3), 285-292 Bayesan predctve Confgural Frequency Analyss Eduardo Gutérrez-Peña 1 Abstract Confgural Frequency Analyss s a method for cell-wse

More information

Qiong (Joan) Wu Harvard Center for Population and Development Studies. INDEPTH-SAGE WORKSHOP April 20, 2010

Qiong (Joan) Wu Harvard Center for Population and Development Studies. INDEPTH-SAGE WORKSHOP April 20, 2010 Qong Joan Wu Harvard Center for Populaton and Development Studes INDEPTH-SAGE WORKSHOP Aprl 20, 2010 1 IRT vs Classcal test theory CTT CTT: focuses test scores observed score = true score + error O=T+E

More information

Uncertainty in measurements of power and energy on power networks

Uncertainty in measurements of power and energy on power networks Uncertanty n measurements of power and energy on power networks E. Manov, N. Kolev Department of Measurement and Instrumentaton, Techncal Unversty Sofa, bul. Klment Ohrdsk No8, bl., 000 Sofa, Bulgara Tel./fax:

More information

/ n ) are compared. The logic is: if the two

/ n ) are compared. The logic is: if the two STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

A Robust Method for Calculating the Correlation Coefficient

A Robust Method for Calculating the Correlation Coefficient A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal

More information

Gaussian Mixture Models

Gaussian Mixture Models Lab Gaussan Mxture Models Lab Objectve: Understand the formulaton of Gaussan Mxture Models (GMMs) and how to estmate GMM parameters. You ve already seen GMMs as the observaton dstrbuton n certan contnuous

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module 3 LOSSY IMAGE COMPRESSION SYSTEMS Verson ECE IIT, Kharagpur Lesson 6 Theory of Quantzaton Verson ECE IIT, Kharagpur Instructonal Objectves At the end of ths lesson, the students should be able to:

More information

Linear Approximation with Regularization and Moving Least Squares

Linear Approximation with Regularization and Moving Least Squares Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...

More information

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA 4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected

More information

DETERMINATION OF UNCERTAINTY ASSOCIATED WITH QUANTIZATION ERRORS USING THE BAYESIAN APPROACH

DETERMINATION OF UNCERTAINTY ASSOCIATED WITH QUANTIZATION ERRORS USING THE BAYESIAN APPROACH Proceedngs, XVII IMEKO World Congress, June 7, 3, Dubrovn, Croata Proceedngs, XVII IMEKO World Congress, June 7, 3, Dubrovn, Croata TC XVII IMEKO World Congress Metrology n the 3rd Mllennum June 7, 3,

More information

Statistics II Final Exam 26/6/18

Statistics II Final Exam 26/6/18 Statstcs II Fnal Exam 26/6/18 Academc Year 2017/18 Solutons Exam duraton: 2 h 30 mn 1. (3 ponts) A town hall s conductng a study to determne the amount of leftover food produced by the restaurants n the

More information

Computerized Adaptive Testing Item Selection in Computerized Adaptive Learning Systems

Computerized Adaptive Testing Item Selection in Computerized Adaptive Learning Systems Chapter 2 Computerzed Adaptve Testng Item Selecton n Computerzed Adaptve Learnng Systems Theo J.H.M. Eggen Abstract Item selecton methods tradtonally developed for computerzed adaptve testng (CAT) are

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

Composite Hypotheses testing

Composite Hypotheses testing Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter

More information

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition) Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

ESTIMATES OF VARIANCE COMPONENTS IN RANDOM EFFECTS META-ANALYSIS: SENSITIVITY TO VIOLATIONS OF NORMALITY AND VARIANCE HOMOGENEITY

ESTIMATES OF VARIANCE COMPONENTS IN RANDOM EFFECTS META-ANALYSIS: SENSITIVITY TO VIOLATIONS OF NORMALITY AND VARIANCE HOMOGENEITY ESTIMATES OF VARIANCE COMPONENTS IN RANDOM EFFECTS META-ANALYSIS: SENSITIVITY TO VIOLATIONS OF NORMALITY AND VARIANCE HOMOGENEITY Jeffrey D. Kromrey and Krstne Y. Hogarty Department of Educatonal Measurement

More information

STAT 3008 Applied Regression Analysis

STAT 3008 Applied Regression Analysis STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method Appled Mathematcal Scences, Vol. 7, 0, no. 47, 07-0 HIARI Ltd, www.m-hkar.com Comparson of the Populaton Varance Estmators of -Parameter Exponental Dstrbuton Based on Multple Crtera Decson Makng Method

More information

Basically, if you have a dummy dependent variable you will be estimating a probability.

Basically, if you have a dummy dependent variable you will be estimating a probability. ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy

More information

Bayesian Planning of Hit-Miss Inspection Tests

Bayesian Planning of Hit-Miss Inspection Tests Bayesan Plannng of Ht-Mss Inspecton Tests Yew-Meng Koh a and Wllam Q Meeker a a Center for Nondestructve Evaluaton, Department of Statstcs, Iowa State Unversty, Ames, Iowa 5000 Abstract Although some useful

More information

Markov Chain Monte Carlo Lecture 6

Markov Chain Monte Carlo Lecture 6 where (x 1,..., x N ) X N, N s called the populaton sze, f(x) f (x) for at least one {1, 2,..., N}, and those dfferent from f(x) are called the tral dstrbutons n terms of mportance samplng. Dfferent ways

More information

Outline. Bayesian Networks: Maximum Likelihood Estimation and Tree Structure Learning. Our Model and Data. Outline

Outline. Bayesian Networks: Maximum Likelihood Estimation and Tree Structure Learning. Our Model and Data. Outline Outlne Bayesan Networks: Maxmum Lkelhood Estmaton and Tree Structure Learnng Huzhen Yu janey.yu@cs.helsnk.f Dept. Computer Scence, Unv. of Helsnk Probablstc Models, Sprng, 200 Notces: I corrected a number

More information

The Expectation-Maximization Algorithm

The Expectation-Maximization Algorithm The Expectaton-Maxmaton Algorthm Charles Elan elan@cs.ucsd.edu November 16, 2007 Ths chapter explans the EM algorthm at multple levels of generalty. Secton 1 gves the standard hgh-level verson of the algorthm.

More information

P R. Lecture 4. Theory and Applications of Pattern Recognition. Dept. of Electrical and Computer Engineering /

P R. Lecture 4. Theory and Applications of Pattern Recognition. Dept. of Electrical and Computer Engineering / Theory and Applcatons of Pattern Recognton 003, Rob Polkar, Rowan Unversty, Glassboro, NJ Lecture 4 Bayes Classfcaton Rule Dept. of Electrcal and Computer Engneerng 0909.40.0 / 0909.504.04 Theory & Applcatons

More information

Module 9. Lecture 6. Duality in Assignment Problems

Module 9. Lecture 6. Duality in Assignment Problems Module 9 1 Lecture 6 Dualty n Assgnment Problems In ths lecture we attempt to answer few other mportant questons posed n earler lecture for (AP) and see how some of them can be explaned through the concept

More information

A Comparative Study for Estimation Parameters in Panel Data Model

A Comparative Study for Estimation Parameters in Panel Data Model A Comparatve Study for Estmaton Parameters n Panel Data Model Ahmed H. Youssef and Mohamed R. Abonazel hs paper examnes the panel data models when the regresson coeffcents are fxed random and mxed and

More information

Number of cases Number of factors Number of covariates Number of levels of factor i. Value of the dependent variable for case k

Number of cases Number of factors Number of covariates Number of levels of factor i. Value of the dependent variable for case k ANOVA Model and Matrx Computatons Notaton The followng notaton s used throughout ths chapter unless otherwse stated: N F CN Y Z j w W Number of cases Number of factors Number of covarates Number of levels

More information

Calculating CLs Limits. Abstract

Calculating CLs Limits. Abstract DØNote 4492 Calculatng CLs Lmts Harrson B. Prosper Florda State Unversty, Tallahassee, Florda 32306 (Dated: June 8, 2004) Abstract Ths note suggests how the calculaton of lmts based on the CLs method mght

More information

NUMERICAL DIFFERENTIATION

NUMERICAL DIFFERENTIATION NUMERICAL DIFFERENTIATION 1 Introducton Dfferentaton s a method to compute the rate at whch a dependent output y changes wth respect to the change n the ndependent nput x. Ths rate of change s called the

More information

CIE4801 Transportation and spatial modelling Trip distribution

CIE4801 Transportation and spatial modelling Trip distribution CIE4801 ransportaton and spatal modellng rp dstrbuton Rob van Nes, ransport & Plannng 17/4/13 Delft Unversty of echnology Challenge the future Content What s t about hree methods Wth specal attenton for

More information

Non-Mixture Cure Model for Interval Censored Data: Simulation Study ABSTRACT

Non-Mixture Cure Model for Interval Censored Data: Simulation Study ABSTRACT Malaysan Journal of Mathematcal Scences 8(S): 37-44 (2014) Specal Issue: Internatonal Conference on Mathematcal Scences and Statstcs 2013 (ICMSS2013) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES Journal

More information

Using the estimated penetrances to determine the range of the underlying genetic model in casecontrol

Using the estimated penetrances to determine the range of the underlying genetic model in casecontrol Georgetown Unversty From the SelectedWorks of Mark J Meyer 8 Usng the estmated penetrances to determne the range of the underlyng genetc model n casecontrol desgn Mark J Meyer Neal Jeffres Gang Zheng Avalable

More information

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y) Secton 1.5 Correlaton In the prevous sectons, we looked at regresson and the value r was a measurement of how much of the varaton n y can be attrbuted to the lnear relatonshp between y and x. In ths secton,

More information

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number

More information

Lecture 12: Classification

Lecture 12: Classification Lecture : Classfcaton g Dscrmnant functons g The optmal Bayes classfer g Quadratc classfers g Eucldean and Mahalanobs metrcs g K Nearest Neghbor Classfers Intellgent Sensor Systems Rcardo Guterrez-Osuna

More information

Polynomial Regression Models

Polynomial Regression Models LINEAR REGRESSION ANALYSIS MODULE XII Lecture - 6 Polynomal Regresson Models Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Test of sgnfcance To test the sgnfcance

More information

Resource Allocation with a Budget Constraint for Computing Independent Tasks in the Cloud

Resource Allocation with a Budget Constraint for Computing Independent Tasks in the Cloud Resource Allocaton wth a Budget Constrant for Computng Independent Tasks n the Cloud Wemng Sh and Bo Hong School of Electrcal and Computer Engneerng Georga Insttute of Technology, USA 2nd IEEE Internatonal

More information

Economics 130. Lecture 4 Simple Linear Regression Continued

Economics 130. Lecture 4 Simple Linear Regression Continued Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do

More information

3.1 ML and Empirical Distribution

3.1 ML and Empirical Distribution 67577 Intro. to Machne Learnng Fall semester, 2008/9 Lecture 3: Maxmum Lkelhood/ Maxmum Entropy Dualty Lecturer: Amnon Shashua Scrbe: Amnon Shashua 1 In the prevous lecture we defned the prncple of Maxmum

More information

Statistical Inference. 2.3 Summary Statistics Measures of Center and Spread. parameters ( population characteristics )

Statistical Inference. 2.3 Summary Statistics Measures of Center and Spread. parameters ( population characteristics ) Ismor Fscher, 8//008 Stat 54 / -8.3 Summary Statstcs Measures of Center and Spread Dstrbuton of dscrete contnuous POPULATION Random Varable, numercal True center =??? True spread =???? parameters ( populaton

More information

ANSWERS CHAPTER 9. TIO 9.2: If the values are the same, the difference is 0, therefore the null hypothesis cannot be rejected.

ANSWERS CHAPTER 9. TIO 9.2: If the values are the same, the difference is 0, therefore the null hypothesis cannot be rejected. ANSWERS CHAPTER 9 THINK IT OVER thnk t over TIO 9.: χ 2 k = ( f e ) = 0 e Breakng the equaton down: the test statstc for the ch-squared dstrbuton s equal to the sum over all categores of the expected frequency

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

Efficient nonresponse weighting adjustment using estimated response probability

Efficient nonresponse weighting adjustment using estimated response probability Effcent nonresponse weghtng adjustment usng estmated response probablty Jae Kwang Km Department of Appled Statstcs, Yonse Unversty, Seoul, 120-749, KOREA Key Words: Regresson estmator, Propensty score,

More information

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation Econ 388 R. Butler 204 revsons Lecture 4 Dummy Dependent Varables I. Lnear Probablty Model: the Regresson model wth a dummy varables as the dependent varable assumpton, mplcaton regular multple regresson

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information

Statistical tables are provided Two Hours UNIVERSITY OF MANCHESTER. Date: Wednesday 4 th June 2008 Time: 1400 to 1600

Statistical tables are provided Two Hours UNIVERSITY OF MANCHESTER. Date: Wednesday 4 th June 2008 Time: 1400 to 1600 Statstcal tables are provded Two Hours UNIVERSITY OF MNCHESTER Medcal Statstcs Date: Wednesday 4 th June 008 Tme: 1400 to 1600 MT3807 Electronc calculators may be used provded that they conform to Unversty

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VIII LECTURE - 34 ANALYSIS OF VARIANCE IN RANDOM-EFFECTS MODEL AND MIXED-EFFECTS EFFECTS MODEL Dr Shalabh Department of Mathematcs and Statstcs Indan

More information

Lecture 7: Boltzmann distribution & Thermodynamics of mixing

Lecture 7: Boltzmann distribution & Thermodynamics of mixing Prof. Tbbtt Lecture 7 etworks & Gels Lecture 7: Boltzmann dstrbuton & Thermodynamcs of mxng 1 Suggested readng Prof. Mark W. Tbbtt ETH Zürch 13 März 018 Molecular Drvng Forces Dll and Bromberg: Chapters

More information

Semiparametric geographically weighted generalised linear modelling in GWR 4.0

Semiparametric geographically weighted generalised linear modelling in GWR 4.0 Semparametrc geographcally weghted generalsed lnear modellng n GWR 4.0 T. Nakaya 1, A. S. Fotherngham 2, M. Charlton 2, C. Brunsdon 3 1 Department of Geography, Rtsumekan Unversty, 56-1 Tojn-kta-mach,

More information

Second Order Analysis

Second Order Analysis Second Order Analyss In the prevous classes we looked at a method that determnes the load correspondng to a state of bfurcaton equlbrum of a perfect frame by egenvalye analyss The system was assumed to

More information

Feature Selection: Part 1

Feature Selection: Part 1 CSE 546: Machne Learnng Lecture 5 Feature Selecton: Part 1 Instructor: Sham Kakade 1 Regresson n the hgh dmensonal settng How do we learn when the number of features d s greater than the sample sze n?

More information

CHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE

CHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE CHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE Analytcal soluton s usually not possble when exctaton vares arbtrarly wth tme or f the system s nonlnear. Such problems can be solved by numercal tmesteppng

More information

Lecture 4 Hypothesis Testing

Lecture 4 Hypothesis Testing Lecture 4 Hypothess Testng We may wsh to test pror hypotheses about the coeffcents we estmate. We can use the estmates to test whether the data rejects our hypothess. An example mght be that we wsh to

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Exerments-I MODULE III LECTURE - 2 EXPERIMENTAL DESIGN MODELS Dr. Shalabh Deartment of Mathematcs and Statstcs Indan Insttute of Technology Kanur 2 We consder the models

More information

Motion Perception Under Uncertainty. Hongjing Lu Department of Psychology University of Hong Kong

Motion Perception Under Uncertainty. Hongjing Lu Department of Psychology University of Hong Kong Moton Percepton Under Uncertanty Hongjng Lu Department of Psychology Unversty of Hong Kong Outlne Uncertanty n moton stmulus Correspondence problem Qualtatve fttng usng deal observer models Based on sgnal

More information

On an Extension of Stochastic Approximation EM Algorithm for Incomplete Data Problems. Vahid Tadayon 1

On an Extension of Stochastic Approximation EM Algorithm for Incomplete Data Problems. Vahid Tadayon 1 On an Extenson of Stochastc Approxmaton EM Algorthm for Incomplete Data Problems Vahd Tadayon Abstract: The Stochastc Approxmaton EM (SAEM algorthm, a varant stochastc approxmaton of EM, s a versatle tool

More information

Learning from Data 1 Naive Bayes

Learning from Data 1 Naive Bayes Learnng from Data 1 Nave Bayes Davd Barber dbarber@anc.ed.ac.uk course page : http://anc.ed.ac.uk/ dbarber/lfd1/lfd1.html c Davd Barber 2001, 2002 1 Learnng from Data 1 : c Davd Barber 2001,2002 2 1 Why

More information

EDMS Modern Measurement Theories. Multidimensional IRT Models. (Session 6)

EDMS Modern Measurement Theories. Multidimensional IRT Models. (Session 6) EDMS 74 - Modern Measurement Theores Multdmensonal IRT Models (Sesson 6) Sprng Semester 8 Department of Measurement, Statstcs, and Evaluaton (EDMS) Unversty of Maryland Dr. André A. Rupp, (3) 45 363, ruppandr@umd.edu

More information

Chapter 6. Supplemental Text Material

Chapter 6. Supplemental Text Material Chapter 6. Supplemental Text Materal S6-. actor Effect Estmates are Least Squares Estmates We have gven heurstc or ntutve explanatons of how the estmates of the factor effects are obtaned n the textboo.

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

CIS526: Machine Learning Lecture 3 (Sept 16, 2003) Linear Regression. Preparation help: Xiaoying Huang. x 1 θ 1 output... θ M x M

CIS526: Machine Learning Lecture 3 (Sept 16, 2003) Linear Regression. Preparation help: Xiaoying Huang. x 1 θ 1 output... θ M x M CIS56: achne Learnng Lecture 3 (Sept 6, 003) Preparaton help: Xaoyng Huang Lnear Regresson Lnear regresson can be represented by a functonal form: f(; θ) = θ 0 0 +θ + + θ = θ = 0 ote: 0 s a dummy attrbute

More information

Numerical Heat and Mass Transfer

Numerical Heat and Mass Transfer Master degree n Mechancal Engneerng Numercal Heat and Mass Transfer 06-Fnte-Dfference Method (One-dmensonal, steady state heat conducton) Fausto Arpno f.arpno@uncas.t Introducton Why we use models and

More information

Using T.O.M to Estimate Parameter of distributions that have not Single Exponential Family

Using T.O.M to Estimate Parameter of distributions that have not Single Exponential Family IOSR Journal of Mathematcs IOSR-JM) ISSN: 2278-5728. Volume 3, Issue 3 Sep-Oct. 202), PP 44-48 www.osrjournals.org Usng T.O.M to Estmate Parameter of dstrbutons that have not Sngle Exponental Famly Jubran

More information

Homework Assignment 3 Due in class, Thursday October 15

Homework Assignment 3 Due in class, Thursday October 15 Homework Assgnment 3 Due n class, Thursday October 15 SDS 383C Statstcal Modelng I 1 Rdge regresson and Lasso 1. Get the Prostrate cancer data from http://statweb.stanford.edu/~tbs/elemstatlearn/ datasets/prostate.data.

More information

Stat 642, Lecture notes for 01/27/ d i = 1 t. n i t nj. n j

Stat 642, Lecture notes for 01/27/ d i = 1 t. n i t nj. n j Stat 642, Lecture notes for 01/27/05 18 Rate Standardzaton Contnued: Note that f T n t where T s the cumulatve follow-up tme and n s the number of subjects at rsk at the mdpont or nterval, and d s the

More information

CS 2750 Machine Learning. Lecture 5. Density estimation. CS 2750 Machine Learning. Announcements

CS 2750 Machine Learning. Lecture 5. Density estimation. CS 2750 Machine Learning. Announcements CS 750 Machne Learnng Lecture 5 Densty estmaton Mlos Hauskrecht mlos@cs.ptt.edu 539 Sennott Square CS 750 Machne Learnng Announcements Homework Due on Wednesday before the class Reports: hand n before

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours UNIVERSITY OF TORONTO Faculty of Arts and Scence December 005 Examnatons STA47HF/STA005HF Duraton - hours AIDS ALLOWED: (to be suppled by the student) Non-programmable calculator One handwrtten 8.5'' x

More information

Online Appendix to: Axiomatization and measurement of Quasi-hyperbolic Discounting

Online Appendix to: Axiomatization and measurement of Quasi-hyperbolic Discounting Onlne Appendx to: Axomatzaton and measurement of Quas-hyperbolc Dscountng José Lus Montel Olea Tomasz Strzaleck 1 Sample Selecton As dscussed before our ntal sample conssts of two groups of subjects. Group

More information

Lossy Compression. Compromise accuracy of reconstruction for increased compression.

Lossy Compression. Compromise accuracy of reconstruction for increased compression. Lossy Compresson Compromse accuracy of reconstructon for ncreased compresson. The reconstructon s usually vsbly ndstngushable from the orgnal mage. Typcally, one can get up to 0:1 compresson wth almost

More information

Topic- 11 The Analysis of Variance

Topic- 11 The Analysis of Variance Topc- 11 The Analyss of Varance Expermental Desgn The samplng plan or expermental desgn determnes the way that a sample s selected. In an observatonal study, the expermenter observes data that already

More information

Testing for seasonal unit roots in heterogeneous panels

Testing for seasonal unit roots in heterogeneous panels Testng for seasonal unt roots n heterogeneous panels Jesus Otero * Facultad de Economía Unversdad del Rosaro, Colomba Jeremy Smth Department of Economcs Unversty of arwck Monca Gulett Aston Busness School

More information

Stat260: Bayesian Modeling and Inference Lecture Date: February 22, Reference Priors

Stat260: Bayesian Modeling and Inference Lecture Date: February 22, Reference Priors Stat60: Bayesan Modelng and Inference Lecture Date: February, 00 Reference Prors Lecturer: Mchael I. Jordan Scrbe: Steven Troxler and Wayne Lee In ths lecture, we assume that θ R; n hgher-dmensons, reference

More information

First Year Examination Department of Statistics, University of Florida

First Year Examination Department of Statistics, University of Florida Frst Year Examnaton Department of Statstcs, Unversty of Florda May 7, 010, 8:00 am - 1:00 noon Instructons: 1. You have four hours to answer questons n ths examnaton.. You must show your work to receve

More information