A General Approach to Evaluating Agreement between Two Observers or Methods of Measurement

Size: px
Start display at page:

Download "A General Approach to Evaluating Agreement between Two Observers or Methods of Measurement"

Transcription

1 A eneral Approach to Evaluatng Agreement between wo Observers or Methods of Measurement Mchael Haber Department of Bostatstcs, ollns School of Publc Health Emory Unversty, Atlanta, A and Human X. Barnhart Department of Bostatstcs and Bonformatcs, Duke Clncal esearch Insttute, Duke Unversty, Durham, C Correspondence author: Dr. Mchael J. Haber, Department of Bostatstcs, ollns School of Publc Health, Emory Unversty, Atlanta A 303, U.S.A. el: (404) e-mal: mhaber@sph.emory.edu evsed 3//06 1

2 Abstract We present a general approach to the defnton and estmaton of coeffcents for evaluatng agreement between two fxed methods of measurements or human observers. he measured varable s assumed to be contnuous, but no other dstrbutonal assumptons are made. We ntroduce the term dsagreement functon for the functon of the observatons that s used to quantfy the extent of dsagreement between the two measurements made on the same subject. he proposed agreement coeffcents compare the dsagreement between measurements made by dfferent methods on the same subject to the correspondng dsagreement between measurements made by the same method. We propose agreement coeffcents for two practcal stuatons nvolvng two methods that have a measurement error: (a) comparson of a new method to a gold standard (or a reference method), and (b) comparson of two methods where nether method s consdered a gold standard. We consder three dsagreement functons based on the dfferences between two measurements: (a) the mean squared dfference (MSD), (b) the mean absolute dfference (MAD), and (c) the complement of the coverage probablty (CP), whch s the probablty that the absolute dfference does not exceed a predetermned threshold. We then derve nonparametrc estmates for the varous agreement coeffcents. Our approach s llustrated usng data from a study comparng systolc blood pressure measurements by a human observer and an automatc montor. he performance of the new estmates s assessed va stochastc smulatons..

3 1. Introducton Frequently one s nterested n comparng two methods of measurement that were appled to each of study subjects. In ths paper, the term method may correspond to a measurements devce or to a human observer. Denotng the correspondng measurements X and Y, we wll say that the two methods are n agreement f they produce the same value on each subject. In realty, even under perfect agreement the values of X and Y on the same subject wll usually dffer because of measurement errors and other factors. herefore, t s mportant to defne and estmate coeffcents that quantfy the extent to whch the two methods agree wth each other. We consder two practcal scenaros for the comparson of two methods that have a measurement error. In the frst scenaro, one s nterested n comparng the methods wthout consderng ether of them as a reference (or a gold standard ). hs may be of nterest, for example, when due to logstc consderatons one plans to use method X on some of the subjects and method Y on the remanng subjects. In ths case t s mportant that the two methods produce smlar values when appled to the same subject so that they can be used nterchangeably. In the second scenaro, method X s a reference method that has been used n the past and was found to produce relable measurements, and method Y s a new method that may be less expensve or less nvasve. In ths case one may consder replacng X by Y, and must make sure that the two methods produce smlar measurements on the same subject. We wll refer to these two scenaros as assessng agreement wth or wthout a reference method. Our objectve s to defne coeffcents of agreement for each of the two scenaros such that f the coeffcent s close to one (or exceeds one) we conclude that the methods are n good agreement, whle a value close to zero ndcates poor agreement. Over the years, several coeffcents of agreement have been proposed 3

4 and used, ncludng dfferent versons of the ntraclass correlaton coeffcent (ICC) and the concordance correlaton coeffcent (CCC). A comparatve summary of these coeffcents s presented n a recent paper by Haber and Barnhart (006) 1. In the present paper we propose a general approach that can be used n defnng and estmatng coeffcents of agreement correspondng to dfferent measures of dsagreement. he new coeffcents generalze the ψ coeffcent presented n our earler paper. In order to defne a coeffcent of agreement, we frst have to decde how we quantfy the agreement (or lack thereof) between the two methods. For ths, we defne the concept of a dsagreement functon ( X, Y ). hs functon has to satsfy: a. ( X, Y ) 0 b. ( X, Y ) = ( Y, X ) c. ( X, Y ) ncreases as the dsagreement between X and Y (accordng to a specfc crteron) ncreases. Once we have decded whch dsagreement functon to use, t s necessary to standardze t n order to obtan a coeffcent so that values close to zero ndcate poor agreement and values close to (or above) one ndcate good agreement. In other words, we need to compare the observed value of ( X, Y ) to a reference value that represents ether the best or the worst case scenaro. he ICC s and the CCC consder the worst case scenaro as ndependence, or agreement by chance. herefore these coeffcents compare the observed value of a dsagreement functon to the value expected under ndependence between the two methods, 3 In our opnon 1, ths approach s unjustfed because (a) agreement and correlaton are dfferent concepts, and (b) two observatons made on the same subject are always expected be postvely correlated because both are postvely correlated wth the true value of the measured varable on ths subject. 4

5 Snce we could not fnd an adequate defnton for the worst case scenaro we decded to move to the opposte sde of the agreement scale and defne the best case scenaro as the case where the dsagreement between observatons made by dfferent methods s smlar to the dsagreement between observatons made by the same method. In other words, replacng or nterchangng one method by the other does not substantally ncrease the dsagreement between measurements made on the same subject. herefore, the coeffcents of agreement we propose compare the dsagreement between measurements made by dfferent methods to the dsagreement between replcated measurements made by the same method. We denote by ( X, X ') the dsagreement between two replcated measurements made by the method X, and defne ( Y, Y ') n an analogous way for method Y. Usng the above-mentoned prncple, we defne the coeffcent of agreement n the case that none of the methods s consdered as a reference as: [ ( X, X ') + ( Y, Y ')]/ ψ =. (1) ( X, Y ) he numerator s the average dsagreement between two replcated measurements made by the same method, and the denomnator s the dsagreement between X and Y. he coeffcent of agreement for the case where method X s consdered a reference method s defned as: ( X, X ') ψ =. () ( X, Y ) Here we compare the dsagreement between measurements made by the two methods wth the dsagreement between two replcated measurements made by the reference method. ote that both ψ and ψ are nonnegatve but they may attan values greater than one, ndcatng that the between-methods dsagreement s lower than the wthn-methods dsagreement. he comparson of ( X, Y ) to ( X, X ') when X s consdered a reference method has been used n the past to evaluate ndvdual boequvalence between a standard (reference) and a new formulaton of a drug 4 In that case, the focus was 5

6 on comparng the boavalabltes of the two formulatons. he comparson of ( X, Y ) to the mean of ( X, X ') and ( Y, Y ') where = E( X Y ) was ntroduced n a recent paper by Barnhart et al. 5 to assess the agreement between two or more measurement methods when there s no reference method. hey called the coeffcents (1) and () wth ( X, Y ) = E( X Y ) coeffcents of ndvdual agreement (CIA) as they were based on the dea used n assessng ndvdual boequvalence. It s mportant to realze that n these coeffcents the wthn-method dsagreement s used as a reference to whch we compare the between-methods dsagreement. Hence, before usng the ψ s one must check that the correspondng wthnmethod dsagreement,.e., ( X, X ') for ψ and both ( X, X ') and ( Y, Y ') for ψ, are acceptable. As wth other coeffcents, a common practcal queston s how good s good,.e. what values of the coeffcents are large enough to be consdered as ndcatng good agreement. o allow a better nterpretaton of the new coeffcents we suggest to look at ther recprocals, 1 /ψ and 1 /ψ. hese quanttes represent the relatve ncreases n dsagreement due to replacng X by Y or due to nterchangng X and Y. In order to clam good agreement, we beleve that the dsagreement between the two methods should not exceed the dsagreement between replcated observatons from the same method by more than 5%,.e., ψ 0.8. he dsagreement functons can also be defned at the subject level. Denotng ( X, Y ), ( X, X '), ( Y, Y ') the values of the dsagreement functons for subject, we can defne the followng subject-specfc agreement coeffcents: [ ( X, X ') + ( Y, Y ')]/ ψ, =, (3) ( X, Y ) 6

7 ψ ( X, X '), = ( X, Y ). (4) ote that ( X, Y ) may be zero. Snce n ths case there s good agreement, as measured by the dsagreement functon for ths subject, we defne the dsagreements coeffcents (3) and (4) as one regardless of the values of ( X, X ') and ( Y, Y '). Data: Snce the new coeffcents nvolve the evaluaton of dsagreement between measurements made wth the same method on the same subject, we have to use data wth replcated measurements. When we compare two methods (wthout a reference) we wll assume that at least two measurements are made usng each method on each subject. For the case of comparng a new method to a reference we only need replcated measurements on the reference method. he number of replcatons does not have to be fxed. We denote by X 1,..., X K the K measurements made on subject usng method X. Smlarly, we denote by Y 1,..., Y L the L measurements made on subject usng method Y. We assume that K, L and that these are unmatched replcatons,.e., f one permutes the order of the subscrpts for one of the methods wthout changng the order for the other method, then there s no change n the resultng coeffcents or ther estmates. We do not make any assumptons about the dstrbutons of the measurements. In ths work we wll consder two dsagreement functons related to agreement measures ntroduced by Ln et al. 6, namely the mean squared dfference (MSD), defned as the expectaton of the squared devaton between two measurements, and the complement of the coverage probablty (CP), whch s the probablty that two measurements are wthn a specfc dstance from each other. In addton, we consder a new dsagreement functon, the mean absolute dfference (MAD), as the absolute dfference s more meanngful than the squared dfference. 7

8 . he Mean Squared Devaton (MSD) In ths Secton we defne ( X, Y ) = MSD( X, Y ) = E( X Y ). We frst consder the dsagreements for a partcular subject, : ( X, Y ) = E[( X Y ) )], k ( X, X ') = E[( X X ) )] for k < k' k l k ' ( Y, Y ') = E[( Y Y ) )] for l < l' l l' We then obtan the overall dsagreement functons as ( X, Y ) = E [ ( X, Y )], ( X, X ') = E[ ( X, X ')], ( Y, Y ') = E[ ( Y, Y ')], where E stands for the expectaton over all study subjects. If we assume condtonal ndependence of X and Y gven the subjects characterstcs, then the dsagreement functons can also be wrtten n terms of the condtonal moments of X and Y: ( X, Y ) = E {[ E( X ) E( Y )] + Var( X ) Var( Y )}, k l k + ( X, X ') = E [ Var( X )], ( Y, Y ') = E [ Var( Y )]. k l l he overall agreement coeffcents (1) and () and the subject-specfc coeffcents (3) and (4) are now obtaned by substtutng these dsagreement functons nto the correspondng equatons. he overall coeffcents (1) and () wth = MSD are dentcal to the CIA and CIA coeffcents n Barnhart et al. 5 when there s no reference method and when method X s consdered as reference, respectvely. In addton, ψ MSD s the same as ψ n Haber and Barnhart 1. However, ψ n the earler paper compares MSD ( X, Y ) to ts expected value when E X ) E( Y ) whle the current ψ ( MSD compares MSD ( X, Y ) to the mean MSD of replcated measurements by the same method. hese approaches lead to the same coeffcent for = MSD, but they may result n dfferent coeffcents for other selectons of the dsagreement functon. 8

9 .1 Estmaton We begn wth the subject-specfc coeffcents. For subject, we estmate ˆ ( X, Y ) = Mean, k, l ( X k Yl ) ˆ k k ' k k ' ( X, X ') = Mean < ( X X ) = MSW ( X ), ˆ l l' l l' ( Y, Y ') = Mean < ( Y Y ) = MSW ( Y ), where MSW are the wthn-subject mean squares: = k k MSW ( X ) ( X X ) /( K 1), MSW ( Y ) ( Y Y ) /( L 1). We then substtute these estmates of the = l l Ĝ s nto (3) or (4) to obtan the estmated subject-specfc coeffcents of agreement. Based on the note underneath equaton (4), we set the estmated coeffcents to one whenever ˆ ( X, Y ) = 0. o estmate the overall coeffcents of agreement we estmate the overall dsagreement functon as ˆ ( X, Y ) = Mean [ ˆ ( X, Y )]. Lkewse, ˆ ( X, X ') and ˆ ( Y, Y ') are estmated as the means over all subjects of the correspondng subject-specfc dsagreements. he overall agreement coeffcent s estmated by substtutng the estmated s nto (1) or ().. An example o llustrate the coeffcents ntroduced n ths paper we use a data set from Bland and Altman 7. Systolc blood pressure (SBP) was measured on 85 subjects by two experenced human observers usng a sphygmomanometer and by a semautomatc blood pressure montor. hree replcatons were made n quck successon wth each of the three methods on each subject. We frst assessed the agreement between the two human observers. he agreement was excellent ψ ˆ =1.44, hence we wll focus here on the agreement between the frst human observer, whch we label X, and the montor, whch we label Y. Our estmated 9

10 s were ˆ ( X, X ') = , ˆ ( Y, Y ') = , ˆ ( X, Y ) = Hence, f we compare the two methods wthout consderng ether of them as a reference then the estmated agreement s ψ ˆ = However, t could be more approprate to consder the experenced human observer as a reference and the montor as a new method that s tested aganst ths reference. hen we estmate the agreement as ψ ˆ = Ether way, the agreement s poor. hs s not surprsng as the mean readng of the montor s about 16 ponts hgher than that of the human observer. hese estmates are dentcal to those obtaned by Barnhart et al. 5 usng an AOVA model. We used the bootstrap method to obtan standard errors and confdence ntervals (CI) for these coeffcents. he 95% CI s based on the bootstrap percentles for ψ and ψ were (0.110, 0.306) and (0.067, 0.07), respectvely. We also calculated the CI s based on the normal approxmaton to the dstrbuton of the estmates: (0.078, 0.78) and (0.037,0.183) respectvely. hus, the endponts of the percentle-based CI s exceed those of the normal CI s, whch suggests that the dstrbutons of the estmates are skewed. he CI s based on the normal approxmaton to the dstrbuton of the logarthms of the estmates are (0.105, 0.303) and (0.061, 0.198) respectvely, qute close to the percentage-based CI s. We can learn more about the agreement between X and Y by plottng the subjectspecfc coeffcents of agreement as functons of our best estmates of the subjects true SBP. When we dd not consder ether method as a reference, we, plotted the estmates of ψ aganst ( X + Y ) /, where X and Y are the means over the replcated observatons (Fgure 1a). When the human observer (X) was consdered a reference, we plotted the estmates of ψ, aganst X (Fgure 1b). Both coeffcents tend to ncrease as the SBP ncreases hus, the agreement between the two methods s better for ndvduals wth hgher SBP compared to those wth lower blood pressure. he plots also confrm the skewed dstrbuton of the estmates. 10

11 .3 Smulatons We conducted a smulaton study to nvestgate the bas and precson of the estmates of ψ and ψ for = MSD, usng a smple latent class model to generate the replcated condtonally ndependent measurements by the methods X and Y. he true value,, was assumed normal wth mean μ and standard devaton σ. Let = t denote the true value for subject. We then generated K replcated observatons by method X from ( X t ) ~ ( μ, σ ) and L k X t X t replcated observatons by method Y from ( Y t ) ~ ( μ, σ ). For l Y t Y t smplcty we assumed that the number of replcaton by each method s the same for all subjects, though the number of replcatons by X and Y may dffer. We assumed further that the above condtonal means and standard devatons are lnear functons of t : μ = a bt, μ = c dt, σ = e ft, σ = g ht. X t + Y t + X t + Y t + hen t s easy to show that the true value of the three functons for are: = MSD ( X, Y ) = ( a c) ( X, X ') = e + e + g + [( a c)( b d) + ef + 4efμ + f ( μ + σ ) ( Y, Y ') = g + 4ghμ + h ( μ + σ ). + gh)] μ + [( b d) + f + h ]( μ + σ ) he true values of ψ and ψ are now obtaned from (1) and (). Our purpose was to smulate data that are smlar to the SBP data used n the above example. We consdered the mean (over the three replcatons) readngs of the second human observer (whose observatons were not used n our example) n the Bland and Altman 7 data as the true values () of SBP. he sample moments of these true values wll be used as the moments of ; they are μ =17.3 and σ = We then obtaned the coeffcents n the equatons defnng the condtonal moments of X and Y gven by regressng the observed values of these condtonal moments on the true values. he coeffcents are: a = 1.03, b =1. 01, c = , d = 0. 85, e = 1. 91, f = 0. 03, g = 3. 6, 11

12 h = For ths choce of the parameters n our smulaton model, whch wll be referred to as case 1, the true values of the agreement coeffcents are ψ = 0.66, ψ = We also conducted smulatons wth larger true values of the agreement coeffcents by changng the values of c and d whle leavng the other smulaton parameters unchanged. For case we used c = 13, d = 0.95, whch yeld ψ = , ψ = 0. 50, and for case 3 we used c = 5, d = 0.98, whch yeld ψ = , ψ = (By brngng c closer to 0 and d closer to 1 we ncrease the agreement between Y and the true value. hs ncreases the agreement between X and Y because X agrees very well wth the true value.) hus, the three cases correspond to poor, far and good agreement, respectvely. We used three sample szes, n = two combnatons of ( K, L) 50, 100 and 00. For estmatng, namely (3,3) and (,), For estmatng ψ we used ψ we used four combnatons of ( K, L), namely (3,3), (,), (3,1) and (,1). hs allows to assess the mportance of usng replcated observatons on the new method, Y, snce ψ can be estmated from data wth replcated observatons on the standard method, X, only. able 1 presents the bas and root mean square error (MSE) of the estmates for all the combnatons of ( n, K, L) for the three cases defned earler. Each table entry s based on a set of 00 smulatons. We see that the bas s usually postve but very small. he MSE vares wth the sample sze and the number of replcatons. For n = 50, usng fewer than three replcatons wth ether X or Y may result n mprecse estmates. For n 100 the precson s acceptable even wth the mnmum number of replcatons ( K = L = for for ψ ). ψ and K =, L = 1 1

13 .4 Effect of between-subjects varablty on the agreement coeffcents he latent class model ntroduced earler can also be used to nvestgate the behavor of the agreement coeffcents n varous stuatons. One ssue that s commonly rased n connecton wth coeffcents of agreement s ther dependence on the between-subjects varablty of the quantty beng measured. herefore we explore the dependence of the agreement coeffcents on the varaton of the true SBP n the populaton,.e. on σ. Let us frst consder a specal case of the latent class model where the condtonal ( wthn subject ) varances of the measurements do not depend on the true value t,.e., f = h = 0. In ths case ( X, X ') and ( Y, Y ') do not depend on σ and ( X, Y ) s a monotoncally ncreasng functon of σ. herefore both ψ and ψ are decreasng functons of σ. In the general latent class model presented n Secton.3 all the three s ncrease wth σ. We used the values of the eght coeffcents ( a, b,..., h ) determnng the condtonal moments of X and Y gven t as estmated from the data (Secton.3) to examne the dependence of the agreement coeffcents on σ. For comparson, we also calculated the CCC for each case. he results are presented n Fgure. We see that the new agreement coeffcents decrease as the between-subjects varance ncreases. However, the rate of decrease s modest. On the other hand, the CCC ncreases at a very fast rate wth σ. Also, for σ 0 the CCC s consderably hgher than the new ps coeffcents. We repeated the calculatons of the ps s wth hgher values of f and h, to allow more varablty of the between-subject varance as the subject s true value t changes. However, the rate of decrease n the new agreement coeffcents wth σ dd not change. In our opnon, the decrease of the agreement coeffcents as the between-subject varance ncreases looks reasonable, as t s more dffcult for observers to demonstrate good agreement across a wder range of the subjects true values. 13

14 3. he Mean Absolute Devaton (MAD) In ths secton we consder the dsagreement functon ( X, Y ) = E X Y. he MAD s easer to nterpret as compared to the MSD, as t s more readly related to the actual observatons. In general, one would expect the to the square roots of the correspondng ψ MAD s to be close ψ MSD s. Estmaton of the agreement coeffcents based on the MAD s very smlar to the estmaton of the coeffcents based on the MSD. We begn wth the estmaton of the subject-specfc s: ˆ ( X, Y ) = Mean X Y, k, l k l ˆ ( X, X ') = Mean < X X, k k ' k k ' ˆ ( Y, Y ') = Mean < Y Y. l l' l l' We then calculate the sample means of the estmated s to obtan the overall estmates of the dsagreement functons and of the agreement coeffcents. We can also estmate the subject-specfc coeffcents of agreement. 3.1 SBP example For the SBP data dscussed n Secton., the estmated coeffcents wth = MAD are ψ ˆ = and ψ ˆ = he bootstrap percentle-based CI s are (0.348, 0.517) and (0.8, 0.459), respectvely, whle the bootstrap CI s based on the normal approxmaton are (0.341, 0.51) and (0.78, 0.448), respectvely. he dfferences between the two knds of confdence ntervals for the same parameter are smaller than they were for = MSD. In other words, the dstrbuton of the ψ MAD s s more symmetrc that that of the ψ MAD s. hs s also evdent from the plots of the subject-specfc ψ MAD s (not ncluded) whch show that the dstrbuton of these quanttes s less skewed. 14

15 3. Smulatons We used the same smulaton model and parameters as n Secton.3 he true values of the MAD s were calculated as means of folded normal varables. We now llustrate the calculaton of MAD ( X, Y ). Under our smulaton model the dfference D = X Y has a normal dstrbuton and D has a folded normal dstrbuton. hen MAD ( X, Y ) s the mean of ths dstrbuton: MAD( X, Y ) = E D = / π σ exp( μ / σ ) μ [1 Φ( μ / σ )], D where μ D and σ D are the mean and standard devaton of D and Φ s the standard normal dstrbuton functon 8. he frst two moments of D were obtaned from the frst two moments of ( X, Y ), whch were calculated from the parameters of the smulaton model as follows: E ( X ) = a + bμ, E Y ) = c + dμ Var Var Cov (, ( X ) e + efμ + f μ + ( b + f ) σ D =, ( Y ) g + ghμ + h μ + ( d + h ) σ =, ( X, Y ) bdσ =. he MAD ( X, X ') and MAD ( Y, Y ') were calculated n a smlar way. D D D D able presents the bas and MSE of the estmates of ψ and ψ for = MAD. he bas s usually negatve, whle the bas for the MSD-based estmates was usually postve. Comparng the MSE s of the MSD and MADbased estmates, we see that for case 1, where the true agreement s low, the MSE s of the MAD-based estmates are slghtly hgher than the correspondng MSE s of the MSD-based estmates. On the other hand, for cases and 3 where the true agreement s moderate or hgh, the MAD-based estmates have consderably smaller MSE s as compared to the MSD-based estmates. 15

16 4. Coverage Probabltes (CP) he coverage probablty for a gven value c > 0 was defned by Ln et al. 6 as CP ( c) = P( X Y < c). We defne the CP-based dsagreement functon as ( X, Y, c) = P( X Y c) = 1 CP( c). As before, we wll not make any assumptons on the dstrbutons of X and Y and estmate the overall agreement coeffcents from the subject-specfc estmates. he subject-specfc dsagreement functons are estmated from the proportons of parwse observatons that are separated by c unts or more: ˆ ( X, Y, c) = [# pars ( X k, Yl ) so that X k Yl c ]/ K L, ˆ ( X, X ', c) = [# pars X, ), k < k', so that X X ' c ]/ K ( 1), ( k X k ' k k K and ˆ ( Y, Y ', c) s obtaned n an analogous way. he overall dsagreement functons and agreement coeffcents are calculated from the estmates of the subject-specfc dsagreement functons n the same way as n Sectons and SBP example For the SBP data we used c = 10, whch means that dfferences of less than 10 ponts between two measurements on the same subject are consdered acceptable. Our estmates were ψ ˆ = , wth a bootstrap percentle-based CI (0.357, 0.547), and ψ ˆ = , wth a bootstrap percentle-based CI (0.304, 0.51). he CI s based on the normal assumpton were very close to the percentle-based CI s. he man drawback of the CP-based agreement coeffcents s ther dependence on the choce of c. In the SBP example, usng c = 5 rather than c = 10 ncreases the estmated ψ and ψ to and 0.615, respectvely. 16

17 5. Dscusson We presented a general approach to defnng agreement coeffcents for the comparson of two fxed observers or methods of measurement. he approach s based on the concept of dsagreement functon, whch allows the user to specfy her/hs crteron for assessng the agreement between observatons on the same subject. We used two approaches to standardze the value of the dsagreement functon n such a way that the ensung coeffcent wll be close to or above one when the magntude of the dsagreement s good. When nether of the measurement methods can be consder as a gold standard or a reference, we compare the dsagreement between the methods to the average dsagreement between replcated readngs from the same method. On the other hand, when one of the methods has been n use for a whle and can be consdered a gold standard (reference) then the between-methods dsagreement s compared to the dsagreement between replcated readngs from the reference method. In the absence of any parametrc assumptons, the estmaton of the new coeffcents requres replcated observatons by the same method on the same subject. (When one of the methods s consdered a gold standard then replcated observaton by the new method are not requred). Ideally, the methods or observers should be blnd to ther prevous assessment(s) of the same subject and the true value of the measured varable on a subject should not change between replcated evaluatons. We realze that t some cases t may be dffcult to obtan replcated observatons that satsfy these condtons. he new coeffcents are useful, for example, when the measurements are made by automatc devces on blood samples or x-ray sldes obtaned from each subject. When the methods are human observers t s mportant to make sure that each observer makes her/hs measurements n a random order, so that she/he s unlkely to recall the prevous measurements(s) on a gven subject. In most cases the dsagreement functon can be computed for each subject. hen one can estmate the agreement coeffcent for each subject and plot the estmated 17

18 coeffcents aganst the subjects estmated true values or aganst other subjectspecfc covarates. hese plots may shed more lght on the factors that affect agreement and help n dentfyng outlyng observatons. hey may also be used as a supplement to the tradtonal Bland-Altman plots 9 whch dsplay the dfference between measurements of two methods on the same subject as functon of the subjects estmated true value. In ths paper we dd not make any assumptons regardng the dstrbutons of the measurements. hus, we presented non parametrc estmates only. We also avoded some of the other assumptons commonly made when evaluatng observer agreement. For example, our approach allows the error varances to dffer across subjects, whch s a qute common phenomenon n realty. It s nterestng to compare the MSD-based coeffcent ψ wth the CCC. Both coeffcents have the MSD n the denomnator. he new coeffcent compares the MSD wth ts value under the assumpton that the two methods are nterchangeable, whle the CCC used the expected MSD under ndependence as a yardstck. One mportant dfference between these coeffcents s related to the fast ncrease n the CCC when the between-subject heterogenety ncreases (Fgure ). he new coeffcent, on the other hand, s less senstve to the sample heterogenety and, at least n the examples we explored, decreases when there s more heterogenety. Intutvely one would expect agreement to decrease when the measured varable exhbts more varablty because the ncreased varablty requres the methods to agree across a wder range of values of the measured varable. For further dscusson of agreement and heterogenety the reader s referred to Atknson & avel 10. We consdered three dsagreement functons, based on the MSD, MAD and CP. We saw that the CP-based coeffcents depend on the threshold used to defne good agreement, so we actually have to deal wth a famly of coeffcents correspondng to a gven range of threshold values. Comparng the MSD and 18

19 MAD-based approaches, we prefer the latter as the MAD s expressed n the same unts as the measured varable. In our smulaton studes the MAD-based estmates were more precse than the correspondng MSD-based estmates except when the true agreement was very small. he approach ntroduced n ths artcle can be generalzed n varous drectons. he MSD-based coeffcents were generalzed to stuatons where several new methods are compared to a common gold standard or several methods are compared wthout a gold standard 5. It wll be mportant to develop multplemethods coeffcents based on other dsagreement functons. We consdered only contnuous varables n ths work, and we are currently explorng smlar coeffcents for bnary and categorcal outcome varables. Fnally, throughout ths work we assumed that the replcatons from the two methods are ndependent of each other (unmatched replcatons). We found many examples where ths assumpton does not hold (for example, repeated measurements are performed at a set of fxed tme ponts or under fxed condtons). We plan to adapt our coeffcents to ncorporate matched replcatons and account for the varablty assocated wth the factor that underles the repeated measurements. Acknowledgement hs research was supported by IMH grant 1 01 MH

20 eferences 1. Haber M, Barnhart HX. Coeffcents of agreement for fxed observers. Statstcal Methods n Medcal esearch 006 (to appear). Zegers FE. A famly of chance-corrected assocaton coeffcents for metrc scales. Psychometrka 1986; 51: Ln L. A concordance correlaton coeffcent to evaluate reproducblty. Bometrcs 1989; 45: Schall, Luus H. On populaton and ndvdual boequvalence. Stat. Med. 1993; 1: Barnhart HX, Kosnsk AS, Haber M. Assessng ndvdual agreement. (submtted). 6. Ln L, Hedayat AS, Snha B, Yang M. Statstcal methods n assessng agreement: models, ssues and tools. J. Amer. Stat. Assoc. 00; 97; Bland JM, Altman D. Measurng agreement n method comparson studes. Statstcal Methods n Medcal esearch 1999; 8: ead CB. Folded Dstrbutons. Encyclopeda of Statstcal Scences (edted by S Kotz and L Johnson) Vol 3, pp Wley & Sons, ew York, Bland JM, Altman D. Statstcal methods for assessng agreement between two methods of clncal measurements. Lancet 1986; : Atknson, evll A. Comment on the use of concordance correlaton to assess the agreement between varables. Bometrcs 1997; 53:

21 able 1: Bas and root mean square error (MSE) of estmates of ψ and ψ usng the mean squared devaton (MSD) as the dsagreement functon ψ ψˆ Case 1 = 0.66, ψ = ψˆ ψ Case = 0.670, ψ = 0.50 ψˆ ψˆ ψ ψˆ Case 3 = 0.940, ψ = K L Bas MSE Bas MSE Bas MSE Bas MSE Bas MSE Bas MSE ψˆ

22 able : Bas and root mean square error (MSE) of estmates of ψ and ψ usng the mean absolute devaton (MAD) as the dsagreement functon ψ ψˆ Case 1 = 0.476, ψ = ψˆ ψ ψˆ Case = 0.804, ψ = 0.70 ψˆ ψ ψˆ Case 3 = 0.96, ψ = K L Bas MSE Bas MSE Bas MSE Bas MSE Bas MSE Bas MSE ψˆ

23 Fgure 1a: 3 Ψˆ based on MSD as a functon of the mean of X, Y.5 ˆ Ψ (MSD) Least squares lne coeffcents: α = , β = ; p-value for testng H : β 0 s = mean(x,y) 3

24 3 Fgure 1b: Ψˆ based on MSD as a functon of the mean of X.5 ˆ Ψ (MSD) Least squares lne coeffcents: α = -0.56, β = 0.049; p-value for testng H : β 0 s = mean(x) 4

25 5

Estimation of Coefficients of Individual Agreement (CIA s) for. Quantitative and Binary Data using SAS and R

Estimation of Coefficients of Individual Agreement (CIA s) for. Quantitative and Binary Data using SAS and R Estmaton of Coeffcents of Indvdual Agreement (CIA s) for Quanttatve and Bnary Data usng SAS and Y Pan Department of Bostatstcs and Bonformatcs ollns School of Publc Health Emory Unversty 58 Clfton oad

More information

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests Smulated of the Cramér-von Mses Goodness-of-Ft Tests Steele, M., Chaselng, J. and 3 Hurst, C. School of Mathematcal and Physcal Scences, James Cook Unversty, Australan School of Envronmental Studes, Grffth

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

Chapter 9: Statistical Inference and the Relationship between Two Variables

Chapter 9: Statistical Inference and the Relationship between Two Variables Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA 4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

Effective plots to assess bias and precision in method comparison studies

Effective plots to assess bias and precision in method comparison studies Effectve plots to assess bas and precson n method comparson studes Bern, November, 016 Patrck Taffé, PhD Insttute of Socal and Preventve Medcne () Unversty of Lausanne, Swtzerland Patrck.Taffe@chuv.ch

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise. Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the

More information

SIMPLE LINEAR REGRESSION

SIMPLE LINEAR REGRESSION Smple Lnear Regresson and Correlaton Introducton Prevousl, our attenton has been focused on one varable whch we desgnated b x. Frequentl, t s desrable to learn somethng about the relatonshp between two

More information

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y) Secton 1.5 Correlaton In the prevous sectons, we looked at regresson and the value r was a measurement of how much of the varaton n y can be attrbuted to the lnear relatonshp between y and x. In ths secton,

More information

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models Computaton of Hgher Order Moments from Two Multnomal Overdsperson Lkelhood Models BY J. T. NEWCOMER, N. K. NEERCHAL Department of Mathematcs and Statstcs, Unversty of Maryland, Baltmore County, Baltmore,

More information

Linear Regression Analysis: Terminology and Notation

Linear Regression Analysis: Terminology and Notation ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented

More information

A Bound for the Relative Bias of the Design Effect

A Bound for the Relative Bias of the Design Effect A Bound for the Relatve Bas of the Desgn Effect Alberto Padlla Banco de Méxco Abstract Desgn effects are typcally used to compute sample szes or standard errors from complex surveys. In ths paper, we show

More information

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U) Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information

e i is a random error

e i is a random error Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where + β + β e for,..., and are observable varables e s a random error How can an estmaton rule be constructed for the unknown

More information

Chapter 5 Multilevel Models

Chapter 5 Multilevel Models Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level

More information

FREQUENCY DISTRIBUTIONS Page 1 of The idea of a frequency distribution for sets of observations will be introduced,

FREQUENCY DISTRIBUTIONS Page 1 of The idea of a frequency distribution for sets of observations will be introduced, FREQUENCY DISTRIBUTIONS Page 1 of 6 I. Introducton 1. The dea of a frequency dstrbuton for sets of observatons wll be ntroduced, together wth some of the mechancs for constructng dstrbutons of data. Then

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 31 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 6. Rdge regresson The OLSE s the best lnear unbased

More information

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications Durban Watson for Testng the Lack-of-Ft of Polynomal Regresson Models wthout Replcatons Ruba A. Alyaf, Maha A. Omar, Abdullah A. Al-Shha ralyaf@ksu.edu.sa, maomar@ksu.edu.sa, aalshha@ksu.edu.sa Department

More information

Chapter 6. Supplemental Text Material

Chapter 6. Supplemental Text Material Chapter 6. Supplemental Text Materal S6-. actor Effect Estmates are Least Squares Estmates We have gven heurstc or ntutve explanatons of how the estmates of the factor effects are obtaned n the textboo.

More information

j) = 1 (note sigma notation) ii. Continuous random variable (e.g. Normal distribution) 1. density function: f ( x) 0 and f ( x) dx = 1

j) = 1 (note sigma notation) ii. Continuous random variable (e.g. Normal distribution) 1. density function: f ( x) 0 and f ( x) dx = 1 Random varables Measure of central tendences and varablty (means and varances) Jont densty functons and ndependence Measures of assocaton (covarance and correlaton) Interestng result Condtonal dstrbutons

More information

More metrics on cartesian products

More metrics on cartesian products More metrcs on cartesan products If (X, d ) are metrc spaces for 1 n, then n Secton II4 of the lecture notes we defned three metrcs on X whose underlyng topologes are the product topology The purpose of

More information

STAT 3008 Applied Regression Analysis

STAT 3008 Applied Regression Analysis STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an

More information

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes 25/6 Canddates Only January Examnatons 26 Student Number: Desk Number:...... DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR Department Module Code Module Ttle Exam Duraton

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study

More information

/ n ) are compared. The logic is: if the two

/ n ) are compared. The logic is: if the two STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence

More information

Lecture 4 Hypothesis Testing

Lecture 4 Hypothesis Testing Lecture 4 Hypothess Testng We may wsh to test pror hypotheses about the coeffcents we estmate. We can use the estmates to test whether the data rejects our hypothess. An example mght be that we wsh to

More information

Assignment 5. Simulation for Logistics. Monti, N.E. Yunita, T.

Assignment 5. Simulation for Logistics. Monti, N.E. Yunita, T. Assgnment 5 Smulaton for Logstcs Mont, N.E. Yunta, T. November 26, 2007 1. Smulaton Desgn The frst objectve of ths assgnment s to derve a 90% two-sded Confdence Interval (CI) for the average watng tme

More information

Statistical Evaluation of WATFLOOD

Statistical Evaluation of WATFLOOD tatstcal Evaluaton of WATFLD By: Angela MacLean, Dept. of Cvl & Envronmental Engneerng, Unversty of Waterloo, n. ctober, 005 The statstcs program assocated wth WATFLD uses spl.csv fle that s produced wth

More information

Lecture 3 Stat102, Spring 2007

Lecture 3 Stat102, Spring 2007 Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

Kernel Methods and SVMs Extension

Kernel Methods and SVMs Extension Kernel Methods and SVMs Extenson The purpose of ths document s to revew materal covered n Machne Learnng 1 Supervsed Learnng regardng support vector machnes (SVMs). Ths document also provdes a general

More information

Linear Approximation with Regularization and Moving Least Squares

Linear Approximation with Regularization and Moving Least Squares Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...

More information

LECTURE 9 CANONICAL CORRELATION ANALYSIS

LECTURE 9 CANONICAL CORRELATION ANALYSIS LECURE 9 CANONICAL CORRELAION ANALYSIS Introducton he concept of canoncal correlaton arses when we want to quantfy the assocatons between two sets of varables. For example, suppose that the frst set of

More information

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction ECONOMICS 35* -- NOTE 7 ECON 35* -- NOTE 7 Interval Estmaton n the Classcal Normal Lnear Regresson Model Ths note outlnes the basc elements of nterval estmaton n the Classcal Normal Lnear Regresson Model

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

Limited Dependent Variables

Limited Dependent Variables Lmted Dependent Varables. What f the left-hand sde varable s not a contnuous thng spread from mnus nfnty to plus nfnty? That s, gven a model = f (, β, ε, where a. s bounded below at zero, such as wages

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours UNIVERSITY OF TORONTO Faculty of Arts and Scence December 005 Examnatons STA47HF/STA005HF Duraton - hours AIDS ALLOWED: (to be suppled by the student) Non-programmable calculator One handwrtten 8.5'' x

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson

More information

Statistical Inference. 2.3 Summary Statistics Measures of Center and Spread. parameters ( population characteristics )

Statistical Inference. 2.3 Summary Statistics Measures of Center and Spread. parameters ( population characteristics ) Ismor Fscher, 8//008 Stat 54 / -8.3 Summary Statstcs Measures of Center and Spread Dstrbuton of dscrete contnuous POPULATION Random Varable, numercal True center =??? True spread =???? parameters ( populaton

More information

Homework Assignment 3 Due in class, Thursday October 15

Homework Assignment 3 Due in class, Thursday October 15 Homework Assgnment 3 Due n class, Thursday October 15 SDS 383C Statstcal Modelng I 1 Rdge regresson and Lasso 1. Get the Prostrate cancer data from http://statweb.stanford.edu/~tbs/elemstatlearn/ datasets/prostate.data.

More information

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis Resource Allocaton and Decson Analss (ECON 800) Sprng 04 Foundatons of Regresson Analss Readng: Regresson Analss (ECON 800 Coursepak, Page 3) Defntons and Concepts: Regresson Analss statstcal technques

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

Supplementary Notes for Chapter 9 Mixture Thermodynamics

Supplementary Notes for Chapter 9 Mixture Thermodynamics Supplementary Notes for Chapter 9 Mxture Thermodynamcs Key ponts Nne major topcs of Chapter 9 are revewed below: 1. Notaton and operatonal equatons for mxtures 2. PVTN EOSs for mxtures 3. General effects

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VIII LECTURE - 34 ANALYSIS OF VARIANCE IN RANDOM-EFFECTS MODEL AND MIXED-EFFECTS EFFECTS MODEL Dr Shalabh Department of Mathematcs and Statstcs Indan

More information

January Examinations 2015

January Examinations 2015 24/5 Canddates Only January Examnatons 25 DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR STUDENT CANDIDATE NO.. Department Module Code Module Ttle Exam Duraton (n words)

More information

Composite Hypotheses testing

Composite Hypotheses testing Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter

More information

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Maxmum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models

More information

Measurement Uncertainties Reference

Measurement Uncertainties Reference Measurement Uncertantes Reference Introducton We all ntutvely now that no epermental measurement can be perfect. It s possble to mae ths dea quanttatve. It can be stated ths way: the result of an ndvdual

More information

Uncertainty in measurements of power and energy on power networks

Uncertainty in measurements of power and energy on power networks Uncertanty n measurements of power and energy on power networks E. Manov, N. Kolev Department of Measurement and Instrumentaton, Techncal Unversty Sofa, bul. Klment Ohrdsk No8, bl., 000 Sofa, Bulgara Tel./fax:

More information

NUMERICAL DIFFERENTIATION

NUMERICAL DIFFERENTIATION NUMERICAL DIFFERENTIATION 1 Introducton Dfferentaton s a method to compute the rate at whch a dependent output y changes wth respect to the change n the ndependent nput x. Ths rate of change s called the

More information

Lecture 16 Statistical Analysis in Biomaterials Research (Part II)

Lecture 16 Statistical Analysis in Biomaterials Research (Part II) 3.051J/0.340J 1 Lecture 16 Statstcal Analyss n Bomaterals Research (Part II) C. F Dstrbuton Allows comparson of varablty of behavor between populatons usng test of hypothess: σ x = σ x amed for Brtsh statstcan

More information

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method Appled Mathematcal Scences, Vol. 7, 0, no. 47, 07-0 HIARI Ltd, www.m-hkar.com Comparson of the Populaton Varance Estmators of -Parameter Exponental Dstrbuton Based on Multple Crtera Decson Makng Method

More information

This column is a continuation of our previous column

This column is a continuation of our previous column Comparson of Goodness of Ft Statstcs for Lnear Regresson, Part II The authors contnue ther dscusson of the correlaton coeffcent n developng a calbraton for quanttatve analyss. Jerome Workman Jr. and Howard

More information

Temperature. Chapter Heat Engine

Temperature. Chapter Heat Engine Chapter 3 Temperature In prevous chapters of these notes we ntroduced the Prncple of Maxmum ntropy as a technque for estmatng probablty dstrbutons consstent wth constrants. In Chapter 9 we dscussed the

More information

Week3, Chapter 4. Position and Displacement. Motion in Two Dimensions. Instantaneous Velocity. Average Velocity

Week3, Chapter 4. Position and Displacement. Motion in Two Dimensions. Instantaneous Velocity. Average Velocity Week3, Chapter 4 Moton n Two Dmensons Lecture Quz A partcle confned to moton along the x axs moves wth constant acceleraton from x =.0 m to x = 8.0 m durng a 1-s tme nterval. The velocty of the partcle

More information

Andreas C. Drichoutis Agriculural University of Athens. Abstract

Andreas C. Drichoutis Agriculural University of Athens. Abstract Heteroskedastcty, the sngle crossng property and ordered response models Andreas C. Drchouts Agrculural Unversty of Athens Panagots Lazards Agrculural Unversty of Athens Rodolfo M. Nayga, Jr. Texas AMUnversty

More information

Difference Equations

Difference Equations Dfference Equatons c Jan Vrbk 1 Bascs Suppose a sequence of numbers, say a 0,a 1,a,a 3,... s defned by a certan general relatonshp between, say, three consecutve values of the sequence, e.g. a + +3a +1

More information

Credit Card Pricing and Impact of Adverse Selection

Credit Card Pricing and Impact of Adverse Selection Credt Card Prcng and Impact of Adverse Selecton Bo Huang and Lyn C. Thomas Unversty of Southampton Contents Background Aucton model of credt card solctaton - Errors n probablty of beng Good - Errors n

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

U-Pb Geochronology Practical: Background

U-Pb Geochronology Practical: Background U-Pb Geochronology Practcal: Background Basc Concepts: accuracy: measure of the dfference between an expermental measurement and the true value precson: measure of the reproducblty of the expermental result

More information

Uncertainty and auto-correlation in. Measurement

Uncertainty and auto-correlation in. Measurement Uncertanty and auto-correlaton n arxv:1707.03276v2 [physcs.data-an] 30 Dec 2017 Measurement Markus Schebl Federal Offce of Metrology and Surveyng (BEV), 1160 Venna, Austra E-mal: markus.schebl@bev.gv.at

More information

CHAPTER 8. Exercise Solutions

CHAPTER 8. Exercise Solutions CHAPTER 8 Exercse Solutons 77 Chapter 8, Exercse Solutons, Prncples of Econometrcs, 3e 78 EXERCISE 8. When = N N N ( x x) ( x x) ( x x) = = = N = = = N N N ( x ) ( ) ( ) ( x x ) x x x x x = = = = Chapter

More information

Using T.O.M to Estimate Parameter of distributions that have not Single Exponential Family

Using T.O.M to Estimate Parameter of distributions that have not Single Exponential Family IOSR Journal of Mathematcs IOSR-JM) ISSN: 2278-5728. Volume 3, Issue 3 Sep-Oct. 202), PP 44-48 www.osrjournals.org Usng T.O.M to Estmate Parameter of dstrbutons that have not Sngle Exponental Famly Jubran

More information

Markov Chain Monte Carlo Lecture 6

Markov Chain Monte Carlo Lecture 6 where (x 1,..., x N ) X N, N s called the populaton sze, f(x) f (x) for at least one {1, 2,..., N}, and those dfferent from f(x) are called the tral dstrbutons n terms of mportance samplng. Dfferent ways

More information

Diagnostics in Poisson Regression. Models - Residual Analysis

Diagnostics in Poisson Regression. Models - Residual Analysis Dagnostcs n Posson Regresson Models - Resdual Analyss 1 Outlne Dagnostcs n Posson Regresson Models - Resdual Analyss Example 3: Recall of Stressful Events contnued 2 Resdual Analyss Resduals represent

More information

Chapter 15 - Multiple Regression

Chapter 15 - Multiple Regression Chapter - Multple Regresson Chapter - Multple Regresson Multple Regresson Model The equaton that descrbes how the dependent varable y s related to the ndependent varables x, x,... x p and an error term

More information

Non-Mixture Cure Model for Interval Censored Data: Simulation Study ABSTRACT

Non-Mixture Cure Model for Interval Censored Data: Simulation Study ABSTRACT Malaysan Journal of Mathematcal Scences 8(S): 37-44 (2014) Specal Issue: Internatonal Conference on Mathematcal Scences and Statstcs 2013 (ICMSS2013) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES Journal

More information

STAT 405 BIOSTATISTICS (Fall 2016) Handout 15 Introduction to Logistic Regression

STAT 405 BIOSTATISTICS (Fall 2016) Handout 15 Introduction to Logistic Regression STAT 45 BIOSTATISTICS (Fall 26) Handout 5 Introducton to Logstc Regresson Ths handout covers materal found n Secton 3.7 of your text. You may also want to revew regresson technques n Chapter. In ths handout,

More information

Lecture 3: Probability Distributions

Lecture 3: Probability Distributions Lecture 3: Probablty Dstrbutons Random Varables Let us begn by defnng a sample space as a set of outcomes from an experment. We denote ths by S. A random varable s a functon whch maps outcomes nto the

More information

Global Sensitivity. Tuesday 20 th February, 2018

Global Sensitivity. Tuesday 20 th February, 2018 Global Senstvty Tuesday 2 th February, 28 ) Local Senstvty Most senstvty analyses [] are based on local estmates of senstvty, typcally by expandng the response n a Taylor seres about some specfc values

More information

A Robust Method for Calculating the Correlation Coefficient

A Robust Method for Calculating the Correlation Coefficient A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal

More information

A Note on Test of Homogeneity Against Umbrella Scale Alternative Based on U-Statistics

A Note on Test of Homogeneity Against Umbrella Scale Alternative Based on U-Statistics J Stat Appl Pro No 3 93- () 93 NSP Journal of Statstcs Applcatons & Probablty --- An Internatonal Journal @ NSP Natural Scences Publshng Cor A Note on Test of Homogenety Aganst Umbrella Scale Alternatve

More information

Using the estimated penetrances to determine the range of the underlying genetic model in casecontrol

Using the estimated penetrances to determine the range of the underlying genetic model in casecontrol Georgetown Unversty From the SelectedWorks of Mark J Meyer 8 Usng the estmated penetrances to determne the range of the underlyng genetc model n casecontrol desgn Mark J Meyer Neal Jeffres Gang Zheng Avalable

More information

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Mamum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models for

More information

STATISTICS QUESTIONS. Step by Step Solutions.

STATISTICS QUESTIONS. Step by Step Solutions. STATISTICS QUESTIONS Step by Step Solutons www.mathcracker.com 9//016 Problem 1: A researcher s nterested n the effects of famly sze on delnquency for a group of offenders and examnes famles wth one to

More information

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the Chapter 11 Student Lecture Notes 11-1 Lnear regresson Wenl lu Dept. Health statstcs School of publc health Tanjn medcal unversty 1 Regresson Models 1. Answer What Is the Relatonshp Between the Varables?.

More information

Statistics II Final Exam 26/6/18

Statistics II Final Exam 26/6/18 Statstcs II Fnal Exam 26/6/18 Academc Year 2017/18 Solutons Exam duraton: 2 h 30 mn 1. (3 ponts) A town hall s conductng a study to determne the amount of leftover food produced by the restaurants n the

More information

Turbulence classification of load data by the frequency and severity of wind gusts. Oscar Moñux, DEWI GmbH Kevin Bleibler, DEWI GmbH

Turbulence classification of load data by the frequency and severity of wind gusts. Oscar Moñux, DEWI GmbH Kevin Bleibler, DEWI GmbH Turbulence classfcaton of load data by the frequency and severty of wnd gusts Introducton Oscar Moñux, DEWI GmbH Kevn Blebler, DEWI GmbH Durng the wnd turbne developng process, one of the most mportant

More information

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE)

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE) ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE) June 7, 016 15:30 Frst famly name: Name: DNI/ID: Moble: Second famly Name: GECO/GADE: Instructor: E-mal: Queston 1 A B C Blank Queston A B C Blank Queston

More information

Marginal Effects in Probit Models: Interpretation and Testing. 1. Interpreting Probit Coefficients

Marginal Effects in Probit Models: Interpretation and Testing. 1. Interpreting Probit Coefficients ECON 5 -- NOE 15 Margnal Effects n Probt Models: Interpretaton and estng hs note ntroduces you to the two types of margnal effects n probt models: margnal ndex effects, and margnal probablty effects. It

More information

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva Econ 39 - Statstcal Propertes of the OLS estmator Sanjaya DeSlva September, 008 1 Overvew Recall that the true regresson model s Y = β 0 + β 1 X + u (1) Applyng the OLS method to a sample of data, we estmate

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

Introduction to Regression

Introduction to Regression Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes

More information

Learning Objectives for Chapter 11

Learning Objectives for Chapter 11 Chapter : Lnear Regresson and Correlaton Methods Hldebrand, Ott and Gray Basc Statstcal Ideas for Managers Second Edton Learnng Objectves for Chapter Usng the scatterplot n regresson analyss Usng the method

More information

2E Pattern Recognition Solutions to Introduction to Pattern Recognition, Chapter 2: Bayesian pattern classification

2E Pattern Recognition Solutions to Introduction to Pattern Recognition, Chapter 2: Bayesian pattern classification E395 - Pattern Recognton Solutons to Introducton to Pattern Recognton, Chapter : Bayesan pattern classfcaton Preface Ths document s a soluton manual for selected exercses from Introducton to Pattern Recognton

More information

USE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE

USE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE STATISTICA, anno LXXV, n. 4, 015 USE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE Manoj K. Chaudhary 1 Department of Statstcs, Banaras Hndu Unversty, Varanas,

More information