DIAGNOSTICS AND REMEDIAL MEASURES

Size: px
Start display at page:

Download "DIAGNOSTICS AND REMEDIAL MEASURES"

Transcription

1 DIAGNOSTICS AND REMEDIAL MEASURES Rajender Parsad and Lal Mohan Bhar I.A.S.R.I., Lbrary Aenue, New Delh-0 0 rajender@asr.res.n; lmbhar@asr.res.n The nterpretaton of data based on analyss of arance (ANOVA) s ald only when the followng assumptons are satsfed:. Addte Effects: Treatment effects and block (enronmental) effects are addte.. Independence of errors: Expermental errors are ndependent. 3. Homogenety of Varances: Errors hae common arance. 4. Normal Dstrbuton: Errors follow a normal dstrbuton. Also the statstcal tests t, F, z, etc. are ald under the assumpton of ndependence of errors and normalty of errors. The departures from these assumptons make the nterpretaton based on these statstcal technques nald. Therefore, t s necessary to detect the deatons and apply the approprate remedal measures. The assumpton of ndependence of errors,.e., error of an obseraton s not related to or depends upon that of another. Ths assumpton s usually assured wth the use of proper randomzaton procedure. Howeer, f there s any systematc pattern n the arrangement of treatments from one replcaton to another, errors may be non-ndependent. Ths may be handled by usng nearest neghbour methods n the analyss of expermental data. The assumpton of addte effects can be defned and detected n the followng manner: Addte Effects The effects of two factors, say, treatment and replcaton, are sad to be addte f the effect of one-factor remans constant oer all the leels of other factors. A hypothetcal set of data from a randomzed complete block (RCB) desgn, wth treatments and replcatons, wth addte effects s gen n Table. Table Treatment Replcaton Replcaton Effect I II I - II A B Treatment Effect (A-B) 0 0 Here, the treatment effect s equal to 0 for both replcatons and replcaton effect s 65 for both treatments. When the effect of one factor s not constant at all the leels of other factor, the effects are sad to be non-addte. A common departure from the assumpton of addtty n bologcal experments s one where the effects are multplcate. Two factors are sad to hae multplcate effects f ther effects are addte only when expressed n terms of percentages. Table llustrates a hypothetcal set of data wth multplcate effects.

2 Treatment Table Replcaton Replcaton Effect I II I - II 00(I - II)/II A (.3003) (.0969) (0.04) B (.04) (.0000) (0.04) Treatment Effect (A-B) 40 (0.0969) 5 (0.0969) 00 (A - B)/B 5 5 In ths case, the treatment effect s not constant oer replcatons and the replcaton effect s not constant oer treatments. Howeer, when both treatment effect and replcaton effect are expressed n terms of percentages, an entrely dfferent pattern emerges. For such olatons of assumptons, Logarthmc transformaton s qute sutable. For llustraton, the Logarthmc transformaton of data n Table s gen n brackets. Ths s, howeer a crude method for testng the addtty. Tukey (949) gae a statstcal test for testng the addtty n a RCB desgn. Ths test s known as one degree of freedom test for non-addtty. In ths test, one degree of freedom s solated from error and ths degree of freedom s called as the degree of freedom for non-addtty. In the sequel, we descrbe the procedure n bref. Suppose that an experment has been conducted to compare treatments usng RCB desgn wth r replcatons. Let y j denote the obsered alue of the response arable for th th treatment n j replcaton; =,,, ; j =,,, r. Arrange the data n a r table as gen below: Treatment K j K r Treatment Total Treatment y y K y y K y j y j K K y r T. y.. Deatons from Grand d C Sum of Cross Product y r T. y. d. C M M M K M K M M M M M y y K y j K y r T. y. d. C M M M K M K M M M M M y y K y j K y r T. y. d. C Replcaton R. R K. R K. j R G (Grand. r Total total) Replcaton y. y K. y K. j y. r G GM = r Deaton d. d K. d K. j from Grand d. r III-

3 r where T. = y j ; y. = T. / r ; R. j = y j ; y. j = R. j / ; d. = y. GM = j= r d. j = y. j GM ; C = yj d. j j= Obtan L = = C d. ; D = d. ; D = d = Sum of squares due to non-addtty (SSNA)= The sum of squares due to treatments, replcatons and total sum of squares are gen by r j=. j D L D T. Sum of squares due to treatments (SST) = r = G r R. Sum of squares due to replcatons (SSR) = Total sum of squares (TSS) = = r j y j G r r j= j G r Sum of squares due to Error (SSE) = TSS SST-SSR-SSNA Then the outlne of ANOVA table s Source df SS MS Treatments - SST MST Replcatons r- SSR MSR Non-addtty SSNA MSNA Error (-)(r-)- SSE MSE Total r- TSS The mean squares (MS) are obtaned by ddng sum of squares (SS) by correspondng degrees of freedom (df). The non-addtty s tested by F-statstc wth and (-)(r-)- MSNA degree of freedom calculated alue of F =. MSE Normalty of Errors The assumptons of homogenety of arances and normalty are generally olated together. To test the aldty of normalty of errors for the character under study, one can take help of Normal Probablty Plot, Anderson-Darlng Test, D'Augstno's Test, Shapro - Wlk's Test, Ryan-Joner test, Kolmogro-Smrno test, etc. In general moderate departures from III-3

4 normalty are of lttle concern n the fxed effects ANOVA as F - test s slghtly affected but n case of random effects, t s more seerely mpacted by non-normalty. The sgnfcant deatons of errors from normalty, makes the nferences nald. So before analyzng the data, t s necessary to conert the data to a scale that t follows a normal dstrbuton. In the data from desgned feld experments, we do not drectly use the orgnal data for testng of normalty or homogenety of obseratons because ths s embedded wth the treatment effects and some of other effects lke block, row, column, etc. So there s need to elmnate these effects from the data before testng the assumptons of normalty and homogenety of arances. For elmnatng the treatment effects and other effects we ft the model correspondng to the desgn adopted and estmate the resduals. These resduals are then used for testng the normalty of the obseratons. In other words, we want to test the null hypothess H 0 : errors are normally dstrbuted aganst alternate hypothess H : errors are not normally dstrbuted. For detals on these tests one may refer to D Agostno and Stephens (986). Most of the standard statstcal packages aalable n the market are capable of testng the normalty of the data. In SAS and SPSS commonly used tests are Shapro-Wlk test and Kolmogro-Smrno test. MINITAB uses three tests z. Anderson-Darlng, Ryan-Joner, Kolmogro-Smrno for testng the normalty of data. Homogenety of Error Varances A crude method for detectng the heterogenety of arances s based on scatter plots of means and arance or range of obseratons or errors, resdual s ftted alues, etc. To be clearer, let Y j be the obseraton pertanng to th treatment ( = () ) n the j th replcaton ( ( ) j = ) r. Compute the mean and arance for each treatment across the replcatons (the range can be used n place of arance) as r r = Y. = Yj ; Varance = S = ( Yj Y. ) r r j= j= Draw the scatter plot of mean s arance (or range). If S. 's ( = () ) are equal (constant) or nearly equal, then the arances are homogeneous. Based on these scatter plots, the heterogenety of arances can be classfed nto two types:. Where the arance s functonally related to mean.. Where there s no functonal relatonshp between the arance and the mean. For llustraton some scatter - dagrams of mean and arances (or range) are gen as: Varance Varance (a) Homogeneous arance (b) Heterogeneous arance where arance s proportonal to mean III-4

5 Varance (c) Heterogeneous arance wthout any functonal relatonshp between arance and mean The frst knd of arance heterogenety (fgure b) s usually assocated wth the data whose dstrbuton s non-normal z., negate bnomal, Posson, bnomal, etc. The second knd of arance heterogenety usually occurs n experments, where, due to the nature of treatments tested, some treatments hae errors that are substantally hgher (lower) than others. For example, n aretal trals, where arous types of breedng materal are beng compared, the sze of arance between plots of a partcular arety wll depend on the degree of genetc homogenety of materal beng tested. The arance of F generaton, for example, can be expected to be hgher than that of F generaton because genetc arablty n F s much hgher than that n F. The arances of aretes that are hghly tolerant of or hghly susceptble to, the stress beng tested are expected to be smaller than those of hang moderate degree of tolerance. Also n testng yeld response to a chemcal treatment, such as, fertlzer, nsectcde or herbcde, the non-unform applcaton of chemcal treatments may result n a hgher arablty n the treated plots than that n the untreated plots. The scatter-dagram of means and arances of obseratons for each treatment across the replcatons ges only a prelmnary dea about homogenety of error arances. Statstcally the homogenety of error arances s tested usng Bartlett's test for normally dstrbuted errors and Leene test for non-normal errors. These tests are descrbed n the sequel. Bartlett's Test for Homogenety of Varances Let there are - ndependent samples drawn from same populaton and th sample s of sze r and ( r + r r ) = N. In the present case, the ndependent samples are the resduals of the obseratons pertanng to treatments and th sample sze s the number of replcatons of the treatment. One wants to test the null hypothess H 0 σ = σ =... = σ aganst the alternate hypothess H : at least two of the σ ' s arance for treatment. : are not equal, where σ s the error Let e j denotes the resdual pertanng to the obseraton of treatment from replcaton j, then t can easly be shown that the sum of resduals pertanng to a gen treatment s zero. In III-5

6 ths test S r r = ( ej e. ) = ej r j= r j= s taken as unbased estmate of σ. The procedure noles computng a statstc whose samplng dstrbuton s closely approxmated by the χ dstrbuton wth - degrees of freedom. The test statstc s χ q 0 =.306 c and null hypothess s rejected when χ 0 > χ α,, where χ α, s the upper α percentage pont of χ dstrbuton wth - degrees of freedom. To compute χ 0, follow the steps: Step : Compute mean and arance of all -samples. Step : Obtan pooled arance S p = = ( r ) N Step 3: Compute q = ( N ) log S ( r ) Step 4: Compute Step 5: Compute χ 0. c = + ( ) 3 = S 0 p log0 = ( r ) ( N ) Bartlett's χ test for homogenety of arances s a modfcaton of the normal-theory lkelhood rato test. Whle Bartlett's test has accurate Type I error rates and optmal power when the underlyng dstrbuton of the data s normal, t can be ery naccurate f that dstrbuton s een slghtly non-normal (Box 953). Therefore, Bartlett's test s not recommended for routne use. An approach that leads to tests that are much more robust to the underlyng dstrbuton s to transform the orgnal alues of the dependent arable to dere a dsperson arable and then to perform analyss of arance on ths arable. The sgnfcance leel for the test of homogenety of arance s the p-alue for the ANOVA F-test on the dsperson arable. Commonly used test for testng the homogenety of arance usng a dsperson arable s Leene Test gen by Leene (960). The procedure s descrbed n the sequel. Leene Test for homogenety of Varances The test s based on the arablty of the resduals. The larger the error arance, the larger the arablty of the resduals wll tend to be. To conduct the Leene test, we dde the data nto dfferent groups based on the number of treatments f the error arance s ether ncreasng or S III-6

7 decreasng wth the treatments, the resduals n the one treatment wll tend to be more arable than those n others treatments. The Leene test than conssts smply F statstc based on one way ANOVA used to determne whether the mean of absolute/ Square root deaton from mean are sgnfcantly dfferent or not. The resduals are obtaned from the usual analyss of arance. The test statstc s gen as ( r ) r ( d. d.. ) = = F = F(( ), r ( r ) = ( d d ) where d j j = j= = e e ; j. r dj j= d. = ; r r dj = j= d.. = and r j r j= e s the mean of the resduals of the th treatment. e s the j th resdual for the th plot, Ths test was modfed by Brown and Forsythe (974). In the modfed test, the absolute deaton s taken from the medan nstead of mean n order to make the test more robust. In the present nestgaton, the Bartlett's χ -test has been used for testng the homogenety of error arances when the dstrbuton of errors s normal and Leene test for non-normal errors. Remark : In a block desgn, t can easly be shown that the sum of resduals wthn a gen block s zero. Therefore, the resduals n a block of sze wll be same wth ther sgn reerse n order. As a consequence, q n Bartlett s test and numerator n Leene test statstc becomes zero for the data generated from experments conducted to compare only two treatments n a RCB desgn. Hence, the tests for homogenety of error arances cannot be used for the experments conducted to compare only two treatments n a RCB desgn. Inferences from such experments may be drawn usng Fsher-Behren t-test. Further, Bartlett s test cannot be used for the expermental stuatons where some of the treatments are sngly replcated. Remark : In a RCB desgn, t can easly be shown that the sum of resduals from a partcular treatment s zero. As a consequence, the denomnator of Leene test statstc s zero for the data generated from RCB desgns wth two replcatons. Therefore, Leene test cannot be used for testng the homogenety of error arances for the data generated from RCB desgns wth two replcatons. Remedal Measures In ths secton, we shall dscuss the remedal measures for non-normal and/or heterogeneous data n greater detals. III-7

8 Data transformaton s the most approprate remedal measure, n the stuaton where the arances are heterogeneous and are some functons of means. Wth ths technque, the orgnal data are conerted to a new scale resultng nto a new data set that s expected to satsfy the homogenety of arances. Because a common transformaton scale s appled to all obseratons, the comparate alues between treatments are not altered and comparson between them remans ald. Error parttonng s the remedal measure of heterogenety that usually occurs n experments, where, due to the nature of treatments tested some treatments hae errors that are substantally hgher (lower) than others. Here, we shall concentrate on those stuatons where character under study s non-normal and arances are heterogeneous. Dependng upon the functonal relatonshp between arances and means, sutable transformaton s adopted. The transformed arate should satsfy the followng:. The arances of the transformed arate should be unaffected by changes n the means. Ths s also called the arance stablzng transformaton.. It should be normally dstrbuted. 3. It should be one for whch effects are lnear and addte. 4. The transformed scale should be such for whch an arthmetc aerage from the sample s an effcent estmate of true mean. The followng are the three transformatons, whch are beng used most commonly, n bologcal research. a) Logarthmc Transformaton b) Square root Transformaton c) Arc Sne or Angular Transformaton a) Logarthmc Transformaton Ths transformaton s sutable for the data where the arance s proportonal to square of the mean or the coeffcent of araton (S.D./mean) s constant or where effects are multplcate. These condtons are generally found n the data that are whole numbers and coer a wde range of alues. Ths s usually the case when analyzng growth measurements such as trunk grth, length of extenson growth, weght of tree or number of nsects per plot, number of eggmass per plant or per unt area etc. For such stuatons, t s approprate to analyze log X nstead of actual data, X. When data set 3 noles small alues or zeros, log (X+), log( X + ) or log X + should be used 8 nstead of log X. Ths transformaton would make errors normal, when obseratons follow negate bnomal dstrbuton lke n the case of nsect counts. b) Square-Root Transformaton Ths transformaton s approprate for the data sets where the arance s proportonal to the mean. Here, the data conssts of small whole numbers, for example, data obtaned n countng rare eents, such as the number of nfested plants n a plot, the number of nsects caught n III-8

9 traps, number of weeds per plot, parthenocarpy n some aretes of mango. Ths data set generally follows the Posson dstrbuton and square root transformaton approxmates Posson to normal dstrbuton. For these stuatons, t s better to analyze X than that of X, the actual data. If X s 3 confrmed to small whole numbers then, X + or X + should be used nstead of X. 8 Ths transformaton s also approprate for the percentage data, where, the range s between 0 to 30% or between 70 and 00%. c) Arc Sne Transformaton Ths transformaton s approprate for the data on proportons,.e., data obtaned from a count and the data expressed as decmal fractons and percentages. The dstrbuton of percentages s bnomal and ths transformaton makes the dstrbuton normal. Snce the role of ths transformaton s not properly understood, there s a tendency to transform any percentage usng arc sne transformaton. But only that percentage data that are dered from count data, such as % barren tllers (whch s dered from the rato of the number of non-bearng tllers to the total number of tllers) should be transformed and not the percentage data such as % proten or % carbohydrates, %ntrogen, etc. whch are not dered from count data. For these stuatons, t s better to analyze sn ( X ) than that of X, the actual data. If the alue of X s 0%, t should be substtuted by and the alue of 00% by 4n number of unts upon whch the percentage data s based. 00, where n s the 4n It s nterestng to note here that not all percentage data need to be transformed and een f they do, arc sne transformaton s not the only transformaton possble. The followng rules may be useful n choosng the proper transformaton scale for percentage data dered from count data. Rule : The percentage data lyng wthn the range 30 to 70% s homogeneous and no transformaton s needed. Rule : For percentage data lyng wthn the range of ether 0 to 30% or 70 to 00%, but not both, the square root transformaton should be used. Rule 3: For percentage that do not follow the ranges specfed n Rule or Rule, the Arc Sne transformaton should be used. The other transformatons used are recprocal square root [, when arance s X proportonal to cube of mean], recprocal [ X, when arance s proportonal to fourth power of mean] and tangent hyperbolc transformaton. The transformaton dscussed aboe are a partcular case of the general famly of transformatons known as Box-Cox transformaton. III-9

10 d) Box-Cox Transformaton By now we know that f the relaton between the arance of obseratons and the mean s known then ths nformaton can be utlzed n selectng the form of the transformaton. We now elaborate on ths pont and show how t s possble to estmate the form of the requred transformaton from the data. The transformaton suggested by Box and Cox (964) s a power transformaton of the orgnal data. Let y ut be the obseraton pertanng to the u th plot; then the power transformaton mples that we use y ut s as * λ y ut = y ut. * λ The transformaton parameter λ n y ut = y ut may be estmated smultaneously wth the other model parameters (oerall mean and treatment effects) usng the method of maxmum lkelhood. The procedure conssts of performng, for the arous alues of λ, a standard analyss of arance on λ yut λ 0 (A) λy& ut λ ( λ) yut = y& ut ln yut λ = 0 N nu where y& ut = ln (/ n) ln yut. u= t= y& ut s the geometrc mean of the obseratons. The maxmum lkelhood estmate of λ s the alue for whch the error sum of squares, say SSE (λ), s mnmum. Notce that we cannot select the alue of λ by drectly comparng the error sum of squares from analyss of arance λ on y because for each alue of λ the error sum of squares s measured on a dfferent scale. Equaton (A) rescales the responses so that the error sums of squares are drectly comparable. Ths s a ery general transformaton and the commonly used transformatons follow as partcular cases. The partcular cases for dfferent alues of λ are gen below. λ Transformaton No Transformaton ½ Square Root 0 Log -/ Recprocal Square Root - Recprocal Remark 3: If any one of the obseratons s zero then the geometrc mean s undefned. In the expresson (A), geometrc mean s n denomnator so t s not possble to compute that expresson. For solng ths problem, we add a small quantty to each of the obseratons. III-0

11 Note: It should be emphaszed that transformaton, f needed, must take place rght at the begnnng of the analyss, all fttng of mssng plot alues, all adjustments by coarance etc. beng done wth the transformed arate and not wth the orgnal data. At the end, when the conclusons hae been reached, t s permssble to 're-transform' the results so as to present them n the orgnal unts of measurement, but ths s done only to render them more ntellgble. As a result of ths transformaton followed by back transformaton, the means wll rather be dfferent from those that would hae been obtaned from the orgnal data. A smple example s that wthout transformaton, the mean of the numbers, 4, 9, 6 and 5 s. Suppose a square root transformaton s used to ge,, 3, 4 and 5, the mean s now 3, whch after back- transformaton ges 9. Usually the dfference wll not be so great because data do not usually ary as much as those gen, but logarthmc and square root transformaton always lead to a reducton of the mean, just as angles of equal formaton usually lead to ts mong away from the central alue of 50%. Howeer, n practce, computng treatment means from orgnal data s more frequently used because of ts smplcty, but ths may change the order of rankng of conerted means for comparson. Although transformatons make possble a ald analyss, they can be ery awkward. For example, although a sgnfcant dfference can be worked out n the usual way for means of the transformed data, none can be worked out for the treatment means after back transformaton. Non-parametrc tests n the Analyss of Expermental Data When the data remans non-normal and/or heterogeneous een after transformaton, a recourse s made to non-parametrc test procedures. A lot of attenton s beng pad to deelop non-parametrc tests for analyss of expermental data. Most of these non-parametrc test procedures are based on rank statstc. The rank statstc has been used n deelopment of these tests as the statstc based on ranks s. dstrbuton free. easy to calculate and 3. smple to explan and understand. Another reason for use of rank statstc s due to the well known result that the aerage rank approaches normalty quckly as n (number of obseratons) ncreases, under the rather general condtons, whle the same mght not be true for the orgnal data {see e.g. Conoer and Iman (976, 98)}. The non-parametrc test procedures aalable n lterature coer completely randomzed desgns, randomzed complete block desgns, balanced ncomplete block desgns, desgn for boassays, splt plot desgns, cross-oer desgns and so on. For an excellent and elaborate dscussons on non-parametrc tests n the analyss of expermental data, one may refer to Segel and Castellan Jr. (988), Deshpande, Gore and Shanubhogue (995), Sen (996), and Hollander and Wolfe (999). Kruskal-Walls Test can be used for the analyss of data from completely randomzed desgns. Skllngs and Mack Test helps n analyzng the data from a general block desgn. Fredman Test and Durbn Test are partcular cases of ths test. Fredman Test s used for the analyss of data from randomzed complete block desgns and Durbn test for the analyss of data from balanced ncomplete block desgns. III-

12 Some examples of testng the assumptons of normalty and homogenety of errors and remedal measures are dscussed n the Appendx. Some Useful References Anderson,V.L. and McLean,R.A.(974). Desgn of Experments: A realstc approach. Marcel Dekker Inc., New York. Bartlett,M.S.(947). The use of transformaton. Bometrcs, 3, Brown, M. B. and Forsythe, A.B. (974). Robust tests for the equalty of arances. Journal of the Amercan Statstcal Assocaton, 69, Box, G.E.P., (953). Non-normalty and tests on arances. Bometrka, 40 ( & ), Box,G.E.P. and Cox,D.R.(964). An analyss of transformaton. J. Roy. Stastst. Soc. B, 6, -5. Conoer, W.J. and Iman, R.L. (976). In Some Alternate Procedure Usng Ranks for the Analyss of Expermental Desgns. Comm. Statst.: Theory & Methods, A5(4), Conoer,W.J. and Iman,R.L.(98). Rank transfomatons as a brdge between parametrc and non-parametrc statstcs (wth dscusson). Amercan Statstcan, 35, D Agostno, R.B. and Stephens, M. A.(986). Goodness-of-ft Technques. Marcel Dekkar, Inc., New York. Deshpande, J.V., Gore, A.P. and Shanubhogue, A. (995). Statstcal Analyss of Non-normal Data. Wley Eastern Lmted, New Delh. Dean, A and Voss, D (999). Desgn and Analyss of Experments. Sprnger, New York. Dolby,J.L.(963). A quck method for choosng a transformaton. Technometrcs, 5, Draper,N.R. and Hunter,W.G.(969). Transformatons: Some examples rested. Technometrcs,, Hollander, M. and Wolfe, D.A. (999). Nonparametrc Statstcal Methods, nd Ed, John Wley, New York. Leene, H., (960). Robust Tests for Equalty of Varances. I Olkn, ed., Contrbutons to Probablty and Statstcs, Palo Alto, Calf.: Stanford Unersty Press, Royston, P. (99), Approxmatng the Shapro-Wlk W-Test for non-normalty. Statstcs and Computng,, 7-9. Shapro, S. S. and Wlk, M. B.. (965). An analyss of arances test for normalty (Complete Samples). Bometrka, 5, 59-6 Segel, S. and Catellan, N.J. Jr. (988). Non parametrc statstcs for the Behaoural Scences, nd Ed., McGraw Hll, New York. Tukey, J.W.(949). One degree of freedom for non-addtty. Bometrcs, 5, 3-4. III-

13 Appendx Example : Suppose an entomologst s nterested n determnng whether four dfferent knds of traps caught equalent nsects when appled to same feld. Each of the traps s used sx tmes on the feld and resultng data (number of nsects per hour) are as shown below alongwth mean, arance and range. Treatment Replcaton Varance Range I II III IV V VI Y A B C D A scatter plot of mean and arance and mean ersus range are gen as follows: S 80 Range Varance Both plots ndcate that arances are heterogeneous and arance s proportonal to mean. Obtan the resduals for testng the normalty and homogenety of error terms. The resduals obtaned are gen below: Treatment Replcaton Varance I II III IV V VI S A B C D III-3

14 Normalty of error terms n SPSS: Analyze Descrpte Statstcs Explore (Enter the arable whose normalty s to be tested n Dependent Lst) Clck on the opton Plots Check the opton Normalty Plots wth tests Shapro-Wlk Test Kolmogro-Smrno Test Statstc p-alue Statstc p-alue (SW) (KS) The errors were found to be normally dstrbuted. Therefore, homogenety of error arances was tested usng Bartlett's test. It s descrbed n the sequel. 5 Pooled Varance ( ) ( ) S p = 0 = [ log log log 650.0] q = 0 log log = c = + = χ = Snce χ 0.05,3 = 7. 8, therefore, we reject the null hypothess and conclude that the arances are unequal. The S Y. are 5.77, 5.3, 3.43 and 5.43, ndcatng that arance s proportonal to mean. Therefore, square root transformaton should be used. After applcaton of square root transformaton, the resduals are Treatment Replcaton Varance I II III IV V VI S A B C D Normalty of error terms on the transformed data: Shapro-Wlk Test Kolmogro-Smrno Test Statstc p-alue Statstc p-alue (SW) (KS) III-4

15 The errors reman normally dstrbuted after transformaton. The results of homogenety of error arances usng Bartlett's test are Bartlett's Test (normal dstrbuton): Test statstc = 0.89, p-alue = 0.88 Hence, we conclude that the errors are normally dstrbuted and hae a constant arance after transformaton. The results of analyss of arance wth orgnal and transformed data are gen n the sequel. ANOVA: Orgnal Data Source DF Seq SS Adj. SS Square F (F-calc) p(pr>f) Replcaton Treatment Error Total R-Square Root MSE 9.80% 9.37 Tukey Smultaneous Tests for All Parwse Treatment Comparsons ANOVA: Transformed Data Source DF Seq SS Adj. SS Square F (F-calc) p(pr>f) Replcaton Treatment Error Total R-Square Root MSE 93.99%.89 Tukey Smultaneous Tests for All Parwse Treatment Comparsons Wth transformed data treatments and are sgnfcantly dfferent whereas wth orgnal data, they were not. III-5

16 Example : A aretal tral on Rapeseed-Mustard was conducted at Fazabad wth aretes usng a randomzed complete block desgn wth 3 replcatons. The expermental data (Yeld n kg/ha ) obtaned from the aboe experment s Treatments Replcatons R R R3 MCN MCN MCN MCN MCN MCN MCN MCN MCN MCN MCN The analyss of arance of the orgnal data s gen as ANOVA: Orgnal Data Sources DF SS MS F Prob. >F Replcaton Treatment Error Total R-Square CV RMSE Yld Normalty of error terms was tested, the results are gen as Shapro-Wlk Test Kolmogro-Smrno Test Statstc p-alue Statstc p-alue (SW) (KS) >0.500 Snce the data s normal, therefore, Bartlett s test s used for testng the homogenety of error arances. The results are gen as Bartlett s Test Test Statstc : 0.77 P-Value : The errors were found to be heterogeneous. Therefore, we can conclude that the data s heterogeneous and normal. Therefore, Box-Cox transformaton was used as a remedal measure. In the sequel we descrbe the results of the Box-Cox transformaton. III-6

17 For ths we transform the data by aryng λ from -0 to +0 wth an ncrement of 0.0. The error sum of squares are computed for each alue of λ. The alue of λ wth mnmum error sum of squares s used for transformaton gen n (A). The mnmum alue SSE s obtaned for λ =.38. Therefore, recprocal transformaton was used. The assumptons of normalty and homogenety of errors are agan tested usng the transformed data. Normalty of error terms was tested, the results are gen as Shapro-Wlk Test Kolmogro-Smrno Test Statstc p-alue Statstc p-alue (SW) (KS) >0.500 Snce the data s normal, therefore, Bartlett s Test s used for testng the homogenety of error arances. The results are gen as Bartlett's Test (normal dstrbuton) Test Statstc : 5.75 P-Value : The transformed obseratons were found to be normal and homogeneous Therefore, ANOVA was performed on the transformed data. The results obtaned are: ANOVA: Transformed Data Sources DF SS MS F Prob. >F Replcaton E E Treatment E E Error E E3 Total E5 R-Square CV RMSE Transformed Yld We can see that there s no change n the results of sgnfcance of treatment and replcaton effects. Howeer, the transformed data satsfed the assumptons of ANOVA. III-7

Lecture 6 More on Complete Randomized Block Design (RBD)

Lecture 6 More on Complete Randomized Block Design (RBD) Lecture 6 More on Complete Randomzed Block Desgn (RBD) Multple test Multple test The multple comparsons or multple testng problem occurs when one consders a set of statstcal nferences smultaneously. For

More information

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS M. Krshna Reddy, B. Naveen Kumar and Y. Ramu Department of Statstcs, Osmana Unversty, Hyderabad -500 007, Inda. nanbyrozu@gmal.com, ramu0@gmal.com

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution Department of Statstcs Unversty of Toronto STA35HS / HS Desgn and Analyss of Experments Term Test - Wnter - Soluton February, Last Name: Frst Name: Student Number: Instructons: Tme: hours. Ads: a non-programmable

More information

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications Durban Watson for Testng the Lack-of-Ft of Polynomal Regresson Models wthout Replcatons Ruba A. Alyaf, Maha A. Omar, Abdullah A. Al-Shha ralyaf@ksu.edu.sa, maomar@ksu.edu.sa, aalshha@ksu.edu.sa Department

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

Chapter 11: I = 2 samples independent samples paired samples Chapter 12: I 3 samples of equal size J one-way layout two-way layout

Chapter 11: I = 2 samples independent samples paired samples Chapter 12: I 3 samples of equal size J one-way layout two-way layout Serk Sagtov, Chalmers and GU, February 0, 018 Chapter 1. Analyss of varance Chapter 11: I = samples ndependent samples pared samples Chapter 1: I 3 samples of equal sze one-way layout two-way layout 1

More information

Topic 23 - Randomized Complete Block Designs (RCBD)

Topic 23 - Randomized Complete Block Designs (RCBD) Topc 3 ANOVA (III) 3-1 Topc 3 - Randomzed Complete Block Desgns (RCBD) Defn: A Randomzed Complete Block Desgn s a varant of the completely randomzed desgn (CRD) that we recently learned. In ths desgn,

More information

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA 4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

Topic- 11 The Analysis of Variance

Topic- 11 The Analysis of Variance Topc- 11 The Analyss of Varance Expermental Desgn The samplng plan or expermental desgn determnes the way that a sample s selected. In an observatonal study, the expermenter observes data that already

More information

Joint Statistical Meetings - Biopharmaceutical Section

Joint Statistical Meetings - Biopharmaceutical Section Iteratve Ch-Square Test for Equvalence of Multple Treatment Groups Te-Hua Ng*, U.S. Food and Drug Admnstraton 1401 Rockvlle Pke, #200S, HFM-217, Rockvlle, MD 20852-1448 Key Words: Equvalence Testng; Actve

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

Chapter 4 Experimental Design and Their Analysis

Chapter 4 Experimental Design and Their Analysis Chapter 4 Expermental Desgn and her Analyss Desgn of experment means how to desgn an experment n the sense that how the obseratons or measurements should be obtaned to answer a query n a ald, effcent and

More information

/ n ) are compared. The logic is: if the two

/ n ) are compared. The logic is: if the two STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence

More information

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests Smulated of the Cramér-von Mses Goodness-of-Ft Tests Steele, M., Chaselng, J. and 3 Hurst, C. School of Mathematcal and Physcal Scences, James Cook Unversty, Australan School of Envronmental Studes, Grffth

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed

More information

Chapter 7 Partially Balanced Incomplete Block Design (PBIBD)

Chapter 7 Partially Balanced Incomplete Block Design (PBIBD) Chapter 7 Partally Balanced Incomplete Block Desgn (PBIBD) The balanced ncomplete block desgns hae seeral adantages. They are connected desgns as well as the block szes are also equal. A restrcton on usng

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VIII LECTURE - 34 ANALYSIS OF VARIANCE IN RANDOM-EFFECTS MODEL AND MIXED-EFFECTS EFFECTS MODEL Dr Shalabh Department of Mathematcs and Statstcs Indan

More information

experimenteel en correlationeel onderzoek

experimenteel en correlationeel onderzoek expermenteel en correlatoneel onderzoek lecture 6: one-way analyss of varance Leary. Introducton to Behavoral Research Methods. pages 246 271 (chapters 10 and 11): conceptual statstcs Moore, McCabe, and

More information

STATISTICS QUESTIONS. Step by Step Solutions.

STATISTICS QUESTIONS. Step by Step Solutions. STATISTICS QUESTIONS Step by Step Solutons www.mathcracker.com 9//016 Problem 1: A researcher s nterested n the effects of famly sze on delnquency for a group of offenders and examnes famles wth one to

More information

# c i. INFERENCE FOR CONTRASTS (Chapter 4) It's unbiased: Recall: A contrast is a linear combination of effects with coefficients summing to zero:

# c i. INFERENCE FOR CONTRASTS (Chapter 4) It's unbiased: Recall: A contrast is a linear combination of effects with coefficients summing to zero: 1 INFERENCE FOR CONTRASTS (Chapter 4 Recall: A contrast s a lnear combnaton of effects wth coeffcents summng to zero: " where " = 0. Specfc types of contrasts of nterest nclude: Dfferences n effects Dfferences

More information

Statistics II Final Exam 26/6/18

Statistics II Final Exam 26/6/18 Statstcs II Fnal Exam 26/6/18 Academc Year 2017/18 Solutons Exam duraton: 2 h 30 mn 1. (3 ponts) A town hall s conductng a study to determne the amount of leftover food produced by the restaurants n the

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

7.1. Single classification analysis of variance (ANOVA) Why not use multiple 2-sample 2. When to use ANOVA

7.1. Single classification analysis of variance (ANOVA) Why not use multiple 2-sample 2. When to use ANOVA Sngle classfcaton analyss of varance (ANOVA) When to use ANOVA ANOVA models and parttonng sums of squares ANOVA: hypothess testng ANOVA: assumptons A non-parametrc alternatve: Kruskal-Walls ANOVA Power

More information

Lecture 4 Hypothesis Testing

Lecture 4 Hypothesis Testing Lecture 4 Hypothess Testng We may wsh to test pror hypotheses about the coeffcents we estmate. We can use the estmates to test whether the data rejects our hypothess. An example mght be that we wsh to

More information

ANOVA. The Observations y ij

ANOVA. The Observations y ij ANOVA Stands for ANalyss Of VArance But t s a test of dfferences n means The dea: The Observatons y j Treatment group = 1 = 2 = k y 11 y 21 y k,1 y 12 y 22 y k,2 y 1, n1 y 2, n2 y k, nk means: m 1 m 2

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

Analysis of Variance and Design of Experiments-II

Analysis of Variance and Design of Experiments-II Analyss of Varance and Desgn of Experments-II MODULE - III LECTURE - 8 PARTIALLY BALANCED INCOMPLETE BLOCK DESIGN (PBIBD) Dr Shalah Department of Mathematcs & Statstcs Indan Insttute of Technology Kanpur

More information

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U) Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of

More information

Chapter 15 - Multiple Regression

Chapter 15 - Multiple Regression Chapter - Multple Regresson Chapter - Multple Regresson Multple Regresson Model The equaton that descrbes how the dependent varable y s related to the ndependent varables x, x,... x p and an error term

More information

F statistic = s2 1 s 2 ( F for Fisher )

F statistic = s2 1 s 2 ( F for Fisher ) Stat 4 ANOVA Analyss of Varance /6/04 Comparng Two varances: F dstrbuton Typcal Data Sets One way analyss of varance : example Notaton for one way ANOVA Comparng Two varances: F dstrbuton We saw that the

More information

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction ECONOMICS 35* -- NOTE 7 ECON 35* -- NOTE 7 Interval Estmaton n the Classcal Normal Lnear Regresson Model Ths note outlnes the basc elements of nterval estmaton n the Classcal Normal Lnear Regresson Model

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson

More information

Chapter 6. Supplemental Text Material

Chapter 6. Supplemental Text Material Chapter 6. Supplemental Text Materal S6-. actor Effect Estmates are Least Squares Estmates We have gven heurstc or ntutve explanatons of how the estmates of the factor effects are obtaned n the textboo.

More information

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition) Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes

More information

Copyright 2017 by Taylor Enterprises, Inc., All Rights Reserved. Adjusted Control Limits for P Charts. Dr. Wayne A. Taylor

Copyright 2017 by Taylor Enterprises, Inc., All Rights Reserved. Adjusted Control Limits for P Charts. Dr. Wayne A. Taylor Taylor Enterprses, Inc. Control Lmts for P Charts Copyrght 2017 by Taylor Enterprses, Inc., All Rghts Reserved. Control Lmts for P Charts Dr. Wayne A. Taylor Abstract: P charts are used for count data

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

UCLA STAT 13 Introduction to Statistical Methods for the Life and Health Sciences. Chapter 11 Analysis of Variance - ANOVA. Instructor: Ivo Dinov,

UCLA STAT 13 Introduction to Statistical Methods for the Life and Health Sciences. Chapter 11 Analysis of Variance - ANOVA. Instructor: Ivo Dinov, UCLA STAT 3 ntroducton to Statstcal Methods for the Lfe and Health Scences nstructor: vo Dnov, Asst. Prof. of Statstcs and Neurology Chapter Analyss of Varance - ANOVA Teachng Assstants: Fred Phoa, Anwer

More information

ANSWERS CHAPTER 9. TIO 9.2: If the values are the same, the difference is 0, therefore the null hypothesis cannot be rejected.

ANSWERS CHAPTER 9. TIO 9.2: If the values are the same, the difference is 0, therefore the null hypothesis cannot be rejected. ANSWERS CHAPTER 9 THINK IT OVER thnk t over TIO 9.: χ 2 k = ( f e ) = 0 e Breakng the equaton down: the test statstc for the ch-squared dstrbuton s equal to the sum over all categores of the expected frequency

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

CHAPTER IV RESEARCH FINDING AND DISCUSSIONS

CHAPTER IV RESEARCH FINDING AND DISCUSSIONS CHAPTER IV RESEARCH FINDING AND DISCUSSIONS A. Descrpton of Research Fndng. The Implementaton of Learnng Havng ganed the whole needed data, the researcher then dd analyss whch refers to the statstcal data

More information

PARTIALLY BALANCED INCOMPLETE BLOCK DESIGNS

PARTIALLY BALANCED INCOMPLETE BLOCK DESIGNS PARTIALLY BALANCED INCOMPLETE BLOCK DESIGNS V.K. Sharma I.A.S.R.I., Lbrary Avenue, New Delh-00. Introducton Balanced ncomplete block desgns, though have many optmal propertes, do not ft well to many expermental

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

Goodness of fit and Wilks theorem

Goodness of fit and Wilks theorem DRAFT 0.0 Glen Cowan 3 June, 2013 Goodness of ft and Wlks theorem Suppose we model data y wth a lkelhood L(µ) that depends on a set of N parameters µ = (µ 1,...,µ N ). Defne the statstc t µ ln L(µ) L(ˆµ),

More information

ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE

ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE P a g e ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE Darmud O Drscoll ¹, Donald E. Ramrez ² ¹ Head of Department of Mathematcs and Computer Studes

More information

Statistics Chapter 4

Statistics Chapter 4 Statstcs Chapter 4 "There are three knds of les: les, damned les, and statstcs." Benjamn Dsrael, 1895 (Brtsh statesman) Gaussan Dstrbuton, 4-1 If a measurement s repeated many tmes a statstcal treatment

More information

Statistics for Business and Economics

Statistics for Business and Economics Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear

More information

Answers Problem Set 2 Chem 314A Williamsen Spring 2000

Answers Problem Set 2 Chem 314A Williamsen Spring 2000 Answers Problem Set Chem 314A Wllamsen Sprng 000 1) Gve me the followng crtcal values from the statstcal tables. a) z-statstc,-sded test, 99.7% confdence lmt ±3 b) t-statstc (Case I), 1-sded test, 95%

More information

Topic 10: ANOVA models for random and mixed effects Fixed and Random Models in One-way Classification Experiments

Topic 10: ANOVA models for random and mixed effects Fixed and Random Models in One-way Classification Experiments Topc 10: ANOVA models for random and mxed effects eferences: ST&D Topc 7.5 (15-153), Topc 9.9 (5-7), Topc 15.5 (379-384); rules for expected on ST&D page 381 replaced by Chapter 8 from Montgomery, 1991.

More information

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y) Secton 1.5 Correlaton In the prevous sectons, we looked at regresson and the value r was a measurement of how much of the varaton n y can be attrbuted to the lnear relatonshp between y and x. In ths secton,

More information

Economics 130. Lecture 4 Simple Linear Regression Continued

Economics 130. Lecture 4 Simple Linear Regression Continued Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do

More information

A Note on Test of Homogeneity Against Umbrella Scale Alternative Based on U-Statistics

A Note on Test of Homogeneity Against Umbrella Scale Alternative Based on U-Statistics J Stat Appl Pro No 3 93- () 93 NSP Journal of Statstcs Applcatons & Probablty --- An Internatonal Journal @ NSP Natural Scences Publshng Cor A Note on Test of Homogenety Aganst Umbrella Scale Alternatve

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

T E C O L O T E R E S E A R C H, I N C.

T E C O L O T E R E S E A R C H, I N C. T E C O L O T E R E S E A R C H, I N C. B rdg n g En g neern g a nd Econo mcs S nce 1973 THE MINIMUM-UNBIASED-PERCENTAGE ERROR (MUPE) METHOD IN CER DEVELOPMENT Thrd Jont Annual ISPA/SCEA Internatonal Conference

More information

28. SIMPLE LINEAR REGRESSION III

28. SIMPLE LINEAR REGRESSION III 8. SIMPLE LINEAR REGRESSION III Ftted Values and Resduals US Domestc Beers: Calores vs. % Alcohol To each observed x, there corresponds a y-value on the ftted lne, y ˆ = βˆ + βˆ x. The are called ftted

More information

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010 Parametrc fractonal mputaton for mssng data analyss Jae Kwang Km Survey Workng Group Semnar March 29, 2010 1 Outlne Introducton Proposed method Fractonal mputaton Approxmaton Varance estmaton Multple mputaton

More information

Chapter 14 Simple Linear Regression

Chapter 14 Simple Linear Regression Chapter 4 Smple Lnear Regresson Chapter 4 - Smple Lnear Regresson Manageral decsons often are based on the relatonshp between two or more varables. Regresson analss can be used to develop an equaton showng

More information

Appendix B: Resampling Algorithms

Appendix B: Resampling Algorithms 407 Appendx B: Resamplng Algorthms A common problem of all partcle flters s the degeneracy of weghts, whch conssts of the unbounded ncrease of the varance of the mportance weghts ω [ ] of the partcles

More information

A nonparametric two-sample wald test of equality of variances

A nonparametric two-sample wald test of equality of variances Unversty of Wollongong Research Onlne Centre for Statstcal & Survey Methodology Workng Paper Seres Faculty of Engneerng and Informaton Scences 0 A nonparametrc two-sample wald test of equalty of varances

More information

U-Pb Geochronology Practical: Background

U-Pb Geochronology Practical: Background U-Pb Geochronology Practcal: Background Basc Concepts: accuracy: measure of the dfference between an expermental measurement and the true value precson: measure of the reproducblty of the expermental result

More information

A Robust Method for Calculating the Correlation Coefficient

A Robust Method for Calculating the Correlation Coefficient A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal

More information

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation Econ 388 R. Butler 204 revsons Lecture 4 Dummy Dependent Varables I. Lnear Probablty Model: the Regresson model wth a dummy varables as the dependent varable assumpton, mplcaton regular multple regresson

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

j) = 1 (note sigma notation) ii. Continuous random variable (e.g. Normal distribution) 1. density function: f ( x) 0 and f ( x) dx = 1

j) = 1 (note sigma notation) ii. Continuous random variable (e.g. Normal distribution) 1. density function: f ( x) 0 and f ( x) dx = 1 Random varables Measure of central tendences and varablty (means and varances) Jont densty functons and ndependence Measures of assocaton (covarance and correlaton) Interestng result Condtonal dstrbutons

More information

LINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables

LINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables LINEAR REGRESSION ANALYSIS MODULE VIII Lecture - 7 Indcator Varables Dr. Shalabh Department of Maematcs and Statstcs Indan Insttute of Technology Kanpur Indcator varables versus quanttatve explanatory

More information

4.3 Poisson Regression

4.3 Poisson Regression of teratvely reweghted least squares regressons (the IRLS algorthm). We do wthout gvng further detals, but nstead focus on the practcal applcaton. > glm(survval~log(weght)+age, famly="bnomal", data=baby)

More information

18. SIMPLE LINEAR REGRESSION III

18. SIMPLE LINEAR REGRESSION III 8. SIMPLE LINEAR REGRESSION III US Domestc Beers: Calores vs. % Alcohol Ftted Values and Resduals To each observed x, there corresponds a y-value on the ftted lne, y ˆ ˆ = α + x. The are called ftted values.

More information

Lecture 3 Stat102, Spring 2007

Lecture 3 Stat102, Spring 2007 Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture

More information

First Year Examination Department of Statistics, University of Florida

First Year Examination Department of Statistics, University of Florida Frst Year Examnaton Department of Statstcs, Unversty of Florda May 7, 010, 8:00 am - 1:00 noon Instructons: 1. You have four hours to answer questons n ths examnaton.. You must show your work to receve

More information

1-FACTOR ANOVA (MOTIVATION) [DEVORE 10.1]

1-FACTOR ANOVA (MOTIVATION) [DEVORE 10.1] 1-FACTOR ANOVA (MOTIVATION) [DEVORE 10.1] Hgh varance between groups Low varance wthn groups s 2 between/s 2 wthn 1 Factor A clearly has a sgnfcant effect!! Low varance between groups Hgh varance wthn

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 14 Multple Regresson Models 1999 Prentce-Hall, Inc. Chap. 14-1 Chapter Topcs The Multple Regresson Model Contrbuton of Indvdual Independent Varables

More information

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems Numercal Analyss by Dr. Anta Pal Assstant Professor Department of Mathematcs Natonal Insttute of Technology Durgapur Durgapur-713209 emal: anta.bue@gmal.com 1 . Chapter 5 Soluton of System of Lnear Equatons

More information

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students.

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students. PPOL 59-3 Problem Set Exercses n Smple Regresson Due n class /8/7 In ths problem set, you are asked to compute varous statstcs by hand to gve you a better sense of the mechancs of the Pearson correlaton

More information

CHAPTER 8. Exercise Solutions

CHAPTER 8. Exercise Solutions CHAPTER 8 Exercse Solutons 77 Chapter 8, Exercse Solutons, Prncples of Econometrcs, 3e 78 EXERCISE 8. When = N N N ( x x) ( x x) ( x x) = = = N = = = N N N ( x ) ( ) ( ) ( x x ) x x x x x = = = = Chapter

More information

Learning Objectives for Chapter 11

Learning Objectives for Chapter 11 Chapter : Lnear Regresson and Correlaton Methods Hldebrand, Ott and Gray Basc Statstcal Ideas for Managers Second Edton Learnng Objectves for Chapter Usng the scatterplot n regresson analyss Usng the method

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experments- MODULE LECTURE - 6 EXPERMENTAL DESGN MODELS Dr. Shalabh Department of Mathematcs and Statstcs ndan nsttute of Technology Kanpur Two-way classfcaton wth nteractons

More information

MULTIPLE COMPARISON PROCEDURES

MULTIPLE COMPARISON PROCEDURES MULTIPLE COMPARISON PROCEDURES RAJENDER PARSAD Indan Agrcultural Statstcs Research Insttute Lbrary Avenue, New Delh 110 012 raender@asr.res.n 1. Introducton Analyss of varance s used to test for the real

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

Lecture 10: Dimensionality reduction

Lecture 10: Dimensionality reduction Lecture : Dmensonalt reducton g The curse of dmensonalt g Feature etracton s. feature selecton g Prncpal Components Analss g Lnear Dscrmnant Analss Intellgent Sensor Sstems Rcardo Guterrez-Osuna Wrght

More information

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise. Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the

More information

STAT 3008 Applied Regression Analysis

STAT 3008 Applied Regression Analysis STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,

More information

SIMPLE LINEAR REGRESSION

SIMPLE LINEAR REGRESSION Smple Lnear Regresson and Correlaton Introducton Prevousl, our attenton has been focused on one varable whch we desgnated b x. Frequentl, t s desrable to learn somethng about the relatonshp between two

More information

Polynomial Regression Models

Polynomial Regression Models LINEAR REGRESSION ANALYSIS MODULE XII Lecture - 6 Polynomal Regresson Models Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Test of sgnfcance To test the sgnfcance

More information

Using the estimated penetrances to determine the range of the underlying genetic model in casecontrol

Using the estimated penetrances to determine the range of the underlying genetic model in casecontrol Georgetown Unversty From the SelectedWorks of Mark J Meyer 8 Usng the estmated penetrances to determne the range of the underlyng genetc model n casecontrol desgn Mark J Meyer Neal Jeffres Gang Zheng Avalable

More information

Introduction to Regression

Introduction to Regression Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes

More information

Introduction to Vapor/Liquid Equilibrium, part 2. Raoult s Law:

Introduction to Vapor/Liquid Equilibrium, part 2. Raoult s Law: CE304, Sprng 2004 Lecture 4 Introducton to Vapor/Lqud Equlbrum, part 2 Raoult s Law: The smplest model that allows us do VLE calculatons s obtaned when we assume that the vapor phase s an deal gas, and

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Exerments-I MODULE III LECTURE - 2 EXPERIMENTAL DESIGN MODELS Dr. Shalabh Deartment of Mathematcs and Statstcs Indan Insttute of Technology Kanur 2 We consder the models

More information

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method Appled Mathematcal Scences, Vol. 7, 0, no. 47, 07-0 HIARI Ltd, www.m-hkar.com Comparson of the Populaton Varance Estmators of -Parameter Exponental Dstrbuton Based on Multple Crtera Decson Makng Method

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.

More information

Number of cases Number of factors Number of covariates Number of levels of factor i. Value of the dependent variable for case k

Number of cases Number of factors Number of covariates Number of levels of factor i. Value of the dependent variable for case k ANOVA Model and Matrx Computatons Notaton The followng notaton s used throughout ths chapter unless otherwse stated: N F CN Y Z j w W Number of cases Number of factors Number of covarates Number of levels

More information

NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION MTH352/MH3510 Regression Analysis

NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION MTH352/MH3510 Regression Analysis NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION 014-015 MTH35/MH3510 Regresson Analyss December 014 TIME ALLOWED: HOURS INSTRUCTIONS TO CANDIDATES 1. Ths examnaton paper contans FOUR (4) questons

More information

4.1. Lecture 4: Fitting distributions: goodness of fit. Goodness of fit: the underlying principle

4.1. Lecture 4: Fitting distributions: goodness of fit. Goodness of fit: the underlying principle Lecture 4: Fttng dstrbutons: goodness of ft Goodness of ft Testng goodness of ft Testng normalty An mportant note on testng normalty! L4.1 Goodness of ft measures the extent to whch some emprcal dstrbuton

More information

Chapter 12 Analysis of Covariance

Chapter 12 Analysis of Covariance Chapter Analyss of Covarance Any scentfc experment s performed to know somethng that s unknown about a group of treatments and to test certan hypothess about the correspondng treatment effect When varablty

More information

NUMERICAL DIFFERENTIATION

NUMERICAL DIFFERENTIATION NUMERICAL DIFFERENTIATION 1 Introducton Dfferentaton s a method to compute the rate at whch a dependent output y changes wth respect to the change n the ndependent nput x. Ths rate of change s called the

More information