An Investigation of the Rank Transformation in Multple Regression

Size: px
Start display at page:

Download "An Investigation of the Rank Transformation in Multple Regression"

Transcription

1 Southern Illinois University Carbondale From the SelectedWorks of Todd Christopher Headrick December, 2001 An Investigation of the Rank Transformation in Multple Regression Todd C. Headrick, Southern Illinois University Carbondale Ourania Rotou Available at:

2 Computational Statistics & Data Analysis 38 (2001) An investigation of the rank transformation in multiple regression Todd C. Headrick a;, Ourania Rotou b a Department of Measurement and Statistics, Southern Illinois University, 222-J, Wham Bldg, Mail Code 4618 Carbondale, IL , USA b Educational Testing Service (ETS), Princeton, NJ 08541, USA Received 1 August 2000; received in revised form 1 April 2001; accepted 1 April 2001 Abstract Real world data often fail to meet the underlying assumptions of normal statistical theory. The rank transformation (RT) procedure is recommended and used in the context of multiple regression analysis when the assumption of normality is violated. There is no general supporting theory of the RT. In view of this, the current study examined the Type I error and power properties of the RT in terms of multiple regression. The investigation included both additive and nonadditive models. Results indicated that there were severely inated Type I error rates associated with the RT procedure under both normal and nonnormal distributions (e.g., with nominal alpha = 0:05). The RT also exhibited a substantial power loss relative to the usual ordinary least squares regression procedure. It is recommended that the RT be avoided in the context of multiple regression despite its encouragement from SAS and other well respected sources. c 2001 Elsevier Science B.V. All rights reserved. Keywords: Type I error; Power; Rank transformation; Monotonic regression 1. Introduction Micceri (1989) collected over 400 large real-world data sets and tested the assumption of normality for each distribution. Using the Kolmogorov Smirnov test and a signicance level () of 0.01, Micceri (1989) found all the distributions to be signicantly nonnormal. It is well known that when the assumption of normality is violated, nonparametric tests can be substantially more powerful than the usual Corresponding author. address: headrick@siu.edu (T.C. Headrick) /01/$ - see front matter c 2001 Elsevier Science B.V. All rights reserved. PII: S (01)

3 204 T.C. Headrick, O. Rotou / Computational Statistics & Data Analysis 38 (2001) parametric t or F tests. For example, the Mann Whitney test, when juxtaposed to the two independent samples t test, has an impressive asymptotic relative eciency of 3 when the populations have an exponential distribution (see, e.g., Conover, 1999). With respect to regression analysis, an additional concern to the assumption of a normally distributed error term is the assumption of a linear regression function. In view of these concerns, Iman and Conover (1979) introduced a nonparametric regression procedure that only requires the assumption of a monotonic regression function (i.e., linear or nonlinear). This procedure conducts the usual ordinary least squares (OLS) regression analysis on the ranks of the original scores. Thus, what makes the rank transformation (RT) in regression appealing is its simplicity and ease of execution. Specically, the steps for hypotheses testing are (Iman and Conover, 1979): (1) separately replace the original scores of the dependent and k independent variables with their respective rank order, (2) apply the regular OLS regression procedure to the ranks, and (3) refer to the usual table(s) of percentage points to test the model. Iman and Conover (1979) found favorable results for simple and multiple RT regression analyses using two nonnormal data sets from Daniel and Wood (1971). Recently, Conover (1999) submitted that the RT multiple regression procedure results in a robust regression method that is not sensitive to outliers or nonnormal distributions to the extent that the regular regression methods on the data are aected (p. 420). Further, manufacturers of statistical software also promote the application of the RT in multiple regression. For example, the current SAS (1999) procedures guide states: You can investigate regression relationships by using rank transformations with a method described by Iman and Conover (1979) (p. 840). The IMSL (1994) manual also states, Many of the tests described in this chapter may be computed using the routines described in other chapters after rst substituting ranks for the observed values (p. 582). Recent suggestions promoting the use of the RT in other complex designs have also been made (Choi, 1998; Regeth and Stine, 1998). For example, Regeth and Stine (1998) submitted, for two-way designs (involving an interaction), the ANOVA test can be run, using the rank orderings of data points rather than the actual scores (p. 708). O Gorman and Woolson (1993) also reported that the RT performed favorably in the contexts of logistic regression and discriminant analysis. It is also important to point out that the RT has recently been used in applied studies involving complex designs. Some examples include multiple regression (Angermeier and Winston, 1998) and factorial ANOVA (Augner et al., 1998). It should be noted that the aforementioned suggestions and applications of the rank transformation to the general linear model have been made despite studies that have demonstrated numerous limitations of the RT in terms of simple regression (Lee and Yan, 1996) and other complex designs (e.g., Akritas, 1990; Brunner and Neuman, 1986; Sawilowsky et al., 1989; Thompson, 1991, 1993). For example, Lee and Yan (1996) demonstrated that the estimated RT regression slope coecient lacks the asymptotic property of consistency except under trivial conditions.

4 T.C. Headrick, O. Rotou / Computational Statistics & Data Analysis 38 (2001) Purpose of the study There is no general supporting theory of the rank transformation (Hettmansperger and McKean, 1998, p. 268). Most of the original studies promoting the RT procedure are based on simulation results (e.g., Conover and Iman, 1976; Conover and Iman, 1982; Iman, 1974). Further, there is a paucity of research investigating the RT in terms of multiple regression. Therefore, it is the purpose of this study to investigate the validity of the RT in the context of multiple regression using Monte Carlo techniques. Most investigations that demonstrated the RT to exhibit undesirable traits are studies that usually involve nonadditive models and where the ranking is performed irrespective of group membership (e.g., Sawilowsky et al., 1989). With respect to multiple regression, however, the RT procedure involves ranking all variables independently of each other. In view of this, the investigation will focus on the relative Type I error and power properties to violations of normality in terms of both additive and nonadditive models. 3. Methodology The additive and nonadditive models selected for investigation were as follows: and Y i = X 1i + 2 X 2i + 3 X 3i + 4 X 2 1i + 5 X 2 2i + 6 X 2 3i + 7 X 3 1i + 8 X 3 2i + 9 X 3 3i + i (1) Y i = X 1i + 2 X 2i + 3 X 3i + 4 X 1i X 2i + 5 X 1i X 3i + 6 X 2i X 3i + 7 X 1i X 2i X 3i + i : (2) The sample sizes selected were i =1;:::;N= 30 and 50. The following cases were specied for hypotheses testing with respect to (1): 1(a) 1 = 2 = 3 = b (null), 4 = 5 = 6 = b (null), 7 = 8 = 9 = b (null); 1(b) 1 = 2 = 3 = b (power), 4 = 5 = 6 = b (null), 7 = 8 = 9 = b (null); 1(c) 1 = 2 = 3 = b (power), 4 = 5 = 6 = b (power), 7 = 8 = 9 = b (null); and 1(d) 1 = 2 = 3 = b (power), 4 = 5 = 6 = b (power), 7 = 8 = 9 = b (power). With respect to (2), the following cases were specied for hypotheses testing: 2(a) 1 = 2 = 3 = b (null), 4 = 5 = 6 = b (null), 7 = b (null); 2(b) 1 = 2 = 3 = b (power), 4 = 5 = 6 = b (null), 7 = b (null); 2(c) 1 = 2 = 3 = b (power), 4 = 5 = 6 = b (power), 7 = b (null); and 2(d) 1 = 2 = 3 = b (power), 4 = 5 = 6 = b (power), 7 = b (power). The values selected for the slope coecients were b =0:0; 0:5; 1:0; 2:0; and 3:0. In all experimental situations the intercept term ( 0 ) was set equal to 5. The stochastic disturbance terms ( i ) in (1) and (2) were distributed with zero means ( = 0), unit variances ( 2 = 1), and varying degrees of 1, 2, 3, and 4. The distributions selected for simulation were: (a) normal ( 1 =0; 2 =0; 3 = 0, and

5 206 T.C. Headrick, O. Rotou / Computational Statistics & Data Analysis 38 (2001) Table 1 Values of c 0, c 1, c 2, c 3, c 4, and c 5 distributions a that were used in Eq. (3) to simulate the desired (non)normal Distribution c 0 c 1 c 2 c 3 c 4 c a Note. The three distributions are described as follows: (1) normal ( 1 =0; 2 =0; 3 =0; 4 = 0); (2) symmetric and light-tailed ( 1 =0; 2 = 6=5; 3 =0; 4 =48=7); and (3) asymmetric and moderately heavy-tailed ( 1 =2; 2 =6; 3 =24; 4 = 120). 4 = 0); (b) symmetric and light-tailed ( 1 =0; 2 = 6=5; 3 = 0, and 4 =48=7); and (c) asymmetric and moderately heavy-tailed ( 1 =2; 2 =6; 3 = 24, and 4 = 120). The values of 1 (coecient of skew), 2 (coecient of kurtosis), 3, and 4 are the third, fourth, fth, and sixth standardized cumulants of the normal, uniform, and exponential probability density functions. The uniform and exponential densities were selected for heuristic reasons as well as for their frequent occurrence in real-world data sets (e.g., the waiting time between units arriving in accordance with a Poisson process is exponentially distributed). The data generation procedure created the Y i from the right-hand sides of Eqs. (1) and (2). In all experimental situations X 1i, X 2i, X 3i iid N (0; 1). The i were generated using the polynomial transformation derived by Headrick (2000). The transformation is expressed as follows: i = c 0 + c 1 Z i + c 2 Z 2 i + c 3 Z 3 i + c 4 Z 4 i + c 5 Z 5 i where Z iid N (0; 1): (3) The values of c 0 ;:::;c 5 for (3) were obtained by simultaneously solving Eqs. (37) (42) from Headrick (2000, Appendix 1) for a mean of zero, unit variance, and the selected values of 1, 2, 3, and 4 using Mathematica (version 4.0, 1999). The distributions selected and their associated constants (c i ) are listed in Table 1. The F statistics for the specied hypotheses were computed using the tests of R 2 and matrix algebra approach given in Pedhazur (1982). Specically, the F ratio for testing hypotheses 1(b) and 2(b) for linear relationships was: [R 2 Y:X F Reduced(2) = 1;X 2;X 3 =3] [1 R 2 Y:X 1;X 2;X 3 ]=[N 3 1] : The F ratios for testing hypotheses 1(c) for quadratic relationships and 2(c) for bilinear interactions were: F Reduced(1) = [R 2 Y:X 1;X 2;X 3;X 2 1 ;X2 2 ;X2 3 R 2 Y:X 1;X 2;X 3 ]=[6 3] [1 R 2 Y:X 1;X 2;X 3;X ]=[N 6 1] 1 2;X2 2 ;X2 3 and F Reduced(1) = [R2 Y:X 1;X 2;X 3;X 1X 2;X 1X 3;X 2X 3 R 2 Y:X 1;X 2;X 3 ]=[6 3] [1 R 2 : Y:X 1X 2X 3;X 1X 2;X 1X 3;X 2X 3 ]=[N 6 1]

6 T.C. Headrick, O. Rotou / Computational Statistics & Data Analysis 38 (2001) The F ratios for testing hypotheses 1(d) for cubic relationships and 2(d) for the three-way interaction were: and F Full = [R 2 Y:X 1;X 2;X 3;X 2 1 ;X2 2 ;X2 3 ;X3 1 ;X3 2 ;X3 3 R 2 Y:X 1;X 2;X 3;X ]=[9 6] 1 2;X2 2 ;X2 3 [1 R 2 ]=[N 9 1] Y:X 1;X 2;X 3;X1 2;X2 2 ;X2 3 ;X3 1 ;X3 2 ;X3 3 F Full = [R2 Y:X 1;X 2;X 3;X 1X 2;X 1X 3;X 2X 3;X 1X 2X 3 R 2 Y:X 1;X 2;X 3;X 1X 2;X 1X 3;X 2X 3 ]=[7 6] [1 R 2 : Y:X 1;X 2;X 3;X 1X 2;X 1X 3;X 2X 3;X 1X 2X 3 ]=[N 7 1] For both additive and nonadditive models, the F statistics were obtained for both the original scores and their respective ranks for each of the [3(type of distribution) 2(sample size)] [4(level of nonzero b) 3(type of nonnull hypothesis)+1(complete null hypothesis)] experimental situations. Using the F table of percentage points, the proportions for each of the hypotheses rejected were recorded at the 5% level of signicance. Twenty ve thousand repetitions were employed for each situation. The computer used to carry out the simulation was a Pentium III-based PC. All programs and subroutines were developed using Lahey Fortran 77 version 3.0 (1994). The subroutines UNI1 and NORMB1 from RANGEN (Blair, 1987) were used to generate pseudo-random uniform and standard normal deviates. 4. Results To demonstrate the adequacy of the Headrick (2000) procedure, overall average values of (ˆ), 2 (ˆ 2 ), 1 ( 1 ), 2 ( 2 ), 3 ( 3 ), and 4 ( 4 ) for the i in (1) were obtained for both sample sizes. Specically, for each repetition the values of, 2, 1, 2, 3, and 4 were computed. These values were then averaged across the 25; 000(repetitions) 13(hypotheses considered) experimental situations. The overall averages are presented in Table 2. Inspection of Table 2 indicates that the data Table 2 Average values of (ˆ), 2 (ˆ 2 ), 1( 1 ), 2( 2 ), 3( 3 ), and 4( 4 ) from the simulation. The average values are based on sample sizes of N = 30 and 50 a Distribution N ˆ ˆ a Note. The three distributions are described as follows: (1) normal ( 1 =0; 2 =0; 3 =0; 4 = 0); (2) symmetric and light-tailed ( 1 =0; 2 = 6=5; 3 =0; 4 =48=7); and (3) asymmetric and moderately heavy-tailed ( 1 =2; 2 =6; 3 =24; 4 = 120).

7 208 T.C. Headrick, O. Rotou / Computational Statistics & Data Analysis 38 (2001) Table 3 Type I error and power analyses for the additive regression model in Eq. (1). The error terms were standard normal. The hypotheses tested the signicance of the incremental increases in R 2 for linear eects (reduced 2), the predictors raised to the second power (reduced 1), and the predictors raised to the third power (full). The sample size was N = 30. Nominal alpha is = 0:05. F denotes the parametric statistic. FR denotes the rank transform statistic Slope coecient (b) Hypotheses Statistic Model Reduced (2) Reduced (1) Full 0.00 Null, null, null F FR Power, null, null F FR Power, power, null F FR Power, power, power F FR Power, null, null F FR Power, power, null F FR Power, power, power F FR Power, null, null F FR Power, power, null F FR Power, power, power F FR Power, null, null F FR Power, power, null F FR Power, power, power F FR generation procedure produced overall average values of ˆ, ˆ 2, 1, 2, 3, and 4 that were in close agreement to the desired population parameters. Type I error and power results are presented in Tables 3 and 4. The results reported are for the case of N = 30 and where the i iid N (0; 1). The column entries from left to right denote (a) the slope coecients b, (b) the hypotheses tested, (c) the statistics computed on the raw scores (F) and their associated ranks (FR), and (d) the proportion of rejections for the tests under the various levels of the parameters considered. In all experimental situations nominal alpha () was (A complete set of tables is available from the rst author.)

8 T.C. Headrick, O. Rotou / Computational Statistics & Data Analysis 38 (2001) Table 4 Type I error and power analyses for the nonadditive regression model in Eq. (2). The error terms were standard normal. The hypotheses tested the signicance of the incremental increases in R 2 for linear eects (reduced 2), bilinear interactions (reduced 1), and the three-way interaction (full). The sample size was N = 30. Nominal alpha is =0:05. F denotes the parametric statistic. FR denotes the rank transform statistic Slope coecient (b) Hypotheses Statistic Model Reduced (2) Reduced (1) Full 0.00 Null, null, null F FR Power, null, null F FR Power, power, null F FR Power, power, power F FR Power, null, null F FR Power, power, null F FR Power, power, power F FR Power, null, null F FR Power, power, null F FR Power, power, power F FR Power, null, null F FR Power, power, null F FR Power, power, power F FR Under a complete null hypothesis, both the parametric and RT procedures maintained Type I error rates close to nominal alpha. By inspection of Tables 3 and 4, the Type I error rates for these tests were within the closed interval of ± 1:96 (1 )= However, when nonzero values of b were present, Tables 3 and 4 indicate that the RT generated liberal Type I error rates. For example, given b = 2:00, inspection of Table 3 indicates that under true null hypotheses for quadratic and cubic relationships, the Type I error rates for the RT were and Further, inspection of Table 4 indicates when both linear eects and bilinear interactions were nonnull (b =2:00), the Type I error rate was for the RT under a true null hypothesis for three-way interaction.

9 210 T.C. Headrick, O. Rotou / Computational Statistics & Data Analysis 38 (2001) With respect to power analyses, inspection of Tables 3 and 4 indicate a substantial power advantage in favor of the parametric regression procedure when all slope coecients were b =0:50. Specically, the parametric F test was rejecting at rates of and for quadratic and cubic relationships while the RT was rejecting at rates of only and Similarly, under these same conditions (b =0:50), inspection of Table 4 also indicates that the F test was rejecting at rates of and for bilinear and three-way interactions while the RT was rejecting at rates of and The inated Type I error rates and power losses associated with the RT were more substantial under nonnormal conditions. For example, when the error terms of the additive model were (approximately) exponentially distributed, the Type I error rates reached as high as Also, when all relationships were nonnull (b =0:50), the parametric procedure was rejecting at rates of and for quadratic and cubic relationships while the RT was rejecting at rates of only and An analytical comparison As indicated in the previous section, the RT procedure resulted in severe Type I error inations for both the additive and nonadditive models considered. As a result, it is interesting to compare the RT and the usual OLS regression procedures to show how the parametric procedure lacks the property of invariance under monotone transformations. In terms of this study, an example to consider is the three-way interaction term in the nonadditive model. Specically, it is demonstrated below how the nonlinear nature of the RT reverses the absence of a three-way interaction in the original scores when nonzero slope coecients for linear relationships and bilinear interactions are present. Without loss of generality, assume that the stochastic error term in (2) follows a standard normal distribution. Further, let X 1i = Temperature, X 2i = Minutes of practice, and X 3i = Humidity be measured on the following dichotomous scale: score 1 if X 1i, X 2i, and X 3i are at high levels; and score 1 ifx 1i, X 2i, and X 3i are at low levels. Presented in Table 5 is an example of a case where the linear and two-way interaction eects are present while a true null hypothesis holds for the three-way interaction. The algebraic form of this example can be expressed in terms of taking expectations of the left and right-hand sides of (2) as, where E[Y i ]=X 1i + X 2i + X 3i + X 1i X 2i + X 1i X 3i + X 2i X 3i ; (4) E[ i ]=0;E[ 1 ]=E[ 2 ]=E[ 3 ]=E[ 4 ]=E[ 5 ]=E[ 6 ]=1 and E[ 0 ]=E[ 7 ]=0: However, taking expectations of the RT analog to (2) for the example in Table 5 yields a nonzero slope coecient for three-way interaction. Specically, the

10 T.C. Headrick, O. Rotou / Computational Statistics & Data Analysis 38 (2001) Table 5 Expected values of the raw scores E[Y i] and their associated ranks E[R(Y i)] for Eq. (2) with X 1i, X 2i, and X 3i measured on a dichotomous scale. The three-way interaction term is null. The errors ( i) are (standard) normally distributed Case (#) E[Y i] X 1i X 2i X 3i X 1iX 2i X 1iX 3i X 2iX 3i E[ i] E[R(Y i)] RT analog to (4) is expressed as, E[R(Y i )] = X 1i + 2 X 2i + 3 X 3i + 4 X 1i X 2i + 5 X 1i X 3i + 6 X 2i X 3i + 7 X 1i X 2i X 3i : (5) The expected values of the coecients in (5) are: E[ 0 ]=16:5, E[ 1 ]=E[ 2 ]= E[ 3 ]=E[ 4 ]=E[ 5 ]=E[ 6 ]=2:843, and E[ 7 ]= 3:056. These expectations are based on the ranks of the Y i and the original form of the independent variables. (An analogous nonzero expected value of 7 also results if the ranked forms of

11 212 T.C. Headrick, O. Rotou / Computational Statistics & Data Analysis 38 (2001) the independent variables are used.) Thus, the RT procedure falsely indicates a nonnull three-way interaction. To demonstrate how the RT created the nonzero slope coecient for three-way interaction, it is rst necessary to dene a rank in terms of this experiment. Before stating this denition, rst note that the values of the Y i in (2), in terms of this example, and the E[Y i ] in (4) are equally arranged across k =1;:::;p (p = 2); l =1;:::;q (q = 2); and m =1;:::;r (r = 2) levels of the three independent variables. If j =1;:::;n (n = 4) indicates the number of observations for each of the p q r arrangements, then the Y i in (2) and their associated ranks can be represented as elements belonging to the klmth arrangement as follows: Y i = Y klmj (6a) and R(Y i )=R(Y klmj ): (6b) The subscript i = f(k; l; m; j) in (6a) and (6b) denotes the position of the jth value of Y i in the klmth arrangement. Using (6a), the rank of Y klmj can be dened as follows: Denition 1. If y denotes any realization of Y klmj and is a Bernoulli variable scoring (k; l; m; j)=1 for y Y klmj or (k; l; m; j)=0 for y Y klmj, then the rank of y, R(y), can be dened as R(y)=1+ k (k; l; m; j): (7) l m j It follows from Denition 1 that the expected value of R(y) belonging to the klmth arrangement can be expressed as E[R(y)]=1+ k Pr{y Y klmj }: (8) l m j Suppose y is any realization belonging to the k l m th arrangement in this experiment. Because the Y k l m j all have the same expectation within this arrangement, Eq. (8) can be rewritten to express the expected value of R(y) in terms of the expected values of the other klm arrangements. Simplifying (8) accordingly yields, E[R(y k l m )]= (n 1) + n Pr{E[Y k l m ] E[Y klm]}; k k l l m m (9) where the value (1=2)(n 1) is the sum of the Pr{y Y k l m j} for all Y k l m j y belonging to the k l m th arrangement.

12 T.C. Headrick, O. Rotou / Computational Statistics & Data Analysis 38 (2001) Given the standard normality assumption, the expected value of the ranks in the k l m th arrangement can be determined from the following expression: E[R(y) k l m ]= (n 1) + n zklm 1 e w2 =2 dw; k l k l m 2 m klm; (10) where z klm =(E[Y k l m ] E[Y klm])= 2 in the upper limit of the integral of (10). Using (10) and the relationship in (6b), the expected values of the ranks in (5) were determined and are listed in the far right column of Table 5. To determine the expected values of the slope coecients for the three-way interaction terms in (4) and (5), it is only necessary to consider the usual denition of interaction (INT) for a continuous variable treated as categorical. This can be expressed in terms of the original scores as INT klm = E[Y klm ] E[Y kl ] E[Y k m ] E[Y lm ]+E[Y k ]+E[Y l ] + E[Y m ] E[ 0 ]; (11) where the dot indicates averaging over the subscript(s). Consider the rst entries in Table 5 under the column headings E[Y i ] and E[R(Y i )] which are 6 and These values are also associated with E[Y klmj ] and E[R(Y klmj )], where k = l = m = j = 1. Using (11), the interaction for the original scores is determined as INT 111 =6 3(3) + 3(1) 0=0: (12) Similarly, using (11) for the ranked observations, the interaction is INT(RT) 111 =30:50 3(25:028) + 3(19:343) 16:5= 3:056: (13) Any value of the E[Y klmj ] and their rank analogs, E[R(Y klmj )], associated with the product term X 1i X 2i X 3i = 1 will also yield the same values for three-way interaction in (12) and (13). Hence, it follows that the E[ 7 ] = 0 and the E[R( 7 )] = 3:056 for the values of E[Y klmj ] and E[R(Y klmj )] positioned in arrangements where X 1i X 2i X 3i =1. Values of E[Y klmj ] and E[R(Y klmj )] associated with the product term X 1i X 2i X 3i = 1 will yield values of E[ 7 ]=0 and E[R( 7 )]=3:056. This, of course, ensures that the usual constraint for interaction is satised. That is, the sum of the interactions across all p q r arrangements is zero. 6. Discussion and conclusion The derivation of the nonzero expected slope coecient in (13) demonstrates how the RT changed the probability structure of the null hypothesis for three-way interaction in the original scores. Thus, the analytical results from the previous section and the severe Type I error rates from the Monte Carlo portion of this study invalidate the RT as a viable alternative to the regular OLS multiple regression procedure. Further, it makes no dierence whether the independent variables are treated as

13 214 T.C. Headrick, O. Rotou / Computational Statistics & Data Analysis 38 (2001) either continuous (i.e., regression) or categorical (i.e., ANOVA). Furthermore, the Monte Carlo results of this study also show that the RT exhibited increased Type I error rates for the additive model considered. This occurred when all parametric assumptions were valid. The results of this study raise serious doubts with respect to the validity of the Choi (1998) Monte Carlo study. Specically, Choi (1998) concluded The results of this simulation study exhibit that the rank transform [RT] test in a 2 3 factorial design has considerable power advantages over the parametric test in many cases. However, Choi (1998) did not investigate Type I error rates when nonnull main eects were present. One of the prerequisites for comparative power analyses between competing procedures is that the statistical procedures be robust with respect to Type I error when all usual parametric assumptions are valid. This is not the case for the RT procedure in the contexts of either multiple regression or factorial ANOVA beyond a 2 2 layout (Thompson, 1991, 1993). References Akritas, M.G., The rank transform method in some two factor designs. J. Amer. Statist. Assoc. 85, Angermeier, P.L., Winston, M.R., Local vs. regional inuences on local diversity in stream sh communities of Virginia. Ecology 79, Augner, M., Provenza, F.D., Villalba, J.J., A rule of thumb in mammalian herbivores? Anim. Behav. 56, Blair, R.C., RANGEN. IBM, Boca Raton, FL. Brunner, E., Neuman, N., Rank tests in 2 2 designs. Statist. Neulandica 40, Choi, Y.H., A study of the power of the rank transform test in a 2 3 factorial experiment. Comm. Statist. Simulation Comput. 27, Conover, W.J., Practical Nonparametric Statistics, 3rd Edition. Wiley, New York. Conover, W.J., Iman, R.L., On some alternative procedures using ranks for the analysis of experimental designs. Commun. Statist. 5, Conover, W.J., Iman, R.L., Analysis of covariance using the rank transformation. Biometrics 38, Daniel, C., Wood, F.S., Fitting Equations to Data. Wiley, New York. Headrick, T.C., Simulating univariate and multivariate nonnormal distributions. Proceedings of the Statistical Computing Section, American Statistical Association, Washington, DC, pp Hettmansperger, T.P., McKean, J.W., Robust Nonparametric Statistical Methods. Arnold, London. Iman, R.L., A power study of a rank transform for the two way classication model when interaction may be present. Canad. J. Statist. 2, Iman, R.L., Conover, W.J., The use of the rank transform in regression. Technometrics 21, IMSL (International Mathematical and Statistical Libraries), IMSL Library, Reference Manual. Author, Houston, TX. Lahey, Computer Systems, Inc., Personal Fortran, Version 3.0. Author, Incline Village, NV. Lee, C.C., Yan, X., On the consistency of rank transformed regression. Commun. Statist. Theory Methods 25, Micceri, T., The unicorn, the normal curve, and other improbable creatures. Psychol. Bull. 105, O Gorman, T.W., Woolson, R.F., On the ecacy of the rank transformation in stepwise logistic and discriminant analysis. Statist. Medicine 12,

14 T.C. Headrick, O. Rotou / Computational Statistics & Data Analysis 38 (2001) Pedhazur, E.J., Multiple Regression in Behavioral Research, 2nd Edition. Harcourt Brace Jovanovich, Fort Worth, TX. Regeth, R.A., Stine, W.W., Computing means from nonnormal distributions: the bisquare-weighted analysis of variance. Behav. Res. Methods Instrum. Comput. 30, SAS Institute, Inc., SAS User s Guide: Statistics, 8th Edition. Author, Cary, NC. Sawilowsky, S.S., Blair, R.C., Higgins, J.J., An investigation of the Type I error and power properties of the rank transform procedure in factorial ANOVA. J. Educ. Statist. 14, Thompson, G.L., A note on the rank transform for interactions. Biometrika 78, Thompson, G.L., A correction note on the rank transform for interactions. Biometrika 80, 711. Wolfram, S., The Mathematica Book, 4th Edition. Wolfram Media-Cambridge University Press, Cambridge, UK.

AN IMPROVEMENT TO THE ALIGNED RANK STATISTIC

AN IMPROVEMENT TO THE ALIGNED RANK STATISTIC Journal of Applied Statistical Science ISSN 1067-5817 Volume 14, Number 3/4, pp. 225-235 2005 Nova Science Publishers, Inc. AN IMPROVEMENT TO THE ALIGNED RANK STATISTIC FOR TWO-FACTOR ANALYSIS OF VARIANCE

More information

On Polynomial Transformations For Simulating Multivariate Non-normal Distributions

On Polynomial Transformations For Simulating Multivariate Non-normal Distributions Journal of Modern Applied Statistical Methods May, 2004, Vol. 3, No. 1,65-71 Copyright 2004 JMASM, Inc. 1538-9472/04/$95.00 On Polynomial Transformations For Simulating Multivariate Non-normal Distributions

More information

An Alternative to Cronbach s Alpha: A L-Moment Based Measure of Internal-consistency Reliability

An Alternative to Cronbach s Alpha: A L-Moment Based Measure of Internal-consistency Reliability Southern Illinois University Carbondale OpenSIUC Book Chapters Educational Psychology and Special Education 013 An Alternative to Cronbach s Alpha: A L-Moment Based Measure of Internal-consistency Reliability

More information

Conventional And Robust Paired And Independent-Samples t Tests: Type I Error And Power Rates

Conventional And Robust Paired And Independent-Samples t Tests: Type I Error And Power Rates Journal of Modern Applied Statistical Methods Volume Issue Article --3 Conventional And And Independent-Samples t Tests: Type I Error And Power Rates Katherine Fradette University of Manitoba, umfradet@cc.umanitoba.ca

More information

Power Method Distributions through Conventional Moments and L-Moments

Power Method Distributions through Conventional Moments and L-Moments Southern Illinois University Carbondale OpenSIUC Publications Educational Psychology and Special Education 2012 Power Method Distributions through Conventional Moments and L-Moments Flaviu A. Hodis Victoria

More information

MLRV. Multiple Linear Regression Viewpoints. Volume 27 Number 2 Fall Table of Contents

MLRV. Multiple Linear Regression Viewpoints. Volume 27 Number 2 Fall Table of Contents Multiple Linear Regression Viewpoints A Publication sponsored by the American Educational Research Association s Special Interest Group on Multiple Linear Regression: The General Linear Model Volume 7

More information

Numerical Computing and Graphics for the Power Method Transformation Using Mathematica

Numerical Computing and Graphics for the Power Method Transformation Using Mathematica Southern Illinois University Carbondale OpenSIUC Publications Educational Psychology and Special Education 4-2007 Numerical Computing and Graphics for the Power Method Transformation Using Mathematica

More information

Algorithms and Code JMASM24: Numerical Computing for Third-Order Power Method Polynomials (Excel)

Algorithms and Code JMASM24: Numerical Computing for Third-Order Power Method Polynomials (Excel) Journal of Modern Applied Statistical Methods November, 2006, Vol. 5, No.2, 567-574 Copyright 2006 JMASM, Inc. 1538-9472/06/$95.00 Algorithms and Code JMASM24: Numerical Computing for Third-Order Power

More information

Non-parametric confidence intervals for shift effects based on paired ranks

Non-parametric confidence intervals for shift effects based on paired ranks Journal of Statistical Computation and Simulation Vol. 76, No. 9, September 2006, 765 772 Non-parametric confidence intervals for shift effects based on paired ranks ULLRICH MUNZEL* Viatris GmbH & Co.

More information

PROD. TYPE: COM ARTICLE IN PRESS. Computational Statistics & Data Analysis ( )

PROD. TYPE: COM ARTICLE IN PRESS. Computational Statistics & Data Analysis ( ) COMSTA 28 pp: -2 (col.fig.: nil) PROD. TYPE: COM ED: JS PAGN: Usha.N -- SCAN: Bindu Computational Statistics & Data Analysis ( ) www.elsevier.com/locate/csda Transformation approaches for the construction

More information

Testing For Aptitude-Treatment Interactions In Analysis Of Covariance And Randomized Block Designs Under Assumption Violations

Testing For Aptitude-Treatment Interactions In Analysis Of Covariance And Randomized Block Designs Under Assumption Violations Journal of Modern Applied Statistical Methods Volume 4 Issue 2 Article 11 11-1-2005 Testing For Aptitude-Treatment Interactions In Analysis Of Covariance And Randomized Block Designs Under Assumption Violations

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007) FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter

More information

On power and sample size calculations for Wald tests in generalized linear models

On power and sample size calculations for Wald tests in generalized linear models Journal of tatistical lanning and Inference 128 (2005) 43 59 www.elsevier.com/locate/jspi On power and sample size calculations for Wald tests in generalized linear models Gwowen hieh epartment of Management

More information

Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances

Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances Advances in Decision Sciences Volume 211, Article ID 74858, 8 pages doi:1.1155/211/74858 Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances David Allingham 1 andj.c.w.rayner

More information

INFLUENCE OF USING ALTERNATIVE MEANS ON TYPE-I ERROR RATE IN THE COMPARISON OF INDEPENDENT GROUPS ABSTRACT

INFLUENCE OF USING ALTERNATIVE MEANS ON TYPE-I ERROR RATE IN THE COMPARISON OF INDEPENDENT GROUPS ABSTRACT Mirtagioğlu et al., The Journal of Animal & Plant Sciences, 4(): 04, Page: J. 344-349 Anim. Plant Sci. 4():04 ISSN: 08-708 INFLUENCE OF USING ALTERNATIVE MEANS ON TYPE-I ERROR RATE IN THE COMPARISON OF

More information

Simulating Uniform- and Triangular- Based Double Power Method Distributions

Simulating Uniform- and Triangular- Based Double Power Method Distributions Journal of Statistical and Econometric Methods, vol.6, no.1, 2017, 1-44 ISSN: 1792-6602 (print), 1792-6939 (online) Scienpress Ltd, 2017 Simulating Uniform- and Triangular- Based Double Power Method Distributions

More information

Latin Hypercube Sampling with Multidimensional Uniformity

Latin Hypercube Sampling with Multidimensional Uniformity Latin Hypercube Sampling with Multidimensional Uniformity Jared L. Deutsch and Clayton V. Deutsch Complex geostatistical models can only be realized a limited number of times due to large computational

More information

Sample size calculations for logistic and Poisson regression models

Sample size calculations for logistic and Poisson regression models Biometrika (2), 88, 4, pp. 93 99 2 Biometrika Trust Printed in Great Britain Sample size calculations for logistic and Poisson regression models BY GWOWEN SHIEH Department of Management Science, National

More information

THE 'IMPROVED' BROWN AND FORSYTHE TEST FOR MEAN EQUALITY: SOME THINGS CAN'T BE FIXED

THE 'IMPROVED' BROWN AND FORSYTHE TEST FOR MEAN EQUALITY: SOME THINGS CAN'T BE FIXED THE 'IMPROVED' BROWN AND FORSYTHE TEST FOR MEAN EQUALITY: SOME THINGS CAN'T BE FIXED H. J. Keselman Rand R. Wilcox University of Manitoba University of Southern California Winnipeg, Manitoba Los Angeles,

More information

Application of Parametric Homogeneity of Variances Tests under Violation of Classical Assumption

Application of Parametric Homogeneity of Variances Tests under Violation of Classical Assumption Application of Parametric Homogeneity of Variances Tests under Violation of Classical Assumption Alisa A. Gorbunova and Boris Yu. Lemeshko Novosibirsk State Technical University Department of Applied Mathematics,

More information

A MONTE CARLO INVESTIGATION INTO THE PROPERTIES OF A PROPOSED ROBUST ONE-SAMPLE TEST OF LOCATION. Bercedis Peterson

A MONTE CARLO INVESTIGATION INTO THE PROPERTIES OF A PROPOSED ROBUST ONE-SAMPLE TEST OF LOCATION. Bercedis Peterson A MONTE CARLO INVESTIGATION INTO THE PROPERTIES OF A PROPOSED ROBUST ONE-SAMPLE TEST OF LOCATION by Bercedis Peterson Department of Biostatistics University of North Carolina at Chapel Hill Institute of

More information

Prediction of ordinal outcomes when the association between predictors and outcome diers between outcome levels

Prediction of ordinal outcomes when the association between predictors and outcome diers between outcome levels STATISTICS IN MEDICINE Statist. Med. 2005; 24:1357 1369 Published online 26 November 2004 in Wiley InterScience (www.interscience.wiley.com). DOI: 10.1002/sim.2009 Prediction of ordinal outcomes when the

More information

A nonparametric two-sample wald test of equality of variances

A nonparametric two-sample wald test of equality of variances University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 211 A nonparametric two-sample wald test of equality of variances David

More information

Analysis of 2x2 Cross-Over Designs using T-Tests

Analysis of 2x2 Cross-Over Designs using T-Tests Chapter 234 Analysis of 2x2 Cross-Over Designs using T-Tests Introduction This procedure analyzes data from a two-treatment, two-period (2x2) cross-over design. The response is assumed to be a continuous

More information

TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST

TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST Econometrics Working Paper EWP0402 ISSN 1485-6441 Department of Economics TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST Lauren Bin Dong & David E. A. Giles Department

More information

Chapter Fifteen. Frequency Distribution, Cross-Tabulation, and Hypothesis Testing

Chapter Fifteen. Frequency Distribution, Cross-Tabulation, and Hypothesis Testing Chapter Fifteen Frequency Distribution, Cross-Tabulation, and Hypothesis Testing Copyright 2010 Pearson Education, Inc. publishing as Prentice Hall 15-1 Internet Usage Data Table 15.1 Respondent Sex Familiarity

More information

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

 M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2 Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the

More information

Journal of Biostatistics and Epidemiology

Journal of Biostatistics and Epidemiology Journal of Biostatistics and Epidemiology Original Article Robust correlation coefficient goodness-of-fit test for the Gumbel distribution Abbas Mahdavi 1* 1 Department of Statistics, School of Mathematical

More information

Antonietta Mira. University of Pavia, Italy. Abstract. We propose a test based on Bonferroni's measure of skewness.

Antonietta Mira. University of Pavia, Italy. Abstract. We propose a test based on Bonferroni's measure of skewness. Distribution-free test for symmetry based on Bonferroni's Measure Antonietta Mira University of Pavia, Italy Abstract We propose a test based on Bonferroni's measure of skewness. The test detects the asymmetry

More information

Model Estimation Example

Model Estimation Example Ronald H. Heck 1 EDEP 606: Multivariate Methods (S2013) April 7, 2013 Model Estimation Example As we have moved through the course this semester, we have encountered the concept of model estimation. Discussions

More information

On estimation of the Poisson parameter in zero-modied Poisson models

On estimation of the Poisson parameter in zero-modied Poisson models Computational Statistics & Data Analysis 34 (2000) 441 459 www.elsevier.com/locate/csda On estimation of the Poisson parameter in zero-modied Poisson models Ekkehart Dietz, Dankmar Bohning Department of

More information

My data doesn t look like that..

My data doesn t look like that.. Testing assumptions My data doesn t look like that.. We have made a big deal about testing model assumptions each week. Bill Pine Testing assumptions Testing assumptions We have made a big deal about testing

More information

Least Absolute Value vs. Least Squares Estimation and Inference Procedures in Regression Models with Asymmetric Error Distributions

Least Absolute Value vs. Least Squares Estimation and Inference Procedures in Regression Models with Asymmetric Error Distributions Journal of Modern Applied Statistical Methods Volume 8 Issue 1 Article 13 5-1-2009 Least Absolute Value vs. Least Squares Estimation and Inference Procedures in Regression Models with Asymmetric Error

More information

ON SMALL SAMPLE PROPERTIES OF PERMUTATION TESTS: INDEPENDENCE BETWEEN TWO SAMPLES

ON SMALL SAMPLE PROPERTIES OF PERMUTATION TESTS: INDEPENDENCE BETWEEN TWO SAMPLES ON SMALL SAMPLE PROPERTIES OF PERMUTATION TESTS: INDEPENDENCE BETWEEN TWO SAMPLES Hisashi Tanizaki Graduate School of Economics, Kobe University, Kobe 657-8501, Japan e-mail: tanizaki@kobe-u.ac.jp Abstract:

More information

p(z)

p(z) Chapter Statistics. Introduction This lecture is a quick review of basic statistical concepts; probabilities, mean, variance, covariance, correlation, linear regression, probability density functions and

More information

A Test of Symmetry. Abdul R. Othman. Universiti Sains Malaysia. H. J. Keselman. University of Manitoba. Rand R. Wilcox

A Test of Symmetry. Abdul R. Othman. Universiti Sains Malaysia. H. J. Keselman. University of Manitoba. Rand R. Wilcox Symmetry A Test of Symmetry by Abdul R. Othman Universiti Sains Malaysia H. J. Keselman University of Manitoba Rand R. Wilcox University of Southern California Katherine Fradette University of Manitoba

More information

An Empirical Comparison of Four Data Generating Procedures in Parametric and Nonparametric ANOVA

An Empirical Comparison of Four Data Generating Procedures in Parametric and Nonparametric ANOVA Southern Illinois University Carbondale OpenSIUC Dissertations ses and Dissertations 5-1-2011 An Empirical Comparison of Four Data Generating Procedures in Parametric and Nonparametric ANOVA Anquan Zhang

More information

* Tuesday 17 January :30-16:30 (2 hours) Recored on ESSE3 General introduction to the course.

* Tuesday 17 January :30-16:30 (2 hours) Recored on ESSE3 General introduction to the course. Name of the course Statistical methods and data analysis Audience The course is intended for students of the first or second year of the Graduate School in Materials Engineering. The aim of the course

More information

Dr. Junchao Xia Center of Biophysics and Computational Biology. Fall /1/2016 1/46

Dr. Junchao Xia Center of Biophysics and Computational Biology. Fall /1/2016 1/46 BIO5312 Biostatistics Lecture 10:Regression and Correlation Methods Dr. Junchao Xia Center of Biophysics and Computational Biology Fall 2016 11/1/2016 1/46 Outline In this lecture, we will discuss topics

More information

REVIEW 8/2/2017 陈芳华东师大英语系

REVIEW 8/2/2017 陈芳华东师大英语系 REVIEW Hypothesis testing starts with a null hypothesis and a null distribution. We compare what we have to the null distribution, if the result is too extreme to belong to the null distribution (p

More information

Experimental Design and Data Analysis for Biologists

Experimental Design and Data Analysis for Biologists Experimental Design and Data Analysis for Biologists Gerry P. Quinn Monash University Michael J. Keough University of Melbourne CAMBRIDGE UNIVERSITY PRESS Contents Preface page xv I I Introduction 1 1.1

More information

Prediction Intervals in the Presence of Outliers

Prediction Intervals in the Presence of Outliers Prediction Intervals in the Presence of Outliers David J. Olive Southern Illinois University July 21, 2003 Abstract This paper presents a simple procedure for computing prediction intervals when the data

More information

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric Assumptions The observations must be independent. Dependent variable should be continuous

More information

Aligned Rank Tests As Robust Alternatives For Testing Interactions In Multiple Group Repeated Measures Designs With Heterogeneous Covariances

Aligned Rank Tests As Robust Alternatives For Testing Interactions In Multiple Group Repeated Measures Designs With Heterogeneous Covariances Journal of Modern Applied Statistical Methods Volume 3 Issue 2 Article 17 11-1-2004 Aligned Rank Tests As Robust Alternatives For Testing Interactions In Multiple Group Repeated Measures Designs With Heterogeneous

More information

An Equivalency Test for Model Fit. Craig S. Wells. University of Massachusetts Amherst. James. A. Wollack. Ronald C. Serlin

An Equivalency Test for Model Fit. Craig S. Wells. University of Massachusetts Amherst. James. A. Wollack. Ronald C. Serlin Equivalency Test for Model Fit 1 Running head: EQUIVALENCY TEST FOR MODEL FIT An Equivalency Test for Model Fit Craig S. Wells University of Massachusetts Amherst James. A. Wollack Ronald C. Serlin University

More information

Contents. Preface to Second Edition Preface to First Edition Abbreviations PART I PRINCIPLES OF STATISTICAL THINKING AND ANALYSIS 1

Contents. Preface to Second Edition Preface to First Edition Abbreviations PART I PRINCIPLES OF STATISTICAL THINKING AND ANALYSIS 1 Contents Preface to Second Edition Preface to First Edition Abbreviations xv xvii xix PART I PRINCIPLES OF STATISTICAL THINKING AND ANALYSIS 1 1 The Role of Statistical Methods in Modern Industry and Services

More information

Testing Statistical Hypotheses

Testing Statistical Hypotheses E.L. Lehmann Joseph P. Romano Testing Statistical Hypotheses Third Edition 4y Springer Preface vii I Small-Sample Theory 1 1 The General Decision Problem 3 1.1 Statistical Inference and Statistical Decisions

More information

An L-Moment-Based Analog for the Schmeiser- Deutsch Class of Distributions

An L-Moment-Based Analog for the Schmeiser- Deutsch Class of Distributions University of Texas at Arlington From the SelectedWorks of Mohan Dev Pant Fall August 7, 01 An L-Moment-Based Analog for the Schmeiser- Deutsch Class of Distributions Todd C. Headrick, Southern Illinois

More information

From Practical Data Analysis with JMP, Second Edition. Full book available for purchase here. About This Book... xiii About The Author...

From Practical Data Analysis with JMP, Second Edition. Full book available for purchase here. About This Book... xiii About The Author... From Practical Data Analysis with JMP, Second Edition. Full book available for purchase here. Contents About This Book... xiii About The Author... xxiii Chapter 1 Getting Started: Data Analysis with JMP...

More information

Exact Kolmogorov-Smirnov Test of Normality. for Completely Randomized Designs. Michele M. Altavela and Constance L. Wood. Summary

Exact Kolmogorov-Smirnov Test of Normality. for Completely Randomized Designs. Michele M. Altavela and Constance L. Wood. Summary Exact Kolmogorov-Smirnov Test of Normality for Completely Randomized Designs by Michele M. Altavela and Constance L. Wood Summary Considered are the finite-sample properties of the Kolmogorov- Smirnov

More information

Inferences About the Difference Between Two Means

Inferences About the Difference Between Two Means 7 Inferences About the Difference Between Two Means Chapter Outline 7.1 New Concepts 7.1.1 Independent Versus Dependent Samples 7.1. Hypotheses 7. Inferences About Two Independent Means 7..1 Independent

More information

GMM tests for the Katz family of distributions

GMM tests for the Katz family of distributions Journal of Statistical Planning and Inference 110 (2003) 55 73 www.elsevier.com/locate/jspi GMM tests for the Katz family of distributions Yue Fang Department of Decision Sciences, Lundquist College of

More information

Scaled and adjusted restricted tests in. multi-sample analysis of moment structures. Albert Satorra. Universitat Pompeu Fabra.

Scaled and adjusted restricted tests in. multi-sample analysis of moment structures. Albert Satorra. Universitat Pompeu Fabra. Scaled and adjusted restricted tests in multi-sample analysis of moment structures Albert Satorra Universitat Pompeu Fabra July 15, 1999 The author is grateful to Peter Bentler and Bengt Muthen for their

More information

Bootstrap Simulation Procedure Applied to the Selection of the Multiple Linear Regressions

Bootstrap Simulation Procedure Applied to the Selection of the Multiple Linear Regressions JKAU: Sci., Vol. 21 No. 2, pp: 197-212 (2009 A.D. / 1430 A.H.); DOI: 10.4197 / Sci. 21-2.2 Bootstrap Simulation Procedure Applied to the Selection of the Multiple Linear Regressions Ali Hussein Al-Marshadi

More information

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics DETAILED CONTENTS About the Author Preface to the Instructor To the Student How to Use SPSS With This Book PART I INTRODUCTION AND DESCRIPTIVE STATISTICS 1. Introduction to Statistics 1.1 Descriptive and

More information

A Monte-Carlo study of asymptotically robust tests for correlation coefficients

A Monte-Carlo study of asymptotically robust tests for correlation coefficients Biometrika (1973), 6, 3, p. 661 551 Printed in Great Britain A Monte-Carlo study of asymptotically robust tests for correlation coefficients BY G. T. DUNCAN AND M. W. J. LAYAKD University of California,

More information

STAT Section 5.8: Block Designs

STAT Section 5.8: Block Designs STAT 518 --- Section 5.8: Block Designs Recall that in paired-data studies, we match up pairs of subjects so that the two subjects in a pair are alike in some sense. Then we randomly assign, say, treatment

More information

Linear Models in Statistics

Linear Models in Statistics Linear Models in Statistics ALVIN C. RENCHER Department of Statistics Brigham Young University Provo, Utah A Wiley-Interscience Publication JOHN WILEY & SONS, INC. New York Chichester Weinheim Brisbane

More information

On some properties of elementary derivations in dimension six

On some properties of elementary derivations in dimension six Journal of Pure and Applied Algebra 56 (200) 69 79 www.elsevier.com/locate/jpaa On some properties of elementary derivations in dimension six Joseph Khoury Department of Mathematics, University of Ottawa,

More information

TESTS FOR MEAN EQUALITY THAT DO NOT REQUIRE HOMOGENEITY OF VARIANCES: DO THEY REALLY WORK?

TESTS FOR MEAN EQUALITY THAT DO NOT REQUIRE HOMOGENEITY OF VARIANCES: DO THEY REALLY WORK? TESTS FOR MEAN EQUALITY THAT DO NOT REQUIRE HOMOGENEITY OF VARIANCES: DO THEY REALLY WORK? H. J. Keselman Rand R. Wilcox University of Manitoba University of Southern California Winnipeg, Manitoba Los

More information

Types of Statistical Tests DR. MIKE MARRAPODI

Types of Statistical Tests DR. MIKE MARRAPODI Types of Statistical Tests DR. MIKE MARRAPODI Tests t tests ANOVA Correlation Regression Multivariate Techniques Non-parametric t tests One sample t test Independent t test Paired sample t test One sample

More information

GENERAL PROBLEMS OF METROLOGY AND MEASUREMENT TECHNIQUE

GENERAL PROBLEMS OF METROLOGY AND MEASUREMENT TECHNIQUE DOI 10.1007/s11018-017-1141-3 Measurement Techniques, Vol. 60, No. 1, April, 2017 GENERAL PROBLEMS OF METROLOGY AND MEASUREMENT TECHNIQUE APPLICATION AND POWER OF PARAMETRIC CRITERIA FOR TESTING THE HOMOGENEITY

More information

Textbook Examples of. SPSS Procedure

Textbook Examples of. SPSS Procedure Textbook s of IBM SPSS Procedures Each SPSS procedure listed below has its own section in the textbook. These sections include a purpose statement that describes the statistical test, identification of

More information

Statistical methods and data analysis

Statistical methods and data analysis Statistical methods and data analysis Teacher Stefano Siboni Aim The aim of the course is to illustrate the basic mathematical tools for the analysis and modelling of experimental data, particularly concerning

More information

Application of Homogeneity Tests: Problems and Solution

Application of Homogeneity Tests: Problems and Solution Application of Homogeneity Tests: Problems and Solution Boris Yu. Lemeshko (B), Irina V. Veretelnikova, Stanislav B. Lemeshko, and Alena Yu. Novikova Novosibirsk State Technical University, Novosibirsk,

More information

SAS/STAT 14.1 User s Guide. Introduction to Nonparametric Analysis

SAS/STAT 14.1 User s Guide. Introduction to Nonparametric Analysis SAS/STAT 14.1 User s Guide Introduction to Nonparametric Analysis This document is an individual chapter from SAS/STAT 14.1 User s Guide. The correct bibliographic citation for this manual is as follows:

More information

MONTE CARLO ANALYSIS OF CHANGE POINT ESTIMATORS

MONTE CARLO ANALYSIS OF CHANGE POINT ESTIMATORS MONTE CARLO ANALYSIS OF CHANGE POINT ESTIMATORS Gregory GUREVICH PhD, Industrial Engineering and Management Department, SCE - Shamoon College Engineering, Beer-Sheva, Israel E-mail: gregoryg@sce.ac.il

More information

EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture 19

EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture 19 EEC 686/785 Modeling & Performance Evaluation of Computer Systems Lecture 19 Department of Electrical and Computer Engineering Cleveland State University wenbing@ieee.org (based on Dr. Raj Jain s lecture

More information

Increasing Power in Paired-Samples Designs. by Correcting the Student t Statistic for Correlation. Donald W. Zimmerman. Carleton University

Increasing Power in Paired-Samples Designs. by Correcting the Student t Statistic for Correlation. Donald W. Zimmerman. Carleton University Power in Paired-Samples Designs Running head: POWER IN PAIRED-SAMPLES DESIGNS Increasing Power in Paired-Samples Designs by Correcting the Student t Statistic for Correlation Donald W. Zimmerman Carleton

More information

Practical Statistics for the Analytical Scientist Table of Contents

Practical Statistics for the Analytical Scientist Table of Contents Practical Statistics for the Analytical Scientist Table of Contents Chapter 1 Introduction - Choosing the Correct Statistics 1.1 Introduction 1.2 Choosing the Right Statistical Procedures 1.2.1 Planning

More information

Linear Regression and Its Applications

Linear Regression and Its Applications Linear Regression and Its Applications Predrag Radivojac October 13, 2014 Given a data set D = {(x i, y i )} n the objective is to learn the relationship between features and the target. We usually start

More information

Exam details. Final Review Session. Things to Review

Exam details. Final Review Session. Things to Review Exam details Final Review Session Short answer, similar to book problems Formulae and tables will be given You CAN use a calculator Date and Time: Dec. 7, 006, 1-1:30 pm Location: Osborne Centre, Unit

More information

dqd: A command for treatment effect estimation under alternative assumptions

dqd: A command for treatment effect estimation under alternative assumptions UC3M Working Papers Economics 14-07 April 2014 ISSN 2340-5031 Departamento de Economía Universidad Carlos III de Madrid Calle Madrid, 126 28903 Getafe (Spain) Fax (34) 916249875 dqd: A command for treatment

More information

Extending the Robust Means Modeling Framework. Alyssa Counsell, Phil Chalmers, Matt Sigal, Rob Cribbie

Extending the Robust Means Modeling Framework. Alyssa Counsell, Phil Chalmers, Matt Sigal, Rob Cribbie Extending the Robust Means Modeling Framework Alyssa Counsell, Phil Chalmers, Matt Sigal, Rob Cribbie One-way Independent Subjects Design Model: Y ij = µ + τ j + ε ij, j = 1,, J Y ij = score of the ith

More information

Weighted tests of homogeneity for testing the number of components in a mixture

Weighted tests of homogeneity for testing the number of components in a mixture Computational Statistics & Data Analysis 41 (2003) 367 378 www.elsevier.com/locate/csda Weighted tests of homogeneity for testing the number of components in a mixture Edward Susko Department of Mathematics

More information

Weighted empirical likelihood estimates and their robustness properties

Weighted empirical likelihood estimates and their robustness properties Computational Statistics & Data Analysis ( ) www.elsevier.com/locate/csda Weighted empirical likelihood estimates and their robustness properties N.L. Glenn a,, Yichuan Zhao b a Department of Statistics,

More information

Modeling Hydrologic Chanae

Modeling Hydrologic Chanae Modeling Hydrologic Chanae Statistical Methods Richard H. McCuen Department of Civil and Environmental Engineering University of Maryland m LEWIS PUBLISHERS A CRC Press Company Boca Raton London New York

More information

Strati cation in Multivariate Modeling

Strati cation in Multivariate Modeling Strati cation in Multivariate Modeling Tihomir Asparouhov Muthen & Muthen Mplus Web Notes: No. 9 Version 2, December 16, 2004 1 The author is thankful to Bengt Muthen for his guidance, to Linda Muthen

More information

Parametric Probability Densities and Distribution Functions for Tukey g-and-h Transformations and their Use for Fitting Data

Parametric Probability Densities and Distribution Functions for Tukey g-and-h Transformations and their Use for Fitting Data Applied Mathematical Sciences, Vol. 2, 2008, no. 9, 449-462 Parametric Probability Densities and Distribution Functions for Tukey g-and-h Transformations and their Use for Fitting Data Todd C. Headrick,

More information

POWER AND TYPE I ERROR RATE COMPARISON OF MULTIVARIATE ANALYSIS OF VARIANCE

POWER AND TYPE I ERROR RATE COMPARISON OF MULTIVARIATE ANALYSIS OF VARIANCE POWER AND TYPE I ERROR RATE COMPARISON OF MULTIVARIATE ANALYSIS OF VARIANCE Supported by Patrick Adebayo 1 and Ahmed Ibrahim 1 Department of Statistics, University of Ilorin, Kwara State, Nigeria Department

More information

Application of Variance Homogeneity Tests Under Violation of Normality Assumption

Application of Variance Homogeneity Tests Under Violation of Normality Assumption Application of Variance Homogeneity Tests Under Violation of Normality Assumption Alisa A. Gorbunova, Boris Yu. Lemeshko Novosibirsk State Technical University Novosibirsk, Russia e-mail: gorbunova.alisa@gmail.com

More information

Comparison of nonparametric analysis of variance methods a Monte Carlo study Part A: Between subjects designs - A Vote for van der Waerden

Comparison of nonparametric analysis of variance methods a Monte Carlo study Part A: Between subjects designs - A Vote for van der Waerden Comparison of nonparametric analysis of variance methods a Monte Carlo study Part A: Between subjects designs - A Vote for van der Waerden Version 5 completely revised and extended (13.7.2017) Haiko Lüpsen

More information

TEST POWER IN COMPARISON DIFFERENCE BETWEEN TWO INDEPENDENT PROPORTIONS

TEST POWER IN COMPARISON DIFFERENCE BETWEEN TWO INDEPENDENT PROPORTIONS TEST POWER IN COMPARISON DIFFERENCE BETWEEN TWO INDEPENDENT PROPORTIONS Mehmet MENDES PhD, Associate Professor, Canakkale Onsekiz Mart University, Agriculture Faculty, Animal Science Department, Biometry

More information

Tentative solutions TMA4255 Applied Statistics 16 May, 2015

Tentative solutions TMA4255 Applied Statistics 16 May, 2015 Norwegian University of Science and Technology Department of Mathematical Sciences Page of 9 Tentative solutions TMA455 Applied Statistics 6 May, 05 Problem Manufacturer of fertilizers a) Are these independent

More information

Power Calculations for Preclinical Studies Using a K-Sample Rank Test and the Lehmann Alternative Hypothesis

Power Calculations for Preclinical Studies Using a K-Sample Rank Test and the Lehmann Alternative Hypothesis Power Calculations for Preclinical Studies Using a K-Sample Rank Test and the Lehmann Alternative Hypothesis Glenn Heller Department of Epidemiology and Biostatistics, Memorial Sloan-Kettering Cancer Center,

More information

Glossary for the Triola Statistics Series

Glossary for the Triola Statistics Series Glossary for the Triola Statistics Series Absolute deviation The measure of variation equal to the sum of the deviations of each value from the mean, divided by the number of values Acceptance sampling

More information

Statistics Toolbox 6. Apply statistical algorithms and probability models

Statistics Toolbox 6. Apply statistical algorithms and probability models Statistics Toolbox 6 Apply statistical algorithms and probability models Statistics Toolbox provides engineers, scientists, researchers, financial analysts, and statisticians with a comprehensive set of

More information

VarCan (version 1): Variation Estimation and Partitioning in Canonical Analysis

VarCan (version 1): Variation Estimation and Partitioning in Canonical Analysis VarCan (version 1): Variation Estimation and Partitioning in Canonical Analysis Pedro R. Peres-Neto March 2005 Department of Biology University of Regina Regina, SK S4S 0A2, Canada E-mail: Pedro.Peres-Neto@uregina.ca

More information

Trendlines Simple Linear Regression Multiple Linear Regression Systematic Model Building Practical Issues

Trendlines Simple Linear Regression Multiple Linear Regression Systematic Model Building Practical Issues Trendlines Simple Linear Regression Multiple Linear Regression Systematic Model Building Practical Issues Overfitting Categorical Variables Interaction Terms Non-linear Terms Linear Logarithmic y = a +

More information

Outline. Simulation of a Single-Server Queueing System. EEC 686/785 Modeling & Performance Evaluation of Computer Systems.

Outline. Simulation of a Single-Server Queueing System. EEC 686/785 Modeling & Performance Evaluation of Computer Systems. EEC 686/785 Modeling & Performance Evaluation of Computer Systems Lecture 19 Outline Simulation of a Single-Server Queueing System Review of midterm # Department of Electrical and Computer Engineering

More information

Christopher Dougherty London School of Economics and Political Science

Christopher Dougherty London School of Economics and Political Science Introduction to Econometrics FIFTH EDITION Christopher Dougherty London School of Economics and Political Science OXFORD UNIVERSITY PRESS Contents INTRODU CTION 1 Why study econometrics? 1 Aim of this

More information

Multiple Comparison Procedures, Trimmed Means and Transformed Statistics. Rhonda K. Kowalchuk Southern Illinois University Carbondale

Multiple Comparison Procedures, Trimmed Means and Transformed Statistics. Rhonda K. Kowalchuk Southern Illinois University Carbondale Multiple Comparison Procedures 1 Multiple Comparison Procedures, Trimmed Means and Transformed Statistics Rhonda K. Kowalchuk Southern Illinois University Carbondale H. J. Keselman University of Manitoba

More information

MARGINAL HOMOGENEITY MODEL FOR ORDERED CATEGORIES WITH OPEN ENDS IN SQUARE CONTINGENCY TABLES

MARGINAL HOMOGENEITY MODEL FOR ORDERED CATEGORIES WITH OPEN ENDS IN SQUARE CONTINGENCY TABLES REVSTAT Statistical Journal Volume 13, Number 3, November 2015, 233 243 MARGINAL HOMOGENEITY MODEL FOR ORDERED CATEGORIES WITH OPEN ENDS IN SQUARE CONTINGENCY TABLES Authors: Serpil Aktas Department of

More information

Mantel-Haenszel Test Statistics. for Correlated Binary Data. Department of Statistics, North Carolina State University. Raleigh, NC

Mantel-Haenszel Test Statistics. for Correlated Binary Data. Department of Statistics, North Carolina State University. Raleigh, NC Mantel-Haenszel Test Statistics for Correlated Binary Data by Jie Zhang and Dennis D. Boos Department of Statistics, North Carolina State University Raleigh, NC 27695-8203 tel: (919) 515-1918 fax: (919)

More information

Contents. Acknowledgments. xix

Contents. Acknowledgments. xix Table of Preface Acknowledgments page xv xix 1 Introduction 1 The Role of the Computer in Data Analysis 1 Statistics: Descriptive and Inferential 2 Variables and Constants 3 The Measurement of Variables

More information

Rejection regions for the bivariate case

Rejection regions for the bivariate case Rejection regions for the bivariate case The rejection region for the T 2 test (and similarly for Z 2 when Σ is known) is the region outside of an ellipse, for which there is a (1-α)% chance that the test

More information

In Chapter 2, some concepts from the robustness literature were introduced. An important concept was the inuence function. In the present chapter, the

In Chapter 2, some concepts from the robustness literature were introduced. An important concept was the inuence function. In the present chapter, the Chapter 3 Robustness Properties of the Student t Based Pseudo Maximum Likelihood Estimator In Chapter 2, some concepts from the robustness literature were introduced. An important concept was the inuence

More information

Analysis of variance and linear contrasts in experimental design with generalized secant hyperbolic distribution

Analysis of variance and linear contrasts in experimental design with generalized secant hyperbolic distribution Journal of Computational and Applied Mathematics 216 (2008) 545 553 www.elsevier.com/locate/cam Analysis of variance and linear contrasts in experimental design with generalized secant hyperbolic distribution

More information

Non-parametric methods

Non-parametric methods Eastern Mediterranean University Faculty of Medicine Biostatistics course Non-parametric methods March 4&7, 2016 Instructor: Dr. Nimet İlke Akçay (ilke.cetin@emu.edu.tr) Learning Objectives 1. Distinguish

More information