Methodology Review: Applications of Distribution Theory in Studies of. Population Validity and Cross Validity. James Algina. University of Florida
|
|
- Barbara Daniels
- 5 years ago
- Views:
Transcription
1 Distribution Theory 1 Methodology eview: Applications of Distribution Theory in Studies of Population Validity and Cross Validity by James Algina University of Florida and H. J. Keselman University of Manitoba
2 Distribution Theory ABSTACT Applications of distribution theory for the squared multiple correlation coefficient and the squared cross-validation coefficient are reviewed and computer programs for these applications are made available. The applications include confidence intervals, hypothesis testing, and sample size selection.
3 Distribution Theory 3 Methodology eview: Applications of Distribution Theory in Studies of Population Validity and Cross Validity aju, Bilgic, Edward, and Fleer (1997) presented a review of methods for estimating the population squared multiple correlation coefficient and the squared cross-validation correlation coefficient. While this review was quite through in terms of estimation, aju et al. did not review distribution theory for these two coefficients. The purpose of this note is to extend the review by addressing applications of distribution theory under the assumption that scores on the dependent and independent variables are drawn from a multivariate normal distribution. The Squared Multiple Correlation Coefficient Fisher (198) derived the distribution of the sample squared multiple correlation coefficient. Since that time work has been devoted to topics such as simplified derivations of the distribution (e.g., Moran, 195), moments of the distribution (Wishart, 1931), approximate methods for evaluating the cumulative distribution function (CDF) of (Gurland & Milton, 197) or (Khatri, 1996, Lee, 1971), and exact methods for evaluating the CDF of & Bargmann, 1991; Ding, 1994). intervals for An important application of the CDF is constructions of confidence (Ding. According to Steiger and Fouladi (1997), the lower limit of a two-sided 1 1 % confidence interval for can be constructed by finding the value of such that the observed value of is the 1 th percentile
4 Distribution Theory 4 of the distribution of ; the upper limit can be constructed by finding the value of such that the observed value of is the th percentile. For some sample sizes n, number of predictors k, values of, and confidence levels, the 1 th percentile or the 1 th and th percentiles do not exist. In the former case the lower limit is set equal to zero and in the latter case both limits are set equal to zero. Steiger and Fouladi (199a) prepared a program that computes confidence intervals for. While the program is very easy to use, instructors who use a commercially available program such as SAS or SPSS in their course may want to use the program from their course to calculate the confidence interval. Accordingly, we have prepared SAS and SPSS programs that compute confidence intervals for. These programs use Lee s (1971) noncentral F approximation to the distribution of. According to Steiger and Fouladi (199b), results from Lee s approach are accurate to the fourth decimal place. See Algina and Olejnik () for a recent description of Lee s approach. The programs are called ci.smcc.sas and ci.smcc.sps and are available for download from url to be added. (All programs developed for this paper are available at this site.) The use of confidence interval for is very important, both as a way of assessing accuracy of estimation and as a way of acknowledging the bias of as an estimator of. For example, suppose a study based on n 3 participants with k 1 predictors yields.5. A 95% confidence interval for
5 Distribution Theory 5 is. to.58, indicating that the data provide relatively little information about the size of and that more of the interval is below.5 than above it. In more extreme examples, the interval can be wholly below the calculated value for. Confidence intervals can also be used to test hypotheses about example, we may be interested in presenting evidence that prescribed value. To test. For exceeds some H : H : 1 (1) we find, the 1 th percentile of the distribution of 1 in which. If exceeds 1, we can reject H. When exceeds, the lower limit of a 1 % confidence interval will be larger 1 than. Thus by constructing a 1 % confidence interval and determining whether is smaller than the lower limit, we have conducted an 1 level test of the hypotheses in equations (1). If, this confidence interval procedure is equivalent to the usual F test of the hypothesis H H : 1 : () Using a confidence interval to test the hypotheses in (1) requires calculation of. SAS (percentile.mrsq.sas) and SPSS 1 (percentile.mrsq.sas) programs are available to carry out this calculation.
6 Distribution Theory 6 An interesting application of the hypotheses in equation (1) is due to Browne (1975b). Consider the estimated multiple regression equation Y a b1x 1 bkx (3) k based on a sample of size n. Browne defined the mean squared error of prediction (Stein, 196) for the regression equation in (3) as d E Y Y where E denotes the expectation operator and the expectation is take over the distribution of the criterion and predictor variables with a and b1 b fixed. k Browne also defined E d, the mean square error of prediction, averaged over all possible realizations of equation (3) for a sample of size n. Browne considered a situation in which m of the k predictors are, a priori, to be included in the regression equation and the researcher wants to know whether to include the additional k m predictors. ather than testing the null hypothesis that the k m predictors do not increase the squared multiple correlation coefficient: H, Browne agued that it is more appropriate to test that adding the : k m predictors reduces the expected mean squared error of prediction H : k m H 1 : k m and shows that these hypotheses are equivalent to
7 Distribution Theory 7 H : km k m n m k m H1 : km, n m (4) where k m km 1 m is the partial multiple correlation coefficient and is estimated by. k m km 1 m The distribution of is the same as that of km based on k m predictors and a sample size of n m. Therefore, the hypothesis in (4) can be tested by constructing a 1 1 % confidence interval for and checking whether km or not the lower limit is greater than k m n m. If it is, then we can conclude that the additional predictors reduce the expected mean square error of prediction. Our confidence interval programs (ci.smcc.sas and ci.smcc.sps) can construct confidence intervals for Gurland and Milton (1971) to test the hypotheses in (4). km. Browne used an approximation due to As an alternative to the hypotheses in (1), we may want to test that smaller than some prescribed value : is H : H :. 1
8 Distribution Theory 8 We construct a 1 % confidence interval and determine whether is larger than the upper limit of the interval. If so we can reject H. Since the distribution of the squared Pearson product moment correlation coefficient is the distribution of for k 1, the procedures described above can also be used to find confidence intervals for the population squared Pearson product moment correlation coefficient and test hypotheses about. Another application of the distribution is to select sample sizes to meet a target for accurate estimation of distribution of to select sample sizes so that. Algina and Olejnik () used the will be estimated with adequate accuracy when the adjusted squared multiple correlation coefficient a is used as an estimator. Specifically, they found the sample size so that prob max, c min 1, c p. a where c and p are defined by the researcher. For example, suppose a researcher has k 8 predictors, believes that is.5, and wants to design a study so that prob max,.1 a min 1, That is, the researcher wants a.95 probability that a is in the interval.4,.5. The required sample size is. SAS (sample.size.adjmrsq.sas) and SPSS (sample.size.adjmrsq.sps) programs that implement this approach are available. In addition, SAS (sample.size.mrsq.sas) and SPSS (sample.size.mrsq.sps) programs are available to find the sample size such that prob max, c min 1, c p.
9 Distribution Theory 9 For example, suppose a researcher is planning a study with k 8 predictors and believes is.5. The researcher wants to be 95 % sure that the estimate of will be within. 5 of. If is used, the required sample size is 78. a is used the required sample size is 778; if An alternative method for selecting sample size is to determine the sample size necessary to meet some target for power to test the hypotheses in equation (). Gastonis and Sampson (1989) have discussed using the distribution of to conduct such analyses. As they have pointed out, when sampling is conducted so that the values observed for the predictors would vary from sample to sample, power analyses using Fisher s distribution for is more appropriate than power analysis assuming values observed for the predictors would be fixed from sample to sample, as is done, for example, in Cohen s (1988) approach to power analysis in multiple regression. We have prepared SAS (power.mrsq.sas) and SPSS programs (power.mrsq.sps) to determine the sample size necessary to meet some target for power given and k. For example, suppose a researcher is planning a study with k 8 predictors and.1. The researcher wants power to be.95 if is.5. Calculations show that a sample size of n 45 is required. Note that using the same values for k and.5 accuracy would be 778 if, the sample size to meet a were used. While it is impossible to equate how demanding a target for power and a target for accuracy are, it is fair to say that the accuracy and power targets used in these two examples are quite demanding. As these examples illustrate, it will often be the case that a larger sample size is required to achieve target accuracy than to achieve target power.
10 Distribution Theory 1 The Squared Cross-Validation Correlation Coefficient The squared cross-validation correlation coefficient is defined as, that is YY as the population Pearson product moment correlation between Y and Y. The coefficient is a random variable, defined over the samples of size n that could have been drawn. Fowler (1986) stated, without proof, that the distribution of can be used to find a 1 1 % confidence interval for. First a 1 1 % confidence interval for is constructed. Then the lower limit and the upper limit of the confidence interval are substituted in Browne s (1975b) expression for the firstorder approximation to the expected value of : n p 3 4 n k k. Because this procedure starts with a 1 1 % confidence interval for and because is a monotone function of, it is clear that Fowler s procedure provides a1 1 % confidence interval for. However, it is less clear that the procedure provides a 1 1 % confidence interval for. Furthermore, although Fowler reported results that seem to substantiate the accuracy of the interval, Fowler does not describe how these results were obtained. In addition, Fowler used an approximate procedure to obtain the 1 1 % confidence interval for and reported on only a few cases. Given the limitation of this work, we conducted a more extensive simulation.
11 Distribution Theory 11 The design of our simulation was a 4 n 5 k 8 (nominal confidence level) crossed factorial design. The levels of the factors were n 3 to 1 in steps of 3, k to 1 in steps of,.5 to.75 in steps of.1, and confidence level =.95 and.99. All conditions were replicated 5 times. For each condition, the percentage of times the 5 intervals contained E n k n k 6 k 1 1 n p 4 n p p 4 3 (5) was determined. Equation (3) is a more accurate approximation to the expected value of than is (Browne, 1975b). In addition, the percentage of times the 5 intervals contained was also determined. We also determined the average value for over the 5 replications, to determine how well E approximated the expected value of. The approximation was very accurate. Thus, determining the adequacy of the probability coverage for E tells us how adequate the coverage probability is for the expected value of. Table 1 reports the results for the most extreme combination of sample size and number of predictors ( n 3 and k 1 ). The results indicate excellent coverage probability for the interval for the expected value of. The coverage probabilities are also good for, with a slight tendency to decline as gets large. This tendency was reduced as n increased or k decreased. It is well known that must be less than or equal to. Therefore a reasonable goal in planning a study to estimate a prediction equation is to have a
12 Distribution Theory 1 suitably high probability that is sufficiently close to. Algina and Keslman () used simulated data to estimate the required sample size for a number of combinations of, k, and c, where c is the maximum tolerable discrepancy between and. However, Park and Dudycha (1974) had earlier shown that Pr 1 Pr F 1, k 1, k 1, where F 1, k 1, is a noncentral F distribution with 1 and k 1 degrees of freedom and noncentrality parameter 1 : n k 1. This result can be used to analytically determine the sample size required for to be sufficiently close to. Because the researcher s goal is to have a suitably high probability that c, (s)he can select the sample size so that c k 1 Pr c 1 Pr F 1, k 1, p. c Park and Dudycha published tables that provide the required sample size for a limited number of combinations of, k, and c. We have prepared SAS (sample.size..sas) and SPSS (sample.size..sps) programs to compute the required sample. For example, suppose a researcher believes that with k 6 predictors is.3 and wants a.95 probability that the discrepancy between and is not larger than.5. That is, the researcher wants to find n such that
13 Distribution Theory 13.3 n Pr Pr F 1,6 1, Calculations yield n 154. Conclusion Depending on the purpose of the study, estimation of the or will be of interest. aju, Bilgic, Edward, and Fleer (1997) presented an excellent review of point estimation methods for these parameters, but did not focus on interval estimation. We have reviewed applications of the distribution theory for and, including confidence intervals, power, and sample size determination. Thus our review complements the review by aju, Bilgic, Edward, and Fleer.
14 Distribution Theory 14 eferences Algina, J., & Keselman, H. J. (). Cross-validation sample sizes. Applied Psychological Measurement, 4, Algina J., & Okejnik, S. (). Determining sample size for accurate estimation of the squared multiple correlation coefficient. Multivariate Behavioral esearch, 35, Browne, M. B. (1975a). Predictive validity of a linear regression equation. British Journal of Mathematical and Statistical Psychology, 8, Browne, M. B. (1975b). A comparison of single sample and cross-validation methods of estimating the mean squared error of prediction in multiple linear regression. British Journal of Mathematical and Statistical Psychology, 8, Cohen, J. (1988). Statistical power analysis for the behavioral sciences ( nd ed). Hillsdale, NJ: Lawrence Erlbaum. Ding, C. G. (1996). On the computation of the distribution of the square of the sample multiple correlation coefficient. Computational Statistics and Data Analysis,, Ding, C. G., & Bargmann,. E. (1991). Algorithm AS 6: evaluation of the distribution of the square of the sample multiple correlation coefficient. Applied Statistics, 4, Fisher,. A. (198). The general sampling distribution of the multiple correlation coefficient. Proceedings of the oyal Society of London, Series A, 11,
15 Distribution Theory 15 Fowler,. L. (1986). Confidence intervals for the cross-validated multiple correlation in predictive regression models. Journal of Applied Psychology, 71, Gastonis, C., & Sampson, A.. (1989). Multiple correlation: Exact power and sample size calculations. Psychological Bulletin, 16, Gurland, J., & Milton,. (197). Further considerations of the distribution of the multiple correlation coefficient. Journal of the oyal Statistical Society, Series B (Methodological), 3, Khatri, C. G. (1966). A note on a large sample distribution of a transformed multiple correlation coefficient. Annals of the Institute of Statistical Mathematics, 18, Lee, Y.S. (1971). Some results on the sampling distribution of the multiple correlation coefficient. Journal of the oyal Statistical Society, Series B (Methodological), 33, Moran, P. A. P. (195). The distribution of the multiple correlation coefficient. Proceedings of the Cambridge Philosophical Society, Mathematical and Physical Sciences, 46, Park, C. N., & Dudycha, A. L. (1974). A cross-validation approach to sample size determination for regression models. Journal of the American Statistical Association, 69, aju, N. S. Bilgic,., Edward, J.E., & Fleer, P.F. (1997). Methodology review: estimation of population validity and cross validity, and the use of equal weights in prediction. Applied Psychological Measurement, 1,
16 Distribution Theory 16 Steiger, J. H., & Fouladi,. (199a). : A computer program for interval estimation, power calculations, sample size estimation, and hypothesis testing in multiple regression. Behavior research methods, instruments, and computers, 4, Steiger, J. H., & Fouladi,., T. (199b). User s Guide:Version 1.1 [on-line]. Steiger, J. H., & Fouladi,. (1997). Noncentrality interval estimation and the evaluation of statistical models. In L. L. Harlow, S. A. Mulaik, & J. H. Steiger (Eds.). What if there were no significance tests? (pp. 1-57). Lawrence Erlbaum Associates. Stein, C. (196). Multiple regression. In I Olkin, S. G. Ghurye, W. Hoeffding, W. G. Madow, & H. B. Mann, (Eds.). Contribution to probability and statistics; Essays in honor of Harold Hotelling. (pp ). Stanford: Stanford University Press. Wishart, J. (1931). The mean and second moment coefficients of the multiple correlation coefficient, in samples from a normal population, Biometrika,,
17 Distribution Theory 17 Footnote 1 In equation (4.16), Park and Dudycha (1974) report that the noncentrality parameter for the F distribution is rather than. However, their derivation indicates the noncentrality is. In addition they prepared their tables using the noncentral t distribution with noncentrality parameter, which also suggests the noncentrality parameter for the F distribution is. Finally we replicated their tables using the F distribution with as the noncentrality parameter.
18 Distribution Theory 18 Table 1 Estimated Confidence Coefficients for 95% Confidence Intervals on n 3 and k 1 Nominal Confidence Level and : E E Note. is the population squared multiple correlation coefficient and k is the number of predictors.
Probabilistic Hindsight: A SAS Macro for Retrospective Statistical Power Analysis
1 Paper 0-6 Probabilistic Hindsight: A SAS Macro for Retrospective Statistical Power Analysis Kristine Y. Hogarty and Jeffrey D. Kromrey Department of Educational Measurement and Research, University of
More informationTHE EFFECTS OF NONNORMAL DISTRIBUTIONS ON CONFIDENCE INTERVALS AROUND THE STANDARDIZED MEAN DIFFERENCE: BOOTSTRAP AND PARAMETRIC CONFIDENCE INTERVALS
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT 10.1177/0013164404264850 KELLEY THE EFFECTS OF NONNORMAL DISTRIBUTIONS ON CONFIDENCE INTERVALS AROUND THE STANDARDIZED MEAN DIFFERENCE: BOOTSTRAP AND PARAMETRIC
More informationAnother Look at the Confidence Intervals for the Noncentral T Distribution
Journal of Modern Applied Statistical Methods Volume 6 Issue 1 Article 11 5-1-007 Another Look at the Confidence Intervals for the Noncentral T Distribution Bruno Lecoutre Centre National de la Recherche
More informationHarold HOTELLING b. 29 September d. 26 December 1973
Harold HOTELLING b. 29 September 1895 - d. 26 December 1973 Summary. A major developer of the foundations of statistics and an important contributor to mathematical economics, Hotelling introduced the
More informationPower Analysis Introduction to Power Analysis with G*Power 3 Dale Berger 1401
Power Analysis Introduction to Power Analysis with G*Power 3 Dale Berger 1401 G*Power 3 is a wonderful free resource for power analysis. This program provides power analyses for tests that use F, t, chi-square,
More informationP S GWOWEN SHIEH The Psychonomic Society, Inc National Chiao Tung University, Hsinchu, Taiwan
Behavior Research Methods 2010, 42 (3), 824-835 doi:10.3758/brm.42.3.824 Sample size determination for confidence intervals of interaction effects in moderated multiple regression with continuous predictor
More informationPsychology 282 Lecture #4 Outline Inferences in SLR
Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations
More informationAn Approximate Test for Homogeneity of Correlated Correlation Coefficients
Quality & Quantity 37: 99 110, 2003. 2003 Kluwer Academic Publishers. Printed in the Netherlands. 99 Research Note An Approximate Test for Homogeneity of Correlated Correlation Coefficients TRIVELLORE
More informationDETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics
DETAILED CONTENTS About the Author Preface to the Instructor To the Student How to Use SPSS With This Book PART I INTRODUCTION AND DESCRIPTIVE STATISTICS 1. Introduction to Statistics 1.1 Descriptive and
More informationOn Selecting Tests for Equality of Two Normal Mean Vectors
MULTIVARIATE BEHAVIORAL RESEARCH, 41(4), 533 548 Copyright 006, Lawrence Erlbaum Associates, Inc. On Selecting Tests for Equality of Two Normal Mean Vectors K. Krishnamoorthy and Yanping Xia Department
More informationScienceDirect. Who s afraid of the effect size?
Available online at www.sciencedirect.com ScienceDirect Procedia Economics and Finance 0 ( 015 ) 665 669 7th International Conference on Globalization of Higher Education in Economics and Business Administration,
More informationSOME ASPECTS OF MULTIVARIATE BEHRENS-FISHER PROBLEM
SOME ASPECTS OF MULTIVARIATE BEHRENS-FISHER PROBLEM Junyong Park Bimal Sinha Department of Mathematics/Statistics University of Maryland, Baltimore Abstract In this paper we discuss the well known multivariate
More informationTechnical Report that Accompanies Relational and arelational confidence intervals: A comment on Fidler et al. (2004)
Technical Report that Accompanies Relational and arelational confidence intervals: A comment on Fidler et al. (2004) Richard D. Morey and Jeffrey N. Rouder University of Missouri-Columbia Abstract Fidler,
More informationPsychology Seminar Psych 406 Dr. Jeffrey Leitzel
Psychology Seminar Psych 406 Dr. Jeffrey Leitzel Structural Equation Modeling Topic 1: Correlation / Linear Regression Outline/Overview Correlations (r, pr, sr) Linear regression Multiple regression interpreting
More informationContents. Acknowledgments. xix
Table of Preface Acknowledgments page xv xix 1 Introduction 1 The Role of the Computer in Data Analysis 1 Statistics: Descriptive and Inferential 2 Variables and Constants 3 The Measurement of Variables
More informationConventional And Robust Paired And Independent-Samples t Tests: Type I Error And Power Rates
Journal of Modern Applied Statistical Methods Volume Issue Article --3 Conventional And And Independent-Samples t Tests: Type I Error And Power Rates Katherine Fradette University of Manitoba, umfradet@cc.umanitoba.ca
More informationAn Equivalency Test for Model Fit. Craig S. Wells. University of Massachusetts Amherst. James. A. Wollack. Ronald C. Serlin
Equivalency Test for Model Fit 1 Running head: EQUIVALENCY TEST FOR MODEL FIT An Equivalency Test for Model Fit Craig S. Wells University of Massachusetts Amherst James. A. Wollack Ronald C. Serlin University
More information7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between
7.2 One-Sample Correlation ( = a) Introduction Correlation analysis measures the strength and direction of association between variables. In this chapter we will test whether the population correlation
More informationG. Shieh. Keywords Behrens Fisher problem. Cohen s d. Confidence interval. Precision. Welch s statistic
Behav Res (013) 45:955 967 DOI 10.3758/s1348-013-030-7 Confidence intervals and sample size calculations for the standardized mean difference effect size between two normal populations under heteroscedasticity
More informationAn Overview of the Performance of Four Alternatives to Hotelling's T Square
fi~hjf~~ G 1992, m-t~, 11o-114 Educational Research Journal 1992, Vol.7, pp. 110-114 An Overview of the Performance of Four Alternatives to Hotelling's T Square LIN Wen-ying The Chinese University of Hong
More informationThe MBESS Package. August 22, 2006
The MBESS Package August 22, 2006 Title Methods for the Behavioral, Educational, and Social Sciences Version 0.0.7 Date 2006-08-21 Author Ken Kelley Maintainer Ken Kelley Depends R
More informationSTAT 512 MidTerm I (2/21/2013) Spring 2013 INSTRUCTIONS
STAT 512 MidTerm I (2/21/2013) Spring 2013 Name: Key INSTRUCTIONS 1. This exam is open book/open notes. All papers (but no electronic devices except for calculators) are allowed. 2. There are 5 pages in
More informationToward Using Confidence Intervals to Compare Correlations (will appear in Psychological Methods)
Running head: COMPARE CORRELATIONS Toward Using Confidence Intervals to Compare Correlations (will appear in Psychological Methods) Guang Yong Zou Department of Epidemiology and Biostatistics Schulich
More informationDr. Franz Faul Department of Psychology Christian-Albrechts-University Olshausenstr Kiel Germany
G*Power 3 Buchner, A. (2010). G*Power: Users Guide Analysis by design. Web Page of Heinrich-Heine- Universität - Institut für experimentelle Psychologie. Availlable at http://www.psycho.uniduesseldorf.de/abteilungen/aap/gpower3.
More informationDifference in two or more average scores in different groups
ANOVAs Analysis of Variance (ANOVA) Difference in two or more average scores in different groups Each participant tested once Same outcome tested in each group Simplest is one-way ANOVA (one variable as
More informationSome Notes on ANOVA for Correlations. James H. Steiger Vanderbilt University
Some Notes on ANOVA for Correlations James H. Steiger Vanderbilt University Over the years, a number of people have asked me about doing analysis of variance on correlations. Consider, for example, a 2
More informationStatistical. Psychology
SEVENTH у *i km m it* & П SB Й EDITION Statistical M e t h o d s for Psychology D a v i d C. Howell University of Vermont ; \ WADSWORTH f% CENGAGE Learning* Australia Biaall apan Korea Меяко Singapore
More informationChapter 14: Repeated-measures designs
Chapter 14: Repeated-measures designs Oliver Twisted Please, Sir, can I have some more sphericity? The following article is adapted from: Field, A. P. (1998). A bluffer s guide to sphericity. Newsletter
More informationConfidence Intervals for Comparison of the Squared Multiple Correlation Coefficients of Nonnested
Western University Scholarship@Western Electronic Thesis and Dissertation Repository February 2012 Confidence Intervals for Comparison of the Squared Multiple Correlation Coefficients of Nonnested Models
More informationApplication of Parametric Homogeneity of Variances Tests under Violation of Classical Assumption
Application of Parametric Homogeneity of Variances Tests under Violation of Classical Assumption Alisa A. Gorbunova and Boris Yu. Lemeshko Novosibirsk State Technical University Department of Applied Mathematics,
More informationy ˆ i = ˆ " T u i ( i th fitted value or i th fit)
1 2 INFERENCE FOR MULTIPLE LINEAR REGRESSION Recall Terminology: p predictors x 1, x 2,, x p Some might be indicator variables for categorical variables) k-1 non-constant terms u 1, u 2,, u k-1 Each u
More informationØ Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.
Statistical Tools in Evaluation HPS 41 Dr. Joe G. Schmalfeldt Types of Scores Continuous Scores scores with a potentially infinite number of values. Discrete Scores scores limited to a specific number
More informationGraphical Procedures, SAS' PROC MIXED, and Tests of Repeated Measures Effects. H.J. Keselman University of Manitoba
1 Graphical Procedures, SAS' PROC MIXED, and Tests of Repeated Measures Effects by H.J. Keselman University of Manitoba James Algina University of Florida and Rhonda K. Kowalchuk University of Manitoba
More information" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2
Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the
More informationNEW APPROXIMATE INFERENTIAL METHODS FOR THE RELIABILITY PARAMETER IN A STRESS-STRENGTH MODEL: THE NORMAL CASE
Communications in Statistics-Theory and Methods 33 (4) 1715-1731 NEW APPROXIMATE INFERENTIAL METODS FOR TE RELIABILITY PARAMETER IN A STRESS-STRENGT MODEL: TE NORMAL CASE uizhen Guo and K. Krishnamoorthy
More informationEconometrics. 4) Statistical inference
30C00200 Econometrics 4) Statistical inference Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Confidence intervals of parameter estimates Student s t-distribution
More informationJournal of Educational and Behavioral Statistics
Journal of Educational and Behavioral Statistics http://jebs.aera.net Theory of Estimation and Testing of Effect Sizes: Use in Meta-Analysis Helena Chmura Kraemer JOURNAL OF EDUCATIONAL AND BEHAVIORAL
More informationSingle Sample Means. SOCY601 Alan Neustadtl
Single Sample Means SOCY601 Alan Neustadtl The Central Limit Theorem If we have a population measured by a variable with a mean µ and a standard deviation σ, and if all possible random samples of size
More informationDr. Junchao Xia Center of Biophysics and Computational Biology. Fall /1/2016 1/46
BIO5312 Biostatistics Lecture 10:Regression and Correlation Methods Dr. Junchao Xia Center of Biophysics and Computational Biology Fall 2016 11/1/2016 1/46 Outline In this lecture, we will discuss topics
More informationPOWER AND TYPE I ERROR RATE COMPARISON OF MULTIVARIATE ANALYSIS OF VARIANCE
POWER AND TYPE I ERROR RATE COMPARISON OF MULTIVARIATE ANALYSIS OF VARIANCE Supported by Patrick Adebayo 1 and Ahmed Ibrahim 1 Department of Statistics, University of Ilorin, Kwara State, Nigeria Department
More informationPower Analysis for One-Way ANOVA
Chapter 12 Power Analysis for One-Way ANOVA Recall that the power of a statistical test is the probability of rejecting H 0 when H 0 is false, and some alternative hypothesis H 1 is true. We saw earlier
More informationA Monte-Carlo study of asymptotically robust tests for correlation coefficients
Biometrika (1973), 6, 3, p. 661 551 Printed in Great Britain A Monte-Carlo study of asymptotically robust tests for correlation coefficients BY G. T. DUNCAN AND M. W. J. LAYAKD University of California,
More informationLogistic Regression Analysis
Logistic Regression Analysis Predicting whether an event will or will not occur, as well as identifying the variables useful in making the prediction, is important in most academic disciplines as well
More informationAn inferential procedure to use sample data to understand a population Procedures
Hypothesis Test An inferential procedure to use sample data to understand a population Procedures Hypotheses, the alpha value, the critical region (z-scores), statistics, conclusion Two types of errors
More informationModel Estimation Example
Ronald H. Heck 1 EDEP 606: Multivariate Methods (S2013) April 7, 2013 Model Estimation Example As we have moved through the course this semester, we have encountered the concept of model estimation. Discussions
More informationStatistics Handbook. All statistical tables were computed by the author.
Statistics Handbook Contents Page Wilcoxon rank-sum test (Mann-Whitney equivalent) Wilcoxon matched-pairs test 3 Normal Distribution 4 Z-test Related samples t-test 5 Unrelated samples t-test 6 Variance
More information* Tuesday 17 January :30-16:30 (2 hours) Recored on ESSE3 General introduction to the course.
Name of the course Statistical methods and data analysis Audience The course is intended for students of the first or second year of the Graduate School in Materials Engineering. The aim of the course
More informationCHAPTER 2. Types of Effect size indices: An Overview of the Literature
CHAPTER Types of Effect size indices: An Overview of the Literature There are different types of effect size indices as a result of their different interpretations. Huberty (00) names three different types:
More informationStudy Design: Sample Size Calculation & Power Analysis
Study Design: Sample Size Calculation & Power Analysis RCMAR/CHIME/EXPORT April 21, 2008 Honghu Liu, Ph.D. Contents Background Common Designs Examples Computer Software Summary & Discussion Background
More informationReconciling factor-based and composite-based approaches to structural equation modeling
Reconciling factor-based and composite-based approaches to structural equation modeling Edward E. Rigdon (erigdon@gsu.edu) Modern Modeling Methods Conference May 20, 2015 Thesis: Arguments for factor-based
More informationBlack White Total Observed Expected χ 2 = (f observed f expected ) 2 f expected (83 126) 2 ( )2 126
Psychology 60 Fall 2013 Practice Final Actual Exam: This Wednesday. Good luck! Name: To view the solutions, check the link at the end of the document. This practice final should supplement your studying;
More informationPackage MBESS. January 2, 2012
Package MBESS January 2, 2012 Title MBESS Version 3.2.1 Date 2011-5-5 Author Ken Kelley and Keke Lai Maintainer Keke Lai Depends R (>= 2.6.0) Suggests MASS, sem, gsl, boot, nlme MBESS implements
More informationCan you tell the relationship between students SAT scores and their college grades?
Correlation One Challenge Can you tell the relationship between students SAT scores and their college grades? A: The higher SAT scores are, the better GPA may be. B: The higher SAT scores are, the lower
More informationSensitiveness analysis: Sample sizes for t-tests for paired samples
Sensitiveness analysis: Sample sizes for t-tests for paired samples (J.D.Perezgonzalez, 2016, Massey University, New Zealand, doi: 10.13140/RG.2.2.32249.47203) Table 1 shows the sample sizes required for
More informationThis exam contains 5 questions. Each question is worth 10 points. Therefore, this exam is worth 50 points.
GROUND RULES: This exam contains 5 questions. Each question is worth 10 points. Therefore, this exam is worth 50 points. Print your name at the top of this page in the upper right hand corner. This is
More informationMultiple Linear Regression II. Lecture 8. Overview. Readings
Multiple Linear Regression II Lecture 8 Image source:https://commons.wikimedia.org/wiki/file:autobunnskr%c3%a4iz-ro-a201.jpg Survey Research & Design in Psychology James Neill, 2016 Creative Commons Attribution
More informationMultiple Linear Regression II. Lecture 8. Overview. Readings. Summary of MLR I. Summary of MLR I. Summary of MLR I
Multiple Linear Regression II Lecture 8 Image source:https://commons.wikimedia.org/wiki/file:autobunnskr%c3%a4iz-ro-a201.jpg Survey Research & Design in Psychology James Neill, 2016 Creative Commons Attribution
More informationSTAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015
STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots March 8, 2015 The duality between CI and hypothesis testing The duality between CI and hypothesis
More informationTHE 'IMPROVED' BROWN AND FORSYTHE TEST FOR MEAN EQUALITY: SOME THINGS CAN'T BE FIXED
THE 'IMPROVED' BROWN AND FORSYTHE TEST FOR MEAN EQUALITY: SOME THINGS CAN'T BE FIXED H. J. Keselman Rand R. Wilcox University of Manitoba University of Southern California Winnipeg, Manitoba Los Angeles,
More informationAn Introduction to Multivariate Statistical Analysis
An Introduction to Multivariate Statistical Analysis Third Edition T. W. ANDERSON Stanford University Department of Statistics Stanford, CA WILEY- INTERSCIENCE A JOHN WILEY & SONS, INC., PUBLICATION Contents
More informationA Study of Statistical Power and Type I Errors in Testing a Factor Analytic. Model for Group Differences in Regression Intercepts
A Study of Statistical Power and Type I Errors in Testing a Factor Analytic Model for Group Differences in Regression Intercepts by Margarita Olivera Aguilar A Thesis Presented in Partial Fulfillment of
More informationHYPOTHESIS TESTING. Hypothesis Testing
MBA 605 Business Analytics Don Conant, PhD. HYPOTHESIS TESTING Hypothesis testing involves making inferences about the nature of the population on the basis of observations of a sample drawn from the population.
More informationOn Polynomial Transformations For Simulating Multivariate Non-normal Distributions
Journal of Modern Applied Statistical Methods May, 2004, Vol. 3, No. 1,65-71 Copyright 2004 JMASM, Inc. 1538-9472/04/$95.00 On Polynomial Transformations For Simulating Multivariate Non-normal Distributions
More informationResearch Design - - Topic 12 MRC Analysis and Two Factor Designs: Completely Randomized and Repeated Measures 2010 R.C. Gardner, Ph.D.
esearch Design - - Topic MC nalysis and Two Factor Designs: Completely andomized and epeated Measures C Gardner, PhD General overview Completely andomized Two Factor Designs Model I Effect Coding egression
More informationINTERVAL ESTIMATION IN THE PRESENCE OF AN OUTLIER
6 INTVAL STIMATION IN TH SNC OF AN OTLI WONG YOK CHN School of Business The niversity of Nottingham Malaysia Campus mail: YokeChen.Wong@nottingham.edu.my OOI AH HIN School of Business Sunway niversity
More informationAnswer keys for Assignment 10: Measurement of study variables (The correct answer is underlined in bold text)
Answer keys for Assignment 10: Measurement of study variables (The correct answer is underlined in bold text) 1. A quick and easy indicator of dispersion is a. Arithmetic mean b. Variance c. Standard deviation
More informationThe Chi-Square and F Distributions
Department of Psychology and Human Development Vanderbilt University Introductory Distribution Theory 1 Introduction 2 Some Basic Properties Basic Chi-Square Distribution Calculations in R Convergence
More informationWooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics
Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics A short review of the principles of mathematical statistics (or, what you should have learned in EC 151).
More informationWhat p values really mean (and why I should care) Francis C. Dane, PhD
What p values really mean (and why I should care) Francis C. Dane, PhD Session Objectives Understand the statistical decision process Appreciate the limitations of interpreting p values Value the use of
More informationRobustness. James H. Steiger. Department of Psychology and Human Development Vanderbilt University. James H. Steiger (Vanderbilt University) 1 / 37
Robustness James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) 1 / 37 Robustness 1 Introduction 2 Robust Parameters and Robust
More informationResearch Design - - Topic 19 Multiple regression: Applications 2009 R.C. Gardner, Ph.D.
Research Design - - Topic 19 Multiple regression: Applications 2009 R.C. Gardner, Ph.D. Curve Fitting Mediation analysis Moderation Analysis 1 Curve Fitting The investigation of non-linear functions using
More informationPOWER FUNCTION CHARTS FOR SPECIFICATION OF SAMPLE SIZE IN ANALYSIS OF VARIANCE
PSYCHOMETRIKA--VOL. 23, NO. 3 SEPTEMBER, 1958 POWER FUNCTION CHARTS FOR SPECIFICATION OF SAMPLE SIZE IN ANALYSIS OF VARIANCE LEONARD S. FELDT STATE UNIVERSITY OF IOWA AND 5~OHARRAM W. MAHMOUD EGYPTIAN
More informationCorrelation. A statistics method to measure the relationship between two variables. Three characteristics
Correlation Correlation A statistics method to measure the relationship between two variables Three characteristics Direction of the relationship Form of the relationship Strength/Consistency Direction
More informationSample Size Planning with Applications to Multiple Regression: Power and Accuracy for Omnibus and Targeted Effects
11 Sample Size Planning with Applications to Multiple Regression: Power and Accuracy for Omnibus and Targeted Effects K e n K e l l e y a n d S c o t t E. M a x w e l l ABSTRACT When designing a research
More informationHuman-Centered Fidelity Metrics for Virtual Environment Simulations
Human-Centered Fidelity Metrics for Virtual Environment Simulations Three Numbers from Standard Experimental Design and Analysis: α, power, effect magnitude VR 2005 Tutorial J. Edward Swan II, Mississippi
More informationMultiple Comparison Procedures, Trimmed Means and Transformed Statistics. Rhonda K. Kowalchuk Southern Illinois University Carbondale
Multiple Comparison Procedures 1 Multiple Comparison Procedures, Trimmed Means and Transformed Statistics Rhonda K. Kowalchuk Southern Illinois University Carbondale H. J. Keselman University of Manitoba
More informationStatistics: revision
NST 1B Experimental Psychology Statistics practical 5 Statistics: revision Rudolf Cardinal & Mike Aitken 29 / 30 April 2004 Department of Experimental Psychology University of Cambridge Handouts: Answers
More informationThe exact bootstrap method shown on the example of the mean and variance estimation
Comput Stat (2013) 28:1061 1077 DOI 10.1007/s00180-012-0350-0 ORIGINAL PAPER The exact bootstrap method shown on the example of the mean and variance estimation Joanna Kisielinska Received: 21 May 2011
More informationAdvanced Experimental Design
Advanced Experimental Design Topic Four Hypothesis testing (z and t tests) & Power Agenda Hypothesis testing Sampling distributions/central limit theorem z test (σ known) One sample z & Confidence intervals
More informationRegression With a Categorical Independent Variable: Mean Comparisons
Regression With a Categorical Independent Variable: Mean Lecture 16 March 29, 2005 Applied Regression Analysis Lecture #16-3/29/2005 Slide 1 of 43 Today s Lecture comparisons among means. Today s Lecture
More informationThis gives us an upper and lower bound that capture our population mean.
Confidence Intervals Critical Values Practice Problems 1 Estimation 1.1 Confidence Intervals Definition 1.1 Margin of error. The margin of error of a distribution is the amount of error we predict when
More informationAn Examination of the Robustness of the Empirical Bayes and Other Approaches. for Testing Main and Interaction Effects in Repeated Measures Designs
Empirical Bayes 1 An Examination of the Robustness of the Empirical Bayes and Other Approaches for Testing Main and Interaction Effects in Repeated Measures Designs by H.J. Keselman, Rhonda K. Kowalchuk
More informationDistribution-Free Procedures (Devore Chapter Fifteen)
Distribution-Free Procedures (Devore Chapter Fifteen) MATH-5-01: Probability and Statistics II Spring 018 Contents 1 Nonparametric Hypothesis Tests 1 1.1 The Wilcoxon Rank Sum Test........... 1 1. Normal
More information4.2 The Normal Distribution. that is, a graph of the measurement looks like the familiar symmetrical, bell-shaped
4.2 The Normal Distribution Many physiological and psychological measurements are normality distributed; that is, a graph of the measurement looks like the familiar symmetrical, bell-shaped distribution
More informationChapter Fifteen. Frequency Distribution, Cross-Tabulation, and Hypothesis Testing
Chapter Fifteen Frequency Distribution, Cross-Tabulation, and Hypothesis Testing Copyright 2010 Pearson Education, Inc. publishing as Prentice Hall 15-1 Internet Usage Data Table 15.1 Respondent Sex Familiarity
More informationTypes of Statistical Tests DR. MIKE MARRAPODI
Types of Statistical Tests DR. MIKE MARRAPODI Tests t tests ANOVA Correlation Regression Multivariate Techniques Non-parametric t tests One sample t test Independent t test Paired sample t test One sample
More informationPsicológica ISSN: Universitat de València España
Psicológica ISSN: 0211-2159 psicologica@uv.es Universitat de València España Zimmerman, Donald W.; Zumbo, Bruno D. Hazards in Choosing Between Pooled and Separate- Variances t Tests Psicológica, vol. 30,
More informationCHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)
FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter
More informationIncreasing Power in Paired-Samples Designs. by Correcting the Student t Statistic for Correlation. Donald W. Zimmerman. Carleton University
Power in Paired-Samples Designs Running head: POWER IN PAIRED-SAMPLES DESIGNS Increasing Power in Paired-Samples Designs by Correcting the Student t Statistic for Correlation Donald W. Zimmerman Carleton
More informationØ Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.
Statistical Tools in Evaluation HPS 41 Fall 213 Dr. Joe G. Schmalfeldt Types of Scores Continuous Scores scores with a potentially infinite number of values. Discrete Scores scores limited to a specific
More informationTESTS FOR MEAN EQUALITY THAT DO NOT REQUIRE HOMOGENEITY OF VARIANCES: DO THEY REALLY WORK?
TESTS FOR MEAN EQUALITY THAT DO NOT REQUIRE HOMOGENEITY OF VARIANCES: DO THEY REALLY WORK? H. J. Keselman Rand R. Wilcox University of Manitoba University of Southern California Winnipeg, Manitoba Los
More informationTwo-Sample Inferential Statistics
The t Test for Two Independent Samples 1 Two-Sample Inferential Statistics In an experiment there are two or more conditions One condition is often called the control condition in which the treatment is
More informationSTAT FINAL EXAM
STAT101 2013 FINAL EXAM This exam is 2 hours long. It is closed book but you can use an A-4 size cheat sheet. There are 10 questions. Questions are not of equal weight. You may need a calculator for some
More informationSample size calculations for logistic and Poisson regression models
Biometrika (2), 88, 4, pp. 93 99 2 Biometrika Trust Printed in Great Britain Sample size calculations for logistic and Poisson regression models BY GWOWEN SHIEH Department of Management Science, National
More informationMultivariate Statistical Analysis
Multivariate Statistical Analysis Fall 2011 C. L. Williams, Ph.D. Lecture 9 for Applied Multivariate Analysis Outline Addressing ourliers 1 Addressing ourliers 2 Outliers in Multivariate samples (1) For
More informationMultiple Linear Regression II. Lecture 8. Overview. Readings
Multiple Linear Regression II Lecture 8 Image source:http://commons.wikimedia.org/wiki/file:vidrarias_de_laboratorio.jpg Survey Research & Design in Psychology James Neill, 2015 Creative Commons Attribution
More informationMultiple Linear Regression II. Lecture 8. Overview. Readings. Summary of MLR I. Summary of MLR I. Summary of MLR I
Multiple Linear Regression II Lecture 8 Image source:http://commons.wikimedia.org/wiki/file:vidrarias_de_laboratorio.jpg Survey Research & Design in Psychology James Neill, 2015 Creative Commons Attribution
More informationMultiple Regression. More Hypothesis Testing. More Hypothesis Testing The big question: What we really want to know: What we actually know: We know:
Multiple Regression Ψ320 Ainsworth More Hypothesis Testing What we really want to know: Is the relationship in the population we have selected between X & Y strong enough that we can use the relationship
More informationBusiness Statistics. Lecture 10: Course Review
Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,
More informationChapter 5. Introduction to Path Analysis. Overview. Correlation and causation. Specification of path models. Types of path models
Chapter 5 Introduction to Path Analysis Put simply, the basic dilemma in all sciences is that of how much to oversimplify reality. Overview H. M. Blalock Correlation and causation Specification of path
More information