A SAS/AF Application For Sample Size And Power Determination

Size: px
Start display at page:

Download "A SAS/AF Application For Sample Size And Power Determination"

Transcription

1 A SAS/AF Application For Sample Size And Power Determination Fiona Portwood, Software Product Services Ltd. Abstract When planning a study, such as a clinical trial or toxicology experiment, the choice of sample size (i.e. the number of patients or animals to use in the study) is a critical part of the study in order to ensure that the desired conclusions of efficacy or equivalence of the test treatment are successfully demonstrated at the end of the study. Since there are no SAS procedures for performing these calculations, the task of tracking down the necessary background information and the correct statistical formulae can become an obstacle, which may result in researchers being tempted to guess at an appropriate sample size. An application has been developed by the author using SAS/AF software which incorporates the various statistical formulae for sample size selection and power determination behind a user-friendly, point-and-click graphical interface. This application allows the user to estimate the sample size, power, and minimum detectable differences between treatments for various statistical hypotheses, including tests of equivalence and tests of differences between means and proportions. Results are given in both graphical and tabular formats. This paper will give an introduction to the statistical concepts behind sample size and power determination, together with a demonstration of the SAS/AF application. Introduction When planning a study or experiment, for example a clinical trial, a toxicology study or preclinical research experiment, one of the first questions that a statistician is often asked is, How many patients/animals should be used in the study?. The choice of sample size is a critical part of the study in order to ensure that the desired conclusions of efficacy or equivalence of the test treatment are successfully demonstrated at the end of the study using the appropriate statistical methodologies. The task of tracking down the necessary background information and the correct statistical formulae for determining the sample size can be difficult, and thus researchers may be tempted to use their best guess at an appropriate sample size. The number of patients or animals that should be used in a study will depend on a number of factors, including: Purpose of the study (for example, whether you wish to test for a difference or an equivalence between treatments) Study design (for example, cross-over design or parallel group design) Variance of the underlying population Size of the difference between treatments

2 Definitions It is necessary to define the following statistical definitions which will be used frequently in this paper: 1. Null Hypothesis This is a basic assumption or theory about the data which we assume to be true in the absence of any other information. The alternative hypothesis is the converse theory to the null hypothesis. During any study or experiment, we hope to collect enough information to provide evidence against the null hypothesis and for the alternative hypothesis. For example, suppose we wish to carry out a study to test for a difference between a control and a test treatment. The null hypothesis would be that population means for the control and test treatments are equal, and the alternative hypothesis would be that the population means are not equal. 2. Significance Level For tests of difference between treatments, this is the probability of incorrectly rejecting the null hypothesis, i.e. it is the probability of rejecting the null hypothesis of no difference between the treatments, when in reality there is truly no difference. We usually want this probability to be small, and it is commonly set to 0.05 (or 5%). 3. Power This is the probability of correctly rejecting the null hypothesis, i.e. it is the probability of rejecting the null hypothesis when the alternative hypothesis is true. Ideally we would like the power to be as large as possible. Calculation of Sample Size and Power There are various statistical formulae, depending on the null hypothesis of the study, which define the relationship between the sample size, significance level, power, population variance (or estimate of variance) and the size of the detectable difference between treatments. Consider the simplest example where we wish to perform a parallel group study to investigate the difference between a control and a test treatment. Suppose that the control population has a Normal distribution with mean µ 1, variance σ 2, and the test population has a Normal distribution with mean µ 2, but the same variance σ 2. The hypotheses are as follows: Null Hypothesis: H 0 : µ 1 - µ 2 = 0 (no difference between treatments) Alternative Hypothesis: H 1 : µ 1 - µ 2 0 (there is a difference between treatments) Suppose a sample size of n 1 observations is taken from the control population, and a sample size of n 2 observations is taken from the test population. The significance level is 100α% and the probability of incorrectly accepting the null hypothesis is β (so the power is 100(1-β)%).

3 Suppose d = µ 1 - µ 2 is the true difference between the sample means, and the estimated difference is d. Then using a two-sample t-test to compare the sample means, we will incorrectly accept the null hypothesis (with probability β) if the test statistic is less than the critical value at the 100(1-α)/2% level (since this is a 2-tailed test). So Prob( d / (σ (1/n 1 + 1/n 2 ) ) < Z (1-α)/2 d 0 ) = β Prob( Z < Z (1-α)/2 - d /(σ (1/n 1 + 1/n 2 ) ) ) = β where Z ~ N(0,1) d 2 /(σ 2 (1/n 1 + 1/n 2 ) ) = (Z β - Z (1-α)/2 ) 2 = (Z α/2 + Z β ) 2 since Z α/2 = -Z (1-α)/2 Therefore n 1 = (Z α/2 + Z β ) 2 σ 2 (1 + 1/R) / d 2 where R = n 2 /n 1 (ratio of the second to the first sample size). In general, suppose S is a sample estimate of f(µ) and is Normally distributed with mean f(µ) and variance V, and that the two samples have equal variances. Then for a test to measure a difference d with power 100(1-β)% at the 100α% significance level (or 100(1-α)% confidence coefficient for tests of equivalence), we have the relationship: V = d 2 / (g(α,β)) 2 The variance V of the sample S will depend on the sample size, n, and so expressing V as v/n gives: n = (g(α,β)) 2 v / d (1) g(α,β) will depend on the nature of the test, and v will depend on the model parameter f(µ) which is being tested. The tables below illustrate the values of g(α,β) and v respectively for a selection of common tests: Type Of Test g(α,β) 1-tailed difference (Z α + Z β ) 2 1-sided equivalence (Z α + Z β ) 2 2-tailed difference (Z α/2 + Z β ) 2 2-sided equivalence (Z α/2 + Z β/2 ) 2 f(µ) v Single Mean σ 2 Difference of Two Means (1 + 1/R)σ 2 Difference of Two Proportions p 1 (1-p 1 ) + p 2 (1-p 2 )/R Equation (1) can also be rearranged to solve for d and/or β. There is currently no standard SAS procedure for performing these calculations, and so a statistician or researcher might look up these formulae in a statistical text book and then perform some hand calculations, or maybe do some data-step programming. To overcome these difficulties, the author has written an application using SAS/AF which incorporates the various options and statistical formulae behind an easy-to-use graphical user interface.

4 Tests Of Proportions For tests involving proportions, the accuracy of the Normal approximation improves as the sample size increases. However, for accuracy at small sample sizes, Fisher s Exact Test should be used. Computing sample sizes for this method involves an iterative procedure, and the application uses an approximation due to Casagrande, Pike and Smith, 1980 (Ref. [1]). Note that estimates of p 1 and p 2 for the proportions of samples 1 and 2 respectively are needed to estimate the variance of the statistic. In practice, we assume that p 1 = p 2 for equivalence tests, and that p 2 differs from p 1 by an amount d for difference tests. The exact values of p 1 and p 2 make little difference to the results except for values very close to 0 or 1. In general the difference between the proportions p 1 - p 2 is much more important. Estimating Variance Usually the variance is not known in advance, and is therefore estimated from the sample. For large sample sizes, the Normal approximation provides a fair approximation, but for small samples, we should be using the T distribution. However, in order to use the T distribution, we need to know the degrees of freedom (df) in terms of the sample sizes which we are trying to calculate. For example, for a two-tailed test of difference with estimated variance s 2, equation (1) becomes: n = (t df(n(1+1/r)),α/2 + t df(n(1+1/r)),β ) 2 s 2 (1 + 1/R) / d 2 If the sample size n is known, it is as straight forward as before to calculate power or difference given the other values. However, calculating n requires an iterative procedure since the degrees of freedom of the t-values on the right hand side of the equation vary with n. An initial estimate of n is obtained by using the Normal approximation, and then this is used to determine a value for the degrees of freedom, and hence a new value of n. A number of iterations of this sequence is performed until convergence is obtained. For further details, see Dupont & Plummer, 1990 (Ref. [2]). The Application The sample size & power determination application combines the various formulae for sample size estimation behind a point-and-click graphical user interface, which was designed to be as easy to use as possible. By default, the application will calculate sample sizes for a test of differences between two treatment means using a 2-tailed test at the 5% significance level, assuming the sample sizes of each treatment group are equal. For these defaults, the user simply needs to enter an estimate of the population variance. The results are available in both graphical and tabular format. The user selects the format they require together with the attribute (power, minimum detectable difference or sample size) to be used for the curves of the graph or cells of the table. Once the desired options have been selected, the user clicks on the Run button to produce the results.

5 Figure 1: Main Screen with Graphical Output Figure 1 above shows the main screen of the sample size & power determination application with some graphical output for the default test options when the estimated population variance is 9. The graph shows sample size plotted against minimum detectable difference, and the curves show points of constant power. Options are available for printing, editing, or saving the graph to a graphics file. The same information can also be presented in a tabular format as shown in Figure 2 below. This table can be printed or saved to a SAS data set. Figure 2: Main Screen with Tabular Output

6 Figure 3: Sample Size Estimation Options Figure 3 shows the options available for performing sample size estimation. These options are not exhaustive options for sample size estimation by any means, but they give an idea of what is possible using SAS/AF software. The test-type (hypothesis), the significance level, and the ratio of sample sizes in the two samples can all be modified. The Normal or t-test approximation can be selected for use in the sample size calculations, and for tests of proportions, the Casagrande-Pike-Smith approximation is also available. If the t-test approximation is selected, the user is asked to specify a formula for the degrees of freedom in terms of the sample sizes (m & n) of the two groups. For example, for a parallel group study design the degrees of freedom are (m + n - 2), and for a cross-over study design, the degrees of freedom are (m - 2). Options for customising the graphical output are shown in Figure 4. The axis and curve variables can be selected, and the range of the axes, the number of tick marks, and the number and values of the curves plotted can be modified. Figure 5 shows the options available for customising the tabular output. The row, columns and cell variables can be selected, and the user can specify whether either a continuous range of values or specific user-supplied values are used to define the rows and columns of the table.

7 Figure 4: Graphical Output Options Figure 5: Tabular Output Options

8 Conclusion SAS/AF software provides excellent facilities for combining the various formulae for sample size estimation behind an easy-to-use, point-and-click graphical user interface. Use of this application greatly simplifies the task of sample size estimation and power determination. References [1] Casagrande, Pike & Smith, 1980, A Simple Approximation for Calculation Sample Sizes for Comparing Independent Proportions, Biometrics, Vol. 36, pp [2] Dupont & Plummer, 1990, Power and Sample Size Calculations, Controlled Clinical Trials, Vol. 11, pp Further Information For further information, please contact: Fiona Portwood, Software Product Services Ltd., Regent House, The Broadway, Woking, Surrey, GU21 5AP, UK. Tel: + 44 (0) Fax: + 44 (0) software@spsuk.demon.co.uk SAS and SAS/AF are registered trademarks of SAS Institute Inc., Cary, NC, USA.

The SEQDESIGN Procedure

The SEQDESIGN Procedure SAS/STAT 9.2 User s Guide, Second Edition The SEQDESIGN Procedure (Book Excerpt) This document is an individual chapter from the SAS/STAT 9.2 User s Guide, Second Edition. The correct bibliographic citation

More information

Analysis of 2x2 Cross-Over Designs using T-Tests

Analysis of 2x2 Cross-Over Designs using T-Tests Chapter 234 Analysis of 2x2 Cross-Over Designs using T-Tests Introduction This procedure analyzes data from a two-treatment, two-period (2x2) cross-over design. The response is assumed to be a continuous

More information

Using SPSS for One Way Analysis of Variance

Using SPSS for One Way Analysis of Variance Using SPSS for One Way Analysis of Variance This tutorial will show you how to use SPSS version 12 to perform a one-way, between- subjects analysis of variance and related post-hoc tests. This tutorial

More information

Tutorial 4: Power and Sample Size for the Two-sample t-test with Unequal Variances

Tutorial 4: Power and Sample Size for the Two-sample t-test with Unequal Variances Tutorial 4: Power and Sample Size for the Two-sample t-test with Unequal Variances Preface Power is the probability that a study will reject the null hypothesis. The estimated probability is a function

More information

Independent Samples ANOVA

Independent Samples ANOVA Independent Samples ANOVA In this example students were randomly assigned to one of three mnemonics (techniques for improving memory) rehearsal (the control group; simply repeat the words), visual imagery

More information

Hypothesis Tests and Estimation for Population Variances. Copyright 2014 Pearson Education, Inc.

Hypothesis Tests and Estimation for Population Variances. Copyright 2014 Pearson Education, Inc. Hypothesis Tests and Estimation for Population Variances 11-1 Learning Outcomes Outcome 1. Formulate and carry out hypothesis tests for a single population variance. Outcome 2. Develop and interpret confidence

More information

1 Correlation and Inference from Regression

1 Correlation and Inference from Regression 1 Correlation and Inference from Regression Reading: Kennedy (1998) A Guide to Econometrics, Chapters 4 and 6 Maddala, G.S. (1992) Introduction to Econometrics p. 170-177 Moore and McCabe, chapter 12 is

More information

Using Tables and Graphing Calculators in Math 11

Using Tables and Graphing Calculators in Math 11 Using Tables and Graphing Calculators in Math 11 Graphing calculators are not required for Math 11, but they are likely to be helpful, primarily because they allow you to avoid the use of tables in some

More information

The Chi-Square Distributions

The Chi-Square Distributions MATH 03 The Chi-Square Distributions Dr. Neal, Spring 009 The chi-square distributions can be used in statistics to analyze the standard deviation of a normally distributed measurement and to test the

More information

Student s t-distribution. The t-distribution, t-tests, & Measures of Effect Size

Student s t-distribution. The t-distribution, t-tests, & Measures of Effect Size Student s t-distribution The t-distribution, t-tests, & Measures of Effect Size Sampling Distributions Redux Chapter 7 opens with a return to the concept of sampling distributions from chapter 4 Sampling

More information

1 Introduction to Minitab

1 Introduction to Minitab 1 Introduction to Minitab Minitab is a statistical analysis software package. The software is freely available to all students and is downloadable through the Technology Tab at my.calpoly.edu. When you

More information

The Chi-Square Distributions

The Chi-Square Distributions MATH 183 The Chi-Square Distributions Dr. Neal, WKU The chi-square distributions can be used in statistics to analyze the standard deviation σ of a normally distributed measurement and to test the goodness

More information

Bayesian Updating: Discrete Priors: Spring

Bayesian Updating: Discrete Priors: Spring Bayesian Updating: Discrete Priors: 18.05 Spring 2017 http://xkcd.com/1236/ Learning from experience Which treatment would you choose? 1. Treatment 1: cured 100% of patients in a trial. 2. Treatment 2:

More information

Section 9.5. Testing the Difference Between Two Variances. Bluman, Chapter 9 1

Section 9.5. Testing the Difference Between Two Variances. Bluman, Chapter 9 1 Section 9.5 Testing the Difference Between Two Variances Bluman, Chapter 9 1 This the last day the class meets before spring break starts. Please make sure to be present for the test or make appropriate

More information

SAS/STAT 15.1 User s Guide The SEQDESIGN Procedure

SAS/STAT 15.1 User s Guide The SEQDESIGN Procedure SAS/STAT 15.1 User s Guide The SEQDESIGN Procedure This document is an individual chapter from SAS/STAT 15.1 User s Guide. The correct bibliographic citation for this manual is as follows: SAS Institute

More information

MBA 605, Business Analytics Donald D. Conant, Ph.D. Master of Business Administration

MBA 605, Business Analytics Donald D. Conant, Ph.D. Master of Business Administration t-distribution Summary MBA 605, Business Analytics Donald D. Conant, Ph.D. Types of t-tests There are several types of t-test. In this course we discuss three. The single-sample t-test The two-sample t-test

More information

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t =

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t = 2. The distribution of t values that would be obtained if a value of t were calculated for each sample mean for all possible random of a given size from a population _ t ratio: (X - µ hyp ) t s x The result

More information

CHAPTER 10. Regression and Correlation

CHAPTER 10. Regression and Correlation CHAPTER 10 Regression and Correlation In this Chapter we assess the strength of the linear relationship between two continuous variables. If a significant linear relationship is found, the next step would

More information

HSC Chemistry 7.0 User's Guide

HSC Chemistry 7.0 User's Guide HSC Chemistry 7.0 47-1 HSC Chemistry 7.0 User's Guide Sim Flowsheet Module Experimental Mode Pertti Lamberg Outotec Research Oy Information Service P.O. Box 69 FIN - 28101 PORI, FINLAND Fax: +358-20 -

More information

Tutorial 1: Power and Sample Size for the One-sample t-test. Acknowledgements:

Tutorial 1: Power and Sample Size for the One-sample t-test. Acknowledgements: Tutorial 1: Power and Sample Size for the One-sample t-test Anna E. Barón, Keith E. Muller, Sarah M. Kreidler, and Deborah H. Glueck Acknowledgements: The project was supported in large part by the National

More information

Statistics for Managers using Microsoft Excel 6 th Edition

Statistics for Managers using Microsoft Excel 6 th Edition Statistics for Managers using Microsoft Excel 6 th Edition Chapter 13 Simple Linear Regression 13-1 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of

More information

Ratio of Polynomials Fit One Variable

Ratio of Polynomials Fit One Variable Chapter 375 Ratio of Polynomials Fit One Variable Introduction This program fits a model that is the ratio of two polynomials of up to fifth order. Examples of this type of model are: and Y = A0 + A1 X

More information

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between 7.2 One-Sample Correlation ( = a) Introduction Correlation analysis measures the strength and direction of association between variables. In this chapter we will test whether the population correlation

More information

NINE CHOICE SERIAL REACTION TIME TASK

NINE CHOICE SERIAL REACTION TIME TASK instrumentation and software for research NINE CHOICE SERIAL REACTION TIME TASK MED-STATE NOTATION PROCEDURE SOF-700RA-8 USER S MANUAL DOC-025 Rev. 1.3 Copyright 2013 All Rights Reserved MED Associates

More information

Regression Analysis. Table Relationship between muscle contractile force (mj) and stimulus intensity (mv).

Regression Analysis. Table Relationship between muscle contractile force (mj) and stimulus intensity (mv). Regression Analysis Two variables may be related in such a way that the magnitude of one, the dependent variable, is assumed to be a function of the magnitude of the second, the independent variable; however,

More information

Urban Canopy Tool User Guide `bo`

Urban Canopy Tool User Guide `bo` Urban Canopy Tool User Guide `bo` ADMS Urban Canopy Tool User Guide Version 2.0 June 2014 Cambridge Environmental Research Consultants Ltd. 3, King s Parade Cambridge CB2 1SJ UK Telephone: +44 (0)1223

More information

Logistic Regression Analysis

Logistic Regression Analysis Logistic Regression Analysis Predicting whether an event will or will not occur, as well as identifying the variables useful in making the prediction, is important in most academic disciplines as well

More information

Logistic Regression Models to Integrate Actuarial and Psychological Risk Factors For predicting 5- and 10-Year Sexual and Violent Recidivism Rates

Logistic Regression Models to Integrate Actuarial and Psychological Risk Factors For predicting 5- and 10-Year Sexual and Violent Recidivism Rates Logistic Regression Models to Integrate Actuarial and Psychological Risk Factors For predicting 5- and 10-Year Sexual and Violent Recidivism Rates WI-ATSA June 2-3, 2016 Overview Brief description of logistic

More information

DISPLAYING THE POISSON REGRESSION ANALYSIS

DISPLAYING THE POISSON REGRESSION ANALYSIS Chapter 17 Poisson Regression Chapter Table of Contents DISPLAYING THE POISSON REGRESSION ANALYSIS...264 ModelInformation...269 SummaryofFit...269 AnalysisofDeviance...269 TypeIII(Wald)Tests...269 MODIFYING

More information

Paper: ST-161. Techniques for Evidence-Based Decision Making Using SAS Ian Stockwell, The Hilltop UMBC, Baltimore, MD

Paper: ST-161. Techniques for Evidence-Based Decision Making Using SAS Ian Stockwell, The Hilltop UMBC, Baltimore, MD Paper: ST-161 Techniques for Evidence-Based Decision Making Using SAS Ian Stockwell, The Hilltop Institute @ UMBC, Baltimore, MD ABSTRACT SAS has many tools that can be used for data analysis. From Freqs

More information

Hypothesis Testing for Var-Cov Components

Hypothesis Testing for Var-Cov Components Hypothesis Testing for Var-Cov Components When the specification of coefficients as fixed, random or non-randomly varying is considered, a null hypothesis of the form is considered, where Additional output

More information

Practice of SAS Logistic Regression on Binary Pharmacodynamic Data Problems and Solutions. Alan J Xiao, Cognigen Corporation, Buffalo NY

Practice of SAS Logistic Regression on Binary Pharmacodynamic Data Problems and Solutions. Alan J Xiao, Cognigen Corporation, Buffalo NY Practice of SAS Logistic Regression on Binary Pharmacodynamic Data Problems and Solutions Alan J Xiao, Cognigen Corporation, Buffalo NY ABSTRACT Logistic regression has been widely applied to population

More information

y response variable x 1, x 2,, x k -- a set of explanatory variables

y response variable x 1, x 2,, x k -- a set of explanatory variables 11. Multiple Regression and Correlation y response variable x 1, x 2,, x k -- a set of explanatory variables In this chapter, all variables are assumed to be quantitative. Chapters 12-14 show how to incorporate

More information

Lab 1 Uniform Motion - Graphing and Analyzing Motion

Lab 1 Uniform Motion - Graphing and Analyzing Motion Lab 1 Uniform Motion - Graphing and Analyzing Motion Objectives: < To observe the distance-time relation for motion at constant velocity. < To make a straight line fit to the distance-time data. < To interpret

More information

Tutorial 3: Power and Sample Size for the Two-sample t-test with Equal Variances. Acknowledgements:

Tutorial 3: Power and Sample Size for the Two-sample t-test with Equal Variances. Acknowledgements: Tutorial 3: Power and Sample Size for the Two-sample t-test with Equal Variances Anna E. Barón, Keith E. Muller, Sarah M. Kreidler, and Deborah H. Glueck Acknowledgements: The project was supported in

More information

Six Sigma Black Belt Study Guides

Six Sigma Black Belt Study Guides Six Sigma Black Belt Study Guides 1 www.pmtutor.org Powered by POeT Solvers Limited. Analyze Correlation and Regression Analysis 2 www.pmtutor.org Powered by POeT Solvers Limited. Variables and relationships

More information

Nonlinear Regression. Summary. Sample StatFolio: nonlinear reg.sgp

Nonlinear Regression. Summary. Sample StatFolio: nonlinear reg.sgp Nonlinear Regression Summary... 1 Analysis Summary... 4 Plot of Fitted Model... 6 Response Surface Plots... 7 Analysis Options... 10 Reports... 11 Correlation Matrix... 12 Observed versus Predicted...

More information

1 A Review of Correlation and Regression

1 A Review of Correlation and Regression 1 A Review of Correlation and Regression SW, Chapter 12 Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then

More information

Randomized Complete Block Designs

Randomized Complete Block Designs Randomized Complete Block Designs David Allen University of Kentucky February 23, 2016 1 Randomized Complete Block Design There are many situations where it is impossible to use a completely randomized

More information

16.3 One-Way ANOVA: The Procedure

16.3 One-Way ANOVA: The Procedure 16.3 One-Way ANOVA: The Procedure Tom Lewis Fall Term 2009 Tom Lewis () 16.3 One-Way ANOVA: The Procedure Fall Term 2009 1 / 10 Outline 1 The background 2 Computing formulas 3 The ANOVA Identity 4 Tom

More information

Relating Graph to Matlab

Relating Graph to Matlab There are two related course documents on the web Probability and Statistics Review -should be read by people without statistics background and it is helpful as a review for those with prior statistics

More information

The simple linear regression model discussed in Chapter 13 was written as

The simple linear regression model discussed in Chapter 13 was written as 1519T_c14 03/27/2006 07:28 AM Page 614 Chapter Jose Luis Pelaez Inc/Blend Images/Getty Images, Inc./Getty Images, Inc. 14 Multiple Regression 14.1 Multiple Regression Analysis 14.2 Assumptions of the Multiple

More information

Two Correlated Proportions Non- Inferiority, Superiority, and Equivalence Tests

Two Correlated Proportions Non- Inferiority, Superiority, and Equivalence Tests Chapter 59 Two Correlated Proportions on- Inferiority, Superiority, and Equivalence Tests Introduction This chapter documents three closely related procedures: non-inferiority tests, superiority (by a

More information

How To: Deal with Heteroscedasticity Using STATGRAPHICS Centurion

How To: Deal with Heteroscedasticity Using STATGRAPHICS Centurion How To: Deal with Heteroscedasticity Using STATGRAPHICS Centurion by Dr. Neil W. Polhemus July 28, 2005 Introduction When fitting statistical models, it is usually assumed that the error variance is the

More information

Hotelling s One- Sample T2

Hotelling s One- Sample T2 Chapter 405 Hotelling s One- Sample T2 Introduction The one-sample Hotelling s T2 is the multivariate extension of the common one-sample or paired Student s t-test. In a one-sample t-test, the mean response

More information

Basic Business Statistics 6 th Edition

Basic Business Statistics 6 th Edition Basic Business Statistics 6 th Edition Chapter 12 Simple Linear Regression Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of a dependent variable based

More information

ABSTRACT. Page 1 of 9

ABSTRACT. Page 1 of 9 Gage R. & R. vs. ANOVA Dilip A. Shah E = mc 3 Solutions 197 Great Oaks Trail # 130 Wadsworth, Ohio 44281-8215 Tel: 330-328-4400 Fax: 330-336-3974 E-mail: emc3solu@aol.com ABSTRACT Quality and metrology

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Suhasini Subba Rao Motivations for the ANOVA We defined the F-distribution, this is mainly used in

More information

Hypothesis Testing. File: /General/MLAB-Text/Papers/hyptest.tex

Hypothesis Testing. File: /General/MLAB-Text/Papers/hyptest.tex File: /General/MLAB-Text/Papers/hyptest.tex Hypothesis Testing Gary D. Knott, Ph.D. Civilized Software, Inc. 12109 Heritage Park Circle Silver Spring, MD 20906 USA Tel. (301) 962-3711 Email: csi@civilized.com

More information

Tutorial 5: Power and Sample Size for One-way Analysis of Variance (ANOVA) with Equal Variances Across Groups. Acknowledgements:

Tutorial 5: Power and Sample Size for One-way Analysis of Variance (ANOVA) with Equal Variances Across Groups. Acknowledgements: Tutorial 5: Power and Sample Size for One-way Analysis of Variance (ANOVA) with Equal Variances Across Groups Anna E. Barón, Keith E. Muller, Sarah M. Kreidler, and Deborah H. Glueck Acknowledgements:

More information

Chapter Seven: Multi-Sample Methods 1/52

Chapter Seven: Multi-Sample Methods 1/52 Chapter Seven: Multi-Sample Methods 1/52 7.1 Introduction 2/52 Introduction The independent samples t test and the independent samples Z test for a difference between proportions are designed to analyze

More information

Electric Fields and Equipotentials

Electric Fields and Equipotentials OBJECTIVE Electric Fields and Equipotentials To study and describe the two-dimensional electric field. To map the location of the equipotential surfaces around charged electrodes. To study the relationship

More information

Integration of SAS and NONMEM for Automation of Population Pharmacokinetic/Pharmacodynamic Modeling on UNIX systems

Integration of SAS and NONMEM for Automation of Population Pharmacokinetic/Pharmacodynamic Modeling on UNIX systems Integration of SAS and NONMEM for Automation of Population Pharmacokinetic/Pharmacodynamic Modeling on UNIX systems Alan J Xiao, Cognigen Corporation, Buffalo NY Jill B Fiedler-Kelly, Cognigen Corporation,

More information

Example name. Subgroups analysis, Regression. Synopsis

Example name. Subgroups analysis, Regression. Synopsis 589 Example name Effect size Analysis type Level BCG Risk ratio Subgroups analysis, Regression Advanced Synopsis This analysis includes studies where patients were randomized to receive either a vaccine

More information

LOOKING FOR RELATIONSHIPS

LOOKING FOR RELATIONSHIPS LOOKING FOR RELATIONSHIPS One of most common types of investigation we do is to look for relationships between variables. Variables may be nominal (categorical), for example looking at the effect of an

More information

The Difference in Proportions Test

The Difference in Proportions Test Overview The Difference in Proportions Test Dr Tom Ilvento Department of Food and Resource Economics A Difference of Proportions test is based on large sample only Same strategy as for the mean We calculate

More information

Tutorial 2: Power and Sample Size for the Paired Sample t-test

Tutorial 2: Power and Sample Size for the Paired Sample t-test Tutorial 2: Power and Sample Size for the Paired Sample t-test Preface Power is the probability that a study will reject the null hypothesis. The estimated probability is a function of sample size, variability,

More information

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220 Dr. Mohammad Zainal Chapter Goals After completing

More information

Population Variance. Concepts from previous lectures. HUMBEHV 3HB3 one-sample t-tests. Week 8

Population Variance. Concepts from previous lectures. HUMBEHV 3HB3 one-sample t-tests. Week 8 Concepts from previous lectures HUMBEHV 3HB3 one-sample t-tests Week 8 Prof. Patrick Bennett sampling distributions - sampling error - standard error of the mean - degrees-of-freedom Null and alternative/research

More information

How do we compare the relative performance among competing models?

How do we compare the relative performance among competing models? How do we compare the relative performance among competing models? 1 Comparing Data Mining Methods Frequent problem: we want to know which of the two learning techniques is better How to reliably say Model

More information

Location Intelligence Infrastructure Asset Management. Confirm. Confirm Mapping Link to ArcMap Version v18.00b.am

Location Intelligence Infrastructure Asset Management. Confirm. Confirm Mapping Link to ArcMap Version v18.00b.am Location Intelligence Infrastructure Asset Management Confirm Confirm Mapping Link to ArcMap Version v18.00b.am Information in this document is subject to change without notice and does not represent a

More information

Financial Econometrics Review Session Notes 3

Financial Econometrics Review Session Notes 3 Financial Econometrics Review Session Notes 3 Nina Boyarchenko January 22, 2010 Contents 1 k-step ahead forecast and forecast errors 2 1.1 Example 1: stationary series.......................... 2 1.2 Example

More information

Correlation Analysis

Correlation Analysis Simple Regression Correlation Analysis Correlation analysis is used to measure strength of the association (linear relationship) between two variables Correlation is only concerned with strength of the

More information

A GUI FOR EVOLVE ZAMS

A GUI FOR EVOLVE ZAMS A GUI FOR EVOLVE ZAMS D. R. Schlegel Computer Science Department Here the early work on a new user interface for the Evolve ZAMS stellar evolution code is presented. The initial goal of this project is

More information

Chapter 7. Inference for Distributions. Introduction to the Practice of STATISTICS SEVENTH. Moore / McCabe / Craig. Lecture Presentation Slides

Chapter 7. Inference for Distributions. Introduction to the Practice of STATISTICS SEVENTH. Moore / McCabe / Craig. Lecture Presentation Slides Chapter 7 Inference for Distributions Introduction to the Practice of STATISTICS SEVENTH EDITION Moore / McCabe / Craig Lecture Presentation Slides Chapter 7 Inference for Distributions 7.1 Inference for

More information

Item Reliability Analysis

Item Reliability Analysis Item Reliability Analysis Revised: 10/11/2017 Summary... 1 Data Input... 4 Analysis Options... 5 Tables and Graphs... 5 Analysis Summary... 6 Matrix Plot... 8 Alpha Plot... 10 Correlation Matrix... 11

More information

DISTRIBUTIONS USED IN STATISTICAL WORK

DISTRIBUTIONS USED IN STATISTICAL WORK DISTRIBUTIONS USED IN STATISTICAL WORK In one of the classic introductory statistics books used in Education and Psychology (Glass and Stanley, 1970, Prentice-Hall) there was an excellent chapter on different

More information

Factorial Independent Samples ANOVA

Factorial Independent Samples ANOVA Factorial Independent Samples ANOVA Liljenquist, Zhong and Galinsky (2010) found that people were more charitable when they were in a clean smelling room than in a neutral smelling room. Based on that

More information

User Manuel. EurotaxForecast. Version Latest changes ( )

User Manuel. EurotaxForecast. Version Latest changes ( ) User Manuel EurotaxForecast Version 1.23.0771- Latest changes (19.07.2003) Contents Preface 5 Welcome to Eurotax Forecast...5 Using this manual 6 How to use this manual?...6 Program overview 7 General

More information

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras. Lecture 11 t- Tests

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras. Lecture 11 t- Tests Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Lecture 11 t- Tests Welcome to the course on Biostatistics and Design of Experiments.

More information

Esterification in CSTRs in Series with Aspen Plus V8.0

Esterification in CSTRs in Series with Aspen Plus V8.0 Esterification in CSTRs in Series with Aspen Plus V8.0 1. Lesson Objectives Use Aspen Plus to determine whether a given reaction is technically feasible using three continuous stirred tank reactors in

More information

Two-Sample Inferential Statistics

Two-Sample Inferential Statistics The t Test for Two Independent Samples 1 Two-Sample Inferential Statistics In an experiment there are two or more conditions One condition is often called the control condition in which the treatment is

More information

Application of Ghosh, Grizzle and Sen s Nonparametric Methods in. Longitudinal Studies Using SAS PROC GLM

Application of Ghosh, Grizzle and Sen s Nonparametric Methods in. Longitudinal Studies Using SAS PROC GLM Application of Ghosh, Grizzle and Sen s Nonparametric Methods in Longitudinal Studies Using SAS PROC GLM Chan Zeng and Gary O. Zerbe Department of Preventive Medicine and Biometrics University of Colorado

More information

Chapter 23. Inference About Means

Chapter 23. Inference About Means Chapter 23 Inference About Means 1 /57 Homework p554 2, 4, 9, 10, 13, 15, 17, 33, 34 2 /57 Objective Students test null and alternate hypotheses about a population mean. 3 /57 Here We Go Again Now that

More information

Tests for Two Coefficient Alphas

Tests for Two Coefficient Alphas Chapter 80 Tests for Two Coefficient Alphas Introduction Coefficient alpha, or Cronbach s alpha, is a popular measure of the reliability of a scale consisting of k parts. The k parts often represent k

More information

Psychology 282 Lecture #4 Outline Inferences in SLR

Psychology 282 Lecture #4 Outline Inferences in SLR Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations

More information

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing 1 In most statistics problems, we assume that the data have been generated from some unknown probability distribution. We desire

More information

Sampling Distributions: Central Limit Theorem

Sampling Distributions: Central Limit Theorem Review for Exam 2 Sampling Distributions: Central Limit Theorem Conceptually, we can break up the theorem into three parts: 1. The mean (µ M ) of a population of sample means (M) is equal to the mean (µ)

More information

Preparing Spatial Data

Preparing Spatial Data 13 CHAPTER 2 Preparing Spatial Data Assessing Your Spatial Data Needs 13 Assessing Your Attribute Data 13 Determining Your Spatial Data Requirements 14 Locating a Source of Spatial Data 14 Performing Common

More information

This is a multiple choice and short answer practice exam. It does not count towards your grade. You may use the tables in your book.

This is a multiple choice and short answer practice exam. It does not count towards your grade. You may use the tables in your book. NAME (Please Print): HONOR PLEDGE (Please Sign): statistics 101 Practice Final Key This is a multiple choice and short answer practice exam. It does not count towards your grade. You may use the tables

More information

Paper Equivalence Tests. Fei Wang and John Amrhein, McDougall Scientific Ltd.

Paper Equivalence Tests. Fei Wang and John Amrhein, McDougall Scientific Ltd. Paper 11683-2016 Equivalence Tests Fei Wang and John Amrhein, McDougall Scientific Ltd. ABSTRACT Motivated by the frequent need for equivalence tests in clinical trials, this paper provides insights into

More information

Mathematical Notation Math Introduction to Applied Statistics

Mathematical Notation Math Introduction to Applied Statistics Mathematical Notation Math 113 - Introduction to Applied Statistics Name : Use Word or WordPerfect to recreate the following documents. Each article is worth 10 points and should be emailed to the instructor

More information

A Scientific Model for Free Fall.

A Scientific Model for Free Fall. A Scientific Model for Free Fall. I. Overview. This lab explores the framework of the scientific method. The phenomenon studied is the free fall of an object released from rest at a height H from the ground.

More information

Taguchi Method and Robust Design: Tutorial and Guideline

Taguchi Method and Robust Design: Tutorial and Guideline Taguchi Method and Robust Design: Tutorial and Guideline CONTENT 1. Introduction 2. Microsoft Excel: graphing 3. Microsoft Excel: Regression 4. Microsoft Excel: Variance analysis 5. Robust Design: An Example

More information

Chapter 16. Simple Linear Regression and Correlation

Chapter 16. Simple Linear Regression and Correlation Chapter 16 Simple Linear Regression and Correlation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

PDF-4+ Tools and Searches

PDF-4+ Tools and Searches PDF-4+ Tools and Searches PDF-4+ 2018 The PDF-4+ 2018 database is powered by our integrated search display software. PDF-4+ 2018 boasts 72 search selections coupled with 125 display fields resulting in

More information

Quality Measures Green Light Report Online Management Tool. Self Guided Tutorial

Quality Measures Green Light Report Online Management Tool. Self Guided Tutorial Quality Measures Green Light Report Online Management Tool Self Guided Tutorial 1 Tutorial Contents Overview Access the QM Green Light Report Review the QM Green Light Report Tips for Success Contact PointRight

More information

The entire data set consists of n = 32 widgets, 8 of which were made from each of q = 4 different materials.

The entire data set consists of n = 32 widgets, 8 of which were made from each of q = 4 different materials. One-Way ANOVA Summary The One-Way ANOVA procedure is designed to construct a statistical model describing the impact of a single categorical factor X on a dependent variable Y. Tests are run to determine

More information

Measuring relationships among multiple responses

Measuring relationships among multiple responses Measuring relationships among multiple responses Linear association (correlation, relatedness, shared information) between pair-wise responses is an important property used in almost all multivariate analyses.

More information

Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.

Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6. Chapter 7 Reading 7.1, 7.2 Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.112 Introduction In Chapter 5 and 6, we emphasized

More information

Chapter 1 Linear Equations. 1.1 Systems of Linear Equations

Chapter 1 Linear Equations. 1.1 Systems of Linear Equations Chapter Linear Equations. Systems of Linear Equations A linear equation in the n variables x, x 2,..., x n is one that can be expressed in the form a x + a 2 x 2 + + a n x n = b where a, a 2,..., a n and

More information

PDF-4+ Tools and Searches

PDF-4+ Tools and Searches PDF-4+ Tools and Searches PDF-4+ 2019 The PDF-4+ 2019 database is powered by our integrated search display software. PDF-4+ 2019 boasts 74 search selections coupled with 126 display fields resulting in

More information

Any of 27 linear and nonlinear models may be fit. The output parallels that of the Simple Regression procedure.

Any of 27 linear and nonlinear models may be fit. The output parallels that of the Simple Regression procedure. STATGRAPHICS Rev. 9/13/213 Calibration Models Summary... 1 Data Input... 3 Analysis Summary... 5 Analysis Options... 7 Plot of Fitted Model... 9 Predicted Values... 1 Confidence Intervals... 11 Observed

More information

An inferential procedure to use sample data to understand a population Procedures

An inferential procedure to use sample data to understand a population Procedures Hypothesis Test An inferential procedure to use sample data to understand a population Procedures Hypotheses, the alpha value, the critical region (z-scores), statistics, conclusion Two types of errors

More information

One-way ANOVA (Single-Factor CRD)

One-way ANOVA (Single-Factor CRD) One-way ANOVA (Single-Factor CRD) STAT:5201 Week 3: Lecture 3 1 / 23 One-way ANOVA We have already described a completed randomized design (CRD) where treatments are randomly assigned to EUs. There is

More information

Create Satellite Image, Draw Maps

Create Satellite Image, Draw Maps Create Satellite Image, Draw Maps 1. The goal Using Google Earth, we want to create and import a background file into our Adviser program. From there, we will be creating paddock boundaries. The accuracy

More information

df=degrees of freedom = n - 1

df=degrees of freedom = n - 1 One sample t-test test of the mean Assumptions: Independent, random samples Approximately normal distribution (from intro class: σ is unknown, need to calculate and use s (sample standard deviation)) Hypotheses:

More information

Chapter 9 Inferences from Two Samples

Chapter 9 Inferences from Two Samples Chapter 9 Inferences from Two Samples 9-1 Review and Preview 9-2 Two Proportions 9-3 Two Means: Independent Samples 9-4 Two Dependent Samples (Matched Pairs) 9-5 Two Variances or Standard Deviations Review

More information

Non-parametric methods

Non-parametric methods Eastern Mediterranean University Faculty of Medicine Biostatistics course Non-parametric methods March 4&7, 2016 Instructor: Dr. Nimet İlke Akçay (ilke.cetin@emu.edu.tr) Learning Objectives 1. Distinguish

More information

Software BioScout-Calibrator June 2013

Software BioScout-Calibrator June 2013 SARAD GmbH BioScout -Calibrator 1 Manual Software BioScout-Calibrator June 2013 SARAD GmbH Tel.: ++49 (0)351 / 6580712 Wiesbadener Straße 10 FAX: ++49 (0)351 / 6580718 D-01159 Dresden email: support@sarad.de

More information