Analyzing and Interpreting Continuous Data Using JMP

Size: px
Start display at page:

Download "Analyzing and Interpreting Continuous Data Using JMP"

Transcription

1 Analyzing and Interpreting Continuous Data Using JMP A Step-by-Step Guide José G. Ramírez, Ph.D. Brenda S. Ramírez, M.S. Corrections to first printing.

2 The correct bibliographic citation for this manual is as follows: Ramírez, José G., and Brenda S. Ramírez Analyzing and Interpreting Continuous Data Using JMP : A Step-by-Step Guide. Cary, NC: SAS Institute Inc. Analyzing and Interpreting Continuous Data Using JMP : A Step-by-Step Guide Copyright 2009, SAS Institute Inc., Cary, NC, USA ISBN All rights reserved. Produced in the United States of America. For a hard-copy book: No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, or otherwise, without the prior written permission of the publisher, SAS Institute Inc. For a Web download or e-book: Your use of this publication shall be governed by the terms established by the vendor at the time you acquire this publication. U.S. Government Restricted Rights Notice: Use, duplication, or disclosure of this software and related documentation by the U.S. government is subject to the Agreement with SAS Institute and the restrictions set forth in FAR , Commercial Computer Software-Restricted Rights (June 1987). SAS Institute Inc., SAS Campus Drive, Cary, North Carolina st printing, August nd printing, October 2010 SAS Publishing provides a complete selection of books and electronic products to help customers use SAS software to its fullest potential. For more information about our e-books, e-learning products, CDs, and hardcopy books, visit the SAS Publishing Web site at support.sas.com/publishing or call SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other countries. indicates USA registration. Other brand and product names are registered trademarks or trademarks of their respective companies.

3 Chapter 2: Overview of Statistical Concepts and Ideas 55 Figure 2.17b Individual Measurements Chart for Resistance Data In this section we wanted to provide a quick introduction to some useful graphs and summary statistics. In Chapter 3 we will show how to use them to characterize the measured performance of a material, process, or product, and in subsequent chapters we will continue to use them, as appropriate. Every data analysis and formal tests of significance should include a thorough exploratory data analysis (EDA) using the summary statistics and graphical displays discussed in this section. Apart from helping us identify unusual or outlying observations, EDA can help answer questions of interest, and it is a great springboard for suggesting hypotheses that can be explored with further data analysis techniques. 2.5 Quantifying Uncertainty: Common Probability Distributions In Section 2.3 we discussed the need for considering the different aspects that make for a good study. These include the following: clearly defining the reasons for our study being detailed about our population of interest from which a sample will be selected

4 Chapter 2: Overview of Statistical Concepts and Ideas 63 Figure 2.19a Selecting the Formula Editor and the Normal Distribution Once the Normal Distribution() is in the Formula Editor just click in the red box and type Once you click OK the answer appears as or 0.66% (Figure 2.19b). Of course, you can just type the expression ( ) / inside the red box and JMP will calculate the Z-score before computing the area. Figure 2.19b Normal Distribution for and Probability Result

5 68 Analyzing and Interpreting Continuous Data Using JMP: A Step-by-Step Guide Tolerance Interval to Contain a Given Proportion p of the Sampled Population A tolerance interval to contain a given proportion p of the sampled population, with a given confidence (1-α) is given by this formula: X ± g s (1 α / 2, pn, ) This is similar to the confidence interval for the mean, but the t-quantile is replaced by the function g, which is a function of the degree of confidence, (1 α), the proportion, p, that the interval is going to contain, and the sample size n. In other words, the marginof-error M.O.E. = g s. The function g (1 α/2, p, n) (1 α/2, p, n) is somewhat complicated for hand calculations (although tables are available), so it is better to let JMP evaluate it for us. Prediction and tolerance intervals provide ways to characterize the output performance of a process by giving us a window where future measured performance values will fall, or a window where a large proportion (e.g., 95%, 99%) of the measured performance values will fall. Luckily for us the calculations for these statistical intervals are readily available in JMP through the Distribution Platform. In Chapter 3, we will show you how to calculate these intervals in the context of a problem. However, to give you a feel for them Table 2.6a shows four statistical intervals for the Resistance data. Table 2.6a Statistical Intervals for Resistance Data Interval Type Confidence interval for µ, center of the sampled population Confidence interval for σ, spread around center of the population Prediction interval for one future observation Tolerance interval containing 99% of population Confidence Level Lower Limit (Ohms) Upper Limit (Ohms) 95% % % %

6 Chapter 3: Characterizing Measured Performance 89 the one given by X ± 0.67s =[ ; ] = [ 1.972; 0.007] to see that it is close to the 5-Number summary estimate. Figure 3.1a Basic Descriptive Statistics The process capability index, C pk, is a measure of process capability that standardizes the distance from the mean of the process to the closest specification limit, by three times an estimate of the standard deviation (variation). In other words, the C pk is one-third the distance to the nearest specification (DNS). The farther the process average is from the specifications limits in standard deviation units, the larger the DNS and the C pk, and the more elbow room the process has to cope with upsets. For a given set of specifications, think of the specification window (LSL USL) as the width of a garage door and the process output (its distribution) as a car we are trying to park. A C pk = 1 ( DNS=3) indicates that the width of the car is almost the width of the garage; any deviation from the center will scratch the sides of the car! It is desirable then, to have large values of C pk. A C pk 1.5 is considered good, in the sense that the car has enough room on each side and it will not get scratched even if we deviate a little from the center of the garage. In other words, a C pk 1.5 enables the process enough room to shift before making lots of out-ofspecification material.

7 Chapter 4: Comparing Measured Performance to a Standard 157 standard, or greater than the standard. In statistical jargon these types of comparisons (equal, greater than, or less than) can be described using a null (H 0 ) and an alternative (H 1 ) statistical hypothesis that contain statements about the average performance (population mean, µ) as compared with the standard k. Statistics Note 4.1: Tests of significance start with a null hypothesis (H 0 ) that represents our current beliefs that we assume to be true but cannot be proven, and an alternative hypothesis (H 1 ) that is a statement of what the statistical test of significance is set out to prove by rejecting the null hypothesis. In what follows we will label the null hypothesis Assume, and the alternative hypothesis Prove to remind us of what the test of significance sets out to do. The three ways that we can write the null (H 0 ) and alternative (H 1 ) hypothesis are as follows: 1. The average performance is different from the standard. H 0 : Average Performance (µ) = Standard (k) H 1 : Average Performance (µ) Standard (k) (Assume) (Prove) This is a two-sided alternative since we do not care about the average performance being less or greater than the standard, just that it is different from it. The null hypothesis (H 0 ) assumes that the average performance (population mean, µ) is equal to our standard value, k. On the other hand, the alternative hypothesis (H 1 ) states that our average performance is not equal to our standard value, k. If our test of significance favors the alternative, then we have proven with statistical rigor that our average performance is different from the standard, k, without making any statements regarding the direction of departure from k. Two-sided alternative hypotheses do not make any claims about the direction of departure of the mean from our standard. 2. The average performance is greater than the standard. H 0 : Average Performance (µ) Standard (k) H 1 : Average Performance (µ) > Standard (k) (Assume) (Prove) In this one-sided alternative hypothesis, H 1, we are interested in knowing whether our average performance is greater than k. Here, a rejection of the null hypothesis H 0 will show, with statistical rigor, that the average

8 Chapter 4: Comparing Measured Performance to a Standard 197 Are the thickness averages homogeneous? As we discussed in Chapter 2, it is important to verify the homogeneity assumption because our statistical analysis of the data depends on this assumption (Wheeler 2005). Can we treat the thickness average data as coming from a homogeneous population? We can use an individuals and moving range chart (Graph > Control Chart > IR) to check the homogeneity of the thickness averages. The moving range for observation 35, Figure 4.22b, is outside the upper control limit (UCL), giving us a signal that the data may not be homogeneous. When we look at the individuals chart we notice that the last four averages, rows 35 to 38, are below the lower control limit (LCL) of Å, which indicates that these averages are different from the rest. These four thickness averages correspond to the suspect run 10. In other words, the thickness averages are not homogenous, and the culprit seems to be run 10. Figure 4.22b Individual Measurements and Moving Range Chart of Oxide Thickness Data

9 Chapter 5: Comparing Two Measured Performances 225 any statements regarding the direction of departure in the two means. This is the most popular choice for many applications. A two-sided alternative hypothesis does not make any claims about the direction of departure of the mean from our standard. Sometimes, we want to prove with statistical rigor that the two population means are similar and not just assume it to be true. For example, many performance claims are stated in way to suggest that their performance is just as good as or no worse than a competitor s performance. This is equivalent to flipping the two hypotheses around and putting the equality statement in the alternative statement, H 1 : Average Performance (µ 1 ) = Average Performance (µ 2 ). This is referred to as an Equivalence Test and is discussed in Section 5.5. We assume that the average performances of the two populations are equal since we can never really prove it. 2. The Average performance of population 1 is greater than population 2 H 0 : Average Performance (µ 1 ) Average Performance (µ 2 ) (Assume) H 1 : Average Performance (µ 1 ) > Average Performance (µ 2 ) (Prove) In this one-sided alternative hypothesis, H 1, we are interested in knowing whether the average performance of population 1 is greater than the average performance of population 2. Here, a rejection of the null hypothesis, H 0, will show, with statistical rigor, that the average performance of population 1 is larger than the average performance of population 2. Proving the alternative hypothesis means that we have enough evidence (proof) to demonstrate, or establish, the validity of claims in the alternative hypothesis. The decision to write the hypothesis in this manner is application specific. For example, if we want to prove that a second entity (represented by µ 1 ) is superior in performance to the current (represented by µ 2 ) before we can use it, then we would want to set up an appropriate one-sided alternative hypothesis. For example, we might want to change suppliers for a liquid adhesive that we use in our products but only want to consider suppliers that have adhesives with a higher adhesion bond. This would lend itself to the following one-sided alternative hypothesis: H 1 : Avg. Bond Strength New Supplier > Avg. Bond Strength Existing Supplier.

10 378 Analyzing and Interpreting Continuous Data Using JMP: A Step-by-Step Guide Table 7.3 (continued) Model Term Name Description Least Squares Estimator β 1 Slope The slope is of primary interest because it represents the impact that the factor has on the response the rate of change. For every unit increase in X, Y increases (or decreases) by this many units. b = 1 n i= 1 n i= 1 ( x x) y i ( x x) i 2 i Model errors Considered to be a random variable representing the degree of error between the predicted and actual values for a given value of X. Assumed to be independent and normally distributed with mean = 0 and constant variance independent of the values of X. e = y yˆ i i i The last column in Table 7.3 provides the least squares equations for estimating the parameters in a simple linear regression model. What Is Least Squares? Least squares is an estimation method that seeks to minimize the sum of squares of the residuals (fitting errors), namely Σ(y i yt i ) 2. The best-fitting least squares line is the one that produces fitted values (yt i ) that minimize this sum of squares. Figure 7.2 shows a scatter plot of Y versus X for seven data points. In this plot the residuals (y i yt i ) are the lines projecting from each data point to the fitted line. The squares, defined by the residuals, represent the residuals squared (y i yt i ) 2, or each of the terms in the residuals sum of squares. The least squares line in the plot is the line that minimizes the sum of the areas of the seven squares. Figure 7.2 Least Squares Line and Residuals Squares

11 394 Analyzing and Interpreting Continuous Data Using JMP: A Step-by-Step Guide Figure 7.9 Einstein s Data: Constant Variance Check When the data exhibit variation that is not constant, we might see a funnel shape with the opening at either end of the x-axis. For illustrative purposes, a residuals plot showing heterogeneous variance is provided in Figure In this plot we see more spread in the studentized residuals for smaller values of the predicted response Y than we do for larger values of the predicted response Y. In other words, the variation is decreasing as Y increases. This could happen if, for example, it is harder to measure the response at lower values. There are methods, such as weighted least squares estimation, that enable us to deal with non-constant variance but they are beyond the scope of this book (for examples see Sections 9.3 and 9.4 of Draper and Smith [1998]).

12 428 Analyzing and Interpreting Continuous Data Using JMP: A Step-by-Step Guide the load cell. The slope = means that for each 1 pound increase in load, the deflection increases by inches. Figure 7.28 Enhanced Output for Load-Defl ection Calibration Study

13 Chapter 7: Characterizing Linear Relationships between Two Variables 443 A window into the future: Prediction Intervals In addition to performing a thorough check of the model assumptions, including a lack-of-fit test, it is helpful to examine the quality of the predictions from our estimated quadratic model: Deflection = *Load 3.161E 9*(Load 1575) 2 (7.15) For a linear regression model with one factor, the Fit Y by X platform can be used to visualize the prediction error. The JMP steps and output for the Einstein data were previously presented in Section These are repeated here for the load-deflection calibration study. 1. Select Analyze > Fit Y by X and highlight Deflection (in), and click Y, Response. Then select Load (lb) and click X, Factor. Finally, click OK. 2. From within the output window, click the contextual pop-up menu at the top of scatter plot and select Fit Polynomial > 2, quadratic. The quadratic line should appear in the scatter plot and a triangle with the label Polynomial Fit Degree = 2 should appear underneath the plot (Figure 7.39). 3. Click the triangle located right underneath the plot and select Confid Shaded Fit and Confid Shaded Indiv. The plot should now have two sets of bands surrounding the prediction line (Figure 7.39). Figure 7.39 Creating Shaded Confidence Bands in Fit Y by X

14 444 Analyzing and Interpreting Continuous Data Using JMP: A Step-by-Step Guide The final plot is shown in Figure 7.40 and it looks almost identical to the plot shown in Figure 7.39 without the shaded confidence intervals turned on. Why? For the calibration study, the fit of the quadratic model to our data has a very small RMSE = , which in turn results in a very small prediction error. That is why we cannot see the bands around the predicted values. Figure 7.40 Load-Defl ection Quadratic Model with Shaded Confi dence Bands

15 Chapter 7: Characterizing Linear Relationships between Two Variables 451 Load-Deflection Calibration curve Figure 7.46 shows a plot of Load as a function of Deflection. This is the calibration curve that can be used to predict the load using the measured deflection of the load cell. This curve is given by Equation 7.17a, which is one of the two solutions to the quadratic model given in equation Figure 7.46 Load-Defl ection Calibration Curve The calibration curve is (7.18) For a vehicle placed in the load-cell, equation 7.18 gives the weight corresponding to the measured deflection. Note that for a deflection of 0 equation 7.18 gives a weight of lb. In theory we expect a 0 weight object to give a 0 deflection. This is an indication that the load cell might have a downward bias represented, in this case, by the intercept. In addition to the plot we can also present a table with key deflection values and their corresponding predicted weight. Table 7.14 shows the predicted weights for a sample of deflections.

16 452 Analyzing and Interpreting Continuous Data Using JMP: A Step-by-Step Guide Table 7.14 Predicted Weights for Given Defl ections According to Equation 7.18 Deflection (in) Weight (lb) , , , , Step 7: Interpret the results and make recommendations As we always recommend, we must translate our statistical findings in terms of the context of the situation at hand, so we can make decisions based on the outcomes of the analysis. Therefore, we should always go back to the problem statement in Step 1 and our scientific hypothesis to make sure that we answered the questions that we set out to answer with our study. Recall that you are preparing for the certification by a nationally recognized standards bureau for a new high-capacity canister style load cell entering this market. In order to come up to speed, you reproduced an existing calibration standard for equipment and vehicles up to 3,000 lb, based on a historical study circa 1975 by a National Institute of Standards and Technology (NIST) scientist named Paul Pontius. This standard is based on establishing the relationship between a specified load and the corresponding deflection measured by the load cell. After running a well-designed study, you derived a calibration curve, which depicts deflection (in) as a function of the load (lb). You discovered that a simple linear regression model was not adequate for your calibration curve. While the impact to the predicted deflection was relatively small, a quadratic term was added to the model not only to get a better fit to the data, but also to improve the prediction ability. To see this, you calculated the average value of the absolute errors with Y Yt under both models. Under the linear model Average ( Y Yt )= , while under the quadratic model, Average ( Y Y t ) = This represents a one decimal place improvement in the absolute errors, and is smaller than the precision of the load cell, the pure error, = E 8 = (Figure 7.35), which is practically significant. You also established an inverse prediction equation, equation 7.18, which can be hard coded into the load cell software to obtain the weight for any vehicle or heavy equipment appropriate for this class of load cell.

Preparing Spatial Data

Preparing Spatial Data 13 CHAPTER 2 Preparing Spatial Data Assessing Your Spatial Data Needs 13 Assessing Your Attribute Data 13 Determining Your Spatial Data Requirements 14 Locating a Source of Spatial Data 14 Performing Common

More information

DISPLAYING THE POISSON REGRESSION ANALYSIS

DISPLAYING THE POISSON REGRESSION ANALYSIS Chapter 17 Poisson Regression Chapter Table of Contents DISPLAYING THE POISSON REGRESSION ANALYSIS...264 ModelInformation...269 SummaryofFit...269 AnalysisofDeviance...269 TypeIII(Wald)Tests...269 MODIFYING

More information

The SEQDESIGN Procedure

The SEQDESIGN Procedure SAS/STAT 9.2 User s Guide, Second Edition The SEQDESIGN Procedure (Book Excerpt) This document is an individual chapter from the SAS/STAT 9.2 User s Guide, Second Edition. The correct bibliographic citation

More information

Hypothesis Tests and Estimation for Population Variances. Copyright 2014 Pearson Education, Inc.

Hypothesis Tests and Estimation for Population Variances. Copyright 2014 Pearson Education, Inc. Hypothesis Tests and Estimation for Population Variances 11-1 Learning Outcomes Outcome 1. Formulate and carry out hypothesis tests for a single population variance. Outcome 2. Develop and interpret confidence

More information

SAS/STAT 13.1 User s Guide. The Four Types of Estimable Functions

SAS/STAT 13.1 User s Guide. The Four Types of Estimable Functions SAS/STAT 13.1 User s Guide The Four Types of Estimable Functions This document is an individual chapter from SAS/STAT 13.1 User s Guide. The correct bibliographic citation for the complete manual is as

More information

CHAPTER 10. Regression and Correlation

CHAPTER 10. Regression and Correlation CHAPTER 10 Regression and Correlation In this Chapter we assess the strength of the linear relationship between two continuous variables. If a significant linear relationship is found, the next step would

More information

Appendix A Summary of Tasks. Appendix Table of Contents

Appendix A Summary of Tasks. Appendix Table of Contents Appendix A Summary of Tasks Appendix Table of Contents Reporting Tasks...357 ListData...357 Tables...358 Graphical Tasks...358 BarChart...358 PieChart...359 Histogram...359 BoxPlot...360 Probability Plot...360

More information

Module 8: Linear Regression. The Applied Research Center

Module 8: Linear Regression. The Applied Research Center Module 8: Linear Regression The Applied Research Center Module 8 Overview } Purpose of Linear Regression } Scatter Diagrams } Regression Equation } Regression Results } Example Purpose } To predict scores

More information

Business Statistics. Lecture 9: Simple Regression

Business Statistics. Lecture 9: Simple Regression Business Statistics Lecture 9: Simple Regression 1 On to Model Building! Up to now, class was about descriptive and inferential statistics Numerical and graphical summaries of data Confidence intervals

More information

Chapter 27 Summary Inferences for Regression

Chapter 27 Summary Inferences for Regression Chapter 7 Summary Inferences for Regression What have we learned? We have now applied inference to regression models. Like in all inference situations, there are conditions that we must check. We can test

More information

Unit 27 One-Way Analysis of Variance

Unit 27 One-Way Analysis of Variance Unit 27 One-Way Analysis of Variance Objectives: To perform the hypothesis test in a one-way analysis of variance for comparing more than two population means Recall that a two sample t test is applied

More information

Introductory Statistics Neil A. Weiss Ninth Edition

Introductory Statistics Neil A. Weiss Ninth Edition Introductory Statistics Neil A. Weiss Ninth Edition Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world Visit us on the World Wide Web at:

More information

appstats27.notebook April 06, 2017

appstats27.notebook April 06, 2017 Chapter 27 Objective Students will conduct inference on regression and analyze data to write a conclusion. Inferences for Regression An Example: Body Fat and Waist Size pg 634 Our chapter example revolves

More information

Inferences for Regression

Inferences for Regression Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In

More information

SAS/STAT 15.1 User s Guide The GLMMOD Procedure

SAS/STAT 15.1 User s Guide The GLMMOD Procedure SAS/STAT 15.1 User s Guide The GLMMOD Procedure This document is an individual chapter from SAS/STAT 15.1 User s Guide. The correct bibliographic citation for this manual is as follows: SAS Institute Inc.

More information

SAS/STAT 15.1 User s Guide The SEQDESIGN Procedure

SAS/STAT 15.1 User s Guide The SEQDESIGN Procedure SAS/STAT 15.1 User s Guide The SEQDESIGN Procedure This document is an individual chapter from SAS/STAT 15.1 User s Guide. The correct bibliographic citation for this manual is as follows: SAS Institute

More information

ArcGIS 9 ArcGIS StreetMap Tutorial

ArcGIS 9 ArcGIS StreetMap Tutorial ArcGIS 9 ArcGIS StreetMap Tutorial Copyright 2001 2008 ESRI All Rights Reserved. Printed in the United States of America. The information contained in this document is the exclusive property of ESRI. This

More information

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world Visit us on the World Wide Web at: www.pearsoned.co.uk Pearson Education Limited 2014

More information

Warm-up Using the given data Create a scatterplot Find the regression line

Warm-up Using the given data Create a scatterplot Find the regression line Time at the lunch table Caloric intake 21.4 472 30.8 498 37.7 335 32.8 423 39.5 437 22.8 508 34.1 431 33.9 479 43.8 454 42.4 450 43.1 410 29.2 504 31.3 437 28.6 489 32.9 436 30.6 480 35.1 439 33.0 444

More information

Data Analyses in Multivariate Regression Chii-Dean Joey Lin, SDSU, San Diego, CA

Data Analyses in Multivariate Regression Chii-Dean Joey Lin, SDSU, San Diego, CA Data Analyses in Multivariate Regression Chii-Dean Joey Lin, SDSU, San Diego, CA ABSTRACT Regression analysis is one of the most used statistical methodologies. It can be used to describe or predict causal

More information

Chapter 31 The GLMMOD Procedure. Chapter Table of Contents

Chapter 31 The GLMMOD Procedure. Chapter Table of Contents Chapter 31 The GLMMOD Procedure Chapter Table of Contents OVERVIEW...1639 GETTING STARTED...1639 AOne-WayDesign...1639 SYNTAX...1644 PROCGLMMODStatement...1644 BYStatement...1646 CLASSStatement...1646

More information

review session gov 2000 gov 2000 () review session 1 / 38

review session gov 2000 gov 2000 () review session 1 / 38 review session gov 2000 gov 2000 () review session 1 / 38 Overview Random Variables and Probability Univariate Statistics Bivariate Statistics Multivariate Statistics Causal Inference gov 2000 () review

More information

SAS/STAT 12.3 User s Guide. The GLMMOD Procedure (Chapter)

SAS/STAT 12.3 User s Guide. The GLMMOD Procedure (Chapter) SAS/STAT 12.3 User s Guide The GLMMOD Procedure (Chapter) This document is an individual chapter from SAS/STAT 12.3 User s Guide. The correct bibliographic citation for the complete manual is as follows:

More information

Trendlines Simple Linear Regression Multiple Linear Regression Systematic Model Building Practical Issues

Trendlines Simple Linear Regression Multiple Linear Regression Systematic Model Building Practical Issues Trendlines Simple Linear Regression Multiple Linear Regression Systematic Model Building Practical Issues Overfitting Categorical Variables Interaction Terms Non-linear Terms Linear Logarithmic y = a +

More information

Multivariate Data Analysis Joseph F. Hair Jr. William C. Black Barry J. Babin Rolph E. Anderson Seventh Edition

Multivariate Data Analysis Joseph F. Hair Jr. William C. Black Barry J. Babin Rolph E. Anderson Seventh Edition Multivariate Data Analysis Joseph F. Hair Jr. William C. Black Barry J. Babin Rolph E. Anderson Seventh Edition Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies

More information

Basic Statistics Exercises 66

Basic Statistics Exercises 66 Basic Statistics Exercises 66 42. Suppose we are interested in predicting a person's height from the person's length of stride (distance between footprints). The following data is recorded for a random

More information

Chapter 1. A Physics Toolkit

Chapter 1. A Physics Toolkit Chapter 1 A Physics Toolkit Chapter 1 A Physics Toolkit In this chapter you will: Use mathematical tools to measure and predict. Apply accuracy and precision when measuring. Display and evaluate data graphically.

More information

Introduction to Uncertainty and Treatment of Data

Introduction to Uncertainty and Treatment of Data Introduction to Uncertainty and Treatment of Data Introduction The purpose of this experiment is to familiarize the student with some of the instruments used in making measurements in the physics laboratory,

More information

The Model Building Process Part I: Checking Model Assumptions Best Practice (Version 1.1)

The Model Building Process Part I: Checking Model Assumptions Best Practice (Version 1.1) The Model Building Process Part I: Checking Model Assumptions Best Practice (Version 1.1) Authored by: Sarah Burke, PhD Version 1: 31 July 2017 Version 1.1: 24 October 2017 The goal of the STAT T&E COE

More information

Section 11: Quantitative analyses: Linear relationships among variables

Section 11: Quantitative analyses: Linear relationships among variables Section 11: Quantitative analyses: Linear relationships among variables Australian Catholic University 214 ALL RIGHTS RESERVED. No part of this work covered by the copyright herein may be reproduced or

More information

From Practical Data Analysis with JMP, Second Edition. Full book available for purchase here. About This Book... xiii About The Author...

From Practical Data Analysis with JMP, Second Edition. Full book available for purchase here. About This Book... xiii About The Author... From Practical Data Analysis with JMP, Second Edition. Full book available for purchase here. Contents About This Book... xiii About The Author... xxiii Chapter 1 Getting Started: Data Analysis with JMP...

More information

Ratio of Polynomials Fit One Variable

Ratio of Polynomials Fit One Variable Chapter 375 Ratio of Polynomials Fit One Variable Introduction This program fits a model that is the ratio of two polynomials of up to fifth order. Examples of this type of model are: and Y = A0 + A1 X

More information

The simple linear regression model discussed in Chapter 13 was written as

The simple linear regression model discussed in Chapter 13 was written as 1519T_c14 03/27/2006 07:28 AM Page 614 Chapter Jose Luis Pelaez Inc/Blend Images/Getty Images, Inc./Getty Images, Inc. 14 Multiple Regression 14.1 Multiple Regression Analysis 14.2 Assumptions of the Multiple

More information

Red Hot Half-Life Modeling Nuclear Decay

Red Hot Half-Life Modeling Nuclear Decay Red Hot Half-Life Modeling Nuclear Decay About this Lesson This lesson can be used in multiple places within a chemistry curriculum. It can be used with the atomic structure unit, a nuclear chemistry unit

More information

Probability and Statistics

Probability and Statistics Probability and Statistics Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be CHAPTER 4: IT IS ALL ABOUT DATA 4a - 1 CHAPTER 4: IT

More information

Algebra. Table of Contents

Algebra. Table of Contents Algebra...4 Patterns...5 Adding Real Numbers...7 Subtracting Real Numbers...9 Multiplying Real Numbers...11 Dividing Real Numbers...12 Order of Operations...13 Real-Number Operations with Absolute Value...16

More information

Analysis of 2x2 Cross-Over Designs using T-Tests

Analysis of 2x2 Cross-Over Designs using T-Tests Chapter 234 Analysis of 2x2 Cross-Over Designs using T-Tests Introduction This procedure analyzes data from a two-treatment, two-period (2x2) cross-over design. The response is assumed to be a continuous

More information

Practical Statistics for the Analytical Scientist Table of Contents

Practical Statistics for the Analytical Scientist Table of Contents Practical Statistics for the Analytical Scientist Table of Contents Chapter 1 Introduction - Choosing the Correct Statistics 1.1 Introduction 1.2 Choosing the Right Statistical Procedures 1.2.1 Planning

More information

Uncertainty, Error, and Precision in Quantitative Measurements an Introduction 4.4 cm Experimental error

Uncertainty, Error, and Precision in Quantitative Measurements an Introduction 4.4 cm Experimental error Uncertainty, Error, and Precision in Quantitative Measurements an Introduction Much of the work in any chemistry laboratory involves the measurement of numerical quantities. A quantitative measurement

More information

Chapter 12 - Part I: Correlation Analysis

Chapter 12 - Part I: Correlation Analysis ST coursework due Friday, April - Chapter - Part I: Correlation Analysis Textbook Assignment Page - # Page - #, Page - # Lab Assignment # (available on ST webpage) GOALS When you have completed this lecture,

More information

WISE Regression/Correlation Interactive Lab. Introduction to the WISE Correlation/Regression Applet

WISE Regression/Correlation Interactive Lab. Introduction to the WISE Correlation/Regression Applet WISE Regression/Correlation Interactive Lab Introduction to the WISE Correlation/Regression Applet This tutorial focuses on the logic of regression analysis with special attention given to variance components.

More information

Applied Regression Modeling

Applied Regression Modeling Applied Regression Modeling Applied Regression Modeling A Business Approach Iain Pardoe University of Oregon Charles H. Lundquist College of Business Eugene, Oregon WILEY- INTERSCIENCE A JOHN WILEY &

More information

Statistics for Managers using Microsoft Excel 6 th Edition

Statistics for Managers using Microsoft Excel 6 th Edition Statistics for Managers using Microsoft Excel 6 th Edition Chapter 13 Simple Linear Regression 13-1 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of

More information

1 Introduction to Minitab

1 Introduction to Minitab 1 Introduction to Minitab Minitab is a statistical analysis software package. The software is freely available to all students and is downloadable through the Technology Tab at my.calpoly.edu. When you

More information

Developmental Algebra Beginning and Intermediate - Preparing for College Mathematics

Developmental Algebra Beginning and Intermediate - Preparing for College Mathematics Developmental Algebra Beginning and Intermediate - Preparing for College Mathematics By Paul Pierce Included in this preview: Copyright Page Table of Contents Excerpt of Chapter 1 For additional information

More information

M&M Exponentials Exponential Function

M&M Exponentials Exponential Function M&M Exponentials Exponential Function Teacher Guide Activity Overview In M&M Exponentials students will experiment with growth and decay functions. Students will also graph their experimental data and

More information

Regression analysis is a tool for building mathematical and statistical models that characterize relationships between variables Finds a linear

Regression analysis is a tool for building mathematical and statistical models that characterize relationships between variables Finds a linear Regression analysis is a tool for building mathematical and statistical models that characterize relationships between variables Finds a linear relationship between: - one independent variable X and -

More information

STATISTICAL PROCESS CONTROL - THE IMPORTANCE OF USING CALIBRATED MEASUREMENT EQUIPMENT. CASE STUDY

STATISTICAL PROCESS CONTROL - THE IMPORTANCE OF USING CALIBRATED MEASUREMENT EQUIPMENT. CASE STUDY Proceedings of the 6th International Conference on Mechanics and Materials in Design, Editors: J.F. Silva Gomes & S.A. Meguid, P.Delgada/Azores, 26-30 July 2015 PAPER REF: 5376 STATISTICAL PROCESS CONTROL

More information

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

One-Way ANOVA. Some examples of when ANOVA would be appropriate include: One-Way ANOVA 1. Purpose Analysis of variance (ANOVA) is used when one wishes to determine whether two or more groups (e.g., classes A, B, and C) differ on some outcome of interest (e.g., an achievement

More information

The Purpose of Hypothesis Testing

The Purpose of Hypothesis Testing Section 8 1A:! An Introduction to Hypothesis Testing The Purpose of Hypothesis Testing See s Candy states that a box of it s candy weighs 16 oz. They do not mean that every single box weights exactly 16

More information

2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008

2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008 MIT OpenCourseWare http://ocw.mit.edu 2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

More information

Forecasting. Copyright 2015 Pearson Education, Inc.

Forecasting. Copyright 2015 Pearson Education, Inc. 5 Forecasting To accompany Quantitative Analysis for Management, Twelfth Edition, by Render, Stair, Hanna and Hale Power Point slides created by Jeff Heyl Copyright 2015 Pearson Education, Inc. LEARNING

More information

Geog 210C Spring 2011 Lab 6. Geostatistics in ArcMap

Geog 210C Spring 2011 Lab 6. Geostatistics in ArcMap Geog 210C Spring 2011 Lab 6. Geostatistics in ArcMap Overview In this lab you will think critically about the functionality of spatial interpolation, improve your kriging skills, and learn how to use several

More information

Assumptions in Regression Modeling

Assumptions in Regression Modeling Fall Semester, 2001 Statistics 621 Lecture 2 Robert Stine 1 Assumptions in Regression Modeling Preliminaries Preparing for class Read the casebook prior to class Pace in class is too fast to absorb without

More information

PP

PP Algebra I Reviewers Nancy Akhavan Lee Richmond School Hanford, CA Kristi Knevelbaard Caruthers Elementary School Caruthers, CA Sharon Malang Mickey Cox Elementary School Clovis, CA Jessica Mele Valley

More information

Regression Analysis and Forecasting Prof. Shalabh Department of Mathematics and Statistics Indian Institute of Technology-Kanpur

Regression Analysis and Forecasting Prof. Shalabh Department of Mathematics and Statistics Indian Institute of Technology-Kanpur Regression Analysis and Forecasting Prof. Shalabh Department of Mathematics and Statistics Indian Institute of Technology-Kanpur Lecture 10 Software Implementation in Simple Linear Regression Model using

More information

Unit 10: Simple Linear Regression and Correlation

Unit 10: Simple Linear Regression and Correlation Unit 10: Simple Linear Regression and Correlation Statistics 571: Statistical Methods Ramón V. León 6/28/2004 Unit 10 - Stat 571 - Ramón V. León 1 Introductory Remarks Regression analysis is a method for

More information

Leverage Sparse Information in Predictive Modeling

Leverage Sparse Information in Predictive Modeling Leverage Sparse Information in Predictive Modeling Liang Xie Countrywide Home Loans, Countrywide Bank, FSB August 29, 2008 Abstract This paper examines an innovative method to leverage information from

More information

The Model Building Process Part I: Checking Model Assumptions Best Practice

The Model Building Process Part I: Checking Model Assumptions Best Practice The Model Building Process Part I: Checking Model Assumptions Best Practice Authored by: Sarah Burke, PhD 31 July 2017 The goal of the STAT T&E COE is to assist in developing rigorous, defensible test

More information

Basic Statistics. 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation).

Basic Statistics. 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation). Basic Statistics There are three types of error: 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation). 2. Systematic error - always too high or too low

More information

National 5 Mathematics Assessment Practice Topic 6: Graphs of quadratic functions and solving quadratic functions

National 5 Mathematics Assessment Practice Topic 6: Graphs of quadratic functions and solving quadratic functions SCHOLAR Study Guide National 5 Mathematics Assessment Practice Topic 6: Graphs of quadratic functions and solving quadratic functions Authored by: Margaret Ferguson Heriot-Watt University Edinburgh EH14

More information

Spatial Analysis with ArcGIS Pro STUDENT EDITION

Spatial Analysis with ArcGIS Pro STUDENT EDITION Spatial Analysis with ArcGIS Pro STUDENT EDITION Copyright 2018 Esri All rights reserved. Course version 2.0. Version release date November 2018. Printed in the United States of America. The information

More information

Correlation and Regression (Excel 2007)

Correlation and Regression (Excel 2007) Correlation and Regression (Excel 2007) (See Also Scatterplots, Regression Lines, and Time Series Charts With Excel 2007 for instructions on making a scatterplot of the data and an alternate method of

More information

Methods for Identifying Out-of-Trend Data in Analysis of Stability Measurements Part II: By-Time-Point and Multivariate Control Chart

Methods for Identifying Out-of-Trend Data in Analysis of Stability Measurements Part II: By-Time-Point and Multivariate Control Chart Peer-Reviewed Methods for Identifying Out-of-Trend Data in Analysis of Stability Measurements Part II: By-Time-Point and Multivariate Control Chart Máté Mihalovits and Sándor Kemény T his article is a

More information

LAB 2 - ONE DIMENSIONAL MOTION

LAB 2 - ONE DIMENSIONAL MOTION Name Date Partners L02-1 LAB 2 - ONE DIMENSIONAL MOTION OBJECTIVES Slow and steady wins the race. Aesop s fable: The Hare and the Tortoise To learn how to use a motion detector and gain more familiarity

More information

Error Analysis, Statistics and Graphing Workshop

Error Analysis, Statistics and Graphing Workshop Error Analysis, Statistics and Graphing Workshop Percent error: The error of a measurement is defined as the difference between the experimental and the true value. This is often expressed as percent (%)

More information

Chapter 16. Simple Linear Regression and Correlation

Chapter 16. Simple Linear Regression and Correlation Chapter 16 Simple Linear Regression and Correlation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

Chapter 10. Regression. Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania

Chapter 10. Regression. Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania Chapter 10 Regression Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania Scatter Diagrams A graph in which pairs of points, (x, y), are

More information

ncounter PlexSet Data Analysis Guidelines

ncounter PlexSet Data Analysis Guidelines ncounter PlexSet Data Analysis Guidelines NanoString Technologies, Inc. 530 airview Ave North Seattle, Washington 98109 USA Telephone: 206.378.6266 888.358.6266 E-mail: info@nanostring.com Molecules That

More information

Equipotential Lines and Electric Fields

Equipotential Lines and Electric Fields Physics Equipotential Lines and Electric Fields Plotting the Electric Field MATERIALS AND RESOURCES EACH GROUP 5 alligator clip leads 2 batteries, 9 V 2 binder clips, large computer LabQuest multimeter,

More information

Doubt-Free Uncertainty In Measurement

Doubt-Free Uncertainty In Measurement Doubt-Free Uncertainty In Measurement Colin Ratcliffe Bridget Ratcliffe Doubt-Free Uncertainty In Measurement An Introduction for Engineers and Students Colin Ratcliffe United States Naval Academy Annapolis

More information

Elementary Statistics in Social Research Essentials Jack Levin James Alan Fox Third Edition

Elementary Statistics in Social Research Essentials Jack Levin James Alan Fox Third Edition Elementary Statistics in Social Research Essentials Jack Levin James Alan Fox Third Edition Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the

More information

Ratio of Polynomials Fit Many Variables

Ratio of Polynomials Fit Many Variables Chapter 376 Ratio of Polynomials Fit Many Variables Introduction This program fits a model that is the ratio of two polynomials of up to fifth order. Instead of a single independent variable, these polynomials

More information

Using SPSS for One Way Analysis of Variance

Using SPSS for One Way Analysis of Variance Using SPSS for One Way Analysis of Variance This tutorial will show you how to use SPSS version 12 to perform a one-way, between- subjects analysis of variance and related post-hoc tests. This tutorial

More information

A Second Course in Statistics Regression Analysis William Mendenhall Terry Sincich Seventh Edition......

A Second Course in Statistics Regression Analysis William Mendenhall Terry Sincich Seventh Edition...... A Second Course in Statistics Regression Analysis William Mendenhall Terry Sincich Seventh Edition...... Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies

More information

Lab 1 Uniform Motion - Graphing and Analyzing Motion

Lab 1 Uniform Motion - Graphing and Analyzing Motion Lab 1 Uniform Motion - Graphing and Analyzing Motion Objectives: < To observe the distance-time relation for motion at constant velocity. < To make a straight line fit to the distance-time data. < To interpret

More information

SAS/STAT 13.1 User s Guide. Introduction to Survey Sampling and Analysis Procedures

SAS/STAT 13.1 User s Guide. Introduction to Survey Sampling and Analysis Procedures SAS/STAT 13.1 User s Guide Introduction to Survey Sampling and Analysis Procedures This document is an individual chapter from SAS/STAT 13.1 User s Guide. The correct bibliographic citation for the complete

More information

BASICS OF ANALYTICAL CHEMISTRY AND CHEMICAL EQUILIBRIA

BASICS OF ANALYTICAL CHEMISTRY AND CHEMICAL EQUILIBRIA BASICS OF ANALYTICAL CHEMISTRY AND CHEMICAL EQUILIBRIA BASICS OF ANALYTICAL CHEMISTRY AND CHEMICAL EQUILIBRIA BRIAN M. TISSUE Virginia Tech Department of Chemistry Blacksburg, VA Cover Design: Wiley Cover

More information

SAS/STAT 14.1 User s Guide. Introduction to Nonparametric Analysis

SAS/STAT 14.1 User s Guide. Introduction to Nonparametric Analysis SAS/STAT 14.1 User s Guide Introduction to Nonparametric Analysis This document is an individual chapter from SAS/STAT 14.1 User s Guide. The correct bibliographic citation for this manual is as follows:

More information

Principles and Problems. Chapter 1: A Physics Toolkit

Principles and Problems. Chapter 1: A Physics Toolkit PHYSICS Principles and Problems Chapter 1: A Physics Toolkit CHAPTER 1 A Physics Toolkit BIG IDEA Physicists use scientific methods to investigate energy and matter. CHAPTER 1 Table Of Contents Section

More information

UNIT 1 Equations and Their Graphs

UNIT 1 Equations and Their Graphs UNIT 1 Equations and Their Graphs ALGEBRA I Unit Length and Description: 8 weeks By the end of eighth grade students have learned to solve linear equations in one variable and have applied graphical and

More information

Business Statistics 41000: Homework # 5

Business Statistics 41000: Homework # 5 Business Statistics 41000: Homework # 5 Drew Creal Due date: Beginning of class in week # 10 Remarks: These questions cover Lectures #7, 8, and 9. Question # 1. Condence intervals and plug-in predictive

More information

Six Sigma Black Belt Study Guides

Six Sigma Black Belt Study Guides Six Sigma Black Belt Study Guides 1 www.pmtutor.org Powered by POeT Solvers Limited. Analyze Correlation and Regression Analysis 2 www.pmtutor.org Powered by POeT Solvers Limited. Variables and relationships

More information

Determination of Density 1

Determination of Density 1 Introduction Determination of Density 1 Authors: B. D. Lamp, D. L. McCurdy, V. M. Pultz and J. M. McCormick* Last Update: February 1, 2013 Not so long ago a statistical data analysis of any data set larger

More information

Uncertainty and Graphical Analysis

Uncertainty and Graphical Analysis Uncertainty and Graphical Analysis Introduction Two measures of the quality of an experimental result are its accuracy and its precision. An accurate result is consistent with some ideal, true value, perhaps

More information

Introduction to Nonparametric Analysis (Chapter)

Introduction to Nonparametric Analysis (Chapter) SAS/STAT 9.3 User s Guide Introduction to Nonparametric Analysis (Chapter) SAS Documentation This document is an individual chapter from SAS/STAT 9.3 User s Guide. The correct bibliographic citation for

More information

Econ 325: Introduction to Empirical Economics

Econ 325: Introduction to Empirical Economics Econ 325: Introduction to Empirical Economics Chapter 9 Hypothesis Testing: Single Population Ch. 9-1 9.1 What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population

More information

Inference with Simple Regression

Inference with Simple Regression 1 Introduction Inference with Simple Regression Alan B. Gelder 06E:071, The University of Iowa 1 Moving to infinite means: In this course we have seen one-mean problems, twomean problems, and problems

More information

California Common Core State Standards for Mathematics Standards Map Mathematics I

California Common Core State Standards for Mathematics Standards Map Mathematics I A Correlation of Pearson Integrated High School Mathematics Mathematics I Common Core, 2014 to the California Common Core State s for Mathematics s Map Mathematics I Copyright 2017 Pearson Education, Inc.

More information

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006 Chapter 17 Simple Linear Regression and Correlation 17.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

Map your way to deeper insights

Map your way to deeper insights Map your way to deeper insights Target, forecast and plan by geographic region Highlights Apply your data to pre-installed map templates and customize to meet your needs. Select from included map files

More information

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world Visit us on the World Wide Web at: www.pearsoned.co.uk Pearson Education Limited 2014

More information

Important note: Transcripts are not substitutes for textbook assignments. 1

Important note: Transcripts are not substitutes for textbook assignments. 1 In this lesson we will cover correlation and regression, two really common statistical analyses for quantitative (or continuous) data. Specially we will review how to organize the data, the importance

More information

Simple Linear Regression for the Climate Data

Simple Linear Regression for the Climate Data Prediction Prediction Interval Temperature 0.2 0.0 0.2 0.4 0.6 0.8 320 340 360 380 CO 2 Simple Linear Regression for the Climate Data What do we do with the data? y i = Temperature of i th Year x i =CO

More information

********************************************************************************************************

******************************************************************************************************** QUESTION # 1 1. Let the random variable X represent the number of telephone lines in use by the technical support center of a software manufacturer at noon each day. The probability distribution of X is

More information

Regression, part II. I. What does it all mean? A) Notice that so far all we ve done is math.

Regression, part II. I. What does it all mean? A) Notice that so far all we ve done is math. Regression, part II I. What does it all mean? A) Notice that so far all we ve done is math. 1) One can calculate the Least Squares Regression Line for anything, regardless of any assumptions. 2) But, if

More information

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 Statistics Boot Camp Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 March 21, 2018 Outline of boot camp Summarizing and simplifying data Point and interval estimation Foundations of statistical

More information

Business Statistics. Lecture 10: Course Review

Business Statistics. Lecture 10: Course Review Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,

More information

A Scientific Model for Free Fall.

A Scientific Model for Free Fall. A Scientific Model for Free Fall. I. Overview. This lab explores the framework of the scientific method. The phenomenon studied is the free fall of an object released from rest at a height H from the ground.

More information

Mathematics GRADE 8 Teacher Packet

Mathematics GRADE 8 Teacher Packet COMMON CORE Standards Plus Mathematics GRADE 8 Teacher Packet Copyright 01 Learning Plus Associates All Rights Reserved; International Copyright Secured. Permission is hereby granted to teachers to reprint

More information