Improve Forecasts: Use Defect Signals

Similar documents
GAMINGRE 8/1/ of 7

Computing & Telecommunications Services Monthly Report January CaTS Help Desk. Wright State University (937)

Computing & Telecommunications Services

REPORT ON LABOUR FORECASTING FOR CONSTRUCTION

BUSI 460 Suggested Answers to Selected Review and Discussion Questions Lesson 7

Process Behavior Analysis Understanding Variation

Tracking Accuracy: An Essential Step to Improve Your Forecasting Process

Jayalath Ekanayake Jonas Tappolet Harald Gall Abraham Bernstein. Time variance and defect prediction in software projects: additional figures

Time Series Analysis

Section II: Assessing Chart Performance. (Jim Benneyan)

Sales Analysis User Manual

Technical note on seasonal adjustment for M0

Four Basic Steps for Creating an Effective Demand Forecasting Process

Your World is not Red or Green. Good Practice in Data Display and Dashboard Design

Monitoring Platelet Issues - a novel approach CUSUM. Clive Hyam Blood Stocks Management Scheme London

SYSTEM BRIEF DAILY SUMMARY

Mr. XYZ. Stock Market Trading and Investment Astrology Report. Report Duration: 12 months. Type: Both Stocks and Option. Date: Apr 12, 2011

Determine the trend for time series data

YACT (Yet Another Climate Tool)? The SPI Explorer

Lecture Prepared By: Mohammad Kamrul Arefin Lecturer, School of Business, North South University

SYSTEM BRIEF DAILY SUMMARY

FEB DASHBOARD FEB JAN FEB MAR APR MAY JUN JUL AUG SEP OCT NOV DEC

Agile Estimation: Beyond the Myths. Slide: 1 9/25/2015 Webinar Sponsored by Computer Aid, Inc.

Smoothed Prediction of the Onset of Tree Stem Radius Increase Based on Temperature Patterns

2013 GROWTH INCENTIVES PROGRAM FAQS

2.1 Inductive Reasoning Ojectives: I CAN use patterns to make conjectures. I CAN disprove geometric conjectures using counterexamples.

Lecture Prepared By: Mohammad Kamrul Arefin Lecturer, School of Business, North South University

Salem Economic Outlook

Climatography of the United States No

A FACILITY MANAGER S INTRODUCTION TO WEATHER CORRECTION FOR UTILITY BILL TRACKING. John Avina, Director Abraxas Energy Consulting

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

AREP GAW. AQ Forecasting

YEAR 10 GENERAL MATHEMATICS 2017 STRAND: BIVARIATE DATA PART II CHAPTER 12 RESIDUAL ANALYSIS, LINEARITY AND TIME SERIES

DAILY QUESTIONS 28 TH JUNE 18 REASONING - CALENDAR

Akira Ito & Staffs of seasonal forecast sector

Annual Average NYMEX Strip Comparison 7/03/2017

Monitoring on Subsidence Claims. John Parvin Subsidence Claims Manager

In Centre, Online Classroom Live and Online Classroom Programme Prices

Chapter 3. Regression-Based Models for Developing Commercial Demand Characteristics Investigation

The science of learning from data.

Forecasting using R. Rob J Hyndman. 1.3 Seasonality and trends. Forecasting using R 1

Climatography of the United States No

Climatography of the United States No

S95 INCOME-TESTED ASSISTANCE RECONCILIATION WORKSHEET (V3.1MF)

FORECASTING COARSE RICE PRICES IN BANGLADESH

3. Problem definition

Winter Season Resource Adequacy Analysis Status Report

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

What Makes the XmR Chart Work?

Climatography of the United States No

Climatography of the United States No

Life Cycle of Convective Systems over Western Colombia

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

MONTE CARLO SIMULATION RISK ANALYSIS AND ITS APPLICATION IN MODELLING THE INCLEMENT WEATHER FOR PROGRAMMING CIVIL ENGINEERING PROJECTS

An alternative to Red-Yellow-Green Board Reports

Monthly Trading Report July 2018

Climatography of the United States No

982T1S2 Assignment Topic: Trigonometric Functions Due Date: 28 th August, 2015

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

ENGINE SERIAL NUMBERS

Climatography of the United States No

Climatography of the United States No

Atmospheric circulation analysis for seasonal forecasting

In this activity, students will compare weather data from to determine if there is a warming trend in their community.

Why does attention to web articles fall with time?

Climatography of the United States No

Climatography of the United States No

Approximating Fixed-Horizon Forecasts Using Fixed-Event Forecasts

The xmacis Userʼs Guide. Keith L. Eggleston Northeast Regional Climate Center Cornell University Ithaca, NY

Mountain View Community Shuttle Monthly Operations Report

Climatography of the United States No

WHEN IS IT EVER GOING TO RAIN? Table of Average Annual Rainfall and Rainfall For Selected Arizona Cities

Monthly Trading Report Trading Date: Dec Monthly Trading Report December 2017

Climatography of the United States No

Enhancements to the CTBTO IDC radionuclide processing pipeline for particulate samples achieving significant improvement of automatic products

CLIMATE OF THE ZUMWALT PRAIRIE OF NORTHEASTERN OREGON FROM 1930 TO PRESENT

Chapter 3 ANALYSIS OF RESPONSE PROFILES

Jackson County 2018 Weather Data 67 Years of Weather Data Recorded at the UF/IFAS Marianna North Florida Research and Education Center

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Location. Datum. Survey. information. Etrometa. Step Gauge. Description. relative to Herne Bay is -2.72m. The site new level.

BOSELE NATIONAL PROVIDENT FUND JM BUSHA BONDPLUS

CWV Optimisation. TWG 12 th February 2014

Quantification of energy losses caused by blade icing and the development of an Icing Loss Climatology

Climatography of the United States No

Location. Datum. Survey. information. Etrometa. Step Gauge. Description. relative to Herne Bay is -2.72m. The site new level.

Transcription:

Improve Forecasts: Use Defect Signals Paul Below paul.below@qsm.com Quantitative Software Management, Inc. Introduction Large development and integration project testing phases can extend over many months It is important to predict when the defect detection rate will drop to acceptable levels SLIM Control can be used to track and forecast defect detection rates The peak of the defect Rayleigh curve is an important point in the testing phase Control charts are another tool (#)

Cumulative Found by Month, Project XYZ Cum Found Category Total,54,4,4 754 54 4 5 6 7 9 3 4 6 7 9 3 3 3 33 34 35 Apr May Jun Jul Aug Sep Oct Nov Dec Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec '3 '4 4 Current Plan Actuals Current Forecast Green Control Bound Yellow Control Bound Project: Example Project Project has found fewer defects than estimated. What will happen next? (#3) Found by Month, Project XYZ Monthly Found Has the peak of the curve really been reached? Or was the last month misleading? If only we had another signal! 3 Date we need to forecast 5? 5 6 7 9 3 4 6 7 9 3 3 3 33 34 35 Apr May Jun Jul Aug Sep Oct Nov Dec Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec '3 '4 5 Current Plan Actuals Green Control Bound Yellow Control Bound Project: Example Project (#4)

Comment on Defect Prediction It is possible to create very useful estimates of defects based on only a few key metrics. For example, I created a regression analysis to predict defects based on over recently completed software projects from the QSM database. This resulted in an adjusted R square of.537 using only the input variables the log of peak staff, the log of ESLOC, and the log of production rate (ESLOC per calendar month). Standardized residuals show a normal distribution with mean close to zero Large projects have multiple testing phases. Such models, with multiple control charts, can be used throughout. For example, with one Fortune 5 client, I found that merely using the number of prerelease defects was an excellent predictor of their go live release defects (R Square of over.7). (#5) Control Charts Every process displays variation. Some processes display controlled variation and others display uncontrolled variation. In the 9 s, Control Charts were invented by Walter A. Shewhart. In most of the rest of the th century the concepts were popularized by people such as W. Edwards Deming. Control Charts are a major part of Statistical Process Control, and are integrated into Six Sigma and CMMI. (#6) The engineer desires to reduce the variability in quality to an economic minimum. In other words, he wants (a) a rational method of prediction that is subject to minimum error, and (b) a means of minimizing variability in the quality of a given produce at a given cost of production. Walter A. Shewhart 3

Control Charts and Signals Purpose here is not to determine if the testing is in control or to improve product quality (both have been done, but are outside scope of this presentation, see sources at end) Purpose here is to determine if there has been a shift in quality of the underlying product I use Individuals and Moving Range charts (XmR), suitable for many situations including the periodic collection of items found in a given time It is important to calculate the control limits properly (see resources at end, or use a good stats tool) (#7) Rules, Rules There are a number of rules that are used to detect signals. The number of rules used and the definitions of the rules vary slightly from one source to another. However, the traditional use of control charts is best met by keeping the number of rules to a minimum, thereby reducing the chance of obtaining a false signal. But in our case, we don t want to miss a signal. In IBM SPSS, for example, there are possible rules that can be turned on or off: (#) One point above +3 Sigma, or one point below -3 Sigma out of last 3 above + Sigma, or of 3 below - Sigma 4 out of last 5 above + Sigma, or 4 out of 5 below - Sigma points above center line, or below center line 6 in a row trending up, or 6 trending down 4 in a row alternating up and down 4

Why the Rules? All uses of control charts walk this decision line. Shewhart originally used 3 sigma limits because he wanted to minimize false signals, which would incur the unnecessary cost of researching a problem that didn t exist. In other words, when he saw a signal he wanted to be almost completely certain it was real. Possible States (#9) Possible Decision Problem Not a Problem Investigate Correctly Type II error Accepted Ignore Type I error Correctly Rejected Another Example In this example, the balance between testing and fixing has not remained constant. For defects detected, five of the points show up as red, meaning they violated one of the rules These violations are not unusual. As mentioned previously, defect metrics commonly follow a Rayleigh distribution What about backlog burndown? Point # Violations for Points 4 4 points out of the last 5 below - sigma 6 Greater than +3 sigma 6 points out of the last 3 above + sigma 7 points out of the last 3 above + sigma 4 points out of the last 5 above + sigma 9 4 points out of the last 5 above + sigma (#) 5

XmR for Resolved / Discovered Balance between fixed and found has shifted. Something has changed. (#) Summary Control charts can be used to determine whether apparent changes in defect rates are significant. Especially, has the peak of the defect detection Rayleigh curve been reached? One use for this knowledge is to create and improve forecasts of Program completion, or software quality at key Program milestones. since we can make operationally verifiable predictions only in terms of future observations, it follows that with the acquisition of new data, not only may the magnitudes involved in any prediction change, but also our grounds for belief in it. Walter A. Shewhart (#) 6

Example from SLIM Training Quantitative Software Management, Inc. SLIM Control Example: But wait, there s more! From the SW-Wed Gateway example packaged with SLIM Control. Defect Closure & Backlog - Web Gateway Found Category Total 3 4 5 6 7 Fixed Category Total 3 4 5 6 7 5 5 5 5 4 6 4 6 4 6 Jan Mar May Jul Sep Nov Jan Mar May Jul Sep Nov Jan Mar May ' '3 '4 4 6 4 6 4 6 Jan Mar May Jul Sep Nov Jan Mar May Jul Sep Nov Jan Mar May ' '3 '4 Cummulative Found, Fixed & Open Defect closure is not keeping up with discovery rates. Found,, Open Defect Backlog 3 4 5 6 7 75 5 Fixed Open 4 6 4 6 4 6 Jan Mar May Jul Sep Nov Jan Mar May Jul Sep Nov Jan Mar May ' '3 '4 6 4 4 6 4 6 4 6 Jan Mar May Jul Sep Nov Jan Mar May Jul Sep Nov Jan Mar May ' '3 '4 75 5 Current Plan Actuals Green Control Bound Yellow Control Bound Red Control Bound (#4) 7

Let s Look at a Defect Forecast SW-Wed Gateway example packaged with SLIM Control. Arbitrary: Weighted every metric evenly that had Goodness of Fit Curve Fit Analysis - Web Gateway Curve Fit Analysis Report Forecast Assumptions Projection Date: /3/ Implied PI: 5.3 Curve Fit Weighted Results Projected Phase 3 Time: 9. Months Phase 3 End Date: /7/3 Life Cycle End Date: 3//4 Tradeoff Results Phase 3 Peak/Shape: na Tradeoff Results: na Metric Phase 3 Milestones Cum Eff SLOC Found (Moderate) Found (Serious) Found (Critical) Earned Tasks Use Cases Design Components Integration Builds Number of Data Points Used 7 5 5 5 4 Relative Weight (User) Goodness of Fit Metric Complete Metric Complete Poor Predicted Phase 3 Time (Months).4.5 7.6 7.5 6.. 3.4 Current Plan: Baseline Plan Current Forecast: Forecast as of /3/ Lifecycle includes R&D, C&T, P_Mnt (#5) SLIM Control Example Forecast predicts we are still a couple of months away from peak of defect discovery Rayleigh curve. Defect Closure & Backlog - Web Gateway Found Category Total 3 4 5 6 7 3 Fixed Category Total 3 4 5 6 7 3 5 5 5 5 4 6 4 6 4 6 Jan Mar May Jul Sep Nov Jan Mar May Jul Sep Nov Jan Mar ' '3 '4 4 6 4 6 4 6 Jan Mar May Jul Sep Nov Jan Mar May Jul Sep Nov Jan Mar ' '3 '4 Cummulative Found, Fixed & Open Defect closure is not keeping, up with discovery rates. Found, Fixed 6 4 Open 4 6 4 6 4 6 Jan Mar May Jul Sep Nov Jan Mar May Jul Sep Nov Jan Mar ' '3 '4 Open Defect Backlog 3 4 5 6 7 75 5 75 5 4 6 4 6 4 6 Jan Mar May Jul Sep Nov Jan Mar May Jul Sep Nov Jan Mar ' '3 '4 Current Plan Actuals Current Forecast Green Control Bound Yellow Control Bound Red Control Bound (#6)

SLIM Control Example XmR for Discovered No rule violations. Therefore, no significant change (month to month variation in defects discovered can be explained by random variation). (#7) SLIM Control Example XmR for Fixed No significant change. Note though that the moving range chart picked up on month as almost 3 sigma, when they ramped up fixing. (#) 9

SLIM Control Example XmR for Fixed / Discovered No significant change. Control charts agree with SLIM Control Forecast, in that we have no evidence that curve peak has been reached. (#9) Resources Statistical Method from the Viewpoint of Quality Control, Walter A. Shewhart. Dover, 96 edition, p.9 and p. 4. Why CMMI Maturity Level 5?, Michael Comps. Crosstalk, Jan-Feb,, pp 5-. Do Not Get Out of Control: Achieving Real-time Quality and Performance, Craig Hale and Mike Rowe. Crosstalk, Jan-Feb, pp. 4-. Understanding Statistical Process Control, Donald J. Wheeler. SPC Press,. Implementing Six Sigma: Smarter Solutions Using Statistical Methods, Second Edition, Forrest W. Breyfogle III. John Wiley & Sons, 3. The IFPUG Guide to IT and Software Measurement: A Comprehensive International Guide, IFPUG, ed. CRC Press,. Chapter 7, Paul Below, pp. 39-333. Paul Below paul.below@qsm.com http://www.qsm.com (#)

Extra Material (#) XmR Control Limits Individuals Chart (X) Center line is the mean of X, mx, the average of all the individual values Upper Control Limit is mx + (.66 * mr) Lower Control Limit is mx (.66 * mr) Moving Range Chart (mr) Center line is the mean of ranges, mr Upper Control Limit is 3.6 * mr Mean Range, mr, is the average of a moving range of. For each data point, except the first, the moving range of is the difference between that data point and the preceding one. Those ranges are averaged to get the mr. For in depth explanation, see Chapter 3 of Wheeler (on resources slide). (#)