T E C O L O T E R E S E A R C H, I N C.

Similar documents
Statistics for Business and Economics

Chapter 11: Simple Linear Regression and Correlation

Statistics for Economics & Business

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Chapter 15 Student Lecture Notes 15-1

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

e i is a random error

The Ordinary Least Squares (OLS) Estimator

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Chapter 9: Statistical Inference and the Relationship between Two Variables

β0 + β1xi. You are interested in estimating the unknown parameters β

Introduction to Regression

Learning Objectives for Chapter 11

Chapter 3. Two-Variable Regression Model: The Problem of Estimation

Basic Business Statistics, 10/e

a. (All your answers should be in the letter!

β0 + β1xi and want to estimate the unknown

x i1 =1 for all i (the constant ).

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva

Economics 130. Lecture 4 Simple Linear Regression Continued

Chapter 13: Multiple Regression

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

Outline. Zero Conditional mean. I. Motivation. 3. Multiple Regression Analysis: Estimation. Read Wooldridge (2013), Chapter 3.

is the calculated value of the dependent variable at point i. The best parameters have values that minimize the squares of the errors

Chapter 4: Regression With One Regressor

Comparison of Regression Lines

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Unit 10: Simple Linear Regression and Correlation

SIMPLE LINEAR REGRESSION

Chapter 15 - Multiple Regression

Y = β 0 + β 1 X 1 + β 2 X β k X k + ε

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 6: Introduction to Linear Regression

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Polynomial Regression Models

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students.

Global Sensitivity. Tuesday 20 th February, 2018

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)

Topic 7: Analysis of Variance

β0 + β1xi. You are interested in estimating the unknown parameters β

Now we relax this assumption and allow that the error variance depends on the independent variables, i.e., heteroskedasticity

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

Lecture 3 Stat102, Spring 2007

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Chapter 14 Simple Linear Regression

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

Continuous vs. Discrete Goods

Biostatistics. Chapter 11 Simple Linear Correlation and Regression. Jing Li

STAT 3008 Applied Regression Analysis

Chapter Newton s Method

Feb 14: Spatial analysis of data fields

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

Statistical Evaluation of WATFLOOD

Laboratory 1c: Method of Least Squares

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010

Lecture 16 Statistical Analysis in Biomaterials Research (Part II)

Chapter 14 Simple Linear Regression Page 1. Introduction to regression analysis 14-2

Chapter 7 Generalized and Weighted Least Squares Estimation. In this method, the deviation between the observed and expected values of

where I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X).

GMM Method (Single-equation) Pongsa Pornchaiwiseskul Faculty of Economics Chulalongkorn University

17 - LINEAR REGRESSION II

NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION MTH352/MH3510 Regression Analysis

Properties of Least Squares

Some basic statistics and curve fitting techniques

Primer on High-Order Moment Estimators

Laboratory 3: Method of Least Squares

Statistics MINITAB - Lab 2

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management

Chap 10: Diagnostics, p384

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

Estimation: Part 2. Chapter GREG estimation

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

THE ROYAL STATISTICAL SOCIETY 2006 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE

Diagnostics in Poisson Regression. Models - Residual Analysis

4.3 Poisson Regression

STATISTICS QUESTIONS. Step by Step Solutions.

j) = 1 (note sigma notation) ii. Continuous random variable (e.g. Normal distribution) 1. density function: f ( x) 0 and f ( x) dx = 1

Generalized Linear Methods

A New Method for Estimating Overdispersion. David Fletcher and Peter Green Department of Mathematics and Statistics

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation

x = , so that calculated

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Limited Dependent Variables

UNR Joint Economics Working Paper Series Working Paper No Further Analysis of the Zipf Law: Does the Rank-Size Rule Really Exist?

Regression Analysis. Regression Analysis

Negative Binomial Regression

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models

Kernel Methods and SVMs Extension

An (almost) unbiased estimator for the S-Gini index

STAT 3340 Assignment 1 solutions. 1. Find the equation of the line which passes through the points (1,1) and (4,5).

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes

Lecture 2: Prelude to the big shrink

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the

/ n ) are compared. The logic is: if the two

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

Chapter 3 Describing Data Using Numerical Measures

7.1. Single classification analysis of variance (ANOVA) Why not use multiple 2-sample 2. When to use ANOVA

PubH 7405: REGRESSION ANALYSIS. SLR: INFERENCES, Part II

Transcription:

T E C O L O T E R E S E A R C H, I N C. B rdg n g En g neern g a nd Econo mcs S nce 1973 THE MINIMUM-UNBIASED-PERCENTAGE ERROR (MUPE) METHOD IN CER DEVELOPMENT Thrd Jont Annual ISPA/SCEA Internatonal Conference Dr. Shu-Png Hu 14 June 001 shu@sb.tecolote.com

Outlne Objectve Applcaton for MUPE Error Term Assumptons Addtve Multplcatve Multplcatve Error Model Defntons and Propertes Canddate Error Forms Methodologes Used n Analyzng Multplcatve CERs Log-Error, WLS (by observaton), MPE, MUPE, & ZPB/MPE Comparson between MPE and ZPB/MPE Bas Characterstcs Heurstc Goodness-of-ft Measures Analyss of Examples from USCM7 and Other Sources Conclusons Page Brdgng Engneerng and Economcs Snce 1973

Objectve Fnd a regresson technque to model a multplcatve error term wthout bas It requres no transformaton and no correcton factor Page 3 Brdgng Engneerng and Economcs Snce 1973

Applcaton for MUPE Fan-shaped pattern exsts n database, or Data pattern not notceable (especally n small samples) Data range of dependent varable over one order of magntude Management controls proportonal errors Page 4 Brdgng Engneerng and Economcs Snce 1973

Addtve Error Term Addtve Error Term : y = f(x) + Y X Note: Error dstrbuton s ndependent of the scale of the project. (OLS) Page 5 Brdgng Engneerng and Economcs Snce 1973

Addtve Error Term Addtve Error Term : y = ax^b + Y X Note: Ths requres non-lnear regresson. Page 6 Brdgng Engneerng and Economcs Snce 1973

Multplcatve Error Term Multplcatve Error Term : y = (a + bx) * UpperBound f(x) LowerBound Y X Note: Ths requres non-lnear regresson. Page 7 Brdgng Engneerng and Economcs Snce 1973

Multplcatve Error Term Multplcatve Error Term : y = ax^b * UpperBound f(x) LowerBound Y X Note: Ths equaton s lnear n log space. Page 8 Brdgng Engneerng and Economcs Snce 1973

Error Term Assumptons - Background ADDITIVE ERROR MULTIPLICATIVE ERROR Dstrn Assumptons N(0, S ), ndependent LN(0, S ), ndependent Typcal Model Form Lnear y = a + b x Exponental y = a x b Hstorcal Ratonale (Mathematcal Convenence) Legtmate Reasons Absolute Errors Proportonal Errors What should be cost errors? Cost varaton s ndependent of the scale of the project Cost varaton s proportonal to the scale of the project Model form should not drve error term assumpton Error term should not drve model form Page 9 Brdgng Engneerng and Economcs Snce 1973

Multplcatve Error Model - MUPE Defnton for cost varaton: Y = f(x)* where E( ) = 1 and V( ) = f(x) Cost Y Note: E( (Y-f(X)) / f(x) ) = 0 V( (Y-f(X)) / f(x) ) = Some Drver X Page 10 Brdgng Engneerng and Economcs Snce 1973

Bas n a Regresson Equaton a based hgh regresson equaton: f(x) based Cost Y Some Drver X Page 11 Brdgng Engneerng and Economcs Snce 1973

Canddate Error (Resdual) Forms Log-Error ( ~ LN(0, ) ) Least squares n log space Resduals of the cost functon transformed nto log-space Error = Log (y ) - Log f(x ) Weghted Resduals Least squares n unt space by the recprocal of the observaton Error y f y by the recprocal of the predcted value ( x ) Error y f ( x f ( x ) ) Page 1 Brdgng Engneerng and Economcs Snce 1973

Regresson Methods to Implement Multplcatve Errors Least squares n log space Log-Error Model Least squares n unt space wth weghted resduals Weghted by observaton - WLS Weghted by predcted value MPE (Mnmum-Percentage Error) Method MUPE (Mnmum-Unbased-Percentage Error) Method ZPB/MPE Method (Constraned MPE Method) Page 13 Brdgng Engneerng and Economcs Snce 1973

Methodologes Analyses of real examples usng four dfferent methods MPE (Mnmum-Percentage Error) Method MUPE (Mnmum-Unbased-Percentage Error) Method Log-Error Model ZPB/MPE Method (Constraned MPE Method) Page 14 Brdgng Engneerng and Economcs Snce 1973

MPE and MUPE Methods Two possble ways to perform the optmzaton for the weghted least squares usng the predcted values n USCM7 MPE hgh bas due to smultaneous mnmzaton Mnmze ( x n 1 f ( x ) y f ) MUPE bas elmnated Mnmze ( x n 1 fk-1 ( x ) y f ) where k s the teraton number Page 15 Brdgng Engneerng and Economcs Snce 1973

Brdgng Engneerng and Economcs Snce 1973 Page 16 Constraned MPE Method (ZPB/MPE) Alternatve method to remove the hgh bas n MPE equatons for the general level of the functon Constraned Excel Solver soluton 1 ) ( ) ( n x f x f y Mnmze 0 ) ( ) ( 1 n x f x f y to Subject

Comparson between ZPB/MPE and MPE For most equatons (.e.,y = a X b Z c, Y = a + bx + cz, etc.) Senstvty coeffcents (assocated wth the drver varables) are the same between MPE & ZPB/MPE equatons Only leadng term or level of functon adjusted Fndngs also proven by mathematcal dervatons For trad equatons (.e., Y = a + b X c Z d ) All coeffcents changed Page 17 Brdgng Engneerng and Economcs Snce 1973

Bas Characterstcs of Dfferent Methods Log-error equatons wll be based low Bas can be adjusted usng a correcton factor,.e., exp (1 p ) n p = number of estmated coeffcents Resduals weghted by the observaton wll be based low Bas cannot be easly justfed MPE equatons wll be based hgh Bas n equaton can be adjusted n smple cases by a correcton factor 1 ( 1 ) mpe 1 mupe MUPE (noted as IRLS) wll be asymptotcally unbased MUPE has zero proportonal error for ponts n the database, t s also proven to produce consstent parameter estmates. Page 18 Brdgng Engneerng and Economcs Snce 1973

Brdgng Engneerng and Economcs Snce 1973 Page 19 Heurstc Goodness-of-ft Measures p n y y y n 1 ˆ ˆ Multplcatve error (standard error of model): Average percentage error (APE)*: Note: *APE s termed as average bas n USCM7. n y y y n 1 ˆ ˆ n OLS y y p n SEE 1 ˆ 1

MPE Method s based Based hgh for the level of the functon Correcton for bas by a correcton factor: CF = 1 MPE Based low for varance / standard error Correcton for bas n estmate of varance by: CF = 1 / (1 MPE) average percentage error (APE) Correcton for bas n APE by: CF = 1 / (1 APE MPE ) Note: True APE = APE MPE / (1 APE MPE ) Page 0 Brdgng Engneerng and Economcs Snce 1973

Heurstc Goodness-of-ft Measures Pearson s Correlaton Squared (r ): r Cov( Y, Yˆ) Var( Y ) * Var( Yˆ) r = R n OLS Pearson s correlaton coeffcent measures the lnear assocaton between y (actual cost) and y-hat (predcted cost), t cannot explan the actual devaton between y and y-hat f the model s not OLS Page 1 Brdgng Engneerng and Economcs Snce 1973

R-Squared: Heurstc Goodness-of-ft Measures R SSE ( y yˆ ) ( yˆ y ) 1 1 ( SST ( y y ) ( y y ) SSR SST n OLS ) Adjusted R-Squared: Adj ˆ SSE ( n p ) ( y y ). R 1 1 SST ( n 1) ( y y ) ( n ( n p ) 1) Note: R-Squared s a measure of the amount of varaton about the mean explaned by the ftted equaton n OLS Page Brdgng Engneerng and Economcs Snce 1973

Analyss of Examples Comparsons of examples of complex equatons drawn from Unmanned Spacecraft Cost Model, 7th Edton (USCM7) and other sources 6 examples from USCM7 COMM/TT&C Dgtal Electroncs TT&C Dgtal Electroncs TT&C RF Dstrbuton EPS Generaton examples from other sources Analyze and compare results usng 4 dfferent methods Page 3 Brdgng Engneerng and Economcs Snce 1973

COMM/TTC Dgtal Electroncs NR CER (n = 11) MPE: Y = 11.4 * Wt.787 * NumLnks.853 ( s =.1, r =.94 ) MUPE: Y = 194.1 * Wt.804 * NumLnks.85 ( s =.1, r =.94 ) Log-Lnear: Y = 186.61 * Wt.811 * NumLnks.815 (s =.31, r =.94 ) ZPB/MPE: Y = 04.0 * Wt.787 * NumLnks.853 ( s =.1, r =.94 ) Page 4 Brdgng Engneerng and Economcs Snce 1973

COMM/TTC Dgtal Electroncs NR CER (n = 11) MPE: Y = 359.88 * Weght ( s =.58, r =.54, R = -.0 ) (+43%) MUPE: Y = 51.08 * Weght ( s =.69, r =.54, R =.5 ) Log-Error: Y = 0.48 * Weght ( s =.69, r =.54, R =.50 ) (-19%) ZPB/MPE: Y = 51.08 * Weght ( s =.69, r =.54, R =.5 ) Page 5 Brdgng Engneerng and Economcs Snce 1973

Comparson Chart for Weght based CER 45000 NR Cost (9$K) 40000 35000 30000 5000 0000 15000 10000 5000 NR COST MPE MUPE Log Error 0 10 0 30 40 50 60 70 80 90 100 110 Weght (lbs) Page 6 Brdgng Engneerng and Economcs Snce 1973

COMM/TTC Dgtal Electroncs NR CER n USCM7 MPE: Y = 345.78 * Weght ( s =.53, r =.65, R =.7 ) (+35%) average bas = 6% n USCM7 MUPE: Y = 56.88 * Weght ( s =.6, r =.65, R =.64 ) Log-Error: Y = 18.95 * Weght ( s =.58, r =.65, R =.63 ) (-15%) Y * CF = 55.34*Weght ZPB/MPE: Y = 56.88 * Weght ( s =.6, r =.65, R =.64 ) Note: The average bas measures lsted n USCM7 are based low. Page 7 Brdgng Engneerng and Economcs Snce 1973

TT&C Dgtal Electroncs Recur CER (n = 16) MPE: Y = 3.41 * Wt.9 * Nbox.659 * NLnks 1.091 ( s =.6, r =.93, R =.90 ) MUPE: Y = 19.08 * Wt.96 * Nbox.68 * NLnks 1.09 ( s =.6, r =.9, R =.91 ) Log-Lnear: Y = 17.60 * Wt.967 * Nbox.69 * NLnks 1.09 ( s =.6, r =.93, R =.91 ) ZPB/MPE: Y =.0 * Wt.9 * Nbox.659 * NLnks 1.091 ( s =.6, r =.9, R =.90 ) Page 8 Brdgng Engneerng and Economcs Snce 1973

TT&C Dgtal Electroncs Scatter Plot T 1 (9 $ M ) 10 9 8 7 6 5 4 3 1 0 0 10 0 30 40 50 60 70 80 TT&C Comp Weght (lbs) Page 9 Brdgng Engneerng and Economcs Snce 1973

TTC Dgtal Electroncs Recur CER (n = 16) MPE: Y = 103.33 * Wt ( s =.54, r =.15, R = -.8 ) (+37%) MUPE: Y = 75.4 * Wt ( s =.63, r =.15, R =.11 ) Log-Error: Y = 63.5 * Wt ( s =.59, r =.15, R =.11 ) (-16%) ZPB/MPE: Y = 75.4 * Wt ( s =.63, r =.15, R =.11 ) Page 30 Brdgng Engneerng and Economcs Snce 1973

TT&C RF Dstrbuton Comp Rec CER (n = 13) MPE: Y = -7.386 + 9.18 * Wt + 70.68 * Actve ( s =.56, r =.47, R =.4 ) (-5%, +5%, +85%) MUPE: Y = -7.043 + 7.899 *Wt + 38.0 * Actve ( s =.67, r =.46, R =.35 ) Log-Error: Y = -6.165 + 5.08 * Wt + 30.93 * Actve ( s =.60, r =.47, R =.9 ) (+1%, -10%, -0%) ZPB/MPE: Y = -5.616 +.19 * Wt + 53.74 * Actve ( s =.64, r =.47, R =.33) (+0%, -0%, +41%) Note: ZPB/MPE s 76% of MPE Page 31 Brdgng Engneerng and Economcs Snce 1973

EPS Generaton Recur CER ( n = 17 ) MPE: Y = 58.6 + 0.0979 *CellNum ( s =.39, r =.80, R =.51 ) MUPE: Y = 488.54 + 0.0864 *CellNum ( s =.41, r =.80, R =.71 ) Log-Error: Y = 437.787 + 0.080 *CellNum ( s =.47, r =.80, R =.73 ) MPE/ZPB: Y = 506.366 + 0.085 *CellNum ( 87% of MPE ) ( s =.41, r =.80, R =.71 ) Page 3 Brdgng Engneerng and Economcs Snce 1973

SEPM Factors MPE: Y =.54 * Lot# -.718 * Rate -1.0 ( s =.167, r =.84 ) MUPE: Y = 1.93 * Lot# -.713 * Rate -0.99 ( s =.169, r =.84 ) Log-Error: Y = 1.65 * Lot# -.71 * Rate -0.983 ( s =.167, r =.84 ) ZPB/MPE: Y = 1.98 * Lot# -.718 * Rate -1.0 ( s =.169, r =.84 ) (97.5% of MPE) Page 33 Brdgng Engneerng and Economcs Snce 1973

Example from ISPA MPE: Y = 377.337-79.11 * X -0.141 (s =.664, r =.4 ) MUPE: Y = - 1.145 + 53.734 * X 0.658 (s =.871, r =.36 ) Log-Error: Y = 3.51 + 30.17 * X 0.990 (s =1.06, r =.3 ) ZPB/MPE: Y = - 67.649 + 16.666 * X 0.47 (s =.89, r =.40 ) Note: The above equatons are dfferent n both magntude and sgn. Page 34 Brdgng Engneerng and Economcs Snce 1973

Conclusons MUPE method does not requre transformaton or correcton factor. Tradtonal goodness-of-ft measures may not be adequate. Tradtonal statstcs hold for lnear equatons. Revew multplcatve error, adjusted R, resdual plot, percentage error table, etc. for MUPE equatons. Pearson s correlaton s not senstve to dfferent fttng methods. Use t wth cauton. ZPB/MPE method does not change the senstvtes of MPE equaton; t only lowers the level of MPE equaton by a certan percentage *. Ths fndng s also proven by mathematcal dervatons. MPE method requres correcton factors for both standard error and average percentage error because they are based low. Log-error equatons wth correcton factors are very close to MUPE equatons n most cases. MPE and MUPE do not always converge, especally n learnng curve analyss. If no convergence, use log-error models nstead * true for most equatons Page 35 Brdgng Engneerng and Economcs Snce 1973