The Comparison of Stochastic and Deterministic DEA Models

Similar documents
Chance Constrained Data Envelopment Analysis The Productive Efficiency of Units with Stochastic Outputs

Data Envelopment Analysis within Evaluation of the Efficiency of Firm Productivity

A DIMENSIONAL DECOMPOSITION APPROACH TO IDENTIFYING EFFICIENT UNITS IN LARGE-SCALE DEA MODELS

USING LEXICOGRAPHIC PARAMETRIC PROGRAMMING FOR IDENTIFYING EFFICIENT UNITS IN DEA

Subhash C Ray Department of Economics University of Connecticut Storrs CT

A Slacks-base Measure of Super-efficiency for Dea with Negative Data

Symmetric Error Structure in Stochastic DEA

Determination of Economic Optimal Strategy for Increment of the Electricity Supply Industry in Iran by DEA

Finding the strong defining hyperplanes of production possibility set with constant returns to scale using the linear independent vectors

CENTER FOR CYBERNETIC STUDIES

Selective Measures under Constant and Variable Returns-To- Scale Assumptions

ABSTRACT INTRODUCTION

A METHOD FOR SOLVING 0-1 MULTIPLE OBJECTIVE LINEAR PROGRAMMING PROBLEM USING DEA

Further discussion on linear production functions and DEA

AN IMPROVED APPROACH FOR MEASUREMENT EFFICIENCY OF DEA AND ITS STABILITY USING LOCAL VARIATIONS

A DEA- COMPROMISE PROGRAMMING MODEL FOR COMPREHENSIVE RANKING

Sensitivity and Stability Analysis in Uncertain Data Envelopment (DEA)

Second-order cone programming formulation for two player zero-sum game with chance constraints

Revenue Malmquist Index with Variable Relative Importance as a Function of Time in Different PERIOD and FDH Models of DEA

Ranking Decision Making Units with Negative and Positive Input and Output

Prediction of A CRS Frontier Function and A Transformation Function for A CCR DEA Using EMBEDED PCA

Appendix A Taylor Approximations and Definite Matrices

Evaluation of Dialyses Patients by use of Stochastic Data Envelopment Analysis

Searching the Efficient Frontier in Data Envelopment Analysis INTERIM REPORT. IR-97-79/October. Pekka Korhonen

IPRODUCTION FUNCTIONSU) TEXAS UNIV AT AUSIN CENTER FOR ICYBERNETI CAUDIES ACHURNESET AL MAY 83CCS-459

MULTI-COMPONENT RETURNS TO SCALE: A DEA-BASED APPROACH

Sensitivity and Stability Radius in Data Envelopment Analysis

A SECOND ORDER STOCHASTIC DOMINANCE PORTFOLIO EFFICIENCY MEASURE

Data Envelopment Analysis and its aplications

PRIORITIZATION METHOD FOR FRONTIER DMUs: A DISTANCE-BASED APPROACH

Modeling undesirable factors in efficiency evaluation

Groups performance ranking based on inefficiency sharing

Optimization Problems with Probabilistic Constraints

Department of Economics Working Paper Series

Machine Learning. Support Vector Machines. Fabio Vandin November 20, 2017

Equivalent Standard DEA Models to Provide Super-Efficiency Scores

Joint Use of Factor Analysis (FA) and Data Envelopment Analysis (DEA) for Ranking of Data Envelopment Analysis

Marginal values and returns to scale for nonparametric production frontiers

PRINCIPAL COMPONENT ANALYSIS TO RANKING TECHNICAL EFFICIENCIES THROUGH STOCHASTIC FRONTIER ANALYSIS AND DEA

A Fuzzy Data Envelopment Analysis Approach based on Parametric Programming

Data envelopment analysis

A Simple Characterization

Centre for Efficiency and Productivity Analysis

Darmstadt Discussion Papers in ECONOMICS

A reference-searching based algorithm for large-scale data envelopment. analysis computation. Wen-Chih Chen *

Optimal weights in DEA models with weight restrictions

Measuring the efficiency of assembled printed circuit boards with undesirable outputs using Two-stage Data Envelopment Analysis

Finding Closest Targets in Non-Oriented DEA Models: The Case of Convex and Non-Convex Technologies

A Survey of Some Recent Developments in Data Envelopment Analysis

NEGATIVE DATA IN DEA: A SIMPLE PROPORTIONAL DISTANCE FUNCTION APPROACH. Kristiaan Kerstens*, Ignace Van de Woestyne**

Chapter 11: Benchmarking and the Nonparametric Approach to Production Theory

THE PERFORMANCE OF INDUSTRIAL CLUSTERS IN INDIA

Włodzimierz Ogryczak. Warsaw University of Technology, ICCE ON ROBUST SOLUTIONS TO MULTI-OBJECTIVE LINEAR PROGRAMS. Introduction. Abstract.

Investigación Operativa. New Centralized Resource Allocation DEA Models under Constant Returns to Scale 1

Mathematics for Economics and Finance

CHAPTER 4 MEASURING CAPACITY UTILIZATION: THE NONPARAMETRIC APPROACH

INEFFICIENCY EVALUATION WITH AN ADDITIVE DEA MODEL UNDER IMPRECISE DATA, AN APPLICATION ON IAUK DEPARTMENTS

Indian Institute of Management Calcutta. Working Paper Series. WPS No. 787 September 2016

On Kusuoka Representation of Law Invariant Risk Measures

21. Solve the LP given in Exercise 19 using the big-m method discussed in Exercise 20.

36106 Managerial Decision Modeling Linear Decision Models: Part II

Scenario Grouping and Decomposition Algorithms for Chance-constrained Programs

Solving an optimization problem of a profit calculation taking into account fixed costs in Excel

CSCI5654 (Linear Programming, Fall 2013) Lectures Lectures 10,11 Slide# 1

Summary of the simplex method

LITERATURE REVIEW. Chapter 2

IE 5531 Midterm #2 Solutions

COMP 652: Machine Learning. Lecture 12. COMP Lecture 12 1 / 37

1/1 AD-A FOUNDATIONS OF DATA ENVELOPMENT ANALYSIS FOR PARETO-KOOPARNS EFFICIENT EM..(U) TEXAS UNIV AT AUSTIN

Stochastic Programming: From statistical data to optimal decisions

Special Cases in Linear Programming. H. R. Alvarez A., Ph. D. 1

A New Group Data Envelopment Analysis Method for Ranking Design Requirements in Quality Function Deployment

LECTURE NOTE #8 PROF. ALAN YUILLE. Can we find a linear classifier that separates the position and negative examples?

Lectures 6, 7 and part of 8

Computing Efficient Solutions of Nonconvex Multi-Objective Problems via Scalarization

Risk Aggregation with Dependence Uncertainty

Research Article A Data Envelopment Analysis Approach to Supply Chain Efficiency

A Rothschild-Stiglitz approach to Bayesian persuasion

A ratio-based method for ranking production units in profit efficiency measurement

Application of Rough Set Theory in Performance Analysis

Reformulation of chance constrained problems using penalty functions

1 The linear algebra of linear programs (March 15 and 22, 2015)

Integer programming: an introduction. Alessandro Astolfi

Cooperative bargaining: independence and monotonicity imply disagreement

Complete Closest-Target Based Directional FDH Measures of Efficiency in DEA

Inverse Optimization for Linear Fractional Programming

A Criterion for the Stochasticity of Matrices with Specified Order Relations

Department of Social Systems and Management. Discussion Paper Series

CHAPTER 3 THE METHOD OF DEA, DDF AND MLPI

Jeff Howbert Introduction to Machine Learning Winter

Stochastic geometric optimization with joint probabilistic constraints

A Rothschild-Stiglitz approach to Bayesian persuasion

Distributionally Robust Convex Optimization

Review of Methods for Increasing Discrimination in Data Envelopment Analysis

Lecture 3 January 28

Mixed Integer Linear Programming Formulations for Probabilistic Constraints

L0CI..'.'M496C099F0124N

A Parametric Simplex Algorithm for Linear Vector Optimization Problems

Mixed input and output orientations of Data Envelopment Analysis with Linear Fractional Programming and Least Distance Measures

A Stochastic-Oriented NLP Relaxation for Integer Programming

Transcription:

The International Scientific Conference INPROFORUM 2015, November 5-6, 2015, České Budějovice, 140-145, ISBN 978-80-7394-536-7. The Comparison of Stochastic and Deterministic DEA Models Michal Houda, Jana Klicnarová 30 Abstract: The idea of deterministic data envelopment analysis (DEA) models is well-known. If we start to study the stochastic DEA models, it seems at first glance, that the ideas of these two models are totally different. The aim of this paper is to explain that there is a really nice connection between the ideas of stochastic and deterministic models. Key words: Data Envelopment Analysis Deterministic DEA Stochastic DEA Efficiency JEL Classification: C44 C61 1 Introduction The idea of the piecewise-linear convex hull as a frontier estimation for effective units goes back to Farrell (1957) and it was consider by a few authors in following years. The main attention received this topic when Charnes, Cooper and Rhodes (1978) presented their paper, where the Data Envelopment Analysis (DEA) was introduced. Since then there were published a large number of papers and books which extended these results and DEA methodology see for example Coelli (2009). The basic idea of DEA is the comparison of some units which are characterized by several inputs and outputs. The main aim of this analysis is to identify so-called efficient units. More precisely, the aim is to estimate frontier function and to measure the efficiencies of units relative to this estimated frontier. The other possible point of view at this problem comes from the decision making theory. In fact, we have some units and we know there evaluation according to several criteria, some of them are cost type, some of them are benefit type. Our aim is to find such units (alternatives in the terms of multiple attribute decision making), which are not dominated and which are Pareto efficient. The solving of DEA problems leads to the problem of mathematical programming, in case of deterministic models to linear programming, in case of stochastic model to non-linear programming problems. The paper is organized as follows. In the section 2 we introduce the basic notation necessary for the setting of these problems. Section 3 introduces the deterministic DEA models and the stochastic DEA models are given in section 4. To show the connection between deterministic and stochastic models, we present the deterministic models in nonclassical way. We do not use a formulation based on the idea of optimization of efficiency which is widely used for deterministic models, but we use a dual approach an approach of dominated alternatives. This allows us to show a close connection between deterministic and stochastic formulation of the problem. 2 Methods To set up the models, let us introduce the following notation. By, we denote a -th decision making unit, where 1,, and is a number of units in question. Then, we write for input matrix where each column,, represents the input vector of -th decision unit. On the other hand, the rows of the matrix represent the values of individual inputs, that is,,, gives the values for the -th input of all units. In the same way, we use as the output matrix, and, the corresponding output vectors (column and row vectors). The key object in our investigation represents the production possibility set, denoted in short. It is the set of all possible/allowed combination of inputs and outputs. This set is given by the data, it varies from analysis to analysis. It is also important to remark that as mentioned above matrices and are not variables (as usual in optimization models) but they represent the data (i.e., inputs and outputs of analyzed units). The analysis of the efficiency of a unit is closely related to the definition of the dominance property. As we will see, its definition varies depending upon the production possibility set chosen. As a consequence, the efficiency of the unit 30 Mgr. Michal Houda, Ph.D, University of South Bohemia in České Budějovice, Faculty of Economics, Department of Applied Mathematics and Informatics, Studentská 13, CZ-37005 České Budějovice, e-mail: houda@ef.jcu.cz 30 RNDr. Jana Klicnarová, Ph.D, University of South Bohemia in České Budějovice, Faculty of Economics, Department of Applied Mathematics and Informatics, Studentská 13, CZ-37005 České Budějovice, e-mail: janaklic@ef.jcu.cz

The Comparison of Stochastic and Deterministic DEA Models 141 in question will be defined individually for each model considered and cannot be seen as a universal property of the unit. The unit which is considered for efficiency, will be denoted through the paper. 3 Discrete and Continuous Production Possibility Sets In this section, we describe traditional deterministic DEA models appearing in the literature. Apart from the usual economical formulations (based on comparing the input/output ratio), we concentrate on the characterization of these models based on production possibility sets, which later facilitates the transition to their stochastic versions. 3.1 Discrete Production Possibility Set Additive Model We start our investigation with the production possibility set composed from only the actually observed units, that is,,,,. (1) To characterize the efficiency of the units with respect to, we perform a simple pairwise comparison between units using the following definition of efficiency dominance. Definition 1 (Cooper, Huang, Li, 2004) We say that dominates with respect to if and with strict inequality holding for at least one of the components in the input or output vector. We say that is efficient with respect to if there is no unit dominating with respect to. Bowlin et al. (1984) formulated a so-called additive model with binary constraints as follows: maximize subject to for all inputs 1,,, for all outputs 1,,, 1, 0, 1,, 0. The decision variables, are the slack and surplus variables, respectively, for the inequalities,. (We will generally refer to them as slacks in the rest of the paper.) As there can be only one element of he binary variables equal to one (say -th, that is 1), this element identifies the unit (namely ) for pairwise comparison with, at each solution (iteration). Maximizing slacks ensures that, at an optimal solution, is the most dominant unit, with respect to. At optimum, if all slacks are zero, then does not dominate in sense of Definition 1, hence there is no unit dominating, henceforth is efficient. Figure 1 illustrates the procedure: the most dominating unit for is the unit. As the slacks and surpluses are nonzero in this case, is not efficient. The units,,, and do not have dominating units in sense of Definition 1, hence they are efficient. Figure 1 Example of the additive DEA model (2)

Michal Houda, Jana Klicnarová 142 Due to the presence of integrality constraints, this optimization problem unfortunately does not have its dual form, that is, with objective in a traditional input/output ratio form. Its continuous relaxation leads to the model which is described in the following section. 3.2 Continuous (Convex) Production Possibility Set BCC Model A continuous relaxation of (2), given originally by Banker, Cooper, Charnes (1984) and known as BCC model, supposes the production possibility set to be the convex hull of observed units. That is,,,,,,,, 1, 0. (3) To characterize the efficiency of the units with respect to, we easily extend Definition 1 to deal with the newly defined production possibility set. Definition 2 (Cooper, Huang, Li, 2004) Let,,, be two input-output pairs from the production possibility set. We say that, dominates, with respect to if and with strict inequality holding for at least one of the components in the input or output vector. We say that is efficient with respect to S if there is no pair, dominating with respect to. The output oriented BCC optimization model (Banker, Cooper, Charnes, 1978) is formulated as follows: maximize subject to for all inputs 1,,, for all outputs 1,,, 1, 0,, 0, unconstrained, where is a non-archimedean infinitesimal (a positive number smaller than any other positive number). The decision variables, are again the slack and surplus variables for the inequalities,. The additional decision variable represent the inefficiency factor of outputs of. If 1, then with, is sometimes called weakly efficient unit. Indeed, there could be more optimal solutions with 1 if we set 0; including the sum of slacks/surpluses into the objective, we select the most dominant among these solutions (c. f. the additive model), that is, with all slacks equal to zero. Non-Archimedean infinitesimal ensures that this sum of slacks and surpluses does not hurt the value of the objective function. We arrive at sufficient optimality conditions: Proposition 1 (Cooper, Huang, Li, 2004) If (a) 1, and (b) 0, where * represents the optimum of (4), then is (fully) efficient with respect to. Figure 2 demonstrates this situation: is represented by the shaded region, its frontier is piecewise-linear line connecting units,,, and. Points lying on the right-most part of the frontier (to the right of ) are solutions of (4) having 1, but the maximal slacks are not zero for these point there is a point (namely ) which has the same value of the output but with smaller inputs, that is, dominating our considered points in sense of Definition 2. Figure 2 Example of the convex output oriented (BCC) DEA model (4)

The Comparison of Stochastic and Deterministic DEA Models 143 For any inefficient unit, the non-zero elements of the variable identify a point on the efficient frontier found as a convex combination of so-called peer units. Point on Figure 2 is not efficient (with 1): it is its only peer unit which has the optimal outputs with the same level of inputs. The dual form of (4) is formulated by minimize subject to for all units 1,,, 1 (dual constraint for ),,, unconstrained, (5) which is the traditional formulation of BCC model (minimizing the input/output ratio). Non-Archimedean infinitesimal ensures that all inputs and outputs are considered in the analysis. The variable, dual for the convexity constraint 1, is known as the variable return to scale factor. 3.3 Continuous (Linear) Production Possibility Set CCR Model Further relaxation of (4), removing the convexity constraint 1, results in the model given originally by Charnes, Cooper, Rhodes (1978) and known as CCR model. The production possibility set in this case is the convex cone containing observed units:,,,,,,, 0. (6) To characterize the efficiency of the units with respect to, we can use Definition 2, just replacing with. The output oriented CCR optimization model (Charnes, Cooper, Rhodes, 1984) is formulated as follows: maximize subject to for all inputs 1,,, for all outputs 1,,, 0,, 0, unconstrained, (7) The interpretations of the model and optimality conditions are analogous to the convex case. The situation is demonstrated on Figure 3; you can notice the production possibility set in conic form. Figure 3 Example of the linear output-oriented CCR DEA model The dual form of (7) is formulated by minimize subject to for all units 1,,, 1 (dual constraint for ),,. (8)

Michal Houda, Jana Klicnarová 144 The missing variable (i. e., 0) is manifestation of the property of constant return to scale. 4 Stochastic Extensions to Production Possibility Sets The data represented by the matrices, are usually not deterministic. We will now consider the inputs and outputs to be random variables. Hence, the matrices, and corresponding are also random. To deal with resulting random constraints, we also define a (sufficiently small) tolerance or risk level 0;1. It is worth to note here that the notion of stochastic efficiency is not uniquely determined but adapted to a particular situation. 4.1 Stochastic additive model In the case of discrete production possibility set defined by (1). Definition 3 (Cooper et al., 1988) We say that is not stochastically dominates with respect to if for each 0, 1, 1 we have P,. (9) The stochastic extension of the additive 0 1 model reads simply as : maxp,. (10) It is now easy to see the optimality condition for stochastic efficiency in this case: Proposition 2 (Cooper, Huang, Li, 2004) is -stochastically efficient with respect to if and only if. The concept of stochastic efficiency is illustrated on Figure 4. Conclusions about efficiency must be made with respect to uncertain position of the input-output points: for example, units and can exchange their positions so that is dominating for a particular instance. In stochastic terms, both and must be indicated as efficient if the probability of such event is sufficiently high. The only remaining non-efficient points are the units and for which this probability is small. Figure 4 Example of the stochastic additive DEA model 4.2 Stochastic continuous model Continuous extension to additive stochastic dominance is straightforward: Definition 4 (Cooper et al., 1988) We say that is -stochastically efficient with respect to S if for each 0 with 1 we have P,. (11) Applying necessary and sufficient conditions for stochastic dominance (see Cooper et al., 1988), the stochastic extension to BCC takes the following form known as almost 100% confidence chance-constrained problem

The Comparison of Stochastic and Deterministic DEA Models 145 maxp 0 subject to P 1 for all inputs 1,,, P 1 for all outputs 1,,, 1, 0,, 0. (12) Proposition 3 (Cooper et al., 1988) If is -stochastically efficient, then. If then is not stochastically efficient. 4.3 Marginal chance-constrained model Relaxing the almost 100% (namely 1 confidence, Cooper et al. (2002, 2003) introduced a marginal chanceconstrained optimization model for BCC of the form maximize subject to P 1 for all inputs 1,,, P 1 for all outputs 1,,, 1, 0,, 0. Problem (13) implies the following definition of marginal stochastic efficiency: (13) Definition 5 (Cooper et al., 2002) We say that is marginally -stochastically efficient if (a) 1, and (b) for all (alternate) optimal solutions, where * represents the optimum of (13). Problem (13) is a special case of the standard individual chance-constrained problem. If the constraint rows follow normal distribution, (13) reduces to a quadratic optimization problem (involving normal quantiles Φ ). We refer to Prékopa (2003) to a survey of solution methods for such problems; see also Cheng, Houda, Lisser (2015) for a recent contribution (involving dependency properties and second-order cone programming methods). 5 Conclusions In the paper, we presented several types of deterministic and stochastic models. The non-classical way of the construction of deterministic models based on domination of units allows us to show a close connection between deterministic and stochastic models. We also presented that these deterministic models coincide with classical ones, because they are in fact their dual forms. References Banker, R. D., Charnes, A., & Cooper, W. W. (1984). Some models for estimating technical and scale inefficiencies in data envelopment analysis. Management science, 30(9), 1078 1092. Bowlin, W. F., Brennan, J., Cooper, W. W. & Sueyoshi, T. (1984). DEA Models for Evaluating Efficiency Dominance. Research Report, The University of Texas, Center for Cybernetic Studies, Austin, Texas. Charnes, A., Cooper, W. W. & Rhodes, E. (1978). Measuring the efficiency of decision making units. European Journal of Operational research, 2(6), 429 444. Cheng, J., Houda, M. & Lisser, A (2015). Chance constrained 0 1 quadratic programs using copulas. Optimization Letters, 9(7), 1283 1295. Coelli, T.J. et al. (2009). An Introduction to Efficiency and Productivity Analysis. 2 nd ed. Springer. Cooper, W.W., Deng, H., Huang, Z. & Li, S.X. (2002). Chance constrained programming approaches to technical efficiencies and inefficiencies in stochastic data envelopment analysis. Journal of the Operational Research Society, 53(12), 1347 1356. Cooper, W.W., Deng, H., Huang, Z. & Li, S.X. (2004). Chance constrained programming approaches to congestion in stochastic data envelopment analysis. European Journal of Operational Research, 155(2), 487 501. Cooper, W. W., Huang, Z. M., Lelas, V., Lix, S. X. & Olesen, O. B. (1998). Chance Constrained Programming Formulations for Stochastic Characterizations of Efficiency and Dominance in DEA. Journal of Productivity Analysis, 9(1), 53 79. Cooper, W. W., Huang, Z. M. & Li, S. X. (2004). Chance Constrained DEA. In W. W. Cooper, L. M. Seiford & J. Zhou (Eds.), Handbook on Data Envelopment Analysis (pp. 229 264). Boston: Kluwer. Copper, W. W., Seiford, L. M. & Tone, K. (2007). Data Envelopment Analysis. A Comprehensive Text with Models, Application, References and DEA-Solver Software, 2nd ed. New York: Springer. Farell, M. J. (1957). The Measurement of Productive Efficiency. Journal of the Royal Statistical Society. Series A (General), 120(3), 253 290. Prékopa, A. (2003). Probabilistic programming. In A. Ruszczyński & A. Shapiro (Eds.), Stochastic Programming. Handbooks in Operations Research and Management Science, Vol. 10. Amsterdam: Elsevier, 267 352.