ABSTRACT INTRODUCTION

Similar documents
Chance Constrained Data Envelopment Analysis The Productive Efficiency of Units with Stochastic Outputs

MULTI-COMPONENT RETURNS TO SCALE: A DEA-BASED APPROACH

Joint Use of Factor Analysis (FA) and Data Envelopment Analysis (DEA) for Ranking of Data Envelopment Analysis

Equivalent Standard DEA Models to Provide Super-Efficiency Scores

Data Envelopment Analysis and its aplications

A DIMENSIONAL DECOMPOSITION APPROACH TO IDENTIFYING EFFICIENT UNITS IN LARGE-SCALE DEA MODELS

AN IMPROVED APPROACH FOR MEASUREMENT EFFICIENCY OF DEA AND ITS STABILITY USING LOCAL VARIATIONS

USING LEXICOGRAPHIC PARAMETRIC PROGRAMMING FOR IDENTIFYING EFFICIENT UNITS IN DEA

Data envelopment analysis

Determination of Economic Optimal Strategy for Increment of the Electricity Supply Industry in Iran by DEA

Data Envelopment Analysis within Evaluation of the Efficiency of Firm Productivity

Further discussion on linear production functions and DEA

Special Cases in Linear Programming. H. R. Alvarez A., Ph. D. 1

Prediction of A CRS Frontier Function and A Transformation Function for A CCR DEA Using EMBEDED PCA

Ranking Decision Making Units with Negative and Positive Input and Output

Classifying inputs and outputs in data envelopment analysis

A Simple Characterization

The Comparison of Stochastic and Deterministic DEA Models

Identifying Efficient Units in Large-Scale Dea Models

Searching the Efficient Frontier in Data Envelopment Analysis INTERIM REPORT. IR-97-79/October. Pekka Korhonen

Finding the strong defining hyperplanes of production possibility set with constant returns to scale using the linear independent vectors

Department of Economics Working Paper Series

PRIORITIZATION METHOD FOR FRONTIER DMUs: A DISTANCE-BASED APPROACH

Subhash C Ray Department of Economics University of Connecticut Storrs CT

Indian Institute of Management Calcutta. Working Paper Series. WPS No. 787 September 2016

CHAPTER 3 THE METHOD OF DEA, DDF AND MLPI

A Slacks-base Measure of Super-efficiency for Dea with Negative Data

Sensitivity and Stability Radius in Data Envelopment Analysis

Selective Measures under Constant and Variable Returns-To- Scale Assumptions

Measuring the efficiency of assembled printed circuit boards with undesirable outputs using Two-stage Data Envelopment Analysis

European Journal of Operational Research

Sensitivity and Stability Analysis in Uncertain Data Envelopment (DEA)

Mark your answers ON THE EXAM ITSELF. If you are not sure of your answer you may wish to provide a brief explanation.

Modeling undesirable factors in efficiency evaluation

Symmetric Error Structure in Stochastic DEA

A ratio-based method for ranking production units in profit efficiency measurement

The Uniqueness of the Overall Assurance Interval for Epsilon in DEA Models by the Direction Method

PRINCIPAL COMPONENT ANALYSIS TO RANKING TECHNICAL EFFICIENCIES THROUGH STOCHASTIC FRONTIER ANALYSIS AND DEA

A METHOD FOR SOLVING 0-1 MULTIPLE OBJECTIVE LINEAR PROGRAMMING PROBLEM USING DEA

Chapter 2 Output Input Ratio Efficiency Measures

MEASURING THE EFFICIENCY OF WORLD VISION GHANA S EDUCATIONAL PROJECTS USING DATA ENVELOPMENT ANALYSIS

Research Article A Data Envelopment Analysis Approach to Supply Chain Efficiency

CENTER FOR CYBERNETIC STUDIES

NEGATIVE DATA IN DEA: A SIMPLE PROPORTIONAL DISTANCE FUNCTION APPROACH. Kristiaan Kerstens*, Ignace Van de Woestyne**

Mixed input and output orientations of Data Envelopment Analysis with Linear Fractional Programming and Least Distance Measures

INEFFICIENCY EVALUATION WITH AN ADDITIVE DEA MODEL UNDER IMPRECISE DATA, AN APPLICATION ON IAUK DEPARTMENTS

Measuring Geographic Access to Primary Care Physicians

A reference-searching based algorithm for large-scale data envelopment. analysis computation. Wen-Chih Chen *

Evaluation of Dialyses Patients by use of Stochastic Data Envelopment Analysis

Was the bell system a natural monopoly? An application of data envelopment analysis

THE PERFORMANCE OF INDUSTRIAL CLUSTERS IN INDIA

Revenue Malmquist Index with Variable Relative Importance as a Function of Time in Different PERIOD and FDH Models of DEA

CHAPTER 4 MEASURING CAPACITY UTILIZATION: THE NONPARAMETRIC APPROACH

Department of Social Systems and Management. Discussion Paper Series

A DEA- COMPROMISE PROGRAMMING MODEL FOR COMPREHENSIVE RANKING

A Fuzzy Data Envelopment Analysis Approach based on Parametric Programming

Application of Rough Set Theory in Performance Analysis

LITERATURE REVIEW. Chapter 2

Dummies and Interactions

Statistics in medicine

EFFICIENCY ANALYSIS UNDER CONSIDERATION OF SATISFICING LEVELS

Bivariate Relationships Between Variables

Review of ranking methods in the data envelopment analysis context

A buyer - seller game model for selection and negotiation of purchasing bids on interval data

Microeconomic Theory. Microeconomic Theory. Everyday Economics. The Course:

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F).

Partial Input to Output Impacts in DEA: Production Considerations and Resource Sharing among Business Subunits

An Optimization Methodology (DEA Analysis) : A Case Study in Similipal

Predictive Modeling Using Logistic Regression Step-by-Step Instructions

Chapter 2 Network DEA Pitfalls: Divisional Efficiency and Frontier Projection

Finding Relationships Among Variables

Regression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables.

Joint Variable Selection for Data Envelopment Analysis via Group Sparsity

Data envelopment analysis and its key variants utilized in the transport sector

Support Vector Machines

PRIORITIZATION METHOD FOR FRONTIER DMUs: A DISTANCE-BASED APPROACH

ST3241 Categorical Data Analysis I Two-way Contingency Tables. 2 2 Tables, Relative Risks and Odds Ratios

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

ESTIMATING ELASTICITIES OF INPUT SUBSTITUTION USING DATA ENVELOPMENT ANALYSIS NOAH JAMES MILLER

A Novel Sparse Orthogonal Matrix Construction over the Fields of Characteristic 2

STAC51: Categorical data Analysis

Logistic Regression Models for Multinomial and Ordinal Outcomes

Chapter 2: Describing Contingency Tables - I

Describing Contingency tables

A Parametric Simplex Algorithm for Linear Vector Optimization Problems

An Introduction to Linear Programming

Data Envelopment Analysis with metaheuristics

Logistic Regression: Regression with a Binary Dependent Variable

Part 1. The Review of Linear Programming

A New Group Data Envelopment Analysis Method for Ranking Design Requirements in Quality Function Deployment

Two-stage network DEA: when intermediate measures can be treated as outputs from the second stage

2.4. Conditional Probability

Sensitivity and Stability Analysis in DEA on Interval Data by Using MOLP Methods

7. Assumes that there is little or no multicollinearity (however, SPSS will not assess this in the [binary] Logistic Regression procedure).

Estimating most productive scale size using data envelopment analysis

Machine Learning Basics Lecture 7: Multiclass Classification. Princeton University COS 495 Instructor: Yingyu Liang

ij i j m ij n ij m ij n i j Suppose we denote the row variable by X and the column variable by Y ; We can then re-write the above expression as

A Note on the Scale Efficiency Test of Simar and Wilson

Chapte The McGraw-Hill Companies, Inc. All rights reserved.

A Survey of Some Recent Developments in Data Envelopment Analysis

Transcription:

Implementation of A Log-Linear Poisson Regression Model to Estimate the Odds of Being Technically Efficient in DEA Setting: The Case of Hospitals in Oman By Parakramaweera Sunil Dharmapala Dept. of Operations Management & Business Statistics College of Commerce & Economics, Sultan Qaboos University Sultanate of Oman ABSTRACT In this paper, we propose a methodology that comprises Data Envelopment Analysis (DEA) and log-linear Poisson regression to estimate the odds of being technically efficient. Here we evaluate technical efficiency of 45 hospitals in the Sultanate of Oman, using both Charnes-Cooper-Rhodes (CCR) and Banker-Charnes-Cooper (BCC) DEA models with 4 inputs and 2 outputs. Then we compare these efficiency evaluations by estimating the odds of a hospital being technically efficient, using a log-linear regression model under independent Poisson sampling. INTRODUCTION DEA has been used as a tool to evaluate technical efficiency in various studies, but its use in health care services has been reported only a few times (Tavares, 2002). One study reported in 1994, by Jon A. Chiligerian, discussed an important application of DEA to evaluate the physicians efficiency in hospitals. Although it inspired the present study of this paper, the setting and the objective of Chiligerian s study differ very much from ours, and here we evaluate the efficiency of hospitals. RESEARCH METHODOLOGY In the first stage of our analysis, we use BCC and CCR models to compute DEA-efficiency scores of the 45 hospitals in the year 2000. The inputs considered are, recurrent expenditure, monetary value of drugs and equipment, bed capacity, total staff, and the outputs are, total length of stay (LOS) and live discharges. In the second stage, we form contingency tables for cross-classification of hospitals, where the qualitative variables are DEA-efficiency (CCR/BCC) and category of hospitals. The hospitals are divided into 3 categories: Local, County, and Regional. These cross-classifications show the count data under independent Poisson sampling. Next, we formulate the log-linear regression model and estimate the expected cell-counts for the efficient/inefficient hospitals of the 3 categories. Finally, we establish the relationship between the expected cell-count and the odds of a hospital being efficient and compute the latter using the regression coefficient estimates.

PRELIMINARIES OF DATA ENVELOPMENT ANALYSIS DEA is a mathematical programming methodology that provides non-parametric measures of optimal relative efficiency. It identifies the Decision Making Units (DMUs) on the efficient productivity frontier as efficient firms and DMUs that are interior to that frontier as inefficient firms. Many outputs and inputs can be analyzed simultaneously for an arbitrary number of observations. Relative efficiency measurements are computed DMU-by-DMU across all of the DMUs under consideration, for the same inputs and outputs of data. DMU c denotes the selected DMU for comparison. The input/output data entries must be non-negative, with zero entries allowed. (Charnes et al., 1978) A DEA data domain consists of n DMUs. The selected DMU c (c = 1, 2,, n) is characterized by an input vector X c and an output vector Y c. U, the output multiplier, V, the input multiplier, and λ, a real vector, are unknowns that need to be determined by solving the respective linear programming (LP) model stated below. (All vectors are column vectors, and [...] T stands for transpose). As a modification to the original input-oriented CCR (Charnes et al., 1978) and BCC (Banker et al., 1985) models, Thompson et al. (1993) introduced the following LP formulations (Archimedean form) of the CCR and BCC model for DMU c. CCR Model (Dual) Max z = U T Y c s.t. V T X c = 1 (2) U T Y - V T X 0 U 0, V 0 BCC Model (Dual) Max z = U T Y c + u * s.t. V T X c = 1 (3) U T Y - V T X + u * I 0 u * unrestricted U 0, V 0 The efficiency value of DMU c in CCR or BCC model is denoted by the optimal value θ c *. A DMU c with θ c * = 1 is said to be scale efficient. The class of scale efficient DMUs can be partitioned into 3 sub-classes; (1) DEA-extreme-efficient DMUs in class E, which are at the vertices on the frontier, (2) DEA-non-extreme-efficient DMUs in class E', which are on the frontier between vertices, and (3) DEA-inefficient DMUs in class F which are on the extended frontier. DMU c with 0 < θ c * < 1 is said to be scale inefficient and in class N (Charnes et al. 1991).

Scale efficient classes E and E' are also called 'technically efficient'. Class F is scale efficient but not technically efficient because the optimal slacks are present, and class N is both scale and technically inefficient. All these classes are mutually exclusive. In our terminology, E Υ E' form the 'technically efficient' class, and F Υ N form the technically inefficient class. LOG-LINEAR MODEL In a cross-classification of Poisson count data for independent qualitative variables, X and Y, with r rows and c columns respectively, the expected cell-count M i,j is given by the following regression model: Ln (M ij ) = β 0 + α i X i + β j Y j ; i = 1, 2,, r; j = 1, 2,,c. (4) (Everitt & Dunn, 1991) Here α i and β j represent the effects of row categories of X and column categories of Y, respectively. X i and Y j are dummy variables for row-i and column-j. The parameters to be estimated are, α 1, α 2, β 0, β 1, β 2, and β 3. Note: There are no interaction terms of X i and Y j in (4), by our assumption that the variables are independent. That means (4) considers only the main effects. In our contingency table, X = Efficiency of CCR (BCC) model; 1 = Efficient, 2 = Inefficient Y = Hospital category; 1 = Regional, 2 = County, 3 = Local N ij = Number of hospitals in cell (i, j); i = 1, 2, and j = 1, 2, 3 Regional County Local Efficient N 1,1 N 1,2 N 1,3 Infficient N 2,1 N 2,2 N 2,3 Then model (4) becomes, Ln (M ij ) = β 0 + α 1 X 1 + α 2 X 2 + β 1 Y 1 + β 2 Y 2 + β 3 Y 3 (5) RESULTS (I) DEA computations of CCR and BCC efficiency scores provided the following contingency tables. CCR Regional County Local Efficient 2 1 3 Inefficient 11 5 23

BCC Regional County Local Efficient 6 1 7 Inefficient 7 5 19 From the above tables, we make one important observation. The efficiency of 6 county hospitals did not change under the two models; 4 regional hospitals that were inefficient under CCR model became efficient under the BCC model, and 4 local hospitals that were inefficient under CCR model became efficient under the BCC model. The Chi-squared independence test performed on both tables confirmed the independence of variables, Efficiency and Hospital Category. (II) Log-Linear regressions run in SPSS on both tables provided the following design matrix with parameters α 1 and β 3 set to zero. Model (5) becomes, Ln (M ij ) = β 0 + α 2 X 2 + β 1 Y 1 + β 2 Y 2 (6) Efficy, Hospital Constant X 2 Y 1 Y 2 1, 1 1 0 1 0 2, 1 1 1 1 0 1, 2 1 0 0 1 2, 2 1 1 0 1 1, 3 1 0 0 0 2, 3 1 1 0 0 [1] For CCR, we obtained the following estimates: b 0 = 1.2432, a 2 = 1.8718, b 1 = -0.6931, b 2 = -1.4663 Estimated model: Ln (M ij ) = 1.2432 + 1.8718 X 2 0.6931 Y 1 1.4663 Y 2 M ij = exp(1.2432 + 1.8718 X 2 0.6931 Y 1 1.4663 Y 2 ); i = 1, 2; j = 1, 2, 3 (7) [2] For BCC, we obtained the following estimates b 0 = 2.0905, a 2 = 0.7949, b 1 = -0.6931, b 2 = -1.4663

Estimated model: Ln(m ij ) = 2.0905 + 0.7949 X 2 0.6931 Y 1 1.4663 Y 2 M ij = exp(2.0905 + 0.7949 X 2 0.6931 Y 1 1.4663 Y 2 ); i = 1, 2; j = 1, 2, 3 (8) (III) Computation of the Odds of being Efficient Variable X being dichotomous, we use M ij to compute the Odds. Expected count M ij = N P(X = i,y = j) = N P(X = i, * ) P(*, Y = j) ; i = 1, 2; j = 1, 2, 3. Here the * indicates any row/column, and N is the total counts. Then, M 1j = N P(X =1,*) P(*, Y = j), and M 2j = N P(X =2, * ) P(*, Y = j). Let, p = P(X =1,*) = P( Hospital of any category is Efficient). Then, P(X =2, * ) = 1- P(X =1, *) = 1- p, and M 1j / M 2j = p /(1-p) = Odds; j = 1, 2, 3 With our design matrix and the estimated models, we can compute the Odds for each of the 2 models. [1] For CCR, M ij = exp(1.2432 + 1.8718 X 2 0.6931 Y 1 1.4663 Y 2 ); j = 1, 2, 3 For example, j =1; Y 1 =1, Y 2 =0. Then, p /(1-p) = M 11 / M 21 = exp(1.2432 0.6931) / exp(1.2432 + 1.8718 0.6931) = exp(-1.8718) = 0.1538 We obtain the same value for other 2 hospital categories. So the odds of a hospital being efficient than being inefficient is 0.1538 Note: Log(Odds) = Ln [p/(1-p)] = -1.8718 = -a 2 [2] For BCC, M ij = exp(2.0905 + 0.7949 X 2 0.6931 Y 1 1.4663 Y 2 ); i = 1, 2; j = 1, 2, 3 For example, j =1; Y 1 =1, Y 2 =0. Then, p /(1-p) = M 11 / M 21 = exp(2.0905 0.6931) / exp(1.2432 + 0.7949 0.6931) = exp(-0.7949) = 0.4516 We obtain the same value for other 2 hospital categories. So the odds of a hospital being efficient than being inefficient is 0.4516

CONCLUSION Summarizing the above results, we can say the following. The expected number of efficient/inefficient hospitals can be computed using model (7). Log of Odds is equal to the negative coefficient of the dummy variable X 2 (Inefficient). This means, Odds of being efficient are the same for all hospital categories, which is a direct implication of independence of the variables. The Odds under BCC model are higher than those under CCR model by a ratio of 0.4516 to 0.1538. The 6 county hospitals display constant returns-to-scale where as 4 regional hospitals and 4 local hospitals that became efficient under the BCC model, display variable returns-to-scale. This increase in efficiency by 8 hospitals may have contributed to the increase in the Odds under the BCC model. FUTURE RESEARCH The log-linear model (4) is based on independence of the variables X and Y. If they are dependent, then the interaction terms are present and γ ij represent their effect in the model. Ln (M ij ) = β 0 + α i X i + β j Y j + γ ij X i Y j ; i = 1, 2,, r; j = 1, 2,,c. (9) In the absence of independence, the formulation in (III) for p/(1-p) becomes invalid, and the following Linear-Logistic model could be used to estimate the Odds. Ln [p/(1-p)] = β 0 + α i X i + β j Y j + γ ij X i Y j ; i = 1, 2,, r; j = 1, 2,,c. (10) REFERENCES Banker, R. D., A. Charnes and W. W. Cooper, Models for estimating technical and scale efficiencies in DEA, Management Science, vol. 30(1984), pp. 1078-1092 Charnes A., W. W. Cooper and E. Rhodes, Measuring efficiency of decision making units, European Journal of Operational Research, vol.2 (1978), pp. 429-444 Charnes A., W. W. Cooper and R. M. Thrall, A structure for classifying and characterizing efficiency and inefficiency in Data Envelopment Analysis, Journal of Productivity Analysis, vol. 2 (1991), pp. 197-237 Chilingerian, Jon A., Exploring why some physicians hospital practices are more efficient: Taking DEA inside the hospital, Data Envelopment Analysis: Theory, Methodology and Applications, KluwerAcademic Publishers, (1994), pp. 167-193 Everitt, B.S., Graham Dunn, Applied Multivariate Data Analysis, J. Wiley, (1991) Tavares, Gabriel, (2002), "A bibliography of data envelopment analysis (1978-2001)", Rutcor Research Report (RRR) 01-2002, RUTCOR, Rutgers University. Thompson, R. G., P.S. Dharmapala and R.M. Thrall, Importance for DEA of zeros data and multipliers, Journal of Productivity Analysis, vol. 4 (1993), pp. 337-348