ANALELE ŞTIINŢIFICE ALE UNIVERSITĂŢII ALEXANDRU IOAN CUZA DIN IAŞI Tomul LII/LIII Ştiinţe Economice 2005/2006

Similar documents
DISCRIMINANT ANALYSIS IN THE STUDY OF ROMANIAN REGIONAL ECONOMIC DEVELOPMENT

Regional economic disparities in Romania

Section III: Poverty Mapping Results

A regional perspective on the spatial concentration in Romania s international trade in 2011

Identification of vulnerable areas in municipal corporation of Greater Mumbai due to extreme events based on socio economic indicators

Poverty statistics in Mongolia

BUILDING SOUND AND COMPARABLE METRICS FOR SDGS: THE CONTRIBUTION OF THE OECD DATA AND TOOLS FOR CITIES AND REGIONS

Applying cluster analysis to 2011 Census local authority data

Available online at Analele Stiintifice ale Universitatii Al. I. Cuza din Iasi Seria Geologie 58 (1) (2012) 53 58

UNCTAD Single-year Expert Meeting on Tourism s Contribution to Sustainable Development

Understanding China Census Data with GIS By Shuming Bao and Susan Haynie China Data Center, University of Michigan

Tracey Farrigan Research Geographer USDA-Economic Research Service

THE IMPACT OF COMPENSATION PAYMENTS ON EMPLOYMENT, IN REGIONAL STRUCTURES

National Spatial Development Perspective (NSDP) Policy Coordination and Advisory Service

CHANGES IN THE STRUCTURE OF POPULATION AND HOUSING FUND BETWEEN TWO CENSUSES 1 - South Muntenia Development Region

How rural the EU RDP is? An analysis through spatial funds allocation

Egypt Public DSS. the right of access to information. Mohamed Ramadan, Ph.D. [R&D Advisor to the president of CAPMAS]

Poverty assessment me-thodology.

16540/14 EE/cm 1 DG E 1A

Building the Sustainable Network of Settlements on the Caspian Sea Region of Kazakhstan

THE TOURISTIC ACTIVITY OF THE NORTH-EAST REGION IN THE PERIOD

Linking Industry and Occupation Clusters in Regional Economic Development

Improving rural statistics. Defining rural territories and key indicators of rural development

Indicator: Proportion of the rural population who live within 2 km of an all-season road

Educational structure and the structure of economic activities as one of the key factors explaining the differences in development in Slovenia

Households or locations? Cities, catchment areas and prosperity in India

Trip Generation Model Development for Albany

GIS in Locating and Explaining Conflict Hotspots in Nepal

National SDG Data Hubs: Statistical integration and geographic disaggregation (Philippines)

Summary prepared by Amie Gaye: UNDP Human Development Report Office

General considerations of regional policy

AN ECONOMETRICAL MODEL FOR CALCULATING THE ROMANIAN GROSS DOMESTIC PRODUCT

A Comprehensive Method for Identifying Optimal Areas for Supermarket Development. TRF Policy Solutions April 28, 2011

Measuring Poverty. Introduction

CountrySTAT REGIONAL BASIC ADMINISTRATOR TRAINING for GCC MEMBER STATES. CountrySTAT STATISTICS COMPONENT

Measuring Regional Development with RIV Models: A Case Study for Romania and the European Union

CRP 608 Winter 10 Class presentation February 04, Senior Research Associate Kirwan Institute for the Study of Race and Ethnicity

International Journal of Computing and Business Research (IJCBR) ISSN (Online) : APPLICATION OF GIS IN HEALTHCARE MANAGEMENT

A User s Guide to the Federal Statistical Research Data Centers

National Statistics 2001 Area Classifications

Measuring Disaster Risk for Urban areas in Asia-Pacific

Application of Geographic Information Systems for Government School Sites Selection

The Index of Human Insecurity

Johns Hopkins University Fall APPLIED ECONOMICS Regional Economics

Topic 4: Changing cities

Set-up of subnational statistical systems Statistical Development Strategy for the State of Jalisco México, 2016

Compact guides GISCO. Geographic information system of the Commission

The World Bank Decentralized Community Driven Services Project (P117764)

THE ROLE OF GEOSPATIAL AT THE WORLD BANK

Introduction to Development. Indicators and Models

Analysis of Gross Domestic Product Evolution under the Influence of the Final Consumption

MPI and Multidimensional Poverty Analysis What we knew before joining West Bank and Gaza Poverty Assessment team. Nobuo Yoshida April 21, 2011

CHALLENGES IN CONDUCTING SURVEYS/CENSUSES

1 Introduction. Station Type No. Synoptic/GTS 17 Principal 172 Ordinary 546 Precipitation

By Lillian Ntshwarisang Department of Meteorological Services Phone:

Neighborhood social characteristics and chronic disease outcomes: does the geographic scale of neighborhood matter? Malia Jones

Methodological issues in the development of accessibility measures to services: challenges and possible solutions in the Canadian context

SHORT ANALYSIS OF REGIONAL DISPARITIES ÎN ROMÂNIA

USING GMDH IN ECOLOGICAL AND SOCIO-ECONOMICAL MONITORING PROBLEMS

GIS FOR MAZOWSZE REGION - GENERAL OUTLINE

Tackling urban sprawl: towards a compact model of cities? David Ludlow University of the West of England (UWE) 19 June 2014

Spatial Analysis of Public Services (schools) in Nablus City Using the Tool of Geographic Information System (GIS)

Modeling the economic growth of Arctic regions in Russia

UNIVERSITY OF NAIROBI COLLEGE OF HUMANITIES AND SOCIAL SCIENCES FACULTY OF ARTS & SCHOOL OF PHYSICAL SCIENCES

DEZVOLTAREA REGIONALĂ PRIN TURISM

Classification methods

Socio-Economic and Ecological Indicators of the Metropolitan Area of Bucharest

UNIVERSITY OF KWAZULU-NATAL EXAMINATION: JUNE 2012 Howard College and Westville Campuses

The ESPON Programme. Goals Main Results Future

Spatial Disparities and Development Policy in the Philippines

Disaggregation according to geographic location the need and the challenges

Joanne N. Halls, PhD Dept. of Geography & Geology David Kirk Information Technology Services

Creating a Geodemographic Classification

NÁRODOHOSPODÁŘSKÝ OBZOR REVIEW OF ECONOMIC PERSPECTIVES, VOL. 10, ISSUE 4, 2010, pp , DOI: /v

THE IMPACT OF ICT GROWTH ON HOUSEHOLDS AND MUNICIPALITIES IN THE CZECH NUTS-3 REGIONS: THE APPLICATION OF CLUSTER ANALYSIS

A Case Study of Regional Dynamics of China 中国区域动态案例研究

R E SEARCH HIGHLIGHTS

Development of modal split modeling for Chennai

Geospatial Information for Urban Sprawl Planning and Policies Implementation in Developing Country s NCR Region: A Study of NOIDA City, India

Accessibility, rurality, remoteness an investigation on the Island of Sardinia, Italy

THE ROLE OF RURAL TOURISM FOR POVERTY ALLEVIATION AND RURAL COMMUNITY DEVELOPMENT

ECONOMETRIC ANALYSIS OF THE COMPANY ON STOCK EXCHANGE

Targeting the Poor. Towards evidence-based implementation. Johan A. Mistiaen (World Bank)

BOOK REVIEWS. ESPON Atlas: Territorial Dimensions of the Europe 2020 Strategy. ESPON 2013 Programme, Luxembourg, 2013, 66 p. ISBN

URBAN CHANGE DETECTION OF LAHORE (PAKISTAN) USING A TIME SERIES OF SATELLITE IMAGES SINCE 1972

Social Vulnerability Index. Susan L. Cutter Department of Geography, University of South Carolina

AAG CENTER FOR GLOBAL GEOGRAPHY EDUCATION Internationalizing the Teaching and Learning of Geography

Multidimensional Poverty in Colombia: Identifying Regional Disparities using GIS and Population Census Data (2005)

Introducing the WISERD Geoportal. WISERD DATA TEAM Dr Robert Berry & Dr Richard Fry, University of Glamorgan Dr Scott Orford, Cardiff University

Strategic Regional Planning and Regional Employment in Greece: A Clustering Analysis Approach

Multilevel modeling and panel data analysis in educational research (Case study: National examination data senior high school in West Java)

Poverty and Inclusion in the West Bank and Gaza. Tara Vishwanath and Roy Van der Weide

CONSTRUCTING THE POVERTY AND OPPORTUNITIES/PUBLIC SERVICES MAPS INFORMATION MANAGEMENT. Background: Brazil Without Extreme Poverty Plan

Difference in regional productivity and unbalance in regional growth

How proximity to a city influences the performance of rural regions by Lewis Dijkstra and Hugo Poelman

Innovation and Regional Growth in the European Union

Analysis of the Impact of Education Poverty Alleviation on Economic Growth in Shaanxi Province

A Visualization Model Based on Adjacency Data

EUROPEAN INVESTMENT BANK

MICRO-SCALE GEOSTATISTICAL ANALYSIS OF THE LEVEL OF DEVELOPMENT. CASE STUDY: MOUNTAINOUS AND SUB- CARPATHIAN AREA OF IALOMIŢA HYDROGRAPHIC BASIN

Transcription:

ANALELE ŞTIINŢIFICE ALE UNIVERSITĂŢII ALEXANDRU IOAN CUZA DIN IAŞI Tomul LII/LIII Ştiinţe Economice 2005/2006 POVERTY MAPPING OF ROMANIAN COUNTIES USING CLUSTER ANALYSIS ALINA MĂRIUCA IONESCU Abstract Poverty mapping plays a very important role in giving a visual representation of the intensity of poverty incidence by geographic area. This study tries to group Romanian counties taking into account several indicators that describe the various dimensions of poverty. The method used for grouping is cluster analysis. The main objective of this paper is to show how poverty mapping using cluster analysis can be employed as a tool to identify homogenous poverty clusters of counties and help to reduce poverty by necessary resource allocation. Thus, the policy-makers can easily detect the most poverty affected areas and the poverty s specific in these areas and can be guided to target the poor for the best resource allocation to alleviate poverty. Key words: poverty, poverty mapping, poverty alleviation, cluster analysis 1 Introduction Poverty mapping is used for spatial identification of the poor (on which this paper concentrates) and serves to target poverty-alleviation programs from rural anti poverty programs to allocation of public services. It can assist practitioners in the formulation and implementation of poverty reduction, food security and sustainable development strategies, and in the monitoring of progress in poverty alleviation. The choice of a poverty-mapping methodology depends on the objectives of the poverty mapping exercise, philosophical views on poverty, limits on data and analytical capacity and cost. Data availability is a fundamental constraint in choosing a povertymapping method. This constraint has two levels: the existence of data, and access to existing data. For example, using average values from disaggregated geographical units such as counties/districts (as it is used in this study) instead of household-unit data has the advantages that data requirements are less stringent and national statistical agencies may be more likely to release county/district level averages on request. In this study, the method used for achieving poverty mapping is cluster analysis. It is applied in order to identify homogenous poverty clusters of counties aiming to reduce poverty by necessary resource allocation. 2 The method and the variables Cluster analysis is the art of finding groups in data. It aims to form groups in such a way that objects in the same group are similar to each other, whereas objects in different groups are as dissimilar as possible [Kaufman & Rousseeuw, 1990, 1]. The measurements PhD student, Department of Statistics, Faculty of Economy and Business Administration, University Alexandru Ioan Cuza Iaşi, e-mail: ali.ionescu@gmail.com

Poverty Mapping of Romanian Counties Using Cluster Analysis 381 used in cluster analysis can be organized in an n-by-p matrix, where the rows correspond to the objects (or cases) and the columns correspond to the variables [Kaufman & Rousseeuw, 1990, 4]. In this example the cases are the n = 41 Romanian counties for which there are considered p=13 continuous variables describing poverty that cover almost all the dimensions of this phenomenon: persons employed in agriculture, as a percentage of the employed population PAGR; persons living in rural areas, as a percentage of the total population PRUR; poverty rate RPOV; gross investments (thousands of millions current prices) GINV; life expectancy at birth (years) LEXP; gross enrolment ratio primary, secondary and tertiary schools (%) GER; unemployment rate (%) RUNP; infant mortality rate (per 1000 live births) RINM; proportion of the population without sustainable access to electricity (%) PWE; proportion of the population without sustainable access to water (%) PWW; population per physician P/PH; average net nominal monthly earnings AVER; criminality rate (persons definitively convicted per 100000 inhabitants) RCRI. The Squared Euclidian Distance is chosen as proximity measure as it is frequently employed when working with interval data. Letting x i and x j represent two cases (counties) in the p-variate space (where p=13), the squared Euclidian distance between the two items x i and x j is defined as the sum of squared differences between the values for the 13 2 2 items: d = ( x x ). ij f = 1 if jf The variables considered in this study are expressed in different measurement units: years, persons, thousands millions of ROL, percentage etc. Therefore, data values are standardized using z scores so as to equalize the effect of variables measured on different scales. The reference year is 2002. For poverty rate the only available results at county level are for 2003. The data sources are UNDP Romania's National Human Development Report for 2001-2002 and CASPIS s (The Anti-Poverty and Social Inclusion Commission from Romania) statistics. Statistical data processing was conducted using SPSS software. As the city of Bucharest presents extreme values for some of the considered variables, it is not included in the study and needs to be investigated separately. A principal components analysis was performed to verify if the chosen variables are relevant for this study. The high values of the extraction communalities show that all the variables fit well with the factor solution and could be kept in the analysis. Due to the fact that the investigated population s size is relatively small (41counties), there are used hierarchical methods of clustering. In order to determine the most appropriate method for this study, there were applied all the seven hierarchical clustering methods available in SPSS. The resulting dendograms showed that Ward s method has differentiated the counties in the most clearly way and has found the most compact clusters. 3 Results of cluster analysis 3.1 Number of clusters There is no exact procedure for determining the number of clusters. To evaluate the number of clusters, one may always plot the criterion used to join clusters versus the number of clusters [Timm, 2002, 534]. For example, a shape elbow in the plot of distances versus the number of clusters may be an indication of the number of clusters.

382 ALINA MĂRIUCA IONESCU Table no. 1 Agglomeration Schedule Cluster Combined Stage Cluster First Next Stage Cluster 1 Cluster 2 Coefficients Cluster 1 Cluster 2 Stage 1 7 26 1.214 0 0 21 M 30 1 2 157.402 17 24 33 31 16 41 173.065 19 0 35 32 9 28 190.769 25 26 37 33 1 11 208.875 30 23 36 34 7 10 227.734 21 29 35 35 7 16 249.398 34 31 38 36 1 6 271.881 33 0 39 37 3 9 294.920 28 32 40 38 7 25 333.013 35 27 39 39 1 7 390.379 36 38 40 40 1 3 520.000 39 37 0 Source: Results obtained with SPSS Table no. 1 shows how the counties are clustered together at each stage of the cluster analysis. The Coefficients column indicates the distance between the two clusters (or cases) joined at each stage. The values here depend on the proximity measure (Squared Euclidian Distance) and linkage method (Ward s method) used in the analysis. Fig.1 The agglomeration schedule coefficients plot For a good cluster solution, a sudden jump can be seen in the distance coefficient as it can be read down the table. The stage before the sudden change indicates the optimal stopping point for merging clusters. For this example, we should consider using a 9, 6 or 3- cluster solution. For a better visualization of this criterion the hockey stick plot of agglomeration schedule coefficients is displayed in figure 1. It can be easily seen that 9 clusters remain after stage 32, 6 clusters after stage 35 and 3 clusters after stage 38. These three solutions are also illustrated in figure 2 which presents the dendogram. At a first look the dendogram shows three obvious clusters, which can be interpreted as it follows: one cluster consists of the most developed counties (AG, PH, MS, AB, DJ, IS, CT, HD, BV, SB, TM, CJ), another cluster refers to the counties with moderate intensity of poverty (BR, CS, CV, HR, GL, GJ, DB, OT, VL, BN, GR, SJ, IF, AR, BH, MM, SM) and the other cluster groups the most poverty affected counties (CL, IL, TR, TL, BC, MH, NT, BZ, SV, VN, BT, VS). The counties groupings in 6 or in 9 clusters are the solutions that

Poverty Mapping of Romanian Counties Using Cluster Analysis 383 differentiate the most clearly the clusters and identify the special case of Vaslui County that forms a cluster by itself. Fig.2 Ward s dendogram for the 3, 6 and 9 clusters solution 3.2 Territorial distribution of the clusters Fig. 3 Territorial distributions of the clusters for the 3 and 6 cluster solutions The 9 clusters solution reproduces well enough the geographic repartition of Romanian counties as it groups by twos and threes neighbor counties. This solution is efficient when preparing anti-poverty programs and policies that are to be applied to small areas. If one wants to develop programs that focus on large areas (like regions of a country) then it is recommended to use the 6 clusters classification as it reproduces more clearly the geographical map of counties. The 3 clusters-solution provides the possibility of identifying three main directions for poverty alleviation programs and policies: monitoring activities for the cluster that consists of the most developed counties, specific measures for improvement of socio-economic indicators for the cluster of counties with moderate intensity of poverty and allocation of important resources and implementation of radical programs for the cluster of the most poverty affected counties. 3.3 Clusters profiles Once the clusters are obtained, it is generally useful to describe each group using some descriptive tools to create a better understanding of the differences that exist among the created groups. In order to characterize the clusters, there are computed descriptive statistics (means) for each cluster (table no. 2).

384 ALINA MĂRIUCA IONESCU Table no. 2 Descriptive statistics (means) for each of the 6 clusters Clust. 1 2 3 4 5 6 PAGR 51.46 36.14 56.51 42.71 25.44 39.83 PRUR 59.80 49.75 59.90 58.05 30.70 50.80 RPOV 0.37 0.29 0.40 0.32 0.23 0.27 GINV 4129.27 11632.83 1497.00 5100.00 16271.17 5800.00 LEXP 70.79 71.32 71.10 71.34 71.20 69.48 GER 59.04 70.20 58.00 60.64 75.03 63.78 RUNP 9.80 8.47 15.90 9.44 8.60 4.68 RINM 20.16 18.63 23.20 13.82 17.72 18.08 PWE 2.58 2.32 6.80 2.07 1.50 2.40 PWW 39.16 30.57 53.50 34.12 11.20 21.85 P/PH 832.36 464.83 971.00 709.54 397.50 528.25 AVER 3391392.91 3622089.00 33.6616.00 3668304.31 3793331.17 3231682.50 RCRI 459.64 319.83 547.00 347.15 322.17 419.25 Source: Results obtained with SPSS The highest level of poverty is registered by cluster 3, formed of Vaslui County that is extremely affected by unemployment; in 2002 unemployment rate was 15.9%. Population sustainable access to electricity and water (essential elements for a decent life), is very low: 6.8% (a great percentage comparing to other clusters) of population doesn t have sustainable access to electricity and more than a half of this county population doesn t have access to sustainable water. These facts and the very small number of physicians in this area could be the causes for the highest infant mortality rate. The population of Vaslui County is the most poverty affected in almost all the dimensions of this phenomenon: the lowest gross enrolment ratio, the lowest access to health services, to water and electricity, the highest criminality rate. The economy of Vaslui County is in crisis as it presents the lowest level of gross investments (3 times lower than the next cluster and 11 times lower than the richest cluster). This could be a justification of the very high unemployment rate together with the highest proportion of persons occupied in agriculture (over 56% of the total employed population) and of persons living in rural areas (almost 60% of the total population). Another especially poverty affected cluster is cluster 1 (BC, MH, NT, BZ, SV, VN BT) which presents alarming values for all the variables. Cluster 5, that includes CT, HD, BV, SB, TM, CJ, presents the highest standard of living as it is characterized by: a very low percentage of rural population (30.7%) and of population employed in agriculture (25.44%) comparing to other clusters, the lowest poverty rate, the highest level of gross investments, the highest gross enrolment ratio, the lowest unemployment rate and infant mortality rate, the best population access to health services and to utilities (water and electricity). The high standard of living is also reflected by one of the lowest criminality rate. Another group of counties with low level of poverty is cluster 2 (AG, PH, AB, MS, DJ, IS) that presents closer values to those of cluster 5 for most of the considered variables. 4 Conclusions Cluster analysis permitted to group the 41 counties of Romania in homogenous groups considering the poverty dimensions such as: health, education, unemployment. According to each cluster s profile there could be designed and developed specific poverty alleviation programs that take into account poverty intensity in each considered dimension. Therefore, to the clusters that present deprivations in health dimension of

Poverty Mapping of Romanian Counties Using Cluster Analysis 385 poverty should be designed and applied appropriate programs to improve the access to health services. For clusters with low level of investments and high unemployment rates there can be allocated resources to stimulate investments so as to create new jobs and reduce unemployment. The programs that focus on infrastructure development can target the groups of counties characterized by low sustainable access to electricity and water correlated with high percentage of rural population and population employed in agriculture. In conclusion, cluster analysis employed in poverty mapping may be of a real utility in designing poverty reduction programs and policies as it permits to detect the most poverty affected areas and the poverty s specific in these areas and help the policy-makers to target the poor for the best resource allocation to alleviate poverty. Bibliography Anderberg, M. R., Cluster Analysis for applications, Academic Press, New York, 1973 Davis, B., Choosing a method for poverty mapping, 2003, la http://www.povertymap.net/publications/doc/cmpm%20davis%2013%20apr03%20sec.p df, accessed on 3 January 2006. Everitt, B., Landau, S., Leese, M., Cluster analysis, 4th Edition, Edward Arnold Publishers Ltd., London, 2001. Garson, G. D., Quantitative Research in Public Administration, PA 765 Statnotes: An Online Textbook, 2005, la http://www2.chass.ncsu.edu/garson/pa765/cluster.htm, accessed on 5 December 2005. Jaba, E., Statistica, Ediţia a treia, Editura Economică, Bucureşti, 2002. Kaufman, L. and Rousseeuw, P. J., Finding groups in data: An introduction to cluster analysis, John Wiley & Sons, New York, 1990. Timm, N., Applied Multivariate Analysis, Springer Text in Statistics, 2002. *** www.caspis.ro, accessed on 10 December 2005. *** www.spss.com, accessed on 12 November 2005. *** www.undp.ro, accessed on 20 November 2005.