Classification of fragile states based on machine learning

Similar documents
10-701/ Machine Learning Mid-term Exam Solution

There is no straightforward approach for choosing the warmup period l.

Evapotranspiration Estimation Using Support Vector Machines and Hargreaves-Samani Equation for St. Johns, FL, USA

Boosting. Professor Ameet Talwalkar. Professor Ameet Talwalkar CS260 Machine Learning Algorithms March 1, / 32

Information-based Feature Selection

Optimally Sparse SVMs

Support vector machine revisited

Research Article A New Second-Order Iteration Method for Solving Nonlinear Equations

Chapter 7. Support Vector Machine

Some New Iterative Methods for Solving Nonlinear Equations

Machine Learning. Ilya Narsky, Caltech

ECE 901 Lecture 12: Complexity Regularization and the Squared Loss

Comparison Study of Series Approximation. and Convergence between Chebyshev. and Legendre Series

Modified Decomposition Method by Adomian and. Rach for Solving Nonlinear Volterra Integro- Differential Equations

Introduction to Machine Learning DIS10

Electricity consumption forecasting method based on MPSO-BP neural network model Youshan Zhang 1, 2,a, Liangdong Guo2, b,qi Li 3, c and Junhui Li2, d

Research Article A Unified Weight Formula for Calculating the Sample Variance from Weighted Successive Differences

1 Review of Probability & Statistics

6.867 Machine learning

Machine Learning Theory (CS 6783)

1 Duality revisited. AM 221: Advanced Optimization Spring 2016

Double Stage Shrinkage Estimator of Two Parameters. Generalized Exponential Distribution

Intro to Learning Theory

The prediction of cement energy demand based on support vector machine Xin Zhang a, *, Shaohong Jing b

Empirical Process Theory and Oracle Inequalities

REGRESSION WITH QUADRATIC LOSS

Machine Learning Brett Bernstein

CSCI567 Machine Learning (Fall 2014)

CSCI567 Machine Learning (Fall 2014)

Lecture 9: Boosting. Akshay Krishnamurthy October 3, 2017

Regression with quadratic loss

Machine Learning Theory Tübingen University, WS 2016/2017 Lecture 11

The target reliability and design working life

ADVANCED SOFTWARE ENGINEERING

Comparison of the COG Defuzzification Technique and Its Variations to the GPA Index

Correspondence should be addressed to Wing-Sum Cheung,

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance

Convergence of Random SP Iterative Scheme

Machine Learning Theory Tübingen University, WS 2016/2017 Lecture 12

Investigating the Significance of a Correlation Coefficient using Jackknife Estimates

Uniform Strict Practical Stability Criteria for Impulsive Functional Differential Equations

Linear Classifiers III

Research Article Research on Application of Regression Least Squares Support Vector Machine on Performance Prediction of Hydraulic Excavator

A NEW CLASS OF 2-STEP RATIONAL MULTISTEP METHODS

BIOINF 585: Machine Learning for Systems Biology & Clinical Informatics

Week 1, Lecture 2. Neural Network Basics. Announcements: HW 1 Due on 10/8 Data sets for HW 1 are online Project selection 10/11. Suggested reading :

Research on Dependable level in Network Computing System Yongxia Li 1, a, Guangxia Xu 2,b and Shuangyan Liu 3,c

A collocation method for singular integral equations with cosecant kernel via Semi-trigonometric interpolation

Admin REGULARIZATION. Schedule. Midterm 9/29/16. Assignment 5. Midterm next week, due Friday (more on this in 1 min)

ON POINTWISE BINOMIAL APPROXIMATION

An Alternative Scaling Factor In Broyden s Class Methods for Unconstrained Optimization

Presenting A Framework To Study Linkages Among Tqm Practices, Supply Chain Management Practices, And Performance Using Dematel Technique

Lecture 11: Decision Trees

Math 113 Exam 3 Practice

Analysis of Experimental Measurements

G. R. Pasha Department of Statistics Bahauddin Zakariya University Multan, Pakistan

Machine Learning Regression I Hamid R. Rabiee [Slides are based on Bishop Book] Spring

NEW FAST CONVERGENT SEQUENCES OF EULER-MASCHERONI TYPE

Lecture 15: Learning Theory: Concentration Inequalities

Interval Intuitionistic Trapezoidal Fuzzy Prioritized Aggregating Operators and their Application to Multiple Attribute Decision Making

Research Article Structural Reliability Assessment by Integrating Sensitivity Analysis and Support Vector Machine

The improvement of the volume ratio measurement method in static expansion vacuum system

ME 539, Fall 2008: Learning-Based Control

Mixtures of Gaussians and the EM Algorithm

Similarity Solutions to Unsteady Pseudoplastic. Flow Near a Moving Wall

CHAPTER 10 INFINITE SEQUENCES AND SERIES

6.867 Machine learning, lecture 7 (Jaakkola) 1

DERIVING THE 12-LEAD ECG FROM EASI ELECTRODES VIA NONLINEAR REGRESSION

On the Theory of Learning with Privileged Information

Learning Bounds for Support Vector Machines with Learned Kernels

THE APPEARANCE OF FIBONACCI AND LUCAS NUMBERS IN THE SIMULATION OF ELECTRICAL POWER LINES SUPPLIED BY TWO SIDES

Improved Class of Ratio -Cum- Product Estimators of Finite Population Mean in two Phase Sampling

Outline. Linear regression. Regularization functions. Polynomial curve fitting. Stochastic gradient descent for regression. MLE for regression

Math 113 Exam 3 Practice

Intermittent demand forecasting by using Neural Network with simulated data

Solution of Final Exam : / Machine Learning

Lecture 2: Monte Carlo Simulation

Robust Resource Allocation in Parallel and Distributed Computing Systems (tentative)

REGRESSION (Physics 1210 Notes, Partial Modified Appendix A)

OPTIMAL ALGORITHMS -- SUPPLEMENTAL NOTES

Monte Carlo Integration

Common Coupled Fixed Point of Mappings Satisfying Rational Inequalities in Ordered Complex Valued Generalized Metric Spaces

Multilayer perceptrons

Research on real time compensation of thermal errors of CNC lathe based on linear regression theory Qiu Yongliang

Linear Associator Linear Layer

Lecture 4. Hw 1 and 2 will be reoped after class for every body. New deadline 4/20 Hw 3 and 4 online (Nima is lead)

Higher-order iterative methods by using Householder's method for solving certain nonlinear equations

Expectation-Maximization Algorithm.

Output Analysis (2, Chapters 10 &11 Law)

Introduction to Artificial Intelligence CAP 4601 Summer 2013 Midterm Exam

SOME TRIBONACCI IDENTITIES

As metioed earlier, directly forecastig o idividual product demads usually result i a far-off forecast that ot oly impairs the quality of subsequet ma

Overview. Structured learning for feature selection and prediction. Motivation for feature selection. Outline. Part III:

Teaching Mathematics Concepts via Computer Algebra Systems

Ada Boost, Risk Bounds, Concentration Inequalities. 1 AdaBoost and Estimates of Conditional Probabilities

Hybridized Heredity In Support Vector Machine

THE NUMERICAL SOLUTION OF THE NEWTONIAN FLUIDS FLOW DUE TO A STRETCHING CYLINDER BY SOR ITERATIVE PROCEDURE ABSTRACT

Variance Penalizing AdaBoost

Pattern recognition systems Laboratory 10 Linear Classifiers and the Perceptron Algorithm

COMPLEX FACTORIZATIONS OF THE GENERALIZED FIBONACCI SEQUENCES {q n } Sang Pyo Jun

Transcription:

Classificatio of fragile states based o machie learig Yuqua Li 1, Hehua Yao 1 1 Departmet of Computer Techology ad Applicatio, Qighai Uiversity, Xiig, Chia Abstract. The study of fragile states has become a sigificat issue i global security, developmet ad poverty at preset. The existig classificatio methods of fragile state, which is a simple additio to the atioal idex ad threshold segmetatio, is ot reasoable eough. We itroduce a ew method based o machie learig. With this method, it will be easier ad more reasoable to classify a coutry. We use two kids of classifier, oe of which is the support vector machie, ad the other is the gradiet boosted regressio trees. Both models have flaws, so we use esemble learig techiques to combie them. First of all, subjective labellig of a part of the atioal data to allows the machie to lear why a coutry becomes vulerable from these data, ad how to classify the vulerability class of a coutry. The, we traied the model with the data, ad divided fragile states ito four categories successfully (Alert, Warig, Stable ad Sustaiable). For the classificatio result, our model got a 93% test error rate, ad a 96% traiig error rate, which is better tha 77% with the threshold segmetatio method. 1 Itroductio At preset the study of fragile states or of the coutry's vulerability research has become a iteratioal academics ad policy makers to discuss i today's global security, developmet ad poverty problems [1~3], as oe of the core issues. Uderstadig the vulerability of the state, ad the cosequeces of its cosequeces, has oly recetly emerged as a field of study [4]. For the past five years, how to classify a fragile state has become a iteratioal group's mai priorities [5~6]. Some wellkow uiversities ad other research istitutes i may coutries have set up research ceters to discuss the issue of "fragile states", ad academic papers are costatly beig rolled out for the grad view [7~9]. I the past, fragile states were simply separated by subjective thresholds [10~11]. First, the staff evaluates ecoomic, evirometal ad other idicators (scores from 0 to 10). The, they simply add up ad choose thresholds accordig to some rules, separatig differet coutries usig these thresholds. I fact, it is ot fair or reasoable for some coutries to do so. A typical example: a coutry with 11 idicators is perfect, but oe of them, huma rights, is very low with oly two poits. At this time, the huma rights idicator will directly affect the coutry's vulerability directly, ad ca affect the coutry's vulerability through idirect ifluece (such as ecoomic iequality, group grievace). This coutry is supposed to be classified as edagered, but it is recogized as excellet by threshold segmetatio method. 2 Support Vector Machie SVM is a ew geeratio machie learig method proposed by Vapik o the basis of statistical learig theory [12]. It has sigificat advatages i solvig small sample, oliear, high dimesio ad other problems. At preset SVM has bee widely used i may fields of classificatio problems ad regressio problems, ad has a good predictio effect. SVC is a abbreviatio of support vector classificatio ad is a importat brach of support vector machie (SVM) [13]. The classificatio schematic diagram is as follows: Figure1. SVC schematic diagram Give traiig vectors x i R p,,,, i two classes, ad a vector y {1, 1}, SVC solves the followig primal problem: The Authors, published by EDP Scieces. This is a ope access article distributed uder the terms of the Creative Commos Attributio Licese 4.0 (http://creativecommos.org/liceses/by/4.0/).

mi 1 ω, b, ζ 2 ωt ω + C ζ i Subject to y i (ω T φ(x i ) + b) 1 ζ i ζ i 0, i = 1 Its dual is Subject to (1) (2) mi 1 α 2 αt Q e T α (3) y T α = 0 0 α i C, i = 1 where is the vector of all oes, C > 0 is the upper boud, Q is a by positive semidefiite matrix, Q ij y i y j K(x i, x j ), where K(x i, x j ) = φ(x i ) T φ(x j ) is the kerel. Here traiig vectors are implicitly mapped ito a higher (maybe ifiite) dimesioal space by the fuctio φ. The decisio fuctio is: (4) sg( y i α i K(x i, x) + ρ) (5) 3 Gradiet Boosted Regressio Trees Gradiet Boosted Regressio Trees is a geeralizatio of boostig to arbitrary differetiable loss fuctios [14~16]. GBRT is a accurate ad effective off-the-shelf procedure that ca be used for both regressio ad classificatio problems. Gradiet Tree Boostig models are used i a variety of areas icludig Web search rakig ad ecology [16]. The classificatio schematic diagram is as follows: GBRT cosiders additive models of the followig form: M F(x) = γ m h m (x) m=1 where h m (x) are the basis fuctios which are usually called weak learers i the cotext of boostig. Gradiet Tree Boostig uses decisio trees of fixed size as weak learers. Decisio trees have a umber of abilities that make them valuable for boostig, amely the ability to hadle data of mixed type ad the ability to model complex fuctios. Similar to other boostig algorithms GBRT builds the additive model i a forward stage wise fashio: F m (x) = F m 1 (x) + γ m h m (x) (7) At each stage the decisio tree h m (x) is chose to miimize the loss fuctio L give the curret model F m 1 ad its fit F m 1 (x i ) F m (x) = F m 1 (x) + argmi h (6) L (y i, F m 1 (x i ) + h(x)) (8) The iitial model F 0 is problem specific, for leastsquares regressio oe usually chooses the mea of the target values. Gradiet Boostig attempts to solve this miimizatio problem umerically via steepest descet: The steepest descet directio is the egative gradiet of the loss fuctio evaluated at the curret model F m 1 which ca be calculated for ay differetiable loss fuctio: F m (x) = F m 1 (x) γ m F L(y i, F m 1 (x i )) (9) Where the step legth γ m is chose usig lie search: γ m = argmi LL(y i, F m 1 (x i ) γ L(y i, F m 1 (x i )) ) γ F m 1 (x i ) (10) 4 SVC-GBRT model Whether it is the SVC model or the GBRT model, the expressio of the sigle model may ot be eough, so we itegrate the two usig itegrated learig ad get the fial itegrated learig model. The fial classificatio schematic diagram is as follows: Figure 2. GBRT schematic 2

Figure3. SVC-GBRT schematic diagram SVC-GBRT is the esemble of SVC ad GBRT. The priciples of SVC ad GBRT are show i the chapter 2 ad 3. Below are the traiig formulas for esemble learig. For i = 1, 2,, : S i = Trai(S i 1 ) G i = Trai(G i ) F i = Esemble(S i, G i ) S: SVC model G: GBRT model F: SVC-GBRT model We trai SVC ad GBRT separately, ad fially use esemble learig to combie them. We use the stackig algorithm to itegrate. First, the sub-classifier is used to calculate the probability distributio of the target i each category. The the probabilities of multiple classifiers are added ad averaged. You get the fial probability distributio. 5 Classificatio of fragile states All the data comes from the World Bak. We wet to their website ad dowloaded the relevat data ad made a lot of aalysis ad processig. We got the followig data (from 2007 to 2015 of every coutry), ad we added the variace ad rage idicators for some ubalaced distributio of idicators. We labelled each coutry year by year, usig huma aotatio. No Table 1. Data features Features 1 Security Apparatus 2 Factioalized Elites 3 Group Grievace 4 Ecoomy 5 Ecoomic Iequality 6 Huma Flight ad Brai Drai 7 State Legitimacy 8 Public Services 9 Huma Rights 10 Demographic Pressures 11 Refugees ad IDPs 12 Exteral Itervetio 13 Variace 14 Rage We split up the data set we previously obtaied, ad we got the traiig set, validatio set, ad test set for cross validatio. The code was writte by Pytho, the the model was traied through cross-validatio, ad the best traiig iteratios was 600. Fially, we obtaied the followig classificatio error curve(figure 5). We used SVC ad GBRT as the compariso, as show i the figure 5, the test error rate of the SVC- GBRT model reached a astoishig 0.07 (while GBRT was 0.12, ad the SVC was 0.14). O the other had, the traiig error rate of the SVC-GBRT model reached a astoishig 0.04 (while GBRT was 0.5 ad SVC was 0.14). 3

Figure5. Traiig ad testig error curves After the model traiig was completed, we used the model to predict a typical coutry, the Cetral Africa Republic, ad the results were as follows(figure 6). As we ca see from the figure 6, the probability of Africa beig set for Alert is 0.92, the probability of Africa beig set for Warig is 0.3, the probability of Africa beig set for Stable is 0.1. the probability of Africa beig set for Sustaiable is 0.04. Fially, the coutry was classified as Alert. 6 CONCLUSIONS Obviously, the traditioal threshold segmetatio method does ot classify the vulerability class of a coutry reasoably ad fairly. Therefore, we itroduce a ew classificatio method, usig machie learig method to classify the vulerability class of a coutry. First of all, subjective labellig of a part of the atioal data to allows the machie to lear why a coutry becomes vulerable from these data. Through model traiig ad testig, we successfully used the model to separate the coutry ito four types (Alert, Warig, Stable ad Sustaiable). Ad the accuracy of the test data set was 93%, which was better tha 77% by the threshold segmetatio method. Figure6. Classificatio result of the Cetral Africa Republic i 2010 Suppose that out aotatio is reasoable eough, we made a compariso betwee machie learig classificatio results ad previous threshold segmetatio results. Table 2. A compariso betwee differet methods of classificatio Method Error rate Accuracy Machie Learig 7% 93% Threshold Segmetatio 23% 77% Fud Qighai uiversity youth research fud 2016. Refereces 1. Naude, W. & Satos- Paulio, A. U. Fragile States: Causes, Costs, ad Resposes. Oxford: Oxford Uiversity Press, 2011: 23. 2. Mcloughli, C.. Topic guide o fragile states. Birmigham: Uiversity of Birmigham, UK.2012: 6-29. 3. Sai, F. G.. Evaluatig State Performace: A Critical View of State Failure ad Fragility Idex. Europea Joural of Developmet Research, 2011, 23(1): 20-42. 4. Hake, N. &, Messer, J.J. Fragile States Idex2014. Foreig Policy (July -August)2014: 10. 4

5. Mekhaus, K.. State Fragility as A Wicked Problem. PRISM, 2010, 1(2): 85-100. 6. Mata, J. F. & Ziaja S.. Users Guide o Measurig Fragility. Bo: Germa Developmet Isitute, 2009: 24/5 7. Ziaja S.. What Do Fragility Idices Measure? Z Vgl Polit Wiss, 2012, 6: 39-64. 8. Call, C.. The Fallacy of The Failed States. Third World Quarterly, 2008, (8): 1491-1507. 9. Grävigholt, J. & Ziaja S.. State Fragility: To-wards a Multi- Dimesioal Empirical Typology. DIE Discussio Paper, 2012. 10. Kapla, S.. Idetify Truly Fragile States. The Washigto Quarterly, 2014, (37)1: 49-63. 11. Marshall, M. & Cole, B.. Global Report2011: Coflict, Goverace ad State Fragility. Foreig Policy Bulleti, 2011, 18(5). 12. Support-vector etworks, C. Cortes, V. Vapik - Machie Learig, 20, 273-297 (1995). 13. Scikit-lear: Machie Learig i Pytho, Pedregosa et al., JMLR 12, pp. 2825-2830, 2011. 14. Breima L, Friedma J, Olshe R. Classificatio ad regressio trees. et al.. 1984 15. Friedma J H. Greedy fuctio approximatio: A gradiet boostig machie. The Aals of Statistics. 2001 16. Stochastic gradiet boostig[j]. Jerome H. Friedma. Computatioal Statistics ad Data Aalysis. 2002 (4) 5