arxiv: v2 [math.oc] 19 Jan 2010

Similar documents
Isotonic regression and isotonic projection

DS-GA 1002 Lecture notes 0 Fall Linear Algebra. These notes provide a review of basic concepts in linear algebra.

Chapter 1. Preliminaries

CHARACTERIZATION OF (QUASI)CONVEX SET-VALUED MAPS

Topology Proceedings. COPYRIGHT c by Topology Proceedings. All rights reserved.

A Dykstra-like algorithm for two monotone operators

TORIC WEAK FANO VARIETIES ASSOCIATED TO BUILDING SETS

Lecture 1: Basic Concepts

Lecture Notes 1: Vector spaces

Topological properties

A Characterization of Polyhedral Convex Sets

PROJECTIONS ONTO CONES IN BANACH SPACES

CHEBYSHEV INEQUALITIES AND SELF-DUAL CONES

Functional Analysis. Franck Sueur Metric spaces Definitions Completeness Compactness Separability...

Lecture 9 Monotone VIs/CPs Properties of cones and some existence results. October 6, 2008

SOME STABILITY RESULTS FOR THE SEMI-AFFINE VARIATIONAL INEQUALITY PROBLEM. 1. Introduction

V (v i + W i ) (v i + W i ) is path-connected and hence is connected.

Linear Algebra. Preliminary Lecture Notes

Convex Optimization Theory. Chapter 5 Exercises and Solutions: Extended Version

On Riesz-Fischer sequences and lower frame bounds

arxiv: v1 [math.oc] 10 Mar 2015

Jónsson posets and unary Jónsson algebras

Research Division. Computer and Automation Institute, Hungarian Academy of Sciences. H-1518 Budapest, P.O.Box 63. Ujvári, M. WP August, 2007

arxiv:math/ v1 [math.fa] 26 Oct 1993

4 Hilbert spaces. The proof of the Hilbert basis theorem is not mathematics, it is theology. Camille Jordan

z -FILTERS AND RELATED IDEALS IN C(X) Communicated by B. Davvaz

Summer School: Semidefinite Optimization

A Primer in Econometric Theory

EQUICONTINUITY AND SINGULARITIES OF FAMILIES OF MONOMIAL MAPPINGS

In English, this means that if we travel on a straight line between any two points in C, then we never leave C.

A DECOMPOSITION THEOREM FOR FRAMES AND THE FEICHTINGER CONJECTURE

A NICE PROOF OF FARKAS LEMMA

Hilbert spaces. 1. Cauchy-Schwarz-Bunyakowsky inequality

We simply compute: for v = x i e i, bilinearity of B implies that Q B (v) = B(v, v) is given by xi x j B(e i, e j ) =

c 1998 Society for Industrial and Applied Mathematics

Math Camp Lecture 4: Linear Algebra. Xiao Yu Wang. Aug 2010 MIT. Xiao Yu Wang (MIT) Math Camp /10 1 / 88

THE STRUCTURE OF 3-CONNECTED MATROIDS OF PATH WIDTH THREE

Where is matrix multiplication locally open?

Elements of Convex Optimization Theory

Fixed point of ϕ-contraction in metric spaces endowed with a graph

Fixed point theorems for Ćirić type generalized contractions defined on cyclic representations

Direction of Movement of the Element of Minimal Norm in a Moving Convex Set

On the simplest expression of the perturbed Moore Penrose metric generalized inverse

HILBERT SPACES AND THE RADON-NIKODYM THEOREM. where the bar in the first equation denotes complex conjugation. In either case, for any x V define

arxiv: v1 [math.oc] 21 Mar 2015

BEST APPROXIMATIONS AND ORTHOGONALITIES IN 2k-INNER PRODUCT SPACES. Seong Sik Kim* and Mircea Crâşmăreanu. 1. Introduction

Tree sets. Reinhard Diestel

CHAPTER 2: CONVEX SETS AND CONCAVE FUNCTIONS. W. Erwin Diewert January 31, 2008.

Determining a Binary Matroid from its Small Circuits

The Cyclic Decomposition of a Nilpotent Operator

Auerbach bases and minimal volume sufficient enlargements

On Unitary Relations between Kre n Spaces

Linear Algebra. Preliminary Lecture Notes

SPACES ENDOWED WITH A GRAPH AND APPLICATIONS. Mina Dinarvand. 1. Introduction

CHAPTER 5. Birkhoff Orthogonality and a Revisit to the Characterization of Best Approximations. 5.1 Introduction

Prime and irreducible elements of the ring of integers modulo n

On the order of the operators in the Douglas Rachford algorithm

Realization Plans for Extensive Form Games without Perfect Recall

More Powerful Tests for Homogeneity of Multivariate Normal Mean Vectors under an Order Restriction

Some notes on Coxeter groups

Existence of Solutions to Split Variational Inequality Problems and Split Minimization Problems in Banach Spaces

The best generalised inverse of the linear operator in normed linear space

A MAXIMUM PRINCIPLE FOR A MULTIOBJECTIVE OPTIMAL CONTROL PROBLEM

On duality gap in linear conic problems

The equivalence of Picard, Mann and Ishikawa iterations dealing with quasi-contractive operators

POLARS AND DUAL CONES

Some Inequalities for Commutators of Bounded Linear Operators in Hilbert Spaces

Existence and data dependence for multivalued weakly Ćirić-contractive operators

Decomposition of Riesz frames and wavelets into a finite union of linearly independent sets

BCOL RESEARCH REPORT 07.04

(c) For each α R \ {0}, the mapping x αx is a homeomorphism of X.

MATH 167: APPLIED LINEAR ALGEBRA Least-Squares

Analysis-3 lecture schemes

CHAPTER 7. Connectedness

Chapter 2 Metric Spaces

Normal Fans of Polyhedral Convex Sets

A Geometrical Analysis of a Class of Nonconvex Conic Programs for Convex Conic Reformulations of Quadratic and Polynomial Optimization Problems

Lecture 2: Linear Algebra Review

Finite-Dimensional Cones 1

Lecture 7: Semidefinite programming

LINEAR EQUATIONS WITH UNKNOWNS FROM A MULTIPLICATIVE GROUP IN A FUNCTION FIELD. To Professor Wolfgang Schmidt on his 75th birthday

The Nearest Doubly Stochastic Matrix to a Real Matrix with the same First Moment

Introduction. Chapter 1. Contents. EECS 600 Function Space Methods in System Theory Lecture Notes J. Fessler 1.1

A MATROID EXTENSION RESULT

Chapter 2 Linear Transformations

ANOTE ON DIRECTLY ORDERED SUBSPACES OF R n. 1. Introduction

arxiv: v1 [math.co] 10 Aug 2016

implies that if we fix a basis v of V and let M and M be the associated invertible symmetric matrices computing, and, then M = (L L)M and the

ON COMPLEMENTED SUBSPACES OF SUMS AND PRODUCTS OF BANACH SPACES

Characterizations of the finite quadric Veroneseans V 2n

SUPPLEMENT TO CHAPTER 3

NOTES ON VECTOR-VALUED INTEGRATION MATH 581, SPRING 2017

ISOLATED SEMIDEFINITE SOLUTIONS OF THE CONTINUOUS-TIME ALGEBRAIC RICCATI EQUATION

MATH 22A: LINEAR ALGEBRA Chapter 4

Rose-Hulman Undergraduate Mathematics Journal

Projections onto convex sets on the sphere

MATROID PACKING AND COVERING WITH CIRCUITS THROUGH AN ELEMENT

Best proximity problems for Ćirić type multivalued operators satisfying a cyclic condition

1.3 Linear Dependence & span K

MTH 2032 SemesterII

Transcription:

Rapid heuristic projection on simplicial cones arxiv:1001.1928v2 [math.oc] 19 Jan 2010 A. Ekárt Computer Science, Aston University Aston Triangle, Birmingham B4 7ET United Kingdom email: a.ekart@aston.ac.uk A. B. Németh Faculty of Mathematics and Computer Science Babeş Bolyai University, Str. Kogălniceanu nr. 1-3 RO-400084 Cluj-Napoca, Romania email: nemab@math.ubbcluj.ro S. Z. Németh School of Mathematics, The University of Birmingham The Watson Building, Edgbaston Birmingham B15 2TT, United Kingdom email: nemeths@for.mat.bham.ac.uk October 23, 2018 Abstract A very fast heuristic iterative method of projection on simplicial cones is presented. It consists in solving two linear systems at each step of the iteration. The extensive experiments indicate that the method furnishes the exact solution in more then 99.7 percent of the cases. The average number of steps is 5.67 (we have not found any examples which required more than 13 steps) and the relative number of steps with respect to the dimension decreases dramatically. Roughly speaking, for high enough dimensions the absolute number of steps is independent of the dimension. 1 Introduction Projection on polyhedral cones is one of the important problems applied optimization is confronted with. Many applied numerical optimization methods uses projection on polyhedral cones as the main tool. In most of them, projection is part of an iterative process which involve its repeated application (see e. g. problems of image reconstruction [1], nonlinear complementarity [4, 9], etc.). Hence, it is important to get fast projection algorithms. 1991 A M S Subject Classification. Primary 90C33; Secondary 15A48, Key words and phrases. Metric projection on simplicial cones 1

The main streems of the current methods in use rely on the classical von Neumann algorithm (see e.g. the Dykstra algorithm [3, 2, 10]), but they are rather expensive for a numerical handling (see the numerical results in [7] and the remark preceding section 6.3 in [5]). Finite methods of projections are of combinatorial nature which reduces their applicability to low dimensional ambient spaces. Recently we have given a simple projection method exposed in note [8] for projecting on so called isotone projection cones. Isotone projection cones are special simplicial cones, and due to their good properties we can project on them in n steps, where n is the dimension of the ambient space. In the first part of that note we have explained our approach by considering the problem of projection on simplicial cones by giving an exact method based on duality. This method has combinatorial character and therefore it is inefficient. More recently we observed that a heuristic method based on the same ideas gives surprisingly good results. This note describes the theoretical foundation of the heuristic method and draws conclusions based on millions of numerical experiments. Projection on polyhedral cones is a problem of high impact on scientific community. 1 2 The simplicial cone and its polar Let R n be an n-dimensional Euclidean space endowed with a Cartesian reference system. We assume that each point of R n is a column vector. We shall use the term cone in the sense of closed convex cone. That is, the nonempty closed subset K R n in our terminology is a cone, if K + K K, and tk K whenever t R, t 0. Let m n and e 1,..., e m be m elements in R n. Denote cone{e 1,..., e m } = {λ 1 e 1 + + λ m e m : λ i 0, i = 1,..., m}, the cone engendered by e 1,..., e m. Then, cone{e 1,..., e m } = {Ev : v R m + }, (1) where E = (e 1,..., e m ) is the matrix with columns e 1,..., e m and R m + is the non-negative orthant in R m. Suppose that e 1,..., e n R n are linearly independent elements. Then, the cone K = cone{e 1,..., e n } = {λ 1 e 1 + + λ n e n : λ i 0, i = 1,..., n} = {Ev : v R m + }, with E the matrix from (1) for m = n, is called simplicial cone. Denote N = {1, 2,..., n}. The polar of K is the set (2) K = {x R n : x y 0, y K}. (3) K = K is called the dual of K. K is called subdual, if K K. This is equivalent to the condition e l e k 0, l, k N. 1 see the popularity of the Wikimization page Projection on Polyhedral Cone at http://www.convexoptimization.com/wikimization/index.php/special:popularpages. 2

Lemma 1 The polar of the simplicial cone (2) can be represented in the form K = {µ 1 u 1 +... µ n u n : µ i 0, i = 1,..., n}, (4) where u i (i = 1,..., n) is a solution of the system e j u i = 0, j = 1,..., n, j i, e i u i = 1 (u i is normal to the hyperplane span{e 1,..., e i 1, e i+1,..., e n } in the opposite direction to the halfspace that contains e i ). Thus, K = {Ux : x R n +} with R n + = {x = (x 1,..., x n ) : x i 0, i = 1... n} and U = (E 1 ). (5) For simplicity we shall call U the polar matrix of E. The columns of U are {u i : i = 1,..., n}. Proof. Let y = n j=1 µj u j and z = n i=1 αi e i for any non-negative real numbers α i and µ j. The inner produce of y and z is non-positive because y z = n n n α i µ j e i u j = α i µ i 0 i=1 j=1 But y is an arbitrary element of the right hand side of (4) and z is an arbitrary element of K, thus we can conclude that the righ hand side of (4) is a subset of K. The vectors u 1,..., u n are linearly independent. (This can be verified by assuming the contrary, and by multiplying the subsequent relation by e j to get a contradiction.) Hence for y K we have the representation By (3), i=1 y = β 1 u 1 + + β n u n. e k y = β k 0 so β k 0 which prove that y is an element of the right hand side of (4). Thus we can conclude that K is a subset of the right hand side of (4). The formula (5) of Lemma 1 is equivalent to the formula (380) of [1]. Corollary 1 For each subset I of indices in N, the vectors e i, i I, u j, j I c (where I c the complement of I with respect to N) are linearly independent. Proof. Assume that α i e i + β j u j = 0 (6) j I c i I 3

for some reals α i and β j. By the mutual orthogonality of the vectors e i, i I and u j, j I c it follows, by multiplication of the relation (6) with i I αi e i and respectively with j I c βj u j, that α i e i = 0 i I and Hence, α i = β j = 0 must hold. j I c β j u j = 0. The cone K 0 K is called a face of K if from x K 0, y K and x y K it follows that y K 0. The face K 0 is called a proper face of K, if K 0 K. Lemma 2 If K is the cone (2) and K is the cone (4), then for every subset of indices I = {i 1,..., i k } N the set F I = cone{e i : i I} = {x K : x u j = 0 : i I c } (7) (with F I = {0} if I = ) is a face of K. If i h i l whenever h l, then F I is for k > 0 a nonempty set in R n of dimension k. (In the sense that F I spans a subspace of R n of dimension k.) Every face of K is equal to F I for some I N. If I N then F I is a proper face. Proof. The relation in (7) follows from the definition of the vectors u j in Lemma 1, while the assertion on the dimension of F I is obvious. Suppose that x F I and y K with y x. Then (x y) u j = y u j 0, j I c and y u j 0, j N, because y K. Thus y u j = 0, j I c, hence y F I, showing that F I is a face. Suppose that x F for F an arbitrary proper face of K. Since x K, by the definition of the vectors u j, x u j 0 for j N. If x u j < 0, j N, then there exists a positive scalar t with (x ty) u j 0, j N. Hence, x ty K and thus ty x. But then ty F and since F is a cone, y F. This means that K F, that is, F cannot be a proper face. We have to show that F has a representation like (7). By the above reasoning, for each x F there exist some index i N with with x u i = 0. If F = {0} we have the representation (7) with I =. If F {0}, take x in the relative interior of F and let I be the complement in N of the maximal set of indices j with x u j = 0. (I must be a nonempty, proper subset of N since x 0.) Take y F arbitrarily. By the definition of x, x ty F for some sufficiently small t > 0. Hence, (x ty) u i 0, i N. (8) By y F K we also have y u i 0, i N. If y u j < 0 for some j I c, then (8) would imply x u j ty u j < 0, 4

which is a contradiction. Hence, we must have y u j = 0, j I c ; and accordingly F {z K : z u j = 0, j I c }. (9) Suppose that y K and y u j = 0, j I c. From definition we have x u i < 0 for each i I, whereby for a sufficiently large t > 0, (tx y) u i 0, i N. Hence, tx y is in the polar of K, which by Farkas lemma is K (This follows in fact, in our case, also by the symmetry of the vectors e i and u j in the formulae of Lemma 1.) Thus 0 y tx, whereby 0 (1/t)y x. Since F is a face of K, we have (1/t)y F and since it is also a cone, y F. This proves the converse of the inclusion in (9) and completes the proof. Thus a maximal proper face of K is of the form K i0 = cone{e i : i N \ {i 0 }} = cone{e i : i N, e i u i0 = 0}, hence it is also called the face of K orthogonal to u i0. Similarly, we have a maximal proper face of K orthogonal to some e j0. An equivalent result to the one presented in Lemma 2 is given by the Cone Table 1 on page 179 of [1]. Thus a maximal proper face of K is of the form K i0 = cone{e i : i N \ {i 0 }} = cone{e i : i N, e i u i0 = 0}, hence it is also called the face of K orthogonal to u i0. Similarly, we have a maximal proper face of K orthogonal to some e j0. Let F = cone{e i : i I} and F = cone{u j : j I c }. Then, from the above results it follows that and F = {x K : x u j = 0, j I c } F = {y K : y e i = 0, i I}. The faces F K and F K of the above form are called a pair of orthogonal faces where F is called the orthogonal face of F and F is called the orthogonal face of F. 3 Finite method of projection on a simplicial cone For an arbitrary u R n denote u = u u. Let K R n be an arbitrary cone and K its polar, and C R n an arbitrary closed convex set. Recall that the projection mapping P C : H H on C is well defined by P C x C and x P C x = min{ x y : y C}. Then, Moreau s decomposition theorem asserts: 5

Theorem 1 (Moreau, [6]) For x, y, z R n the following statements are equivalent: (i) z = x + y, x K, y K and x y = 0. (ii) x = P K z and y = P K z. Suppose now, that K is a simplicial cone in R n. We shall use the representation (2) for K and the representation (4) for K. Hence, e i u j = δ i j, i, j = 1,..., n where δj i the Kronecker symbol. As a direct implication of Moreau s decomposition theorem and the constructions in the preceding section we have: Theorem 2 Let x R n. For each subset of indices I N, x can be represented in the form x = α i e i + β j u j (10) i I j I c with I c the complement of I with respect to N, and with α i and β j real numbers. Among the subsets I of indices, there exists exactly one (the cases I = and I = N are not excluded) with the property that for the coefficients in (10) one has β j > 0, j I c and α i 0, i I. For this representation it holds that P K x = i I α i e i, α i 0, (11) and P K x = j I c β j u j, β j > 0. (12) Proof. The first assertion is the consequence of Corollary 1. The projections P K x and P K x as elements of K and K, respectively can be represented as n P K x = α i e i, α i 0 (13) and P K x = i=1 n β j u j, β j 0. (14) j=1 To prove existence, let I c = {j N : β j > 0} and let I be the complement of I c in the set N of indices. For an arbitrary element z R n, denote P K z = (P Kz). If α j > 0 would hold in (14), for some j I c, then by Lemma 1 it would follow that P K x P K x < 0, which contradicts the theorem of Moreau. Hence, (13) can be written in the form (11) and (14) can be written in the form (12). Therefore, Theorem 1 implies x = P K x + P K x = i I α i e i + j I c β j u j, where α i 0, i I and β j > 0, j I c. 6

To prove uniqueness, suppose that in the representation (10) of x we have α i 0, β i = 0 for i I and β j > 0, α j = 0 for j I c, where I is a subset of N, and I c is the complement of I in N (the cases I = and I = N are not excluded). Then representations (11) and (12) follow from Theorem 1 by using the mutual orthogonality of the vectors e i, i I and u j, j I c. From (11) and the uniqueness of the projection P K x it follows that I is unique. From this theorem it follows that a given simplicial cone K R n determines a partition of the space R n in 2 n cones in the sense that R n = cone{e i, u j : i I, j I c } I N and for two different sets I of indices the respective cones do not contain common interior points. The cones in the above union are exactly the sums of orthogonal faces. This theorem suggests the following algorithm for finding the projection P K x: Step 1. For the subset I N we solve the following linear system in α i x e l = i I α i e i e l, l I. (15) Step 2. Then, we select from the family of all subsets in N the subfamily of subsets I for which the system possesses non-negative solutions. Step 3. For each I we solve the linear system in β j x u k = j I c β j u j u k, k I c. (16) By Theorem 2 among these systems there exists exactly one with non-negative solutions. By this theorem, for corresponding I and for the solution of the system (15), we must have P K x = α i e i. i I This algorithm requires that we solve 2 n linear systems of at most n equations in Step 1 (15) and another 2 systems in Step 2 (16). (Observe that all these systems are given by Gram matrices, hence they have unique solutions.) Perhaps this great number of systems can be substantially reduced, but it still remains considerable. Remark 1 If K is subdual; that is, if e k e l 0, k, l N, the above algorithm can be reduced as follows: By supposing that we have got the representation (10) of x with non-negative coefficients, we multiply both sides of (10) by an arbitrary e l. If x e l < 0 then l cannot be in I, otherwise the relations u j e l = 0, j I c and e i e l 0, i I would furnish a contradiction. Thus, we have to look for the set I of indices (for which we have to solve the system (15)) among the subfamilies of {i N : x e i 0}. (Arguments like this can be used, as it was done e. g. in [7] for the Dykstra algorithm, to eliminate some hyperplanes while comkputing successive approximations of the solution.) 7

Obviously, the proposed method is inefficient. It was presented by A. B. Németh and S. Z. Németh in [8] as a preparatory matherial for an efficient algorithm for so called isotone projection cones only. For isotone projection cones we can obtain the projection of a point in at most n steps, where n is the dimension of the space. Isotone projection cones are special simplicial cones. Even if there are important isotone projection cones in applications, they are rather particular in the family of simplicial cones. 4 Heuristic method for projection onto a simplicial cone Regardless the inconveniences of the above presented exact method, which follow from its combinatorial character, it suggests an interesting heuristic algorithm. To explain its intuitive background we consider again the simplicial cone and its polar K = cone{e 1,..., e n } K = cone{u 1,..., u n } given by Lemma 1. Take an arbitrary x R n. We are seeking the projection P K x. If e i x 0, i N, then x K = ker P K, hence P K x = 0. If u j x 0, j N, then x K, and hence P Kx = x. We can assume that x / K K. Hence, x projects in a proper face of K and in a proper face of K. Take an arbitrary family I N of indices. Then, the vectors e i, u j : i I, j I c entgender by Corollary 1 a reference system in R n. Then, x = i I α i e i + j I c β j u j (17) with some α i, β j R. (As far as the family I N of indices is given, we can determine the coefficients α i and β j, according to Theorem 2, by solving the systems (15) and (16).) If we have α i 0, β j 0 : i I, j I c, then from Theorem 2 we obtain P K x = i I α i e i and P K x = j I c β j u j. In this case x is projected onto face F = cone{e i : i I} ortogonally along the subspace engendered by the elements {u j : j I c }, roughly speaking, along the orthogonal face F of F. Suppose that β j < 0 for some j I c. Then, considering the reference system entgendered by e i, u j, i I, j I c, x lies in its orthant with negative j th 8

coordinate, that is in the direction of the vector u j. By construction, e j and u j form an obtuse angle. Hence the angle of e j and u j is an acute one. Thus there is a real chance that in a new reference system in which e j replaces u j, the coordinate of x with respect to e j has the same sign as its coordinate with respect to u j, that is positive (or at least non-negative). If we have α i < 0 for some i I, then by similar reasoning it seems to be advantageous to replace e i with u i, and so on. Thus, we arrive to the following step in our algorithm: Substitute u j with e j if β j < 0 and substitute e i with u i if α i < 0 and solve the systems (15) and (16) for the new configuration of indices I. We shall call this step an iteration of the heuristic algorithm. Then, repeat the procedure for the new configuration of I and so on, until we obtain a representation (17) of x with all the coefficients non-negative. 5 Experimental results The heuristic algorithm was programmed in Scilab, an open source platform for numerical computation. 2 Experiments were performed on numerical examples for 2, 3, 5, 10, 15, 20, 25, 30, 50, 75, 100, 200, 300, 500 dimensional cones. The algorithm was performed on 100000 random examples for each of the problem sizes 2,..., 100. Statistical analysis on a subset of 10000 examples from the set of 100000 examples for size 100 indicates no significant difference in overall results and performance, therefore we subsequenlty reduced the number of experiments on larger problem sizes. 10000 random examples were used for sizes 200 and 300 and 1000 examples for size 500, as the time needed by the algorithm increases with size. Table 1 shows the experimental results. For each problem size, the averages of all runs are shown, together with a confidence interval at confidence level 95% where appropriate. 3 The Changes column indicates the total number of swaps u j for e j and e j for u j, respectively, before reaching the solution. The Iterations column indicates the number of iterations (as defined in Section 4) the algorithm performed before reaching a solution. The Iterations with increases column shows the percentage of iterations where the number of changes increased from the previous iteration. We noticed that in the majority of iterations the number of changes decreased, which led to the quick convergence of the algorithm in the vast majority of cases. In all examples the starting point for the search was the e 1,..., e n base. The final column shows the percentage of problems where the algorithm was aborted due to going in a loop by allocating in some iteration a set of e j s and u j s that were encountered in a previous iteration. The percentage of loops was exponentially decreasing as the size increased and we did not observe any loops in any experiments on problem sizes of 30 or above. Overall, loops were observed in 0.1% of the experiments, so the heuristic algorithm was successful 99.9% of the time. A solution that we see for solving the problems that lead to a loop is to restart the algorithm from a different initial set of e j s and u j s. The problems ending in a loop were excluded from the detailed analysis that follows. 2 http://www.scilab.org/ 3 Any difference less than ±0.5 for integers and ±0.1 for percentages, respectively, is not shown, as deemed irrelevant for the analysis. 9

Table 1: Number of changes, iterations, iterations where the number of changes increased and loops for the various cone dimensions Size Changes Iterations Iterations with Loops increases [%] [%] 2 1 1 0 4.382 3 2 1 3.9 ± 0.1 4.278 5 4 2 13 ± 0.2 1.396 10 11 3 26 ± 0.3 0.273 15 17 4 30.3 ± 0.3 0.029 20 24 4 31.6 ± 0.3 0.007 25 30 4 31.2 ± 0.3 0.003 30 37 4 29.9 ± 0.3 50 64 5 26.8 ± 0.3 75 97 5 24.2 ± 0.3 100 131 5 23.8 ± 0.3 200 267 ± 1 6 19.9 ± 0.8 300 409 ± 1 6 26.2 ± 0.9 500 700 ± 5 7 25.7 ± 2.7 More detailed analysis of the three main performance indicators of changes, iterations and iterations with increases was performed using boxplots as shown in Figures 1, 2 and 3. Although the total number of changes performed increases linearly with problem size (at a rate of less than 2 n, even if considering maximum numer of changes), this does not affect the performance substantially (see Figure 1, where the results were split into two parts for a clearer view). 90 900 80 800 70 700 Number of changes 60 50 40 30 Number of changes 600 500 400 300 20 200 10 100 0 0 2 3 5 10 15 20 25 30 50 75 100 200 300 500 Size Size Figure 1: Boxplots of number of changes performed for the various cone dimensions The number of iterations is the crucial indicator of performance. As shown in Figure 2 the number of iterations reaches at most 11 (for sizes 15 and 20), but in 75% of cases has a value of 7 or below. We ran a few experiments on larger sizes, up to 1750, and the largest number of iterations we obeserved was 13. Running experiments on very large problem sizes is problematic due to computer memory limitations and the Scilab built-in solving of linear systems. 4 4 Note that the time needed by one iteration substantially increases with problem size n as one iteration involves solving a linear system with n equations and n variables. 10

The major benefit of this heuristic algorithm is the small number of iterations even for very large number of cone dimensions. 12 10 8 Iterations 6 4 2 0 2 3 5 10 15 20 25 30 50 75 100 200 300 500 Size Figure 2: Boxplots of number of iterations needed for the various cone dimensions As our heuristic algorithm seems to converge quickly, we wanted to know how frequently it deviates from the optimal path. An optimal path would consist of iterations with decreasing number of changes. Figure 3 shows the boxplots for the number of iterations where an increase in the number of changes took place. The maximum number of such iterations over all experiments is 4, but 75% of examples involved only one or no such iteration. This provides an explanation for the fast convergence: the algorithm very rarely deviates from the optimal path. 6 Conclusion We presented a heuristic method of projection on simplicial cones based on Moreau s decomposition theorem. The heuristic algorithm presented in this note iteratively finds the projection onto a simplicial cone in a surprisingly small number of steps even for large cone dimensions in 99.9.% of the cases. We attribute the success to the fact that the algorithm rarely deviates from the optimal path, in every iteration it usually has to change less base values than in the previous iteration. We are planning to further extend the algorithm with random restart hoping to achieve 100% success rate. Acknowledgements S. Z. Németh was supported by the Hungarian Research Grant OTKA 60480. The authors are grateful to J. Dattorro for many helpful conversations. 11

4 3 Increases 2 1 0 2 3 5 10 15 20 25 30 50 75 100 200 300 500 Size Figure 3: Boxplots of number of iterations with increases in number of changes needed for the various cone dimensions References [1] J. Dattorro. Convex Optimization & Euclidean Distance Geometry. Mεβoo, 2005, v2009.04.11. [2] F. Deutsch and H. Hundal. The rate of convergence of Dykstra s cyclic projections algorithm: the polyhedral case. Numer. Funct. Anal. Optim., 15(5-6):537 565, 1994. [3] R. L. Dykstra. An algorithm for restricted least squares regression. J. Amer. Stat. Assoc., 78(384):273 242, 1983. [4] G. Isac and A. B. Németh. Projection methods, isotone projection cones, and the complementarity problem. J. Math. Anal. Appl., 153(1):258 275, 1990. [5] T. Ming, T. Guo-Liang, F. Hong-Bin, and Ng. Kai Wang. A fast EM algorithm for quadratic optimization subject to convex constraints. Statist. Sinica, 17(3):945 964, 2007. [6] J. J. Moreau. Décomposition orthogonale d un espace hilbertien selon deux cônes mutuellement polaires. C. R. Acad. Sci., 255:238 240, 1962. [7] P. M. Morillas. Dykstra s algorithm with strategies for projecting onto certain polyhedral cones. Applied Mathematics and Computation, 167(1):635 649, 2005. [8] A. B. Németh and S. Z. Németh. How to project onto an isotone projection cone. Linear Algebra Appl., submitted. 12

[9] S. Z. Németh. Iterative methods for nonlinear complementarity problems on isotone projection cones. J. Math. Anal. Appl., 350(1):340 347, 2009. [10] X. Shusheng. Estimation of the convergence rate of Dykstra s cyclic projections algorithm in polyhedral case. Acta Math. Appl. Sinica (English Ser.), 16(2):217 220, 2000. 13