Algorithms with Performance Guarantee for Some Quadratic Euclidean Problems of 2-Partitioning a Set and a Sequence

Similar documents
On Some Euclidean Clustering Problems: NP-Hardness and Efficient Approximation Algorithms

On a Quadratic Euclidean Problem of Vector Subset Choice: Complexity and Algorithmic Approach

An FPTAS for a Vector Subset Search Problem

On a Problem of Choosing Elements in a Family of Sequences

Decomposition Algorithm of Generalized Horner Scheme for Multidimensional Polynomial Evaluation

Newton Method with Adaptive Step-Size for Under-Determined Systems of Equations

The FAD-methodology and Recovery the Thermal Conductivity Coefficient in Two Dimension Case

Numerical Algorithm for Optimal Control of Continuity Equations

The Problem of Linear-Quadratic Programming

Newton s and Linearization Methods for Quasi-variational Inequlities

On a Statement of Problem of Control Synthesis the Process of Heating the Bar

LOWER BOUNDS FOR THE UNCAPACITATED FACILITY LOCATION PROBLEM WITH USER PREFERENCES. 1 Introduction

Prime Cost Analysis in the Model of Gas Fields

On Solving the Problem of Optimal Probability Distribution Quantization

Investigation of Optimal Control Problem without Initial Conditions

Control of Chaos in Strongly Nonlinear Dynamic Systems

A Bimatrix Game with Fuzzy Payoffs and Crisp Game

genetic local search. for the Servers Load Balancing Problem

Complexity of local search for the p-median problem

Towards a Fixed Parameter Tractability of Geometric Hitting Set Problem for Axis-Parallel Squares Intersecting a Given Straight Line

Enforcing Speed Limits Using a Randomization Strategy

Hyperbolic set covering problems with competing ground-set elements

Limitations of Algorithm Power

Parameter Identification of an Endogenous Production Function

Models for the Control of Technical Systems Motion Taking into Account Optimality Conditions

International Journal "Information Theories & Applications" Vol.14 /

An 0.5-Approximation Algorithm for MAX DICUT with Given Sizes of Parts

Exact Algorithms for 2-Clustering with Size Constraints in the Euclidean Plane

Identification of Parameters of the Basic Hydrophysical Characteristics of Soil

Optimal Control for Radiative Heat Transfer Model with Monotonic Cost Functionals

Approximation Algorithms for Maximum. Coverage and Max Cut with Given Sizes of. Parts? A. A. Ageev and M. I. Sviridenko

8 Knapsack Problem 8.1 (Knapsack)

NP-COMPLETE PROBLEMS. 1. Characterizing NP. Proof

Algorithms. Outline! Approximation Algorithms. The class APX. The intelligence behind the hardware. ! Based on

Min-max-min Robust Combinatorial Optimization Subject to Discrete Uncertainty

On complete permutation polynomials 1

This means that we can assume each list ) is

Approximation Algorithms for Re-optimization

Extremal Graphs Having No Stable Cutsets

EXERCISES SHORTEST PATHS: APPLICATIONS, OPTIMIZATION, VARIATIONS, AND SOLVING THE CONSTRAINED SHORTEST PATH PROBLEM. 1 Applications and Modelling

A Generalization of Komlós s Theorem on Random Matrices

On Algorithm for the Minimum Spanning Trees Problem with Diameter Bounded Below

On values of vectorial Boolean functions and related problems in APN functions

ARCS IN FINITE PROJECTIVE SPACES. Basic objects and definitions

Convergence of Solutions of an Optimal Control Problem for SP1 and Rosseland Approximations

NUMBER SYSTEMS. Number theory is the study of the integers. We denote the set of integers by Z:

Two hours. To be provided by Examinations Office: Mathematical Formula Tables. THE UNIVERSITY OF MANCHESTER. 29 May :45 11:45

MATH 409 LECTURES THE KNAPSACK PROBLEM

Key Words: cardinal B-spline, coefficients, moments, rectangular rule, interpolating quadratic spline, hat function, cubic B-spline.

Maximum Likelihood Estimation for Mixtures of Spherical Gaussians is NP-hard

Graph. Supply Vertices and Demand Vertices. Supply Vertices. Demand Vertices

Mean Value Theorem. MATH 161 Calculus I. J. Robert Buchanan. Summer Department of Mathematics

Notes on Complexity of the Simple Assembly Line Balancing Problem

Journal of Symbolic Computation. On the Berlekamp/Massey algorithm and counting singular Hankel matrices over a finite field

DISTINGUISH HARD INSTANCES OF AN NP-HARD PROBLEM USING MACHINE LEARNING

Copyright 2013 Springer Science+Business Media New York

Single Machine Scheduling with Generalized Total Tardiness Objective Function

Mean Value Theorem. MATH 161 Calculus I. J. Robert Buchanan. Summer Department of Mathematics

VARIETIES OF ABELIAN TOPOLOGICAL GROUPS AND SCATTERED SPACES

Reachability relations and the structure of transitive digraphs

Integer factorization is in P

Reductions. Reduction. Linear Time Reduction: Examples. Linear Time Reductions

Invited Review OPTIMAL RECOMBINATION IN GENETIC ALGORITHMS FOR COMBINATORIAL OPTIMIZATION PROBLEMS PART I

Chapter 11. Approximation Algorithms. Slides by Kevin Wayne Pearson-Addison Wesley. All rights reserved.

On the mean connected induced subgraph order of cographs

arxiv: v1 [math.oc] 3 Jan 2019

MATH 326: RINGS AND MODULES STEFAN GILLE

Multi-criteria approximation schemes for the resource constrained shortest path problem

Building Graphs from Colored Trees

1 2 0 ERDŐS AND GRAHAM It is clear that one can inscribe k 2 squares into a unit square so that the sum of their circumferences is 4k. Erdös conjectur

ON MATCHINGS IN GROUPS

Efficient approximation algorithms for the Subset-Sums Equality problem

Sparse analysis Lecture II: Hardness results for sparse approximation problems

12. LOCAL SEARCH. gradient descent Metropolis algorithm Hopfield neural networks maximum cut Nash equilibria

PROPAGATION OF A MODE-I CRACK UNDER THE IRWIN AND KHRISTIANOVICH BARENBLATT CRITERIA

). In an old paper [11], I. N. Sanov

Show that the following problems are NP-complete

Bin packing and scheduling

Some Geometric and Algebraic Aspects of Domain Decomposition Methods

Normalizers of group algebras and mixing

DAA 8 TH UNIT DETAILS

CSC 8301 Design & Analysis of Algorithms: Lower Bounds

EVALUATING MISCLASSIFICATION PROBABILITY USING EMPIRICAL RISK 1. Victor Nedel ko

a + bi by sending α = a + bi to a 2 + b 2. To see properties (1) and (2), it helps to think of complex numbers in polar coordinates:

Generalization of some extremal problems on non-overlapping domains with free poles

Chapter 1 Preliminaries

THE COMPLEXITY OF DISSOCIATION SET PROBLEMS IN GRAPHS. 1. Introduction

arxiv: v1 [math.nt] 8 Apr 2016

CHAPTEER - TWO SUBGROUPS. ( Z, + ) is subgroup of ( R, + ). 1) Find all subgroups of the group ( Z 8, + 8 ).

Nonnegative Integral Subset Representations of Integer Sets

Reachability relations and the structure of transitive digraphs

CS5314 Randomized Algorithms. Lecture 18: Probabilistic Method (De-randomization, Sample-and-Modify)

COLORING CLEAN AND K 4 -FREE CIRCLE GRAPHS

An Asymptotic Property of Schachermayer s Space under Renorming

Approximation Preserving Reductions

Essential facts about NP-completeness:

Vertex-Coloring Edge-Weighting of Bipartite Graphs with Two Edge Weights

Theoretical Computer Science

Out-colourings of Digraphs

ON THE NP-COMPLETENESS OF SOME GRAPH CLUSTER MEASURES

Transcription:

Algorithms with Performance Guarantee for Some Quadratic Euclidean Problems of 2-Partitioning a Set and a Sequence Alexander Kel manov Sobolev Institute of Mathematics Acad. Koptyug avenue 4, Novosibirsk State University Pirogova str. 1, 630090 Novosibirsk, Russia kelm@math.nsc.ru Vladimir Khandeev Sobolev Institute of Mathematics Acad. Koptyug avenue 4, Novosibirsk State University Pirogova str. 1, 630090 Novosibirsk, Russia khandeev@math.nsc.ru Abstract We consider the problems of 2-partitioning a finite set and a finite sequence of points in Euclidean space into clusters (subsets or subsequences) minimizing the sum of squared distances between cluster elements and the corresponding cluster centers. It is assumed that the center of one of the desired clusters is the origin, while the center of the other cluster is unknown and determined as the mean value over cluster elements. In this work, we present a short survey on some results for these problems and a new result: a randomized algorithm for the sequence partitioning problem. 1 Introduction In this work, we consider two strongly NP-hard discrete optimization problems. The goal of our work is to present a short survey on some recent results for these problems. The problems under consideration are closely related to the well-known strongly NP-hard 2-MSSC (Minimum Sum-of-Squares Clustering) problem [Aloise et al., 2009]. This problem (also known as 2-Means) is to find a partition of a finite set Y of points from a Euclidean space into two subsets (clusters) C and Y \ C such that where y(c) = 1 C y C y y(c) 2 + y C y Y\C y y(y \ C) 2 min, 1 y and y(y \ C) = C y Y\C y are the centroids (the geometric centers) of sets C and Y \ C, respectively. The example of an input set of points on a plane is presented at Fig. 1. One needs to find a partition of this set into two clusters and find their centroids. It is assumed that the points are scattered around these centroids (the points of different clusters are colored differently). Copyright c by the paper s authors. Copying permitted for private and academic purposes. In: Yu. G. Evtushenko, M. Yu. Khachay, O. V. Khamisov, Yu. A. Kochetov, V.U. Malkova, M.A. Posypkin (eds.): Proceedings of the OPTIMA-2017 Conference, Petrovac, Montenegro, 02-Oct-2017, published at http://ceur-ws.org 298

Figure 1: Example of the input set The main difference of the considered problems from the 2-MSSC problem is that the center of one the clusters is fixed at a certain point in the Euclidean space (see the following sections); without loss of generality we can assume that this point is the origin. Additionally, in one of the problems under consideration the input is a sequence rather than a set, and there are constraints on the indices of elements of subsequences that are included in one cluster or another. Despite the similarities, the considered problems and the 2-MSSC problem are not equivalent. Therefore, the both considered problems need an individual study. Also, the problems under consideration are important for applications, e.g. for approximation theory, statistics, data mining, the remote sensing, geophysics, biometrics, medicine and technical diagnostics, processing the speech signals, electronic intelligence, radiolocation, hydroacoustics, etc. 2 Partitioning a Set Problem 1. Given: a set Y = {y 1,..., y N } of points from R q and a positive integer number M. Find: a subset C Y that minimizes the objective function S(C) = y y(c) 2 + y 2, y C y Y\C where y(c) = 1 C y C y is the centroid (the geometric center) of set C, under constrain C = M. Figure 2: Example of the input set 299

The example of an input set of points on a plane is presented at Fig. 2. One needs to find a partition of this set into two clusters. Unlike 2-MSSC problem, in Problem 1 one needs to find the centroid of only one cluster. It is assumed that the points of this cluster (shown by dark color) are scattered around its unknown centroid and that the points of the second one (shown by white color) are scattered around the origin. The strong NP-hardness of Problem 1 is shown in [Kel manov & Pyatkin, 2009]. In the case of fixed space dimension the problem is solvable in polynomial time [Gimadi et al., 2010, Shenmaier, 2016]. A 2-approximation algorithm for Problem 1 is presented in [Dolgushev & Kel manov, 2011]. The running time of the algorithm is O(qN 2 ). A polynomial-time approximation scheme for Problem 1 with a O(qN 2/ε+1 (9/ε) 3/ε )- time complexity, where ε is an arbitrary relative error, is substantiated in [Dolgushev et al., 2015]. In [Kel manov & Khandeev, 2015a], a randomized algorithm for Problem 1 is presented. For an established parameter value, the algorithm finds an approximate solution of the problem in polynomial time for given values of the relative error and failure probability; the conditions are established under which the algorithm is asymptotically exact and runs in polynomial time. Algorithm A 1 (randomized algorithm). Input: Y, M, and a positive integer parameter k. Step 1. Generate a multiset T by independently and randomly choosing k elements one after another (with replacement) from Y. Step 2. For each nonempty H T, compute the centroid y(h) and form a subset B M (y(h)) consisting of the M points from Y, having the largest projections onto the direction specified by this centroid. Compute S(B M (y(h))). Step 3. In the family of solutions found at Step 2, choose the subset C A = B M (y(h)) for which S(B M (y(h))) is minimal. If there are several solutions with the same minimal value, take an arbitrary of them. Output: C A. Theorem 1. For arbitrary δ (0, 1) and positive integer t k, algorithm A 1 finds a (1 + 1 δt )- approximate solution of Problem 1 in O(2 k q(k + N)) time with probability not less than 1 (δ + α), where α = t 1 ( k ) ( ) i ( ) k i. M i=0 i N 1 M N Corollary 1. Assume that M βn, where β (0, 1) is a constant. Then, given ε > 0 and γ (0, 1) for the fixed parameter k = max( 2 β 2 γε, 8 β ln 2 γ ) algorithm A 1 finds a (1 + ε)-approximate solution of Problem 1 with probability not less than 1 γ in O(qN) time. Below are the conditions for algorithm A 1 to be asymptotically exact. Theorem 2. Under the conditions of Theorem 1, let k = log 2 N, δ = (log 2 N) 1/2, t = km 2N. Assume that M βn, where β (0, 1) is a constant. Then algorithm A 1 finds a (1 + ε N )-approximate solution of Problem 1 with probability 1 γ N in O(qN 2 ) time, where ε N 2 β (log 2 N) 1/2, γ N (log 2 N) 1/2 + N β 8 ln 2. A pseudopolynomial algorithm which finds an optimal solution in the case of integer components of the points in the input set and fixed space dimension is proposed in [Kel manov & Khandeev, 2015b]. Algorithm A 2 (exact pseudopolynomial algorithm). Input: Y, M. Step 1. Given x G, where G is a multidimensional cubic uniform in each coordinate grid of size 2D with the distance 1 M between the nodes and the center at the origin, and D is the maximal modulus of the coordinates of input points, construct the subset B M (x) consisting of the M points from Y, having the largest projections onto the direction specified by x. Calculate Q(B M (x), x) = y B M (x) y x 2 + y Y\B M (x) y 2. Step 2. Find the point x A = arg min Q(B M(x), x) (if there are several points with the same minimal value, x G take an arbitrary of them) and the corresponding subset B M (x A ). As a solution to the problem, take C A = B M (x A ). Output: C A. Theorem 3. Suppose that, under the conditions of Problem 1, the points of Y have integer components lying in [ D, D]. Then algorithm A 2 finds an optimal solution in O(qN(2MD + 1) q ) time. In the case of fixed space dimension q the algorithm is pseudopolynomial, since under this condition, the running time of the algorithm is estimated by O(N(MD) q ). In [Kel manov & Khandeev, 2016], it was proved that, unless P=NP, in the general case of Problem 1 there is no fully polynomial-time approximation scheme (FPTAS). In addition, such a scheme was presented for the case of fixed space dimension. Algorithm A 3 (approximation scheme). 300

Input: Y, M and ε > 0. For each point y Y Steps 1 5 are executed. Step 1. Construct the set B M (y) of the M points from Y, having the largest projections onto the direction specified by y. Step 2. Compute S(B M (y)), h(y, ε) = 2ε qm S(B 1 M (y)) and H(y) = M S(B M(y)). Step 3. If S(B M (y)) = 0, then return the set B M (y) as a result C A produced by the algorithm and exit; otherwise, go to the next step. Step 4. Construct a multidimensional cubic uniform in each coordinate grid G(y, h, H + h/2) of size 2H + h with the distance h between the nodes and the center at the point y. Step 5. For each node x of grid G(y, h, H + h/2), construct the set B M (x) of the M points from Y, having the largest projections onto the direction specified by x, and compute S(B M (x)). Step 6. In the family {B M (x) x G(y, h, H + h/2), y Y} of sets, choose, as a solution C A, the set B M (x) for which S(B M (x)) is minimal. If there are several solutions with the same minimal value, take an arbitrary of them. Output: C A. Theorem 4. For any fixed ε > 0 algorithm A 3 finds a (1 + ε)-approximate solution of Problem 1 in O(qN 2 2q ( ε + 2)q ) time. In the case of fixed space dimension q the algorithm implements an FPTAS, since under this condition, the running time of the algorithm is estimated by O(N 2 (1/ε) q/2 ). 3 Partitioning a Sequence Problem 2. Given: a sequence Y = (y 1,..., y N ) of points from R q, and some positive integer numbers T min, T max and M. Find: a subset M = {n 1,..., n M } N = {1,..., N} such that F (M) = y j y(m) 2 + y i 2 min, j M where y(m) = 1 M i M y i, under constraints i N \M 1 T min n m n m 1 T max N, m = 2,..., M, (1) on the elements of (n 1,..., n M ). In this problem, (1) define the constraints on the elements of the cluster {y i i M} with unknown centroid. In [Kel manov & Pyatkin, 2013], the variant of Problem 2 was under study with T min and T max as parameters, and it was established that Problem 2 is strongly NP-hard for every T min < T max. In the trivial case when T min = T max, this problem is solvable in polynomial time. A 2-approximation polynomial algorithm with complexity O(N 2 (MN + q)) was proposed in [Kel manov & Khamidullin, 2014]. For the case where the components of the points are integers and the dimension q is fixed, the exact pseudopolynomial algorithm was introduced in [Kel manov et all., 2017a]. Algorithm A 4 (exact pseudopolynomial algorithm). Input: Y, T min, T max, M. Step 1. For each point x G, where G is a multidimensional cubic uniform in each coordinate grid of size 2D with the distance 1 M between the nodes and the center at the origin, and D is the maximal modulus of the coordinates of input points, find the optimal solution M x of the problem G x (M) = (2 y n, x x 2 ) max (2) n M under constraints (1) on the elements of M. Compute G x (M x ). Step 2. Find the point x A = arg max x G Gx (M x ) (if there are several points with the same maximal value, take an arbitrary of them) and the corresponding subset M A = M x A. Output: M A. 301

At Step 1 the optimal solution of the problem (2) is found in O(N(M(T max T min + 1) + q)) time using the dynamic programming scheme [Kel manov & Khamidullin, 2014]. Theorem 5. Suppose that, under the conditions of Problem 2, the points of Y have integer components lying in [ D, D]. Then algorithm A 4 finds an optimal solution in O(N(M(T max T min + 1) + q)(2md + 1) q ) time. In the case of fixed space dimension q the algorithm is pseudopolynomial, since under this condition, the running time of the algorithm is estimated by O(N 3 (MD) q ). In [Kel manov et al., 2016], an approximation algorithm was proposed which for the fixed dimension q of the space implements the FPTAS scheme. Algorithm A 5 (approximation scheme). Input: Y, T min, T max, M and ε > 0. For each point y Y Steps 1 5 are executed. Step 1. Find [Kel manov & Khamidullin, 2014] an optimal solution M y of problem (2) for x = y. Step 2. Compute F (M y ), h = 2ε qm F (My 1 ) and H = M F (My ). Step 3. If F (M y ) = 0, then return the set M y as a result M A produced by the algorithm and exit; otherwise, go to the next step. Step 4. Construct a multidimensional cubic uniform in each coordinate grid G(y, h, H + h/2) of size 2H + h with the distance h between the nodes and the center at the point y. Step 5. For each node x of grid G(y, h, H + h/2) construct [Kel manov & Khamidullin, 2014] an optimal solution M x of problem (2) and compute F (M x ). Step 6. In the family {M x x G(y, h, H + h/2), y Y} of sets, choose, as a solution M A, the set M x, for which F (M x ) is minimal. If there are several solutions with the same minimal value, take an arbitrary of them. Output: M A. Theorem 6. For any fixed ε > 0 algorithm A 5 finds a (1 + ε)-approximate solution of Problem 2 in O(N 2 2q (M(T max T min + 1) + q)( ε + 2)q ) time. In the case of fixed space dimension q the algorithm implements an FPTAS, since under this condition, the running time of the algorithm is estimated by O(MN 3 (1/ε) q/2 ). In [Kel manov et al., 2017b], a randomized algorithm for Problem 2 is presented. Algorithm A 6 (randomized algorithm). Input: Y, T min, T max, M, and a positive integer parameter k. Step 1. Generate a multiset T by independently and randomly choosing k elements one after another (with replacement) from Y. Step 2. For each nonempty H T, compute the centroid y(h) and construct [Kel manov & Khamidullin, 2014] an optimal solution M x of problem (2) for x = y(h). Step 3. In the family of solutions found at Step 2, of sets, choose a set M A = M x for which F (M x ) is minimal. If there are several solutions with the same minimal value, take an arbitrary of them. Output: M A. Theorem 7. For arbitrary δ (0, 1) and positive integer t k, algorithm A 6 finds a (1 + 1 δt )-approximate solution of Problem 2 in O(2 k (qk + N(M(T max T min + 1) + q))) time with probability not less than 1 (δ + α), ( k i ) ( M N ) i ( 1 M N ) k i. where α = t 1 i=0 Corollary 2. Assume that M βn, where β (0, 1) is a constant. Then, given ε > 0 and γ (0, 1) for the fixed parameter k = max( 2 β 2 γε, 8 β ln 2 γ ) algorithm A 6 finds a (1 + ε)-approximate solution of Problem 2 with probability not less than 1 γ in O(qMN 2 ) time. Theorem 8. Under the conditions of Theorem 7, let k = log 2 N, δ = (log 2 N) 1/2, t = km 2N. Assume that M βn, where β (0, 1) is a constant. Then algorithm A 6 finds a (1 + ε N )-approximate solution of Problem 2 with probability 1 γ N in O(qMN 2 (T max T min + 1)) time, where ε N 2 β (log 2 N) 1/2, γ N β (log 2 N) 1/2 + N 8 ln 2. Since T max T min + 1 N, under the specified conditions the running time of the algorithm is estimated by O(qMN 3 ). 4 Conclusion In the paper we present a short survey on some results for two problems of 2-partitioning a finite set and a finite sequence of points in Euclidean space into clusters. Also we present a new randomized algorithm for the 302

sequence partitioning problem. For an established parameter value, the algorithm finds an approximate solution in polynomial time for fixed values of the relative error and failure probability. The conditions are found under which the algorithm is asymptotically exact and has polynomial-time complexity. The issue of great interest is the substantiation of faster randomized algorithms for the problems and the search for subclasses of the problems for which linear time and sub-linear time randomized algorithms can be designed. Acknowledgements This work was supported by Russian Foundation for Basic Research, Projects 15-01-00462, 16-31-00186, 16-07- 00168. References [Aloise et al., 2009] Aloise, D., Deshpande, A., Hansen, P., & Popat, P. (2009). NP-hardness of Euclidean sumof-squares clustering. Mach. Learn., 75(2), 245 248. [Kel manov & Pyatkin, 2009] Kel manov, A. V., & Pyatkin, A.V. (2009). Complexity of certain problems of searching for subsets of vectors and cluster analysis. Comput. Math. and Math. Phys., 49(11), 1966 1971. [Gimadi et al., 2010] Gimadi, E. Kh., Pyatkin, A. V., & Rykov, I. A. (2010). On polynomial solvability of some problems of a vector subset choice in a Euclidean space of fixed dimension. J. Appl. Ind. Math., 4(1), 48 53. [Shenmaier, 2016] Shenmaier, V. V. (2016). Solving some vector subset problems by Voronoi diagrams. J. Appl. Ind. Math., 10(4), 560 566. [Dolgushev & Kel manov, 2011] Dolgushev, A. V., & Kel manov, A. V. (2011). An approximation algorithm for solving a problem of cluster analysis. J. Appl. Ind. Math., 5(4), 551 558. [Dolgushev et al., 2015] Dolgushev, A. V., Kel manov, A. V., & Shenmaier, V. V. (2015). Polynomial-time approximation scheme for a problem of partitioning a finite set into two clusters (in Russian). Tr. Inst. Mat. Mekh., 21(3), 100 109. [Kel manov & Khandeev, 2015a] Kel manov, A. V., & Khandeev, V. I. (2015). A randomized algorithm for twocluster partition of a set of vectors. Comput. Math. Math. Phys., 55(2), 330 339. [Kel manov & Khandeev, 2015b] Kel manov, A. V., & Khandeev, V. I. (2015). An exact pseudopolynomial algorithm for a problem of the two-cluster partitioning of a set of vectors. J. Appl. Ind. Math., 9(4), 497 502. [Kel manov & Khandeev, 2016] Kel manov, A. V., & Khandeev, V. I. (2016). Fully polynomial-time approximation scheme for a special case of a quadratic euclidean 2-clustering problem. J. Appl. Ind. Math., 56(2), 334 341. [Kel manov & Pyatkin, 2013] Kel manov A. V., & Pyatkin A. V. (2013). On Complexity of Some Problems of Cluster Analysis of Vector Sequences. J. Appl. Indust. Math., 7(3), 363 369. [Kel manov & Khamidullin, 2014] Kel manov A. V., & Khamidullin S. A. (2014). An Approximating Polynomial Algorithm for a Sequence Partitioning Problem. J. Appl. Indust. Math., 8(2), 236 244. [Kel manov et all., 2017a] Kel manov A.V., Khamidullin S.A., & Khandeev V.I. (2017). Exact Pseudopolynomial Algorithm for one Sequence Partitioning Problem. Automat. Remote Control, 78(1), 66 73. [Kel manov et al., 2016] Kel manov A.V., Khamidullin S.A., & Khandeev V.I. (2016). A Fully Polynomial-Time Approximation Scheme for a Sequence 2-Cluster Partitioning Problem. J. Appl. Ind. Math., 10(2), 209 219. [Kel manov et al., 2017b] Kel manov A.V., Khamidullin S.A., & Khandeev V.I. (2017). A randomized algorithm for a sequence 2-clustering problem (in Russian). Comput. Math. and Math. Phys. (accepted). 303