Operations Research Transactions ›› 2022, Vol. 26 ›› Issue (1): 113-124.doi: 10.15960/j.cnki.issn.1007-6093.2022.01.008

Previous Articles     Next Articles

A local search analysis for the uniform capacitated $k$-means problem with penalty

Jiachen JU1, Qian LIU2*,*(), Zhao ZHANG3, Yang ZHOU2   

  1. 1. Department of Operations Research and Information Engineering, College of Mathematics, Beijing University of Technology, Beijing 100124, China
    2. School of Mathematics and Statistics, Shandong Normal University, Jinan 250014, Shandong, China
    3. College of Mathematics and Computer Science, Zhejiang Normal University, Jinhua 321004, Zhejing, China
  • Received:2021-06-10 Online:2022-03-15 Published:2022-03-14
  • Contact: Qian LIU E-mail:lqqsh@163.com

Abstract:

The classical $k$-means problem has numerous application in the real world. Given a set of data points $D$ in ${\mathbb R}^d$ and an integer $k$, the aim is to select $k$ points in ${\mathbb R}^d$ as a central set $S$, such that the sum of the square of the distance between each point and its closest center in $S$ is minimized. It is a NP-hard problem and has many generalizations, one of which is the uniform capacitated $k$-means problem with penalty. Compared with the classical $k$-means problem, the penalty implies that each data point has a penalty cost, when the distance from some data points to their nearest center is greater than the penalty cost, they contribute the penalty cost. As for the uniform capacity, it suggests that each center is connected at most $U$ times. For this variant problem, we design a local search algorithm which selects at most $(3+\alpha)k$ centers and reaches $\beta$-approximate, where $\alpha>34$, $\beta>\frac{\alpha+34}{\alpha-34}$.

Key words: $k$-means problems, penalty, uniform capacity, bi-criteria, local search, approximation algorithms

CLC Number: