Operations Research and Management Science ›› 2021, Vol. 30 ›› Issue (12): 35-41.DOI: 10.12005/orms.2021.0381

• heory Analysis and Methodology Study • Previous Articles     Next Articles

K-means Clustering Optimized by Lightning Attachment Procedure Optimization

GAO Wen-xin, LIU Sheng, XIAO Zi-ya   

  1. School of Management, Shanghai University of Engineering Science, Shanghai 201620, China
  • Received:2019-09-25 Online:2021-12-25

闪电分叉过程算法优化的K-means聚类

高文欣, 刘升, 肖子雅   

  1. 上海工程技术大学 管理学院,上海 201620
  • 通讯作者: 刘升(1966-),男,湖北黄石人,教授,博士,主要研究方向:人工智能,智能计算;
  • 作者简介:高文欣(1995-),女,蒙古族,河北承德人,硕士研究生,主要研究方向:群智能计算、智能计算、大数据处理与分析;肖子雅(1994-),女,江苏徐州人,硕士研究生,主要研究方向为智能算法、项目调度与优化。
  • 基金资助:
    国家自然科学基金资助项目(61075115,61673258);上海市自然科学基金资助项目(19ZR1421600)

Abstract: K-means clustering algorithm is a commonly algorithm applied in data mining and data analysis, but it has the disadvantages of relying on the initial value and easy to fall into the local optimum. For these shortcomings, this paper proposes an improved K-means clustering which is optimized by the lightning attachment procedure Optimization (LAPO) , which overcomes the difficulty of selecting the initial value of the clustering algorithm. This optimum improves the accuracy of the K-means clustering algorithm, and reduces the possibility of falling into a local optimum. Six real data sets are selected from the UCI data set for simulation experiments. The results show that the improved clustering algorithm has better accuracy and robustness.

Key words: clustering, lightning attachment procedure optimization, data processing, K-means

摘要: K-means聚类算法是在数据挖掘和数据分析中一种常用算法,但是其存在依赖初始值和易陷入局部最优值的缺陷,针对这些不足,本文提出一种闪电分叉过程算法优化的K-means聚类,克服聚类算法在初始值选择困难的问题,提高K-means聚类算法的求解精度,降低陷入局部最优的可能性。从UCI数据集中选取6个真实的数据集进行仿真实验,结果表明本文改进后的聚类算法有更好的求解精度和鲁棒性。

关键词: 聚类, 闪电分叉过程算法, 数据处理, K-均值聚类

CLC Number: