Operations Research and Management Science ›› 2021, Vol. 30 ›› Issue (4): 155-162.DOI: 10.12005/orms.2021.0124

• Application Research • Previous Articles     Next Articles

Modeling and Solution for Load Balancing Optimization in Distributed Stream Processing System Management

TANG Ying-feng1, CHEN Shi-ping2   

  1. 1. Academic Affairs Section, Shanghai University of International Business and Economics, Shanghai 201620, China;
    2. Management School, University of Shanghai for Science and Technology, Shanghai 200093, China
  • Received:2019-01-02 Online:2021-04-25

分布式数据流处理系统管理中负载均衡问题建模与求解

唐颖峰1, 陈世平2   

  1. 1.上海对外贸经贸大学 教务处,上海 201620;
    2.上海理工大学 管理学院,上海 200093
  • 作者简介:唐颖峰(1983-),男,甘肃嘉峪关人,助理研究员,博士,主要研究方向为云计算、数据挖掘;陈世平(1964-),男,教授,博士生导师,博士,主要研究方向为云计算、计算机网络通信、信息检索。
  • 基金资助:
    国家自然科学基金资助项目(61170277,61472256);上海市教委科研创新重点项目(12zz137);上海市一流学科建设项目(S1201YLXK)

Abstract: The paper concentrates on the load balancing problem in the management of distributed stream processing system. The operation mechanism of distributed stream processing system and the causes of unbalanced load of nodes is expounded, and an optimization scheme for load balancing adjustment is proposed in the paper. The model of the proposed optimization scheme is then established, and the applicable conditions of the model are theoretically analyzed. Then the model is solved with ant colony optimization, and the algorithm is then improved to meet the real-time requirement of distributed stream processing system.Finally, the validity of the model and its solution in solving the node load balancing problem in the management of distributed stream processing system is proved by experiments.

Key words: system management, distributed stream processing system, load balancing;combinatorial optimization problem, ant colony optimization

摘要: 对分布式数据流处理系统管理中,处理节点负载均衡问题进行了研究。阐述了分布式数据流处理系统的运行机理以及节点负载不均衡的成因,并提出了对系统负载均衡调整的优化方案;对提出的优化方案建立模型,并对模型的适用条件进行理论分析;然后采用蚁群算法对模型进行求解,并针对分布式数据流处理系统实时性的需求对算法进行改进;最后用实验证明本文所建立的模型及其求解方法对于解决分布式数据流处理系统管理中节点负载均衡问题的有效性。

关键词: 系统管理, 分布式数据流处理系统, 负载均衡, 组合优化问题, 蚁群算法

CLC Number: