运筹与管理 ›› 2025, Vol. 34 ›› Issue (7): 1-8.DOI: 10.12005/orms.2025.0200

• 理论分析与方法探讨 •    下一篇

多任务并行云工作流调度模型与算法

王阳, 王洪普, 范琼瑜, 刘海潮   

  1. 西北工业大学 管理学院,陕西 西安 710072
  • 收稿日期:2023-07-21 发布日期:2025-11-04
  • 通讯作者: 王阳(1985-),女,陕西西安人,教授,博士生导师,研究方向:智能决策与优化。Email: yangw@nwpu.edu.cn。
  • 基金资助:
    国家自然科学基金资助项目(71971172,72371200);陕西省社会科学基金资助项目(2019S051);西北工业大学特色文科发展计划——青年创新能力培养项目(23GH030611);西北工业大学博士论文创新基金资助项目(CX2022057)

Model and Algorithm for Cloud Workflows Scheduling Considering Multi-task Parallelism

WANG Yang, WANG Hongpu, FAN Qiongyu, LIU Haichao   

  1. School of Management, Northwestern Polytechnical University, Xi'an 710072, China
  • Received:2023-07-21 Published:2025-11-04

摘要: 随着云计算在各领域的深入应用,云工作流调度问题所面临的任务数量正在急剧增长。本文研究多任务并行云工作流调度问题,考虑了任务的资源要求、任务特性和资源多样性之间的匹配,以及多个任务在同一虚拟机上并行处理等现实场景。构建了混合整数规划模型,并设计了基于破坏重构的模拟退火算法进行求解,以实现云工作流的高效调度。该算法使用贪心策略生成初始解,通过破坏和重构操作产生邻域解,并使用Metropolis准则确定邻域解的接受规则,避免陷入局部最优。为了验证本文所设计的基于破坏重构的模拟退火算法的有效性,将其与文献中的期限分解算法进行了对比。实验结果表明,在相同求解时间内,基于破坏重构的模拟退火算法求解质量整体优于期限分解算法。

关键词: 云计算, 工作流调度, 模拟退火, 期限分解

Abstract: Workflow is a series of related tasks connected by control flow and data dependencies. Many large-scale scientific computing problems in various fields can be represented as workflow models. These workflow models involve complex computational tasks, increasing data volume and computational requirements, which cannot be met by a single computer. With the rapid development of cloud computing technology and the increasing number of large-scale scientific computing problems, the scheduling problem of workflows in cloud computing environments has gained widespread attention. Developing scheduling strategies to map tasks in workflows to cloud resource nodes is an NP-hard problem, and the quality of scheduling algorithms affects the performance of the scheduling system, resources utilization, and user satisfaction. Therefore, studying the workflow scheduling problem in cloud computing environments is of significant importance.
Existing literature has extensively studied the workflow scheduling problem under different user service quality requirements, different constraint conditions, and specific application scenarios and optimization objectives. However, most studies have not considered the parallel execution of tasks on the same virtual machine and have ignored the matching degree between task characteristics and resources diversity. To address the shortcomings in existing research, this paper studies the cloud workflows scheduling considering multi-task parallelism. Specifically, it aims to minimize the total cost while meeting the deadline requirements of workflows. The proposed approach also considers the parallel execution of multiple independent tasks on the same virtual machine to reduce data transmission between virtual machines. Finally, the matching degree between task characteristics and virtual machine resource diversity is taken into account to improve processing speed and resource utilization.
Firstly, this paper constructs a mathematical model for the cloud workflows scheduling considering multi-task parallelism. Then, it designs a simulated annealing algorithm based on destruction and reconstruction to solve this model to achieve efficient scheduling and cost optimization of workflows. The simulated annealing algorithm first uses a greedy algorithm to generate an initial feasible solution. Then, starting from the initial solution, it generates neighborhood solutions through destruction and reconstruction operations. It uses the Metropolis criterion in simulated annealing to determine the acceptance rule for neighborhood solutions. If the newly generated solution is better than the current solution, the new solution is accepted. Otherwise, it accepts a worse solution with a certain probability, which helps to jump out of local optima to some extent. The algorithm repeats the destruction, reconstruction, and solution update operations until the pre-set solving time is reached, and then terminates the algorithm. Finally, the best solution obtained in the iteration process is taken as the approximate optimal solution.
To verify the effectiveness of the proposed algorithm, simulation experiments are conducted using the cloud simulation platform CloudSim. CyberShake, Epigenomics, Inspiral, and Sipht workflows with different scales, released by the Pegasus project, are used as test cases, and the CPU and memory configuration requirements of tasks in different types of workflows are determined based on literature. Besides, considering that the deadline decomposition algorithm is an effective algorithm for solving cloud workflow scheduling problems in the current literature, this paper designs a deadline decomposition algorithm for this problem and compares it with the simulated annealing algorithm through experiments. The deadline decomposition algorithm first divides tasks in the workflow into levels, grouping tasks of different levels into different task sets. Next, the algorithm uses the deadline decomposition strategy to set appropriate sub-deadlines for each task set. After that, it prioritizes the tasks and generates a task scheduling list that meets the pre and post dependency relationships. Finally, taking into account both time and cost factors, the algorithm assigns virtual machines to tasks in different sets in sequence.
The initial temperature value is an important factor that affects the global search performance of the simulated annealing algorithm, while the cooling rate and iteration number are closely related to the solving performance of the simulated annealing algorithm. Therefore, this paper first determines the optimal parameter values for the initial temperature, cooling rate, and iteration number through experiments. Then, the algorithm performance of the deadline decomposition algorithm and the simulated annealing algorithm under the optimal parameter values are compared. The experimental results show that the simulated annealing algorithm based on destruction and reconstruction designed in this paper has better solving performance than the deadline decomposition algorithm. But the algorithm's solving time is relatively long, and stability needs to be improved. Overall, the proposed algorithm can effectively help cloud service providers develop workflow scheduling plans, improve resources utilization and user service quality, and maximize the interests of both cloud service providers and users.

Key words: cloud computing, workflow scheduling, simulated annealing, deadline division

中图分类号: