A three-stage heuristic task scheduling for optimizing the service level agreement satisfaction in device-edge-cloud cooperative computing

Yongxuan Sang; Junqiang Cheng; Bo Wang; Ming Chen

doi:10.7717/peerj-cs.851

A three-stage heuristic task scheduling for optimizing the service level agreement satisfaction in device-edge-cloud cooperative computing

Yongxuan Sang¹, Junqiang Cheng², Bo Wang ¹, Ming Chen¹

1Zhengzhou University of Light Industry, Zhengzhou, China

2Europe-Aisa Hi-tech and Digital Technology Company Limited, Zhengzhou, China

DOI: 10.7717/peerj-cs.851

Published: 2022-01-18
Accepted: 2021-12-20
Received: 2021-10-11

Academic Editor: Junaid Shuja

Subject Areas: Computer Architecture, Computer Networks and Communications, Distributed and Parallel Computing, Embedded Computing, Mobile and Ubiquitous Computing
Keywords: Edge cloud, Task offloading, Cloud computing, Task scheduling

Copyright: © 2022 Sang et al.
Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.

Cite this article: Sang Y, Cheng J, Wang B, Chen M. 2022. A three-stage heuristic task scheduling for optimizing the service level agreement satisfaction in device-edge-cloud cooperative computing. PeerJ Computer Science 8:e851 https://doi.org/10.7717/peerj-cs.851

The authors have chosen to make the review history of this article public.

Abstract

Device-edge-cloud cooperative computing is increasingly popular as it can effectively address the problem of the resource scarcity of user devices. It is one of the most challenging issues to improve the resource efficiency by task scheduling in such computing environments. Existing works used limited resources of devices and edge servers in preference, which can lead to not full use of the abundance of cloud resources. This article studies the task scheduling problem to optimize the service level agreement satisfaction in terms of the number of tasks whose hard-deadlines are met for device-edge-cloud cooperative computing. This article first formulates the problem into a binary nonlinear programming, and then proposes a heuristic scheduling method with three stages to solve the problem in polynomial time. The first stage is trying to fully exploit the abundant cloud resources, by pre-scheduling user tasks in the resource priority order of clouds, edge servers, and local devices. In the second stage, the proposed heuristic method reschedules some tasks from edges to devices, to provide more available shared edge resources for other tasks cannot be completed locally, and schedules these tasks to edge servers. At the last stage, our method reschedules as many tasks as possible from clouds to edges or devices, to improve the resource cost. Experiment results show that our method has up to 59% better performance in service level agreement satisfaction without decreasing the resource efficiency, compared with eight of classical methods and state-of-the-art methods.

Introduction

With the development of computer and communications technology as well as the growing need for the quality of human life, smart devices, e.g., smartphones and Internet of Thing (IoT) devices, have become more and more popular. As shown in the Cisco Annual Internet Report (CAIR) (Cisco, 2020) released in March 2020, networked devices will be increased from 18.4 billion in 2018 to 29.3 billion in 2023, and IoT devices will account for 50 percent by 2023. Juniper Research has reported similar results in 2018, IoT devices will grow at 140% over the next 4 years (Sorrel, 2018). Because of limited resources, plenty of user devices cannot satisfy their respective requirements at most time (Ghasempour, 2019; Liu et al., 2019b), as Internet applications has undergone rapid growth in both variety and complexity with the development of artificial intelligence algorithms (e.g., deep neural networks) and communication technologies (e.g., 5G and Wifi 6) (Wang et al., 2020b).

To address the resource scarcity problem of user devices, several researchers exploited the low latency edge resources (Balasubramanian et al., 2020) and the abundant cloud resources (Strumberger et al., 2019). Only one of them cannot address the problem effectively due to either the limited resource of edges or the poor network performance of clouds (Wang et al., 2019b). Thus, with the integration of respective benefits of edge and cloud computing, device-edge-cloud cooperative computing (DE3C) (Hong et al., 2019) is an effective way, where edges and clouds are employed jointly for expanding the resource capacity of user devices.

Task scheduling or offloading is an effective way for optimizing the task performance and the resource efficiency for DE3C, which decides the location (the corresponding device, an edge or a cloud) where each task to be processed (offloading decision) and the computing resources which each task performs on in a specified order (task assignment and ordering) (Wang et al., 2020b; Islam et al., 2021). Therefore, several works have proposed various task scheduling methods trying to optimize the response time (Han et al., 2019; Meng et al., 2019; Meng et al., 2020; Apat et al., 2019; Ren et al., 2019; Liu et al., 2019a; Wang et al., 2021), the resource cost (Mahmud et al., 2020; Gao et al., 2019; Chen et al., 2019) or the profit (Chen et al., 2020; Yuan & Zhou, in press) for providing services in DE3C. These works were concerned on addressing only one or two sub-problems of task scheduling, e.g., offloading decision or/and task assignment, and thus cannot provide global optimal solutions. In addition, a lot of existed works did not employ local device resources without network latency, even though a lot of smart devices have been equipped with computing resources almost equivalent to personal computers (Wu et al., 2019), nowadays.

Motivation

This paper focuses on the joint problem of offloading decision, task assignment and task ordering, to optimize the profit for service providers in DE3C, by improving the Service Level Agreement (SLA) satisfaction. An SLA is enforced when a user uses a provider’s service. If the SLA is fail to be satisfied, the provider must pay a penalty, which reduces the provider’s profit. In addition, an SLA violation reduces the provider’s reputation, and thus may lead to a loss of some potential users (Papadakis-Vlachopapadopoulos et al., 2019), while plenty of related works only concerned on improving the response time or the resource cost, which is in contradiction to the SLA satisfaction optimization, as better response times or less resources generally result in a less number of completed tasks. In addition, to our best knowledge, all existed work prioritised local resources (devices) for processing tasks and rented resources from clouds only when local and edge resources are not enough, as local resources are cheap and have no network latency. They did not consider that some tasks processed locally or in edges can be assigned to clouds to save some local or edge resources for completing tasks that cannot be finished by cloud resources.

For example, there are four tasks, t₁, t₂, t₃, and t₄, to be scheduled in a DE3C with one device, one edge server, and a cloud. The information of tasks and resources are shown in Table 1. The times consumed by t₁ and t₂ are 50 s, 51 s, and 35 s, respectively, when they are scheduled to the device, the edge server, and the cloud, and the consumed times are 5 s, 6 s, and 12.5 s, respectively, for t₃ and t₄. With the scheduling order of t₁, t₂, t₃, and t₄, and the idea of using device resources first, t₁, t₃, and t₂ are respectively scheduled to the device, the edge server, and the cloud, but the requirements of t₄ cannot be satisfied. But if scheduling tasks with the priority order of the cloud, the edge server, and the device, requirements of all tasks can be satisfied, where t₁ and t₂ are scheduled to the cloud and t₃ and t₄ are scheduled to the device.

Table 1:

The information of DE3C system for the case motivating our work.

a. task requirements
Task	Computing resource amount	Transferred data amount	Deadline
t₁	100 GHz	100 Mbit	50 s
t₂	100 GHz	100 Mbit	50 s
t₃	10 GHz	100 Mbit	10 s
t₄	10 GHz	100 Mbit	10 s

b. resource configurations
Resource	Computing capacity	Transmission bandwidth
Device	2 GHz	∞
Edge	2 GHz	100 Mbps
Cloud	4 GHz	10 Mbps

DOI: 10.7717/peerjcs.851/table-1

Contribution

This paper focuses on maximizing the SLA satisfaction, i.e., optimizing the number of tasks whose hard-deadlines are met, in DE3C by task scheduling, by exploiting both the low network latency of edges and the rich computing resources of clouds. To address the task scheduling problem in DE3C, the paper formulates it into a binary nonlinear programming (BNLP) for the SLA satisfaction optimization. In order to solve the problem in polynomial time, a heuristic method is designed based on the idea of least accumulated slack time first (LASTF) (Wang et al., 2020a) and earliest deadline first (EDF) (Benoit, Elghazi & Robert, 2021). In brief, the contributions of this paper can be summarized as followings.

The task scheduling problem of DE3C is formulated as a BNLP with two objectives, where the major one is maximizing the number of tasks whose requirements are satisfied¹ and the second one is maximizing the resource utilization.
A three-stage heuristic task scheduling method (TSSLA) is designed for DE3C. The first stage is to exploit the abundant computing resource of clouds, to finish as many tasks as possible,² by pre-scheduling tasks in the resource priority order of clouds, edges, and devices. In the second stage, TSSLA reschedules tasks from edges to respective devices to make some edge resources free for completing more tasks, and schedules remaining unscheduled tasks to edges. At the last stage, our method reschedules tasks from clouds to respective devices or edges, to reduce the resource cost. For task scheduling in each location, the proposed method respectively employs LASTF and EDF to assign the computing core for each task and decide the task execution order.
Simulated experiments are conducted referring to recent related works and the reality, to evaluate our proposed heuristic method. Experiment results verify our method having a much better performance than eight of classical and state-of-the-art methods in optimizing SLA satisfaction.

The rest of this paper is organized as follows. The second section illustrates the related works. Third section present the formulation of the task scheduling problem this paper concerned. Fourth section presents the three-stage heuristic method designed, and analyses its time complexity. The subsequent section evaluates the performance of our task scheduling method by simulated experiments. And finally, the last section concludes this paper.

Related work

As DE3C is one of the most effective ways to solve the problem of insufficient resources of smart devices and task scheduling is a promising technology to improve the resource efficiency, several researchers have focused on the design of efficient task scheduling methods in various DE3C environments (Wang et al., 2020b).

To improve the response time of tasks, the method proposed by Apat et al. (2019) iteratively assigned the task with the least slack time to the edge server closest to the user. Tasks are assigned to the cloud when they cannot be finished by edges. Their work did not consider the task scheduling on each server. OnDisc, proposed by Han et al. (2019), heuristically dispatched a task to the server providing the shortest additional total weighted response time (WRT), and sees the cloud as a server, to improve overall WRT. Stavrinides & Karatza (2019) proposed a heuristic method for improving the deadline miss Ratio. Their proposed method respectively employed EDF and earliest finish time first for task selection and resource allocation, and tried to fill a task before the input data is ready for the next task to execute.

The above research focused on the performance optimization for task execution, but did not concern the cost of used resources. In general, a task requires more resources for a better performance, and thus there is a trade-off between the task performance and the resource cost. Therefore, several works concerned the optimization of the resource cost or the profit for service providers. For example, Chen et al. (2020) presented a task scheduling method to optimize the profit, where the value of a task was proportional to the resource amounts and the time it took, and resources were provided in the form of VM. Their proposed method first classified tasks based on the amount of its required resources by K-means. Then, their method used Kuhn–Munkres method to solve the optimal matching of tasks and VMs with profit maximization for the VM class and the task class closest to the VM class, where all VMs were seen as one VM class. This work ignored the heterogeneity between edge and cloud resources, which may lead to resource inefficiency (Kumar et al., 2019). Li, Wang & Luo (2020) tried to optimize the finish time and the cloud resource usage cost. Their proposed method first made the offloading decision for each task, adopting the artificial fish swarm algorithm improved by simulated annealing method for calculating the probability of updating bulletin, to avoid falling into local optimal solution. Then their method greedily assigned a task to the computing node with the minimum utilization in edges or the cloud. This work focused on media delivery applications, and thus considered that each task can be divided into multiple same-sized subtasks for parallel process, which limited its application scope. The method proposed by Mahmud et al. (2020) iteratively assigned the offloaded application to the first computational instance such that all requirements are satisfied and the profit merit is minimum, where cloud-based instances were sorted behind edge-based instances and the profit merit was defined as the ratio of the profit and the slack time.

All of the aforementioned methods employed only edge and cloud resources for task processing, even though most of user devices have been equipped with various computing resources (Wu et al., 2019) which have zero transmission latency for users’ data. To exploit all the advantages of the local, edge and cloud resources, some works are proposed to address the task scheduling problem for DE3C. The method presented in Lakhan & Li (2019) first tried several existed task order method, e.g., EDF, EFTF, and LSTF, and selected the result with the best performance for task order. Then, the method used existed pair-wise decision methods, TOPSIS (Liang & Xu, 2017) and AHP (Saaty, 2008), to decide the position for each task’s execution, and applied a local search method exploiting random searching for the edge/cloud. For improving the delay, the approach presented in Miao et al. (2020) first decided the amounts of data that is to be processed by the device and an edge/cloud computing node, assuming each task can be divided into two subtasks with any data size. Then they considered to migrate some subtasks between computing nodes to further improve the delay, for each task. The method proposed in Zhang et al. (2019) iteratively assigned the task required minimal resources to the nearest edge server that can satisfy all of its requirements. Ma et al. (2022) proposed a load balance method for improving the revenue for edge computing. The proposed method allocated the computing resources of the edge node with the most available cores and the smallest move-up energy to the new arrived task. To improve the total energy consumption for executing deep neural networks in DE3C with deadline constraints, Chen et al. (2022) proposed a particle swarm optimization algorithm using mutation and crossover operators for population update. Wang et al. (2021) leveraged reinforcement learning with sequence-to sequence neural network for improving the latency and the device energy in DE3C. Machine learning-based or metaheuristic-based approaches may achieve a better performance than heuristics, but in general, they consume hundreds to tens of thousands more time, which makes them not applicable to make online scheduling decisions.

All of these existed research concerned only one or two problems of offloading decision, task assignment, and task ordering, which leads to suboptimal solutions. In addition, they were not fully explored the advantage of abundant cloud resources, as they considered assigning tasks to the cloud only when local and edge resources are exhausted. To address these issues, this paper designs a heuristic method for optimizing SLA satisfaction in DE3C. To our best knowledge, this is the first attempt to jointly address the problems of offloading decision, task assignment, and task ordering.

Problem Formulation

The DE3C environment considered in this paper is composed of various user devices, multiple edges (short for edge computing centers), and one cloud,³ as shown in Fig. 1. Each device launches one or more tasks for processing data it collected from user behaviors or surroundings. Each task can be processed locally, or offloaded to an edge covering the device or the cloud. When a task is offloaded, its processed data must be transmitted from the device to the edge or the cloud before its processing, by various communication links, e.g., wireless networks, telecommunications, etc. There have been some research on data transmission in advance to improve the network latency by predicting task offloading decisions (Zhang et al., 2017), which is complementary to our work. In this paper, the data is assumed to be transmitted only after the offloading decision is made for each task, which can avoid the waste of network resources due to the failure of predicting.

The considered DE3C system is consisted of M user devices ( $M$ = {m₁, m₂, .., m_M}), E edges ( $E$ = {e₁, e₂, …, e_E}), and one public cloud. For device m_j, there are n_j cores ( ${CM}_{j}$ = {cm_j,1, cm_j,2, …, cm_{j,n_j}}), and each core has g_j computing capacity. In edge e_k, s_k servers are deployed, which are represented as $S_{k}$ = {s_k,1, s_k,2, …, s_{k,s_k}}. Each edge server (say s_k,l) has n_k,l cores, each with g_k,l computing capacity. For satisfying users’ requirements when local and edge resources are not enough, assuming there are V cloud servers⁴ ( $V$ = {v₁, v₂, …, v_V}) rented from the cloud. Cloud server v_r has n_{v_r} cores, and the capacity of each core is g_{v_r}. The price of v_r is p_r per unit time. The bandwidth of transmitting data from device m_j to s_k,l and v_r are represented as b_j,k,l and b_j,r, respectively, which can be easily calculated by transmission channel state data (Chen et al., 2016; You et al., 2017; Du et al., 2019). If a device is not in the coverage of an edge, corresponding bandwidths are set to infinity.

Assuming there are T tasks, $T = \{t_{1}, t_{2}, \dots, t_{T}\}$ , requested by users for processing in the DE3C, and each task can be processed in the device launching it (locally), an edge server communicated with the device, or a cloud server. For each task, say t_i, it has in_i data must be processed, and has f_i processing length. To make our method wider application, assuming the data size to be processed and the processing length are independent for each task in any computing node. Task t_i must be finished before its deadline d_i defined by corresponding SLA.⁵ Assuming d₁ ≤ d₂ ≤ ... ≤ d_T without loss of generality. The following binary variables are defined for the following formulations. (1) $x_{i, j} = \{\begin{matrix} 1, if t_{i} is launched by m_{j} \\ 0, otherwise \end{matrix}, \forall i \in [1, T], \forall j \in [1, M] .$ (2) $x_{i, j, h} = \{\begin{matrix} 1, if t_{i} is processed by hth core in m_{j} \\ 0, otherwise \end{matrix}, \forall h \in [1, n_{j}], \forall i \in [1, T], \forall j \in [1, M] .$ (3) $y_{i, k, l, h} = \{\begin{matrix} 1, if t_{i} is processed by hth core in s_{k, l} \\ 0, otherwise \end{matrix}, \forall h \in [1, n_{k, l}], \forall i \in [1, T], \forall j \in [1, M], \forall l \in [1, s_{k}] .$ (4) $z_{i, r, h} = \{\begin{matrix} 1, if t_{i} is processed by hth core in v_{r} \\ 0, otherwise \end{matrix}, \forall h \in [1, n_{v_{r}}], \forall i \in [1, T], \forall r \in [1, V] .$

This paper does not consider the cooperative computing between devices (Hong et al., 2019), and thus each task cannot be processed by other users’ devices, i.e., (5) $x_{i, j, h} = 0, \forall h \in [1, n_{j}], \forall x_{i, j} = 0, \forall i \in [1, T], \forall j \in [1, M] .$

Each task can be only assigned to one core in the DE3C,⁶ and thus (6) $\sum_{j = 1}^{M} \sum_{h = 1}^{n_{j}} x_{i, j, h} + \sum_{k = 1}^{E} \sum_{l = 1}^{s_{k}} \sum_{h = 1}^{n_{k, l}} y_{i, k, l, h} + \sum_{r = 1}^{V} \sum_{h = 1}^{n_{v_{r}}} z_{i, r, h} \leq 1, \forall i \in [1, T] .$ Then the number of completed tasks is (7) $N = \sum_{i = 1}^{T} (\sum_{j = 1}^{M} \sum_{h = 1}^{n_{j}} x_{i, j, h} + \sum_{k = 1}^{E} \sum_{l = 1}^{s_{k}} \sum_{h = 1}^{n_{k, l}} y_{i, k, l, h} + \sum_{r = 1}^{V} \sum_{h = 1}^{n_{v_{r}}} z_{i, r, h}) .$

when t_i is processed locally, its execution time is its computing time as there is no data transmission, i.e., (8) $τ_{i} = f_{i} / g_{j}, \forall x_{i, j} = 1, \forall i \in [1, T], \forall j \in [1, M] .$ As all tasks assigned to a core can be finished before their respective deadlines if and only if each task can be finished within its deadline when they are processed in ascending order of the deadline (Pinedo, 2016), the finish time of each task processed by a core can be calculated by assuming tasks are processed in the EDF scheme in the core. Then, if t_i is processed in the hth core of device m_j, i.e., x_i,j,h = 1, its start time is the accumulated execution time of tasks that have earlier deadlines than it and are processed in the same core, i.e., $\sum_{i i = 1}^{i - 1} (x_{i, j, h} \cdot τ_{i i})$ . Thus, the finish time of t_i can be formulated as (9) $f t_{i, j, h} = x_{i, j, h} \cdot \sum_{i i = 1}^{i} (x_{i i, j, h} \cdot τ_{i i}), \forall h \in [1, n_{j}], \forall i \in [1, T], \forall j \in [1, M] .$ where ft_i,j,h = 0 when t_i is not assigned to h^th core of device m_j. In this situation, the deadline constraints can be formulated into (10) $f t_{i, j, h} \leq d_{i}, \forall h \in [1, n_{j}], \forall i \in [1, T], \forall j \in [1, M] .$ when a task is offloaded to an edge or the cloud, its execution time is formed from the data transfer time and computing time. If a task is scheduled to a core of an edge server or a cloud server, its computing can be started only when its data transmission is complete and the core has finished all tasks that have earlier deadlines and are assigned to the core (recall that EDF scheduling provides the best solution for a core in SLA optimization). The earliest completion time of data transmission for a task offloaded to an edge server or the cloud is (11) $d t_{i, k, l, h} = y_{i, k, l, h} \cdot \sum_{i i = 1}^{i} (y_{i i, k, l, h} \cdot \frac{i n_{i i}}{b_{j, k, l}}), \forall h \in [1, n_{k, l}], \forall i \in [1, T], \forall j \in [1, M], \forall l \in [1, s_{k}],$ or (12) $d t_{i, v_{r}, h} = z_{i, r, h} \cdot \sum_{i i = 1}^{i} (z_{i i, r, h} \cdot \frac{i n_{i i}}{b_{r}}), \forall h \in [1, n_{v_{r}}], \forall i \in [1, T], \forall r \in [1, V] .$ And the ready time of a core for computing an offloaded task in an edge or the cloud is the latest finish time of tasks that are offloaded to the same core and have earlier deadlines, which can be formulated into (13) $r t_{i, k, l, h} = y_{i, k, l, h} \cdot {max}_{i i = 1}^{i - 1} (y_{i i, k, l, h} \cdot f t_{i i, k, l, h}), \forall h \in [1, n_{k, l}], \forall i \in [1, T], \forall j \in [1, M], \forall l \in [1, s_{k}],$ or (14) $r t_{i, v_{r}, h} = z_{i, r, h} \cdot \sum_{i i = 1}^{i - 1} (z_{i i, r, h} \cdot f t_{i i, v_{r}, h}), \forall h \in [1, n_{v_{r}}], \forall i \in [1, T], \forall r \in [1, V],$ where ft_i,k,l,h and ft_{i,v_r,h} represent the finish time of task t_i when it is assigned to hth core in the edge server s_k,l and the cloud server v_r respectively. For a core processing an offloaded task, its start time is a later of its ready time and the completion time of the input data.⁷ Thus, the finish time of offloaded tasks can be calculated as followings (15) $f t_{i, k, l, h} = y_{i, k, l, h} \cdot (max \{d t_{i, k, l, h}, r t_{i, k, l, h}\} + \frac{f_{i}}{g_{k, l}}), \forall h \in [1, n_{k, l}], i \in [1, T], \forall l \in [1, s_{k}], \forall k \in [1, E] .$ (16) $f t_{i, v_{r}, h} = z_{i, r, h} \cdot (max \{d t_{i, v_{r}, h}, r t_{i, v_{r}, h}\} + \frac{f_{i}}{g_{v_{r}}}), \forall h \in [1, n_{v_{r}}], \forall i \in [1, T], \forall r \in [1, V] .$ And the deadline constraints of task processing in edges and the cloud can be formulated into (17) $f t_{i, k, l, h} \leq d_{i}, \forall h \in [1, n_{k, l}], i \in [1, T], \forall j \in [1, M], \forall l \in [1, s_{k}] .$ (18) $f t_{i, v_{r}, h} \leq d_{i}, \forall h \in [1, n_{v_{r}}], \forall i \in [1, T], \forall r \in [1, V] .$

For a computing node (a device, an edge server or a cloud server), the time occupied for processing tasks is the latest finish time of all tasks assigned to the node, i.e., (19) $u t_{m_{j}} = {max}_{i = 1}^{T} {max}_{h = 1}^{n_{j}} f t_{i, j, h}, \forall j \in [1, M],$ (20) $u t_{s_{k, l}} = {max}_{i = 1}^{T} {max}_{h = 1}^{n_{k, l}} f t_{i, k, l, h}, \forall l \in [1, s_{k}], \forall k \in [1, E],$ (21) $u t_{v_{r}} = {max}_{i = 1}^{T} {max}_{h = 1}^{n_{v_{r}}} f t_{i, v_{r}, h}, \forall r \in [1, V],$ where $u t_{N}$ is the use time of the computing node $N$ for finishing tasks assigned to it. Then, the occupied resource amount ( $o r_{N}$ ) of a computing node for task processing are respectively (22) $o r_{m_{j}} = u t_{m_{j}} \cdot g_{j} \cdot n_{j}, \forall j \in [1, M],$ (23) $o r_{s_{k, l}} = u t_{s_{k, l}} \cdot g_{k, l} \cdot n_{k, l}, \forall l \in [1, s_{k}], \forall k \in [1, E],$ (24) $o r_{v_{r}} = u t_{v_{r}} \cdot g_{v_{r}} \cdot n_{v_{r}}, \forall r \in [1, V],$ And the consumed computing resource amount $c r_{N}$ can be quantified by the accumulated processing length of its finished tasks for each computing node, i.e., (25) $c r_{m_{j}} = \sum_{i = 1}^{T} \sum_{h = 1}^{n_{j}} x_{i, j, h} \cdot f_{i}, \forall j \in [1, M],$ (26) $c r_{s_{k, l}} = \sum_{i = 1}^{T} \sum_{h = 1}^{n_{k, l}} y_{i, k, l, h} \cdot f_{i}, \forall l \in [1, s_{k}], \forall k \in [1, E],$ (27) $c r_{v_{r}} = \sum_{i = 1}^{T} \sum_{h = 1}^{n_{v_{r}}} z_{i, r, h} \cdot f_{i}, \forall r \in [1, V],$ Thus, the computing resource utilization of devices, edge servers, and cloud servers are respectively (28) $U_{d e v i c e} = \frac{\sum_{j = 1}^{M} c r_{m_{j}}}{\sum_{j = 1}^{M} o r_{m_{j}}},$ (29) $U_{e d g e} = \frac{\sum_{k = 1}^{E} \sum_{l = 1}^{s_{k}} c r_{s_{k, l}}}{\sum_{k = 1}^{E} \sum_{l = 1}^{s_{k}} o r_{s_{k, l}}},$ (30) $U_{c l o u d} = \frac{\sum_{r = 1}^{V} c r_{v_{r}}}{\sum_{r = 1}^{V} o r_{v_{r}}},$ and the overall resource utilization of the DE3C system is (31) $U = \frac{\sum_{j = 1}^{M} c r_{m_{j}} + \sum_{k = 1}^{E} \sum_{l = 1}^{s_{k}} c r_{s_{k, l}} + \sum_{r = 1}^{V} c r_{v_{r}}}{\sum_{j = 1}^{M} o r_{m_{j}} + \sum_{k = 1}^{E} \sum_{l = 1}^{s_{k}} o r_{s_{k, l}} + \sum_{r = 1}^{V} o r_{v_{r}}} .$

Based on the above formulation, the task scheduling problem optimizing the SLA satisfaction can be modelled as (32) $Maximizing N + U$ subject to (33) $(1) - (31)$ where the objective Eq. (32) is to maximize the number of finished tasks, and to maximize the overall computing resource utilization when the finished task number cannot be improved. The decision variables include x_i,j,h (h ∈ [1, n_j], j ∈ [1, M], i ∈ [1, T]), y_i,k,l,h (h ∈ [1, n_k,l], l ∈ [1, s_k], k ∈ [1, E], i ∈ [1, T]), and z_i,r,h (h ∈ [1, n_{v_r}], r ∈ [1, V], i ∈ [1, T]). This problem is binary nonlinear programming (BNLP), which can be solved by existed tools, e.g., lp_solve (Berkelaar et al., 2020). These tools are implemented based on branch and bound, which is not applicable to large-scale problems. Therefore, a heuristic method is proposed to solve the problem in polynomial time in the next section.

Three-stage Heuristic Task Scheduling

This section presents the proposed hybrid heuristic method, called TSSLA (Three-Stage scheduling optimizing SLA), to address the task scheduling problem stated in the previous section, which coordinates the richness of cloud computing resources and the low transmission delay of local and edge resources, to optimize the SLA satisfaction.

The proposed hybrid heuristic method includes three stages, where the first stage tries to satisfy deadlines of as many tasks as possible, by prioritising the usage of abundant cloud computing resources and assigning tasks that cannot be finished in the cloud to edges or corresponding devices. In the second stage, TSSLA tries to make full use of local resources and release some edge resources by rescheduling several tasks from edges to corresponding devices, and to exploit edge resources shared by multiple users for finishing more tasks. At the last stage, our method aims at optimizing the cost of cloud resources, by rescheduling as many tasks as possible from the cloud to corresponding devices and edges. Algorithm 1 outlines the proposed three-stage hybrid heuristic task scheduling.

As shown in Algorithm 1, in the first stage, TSSLA first pre-assigns all tasks that can be finished by cloud resources to the cloud (line 1 in Algorithm 1). For each task, TSSLA pre-rents a one-core VM instance with the best cost performance. In real world, a public cloud, e.g., Amazon EC2 (Amazon, 2020), provides various VM types configured with different core numbers, and for each type⁸, e.g., c6g.* in Amazon EC2, VM instances have a same price per core. Thus, TSSLA pre-rents one-core VM instances in this stage. After this step, there is no task can be finished in the cloud, and TSSLA pre-assigns remain tasks to edge servers employing LASTF and EDF for computing core selection and task ordering in each core, respectively (see Algorithm 2). Then, TSSLA schedules tasks to each device adopting Algorithm 2.

At the second stage, TSSLA examines each task assigned to edges. If the task can be finished in its device, TSSLA reassigns the task to the device, and the edge has more available resources for processing unassigned tasks. Thus, after that, TSSLA repeats step 2 in the first stage, which assigns remain tasks to edge servers by Algorithm 2. Now, no more tasks can be finished by DE3C resources, thus TSSLA only improves the resource usage in the last stage.

TSSLA employs two approaches to improve the resource efficiency in the third stage. One approach is trying to reassign as many tasks as possible from the cloud to local devices and edge servers because local resources and edge server resources are cheaper and have much less network latency, compared with cloud resources. And another is to consolidate tasks assigned to the cloud for improving the cost efficiency by reducing the idle time of VM instances, as cloud resources are charged by time unit, e.g., hour.

Therefore, in the third stage, TSSLA examines each task assigned to the cloud, and if the task can be finished by corresponding device, TSSLA reassigns the task to the device (see line 6 in Algorithm 1). Otherwise, TSSLA checks whether the task’s requirements can be satisfied by an edge server, and if so reassigns the task to the edge server (see line 7 in Algorithm 1). After these reassignments, TSSLA reassigns the task that has assigned to the next VM to one of its previous VMs by using Algorithm 2, and re-rents idle VMs. This step reduces the idle time of VMs, which improves the cost efficiency, as the rent time of each VM is round up to times of the charge unit for its cost. For example, if one user rents a VM for 1.8 h, and the cloud provider charges $0.1 per hours, the user must pay $0.2 ($0.1/hours ×⌈1.8⌉hours) for the VM.

TSSLA employs Algorithm 2 to implement all of these above task assignments in each device, edges, and the cloud, which decides which core to process the task. The detail is shown in the following.

As shown in Algorithm 2, to select an available computing core for a task, TSSLA traverses each available core (line 2 in Algorithm 2), and calculates the accumulated slack time with the assumptions that the task is assigned to the core and all tasks are executed in the ascending order of their deadlines (lines 3–8 in Algorithm 2). Then, TSSLA allocates the core providing LAST to the task (lines 9–15 in Algorithm 2). For each available core, if the assignment of the task results in at least one deadline violation (line 4 in Algorithm 2), Algorithm 2 returns false which means the requirements of the task cannot be satisfied by any of these available computing cores (line 17 in Algorithm 2).

Results

This section conducts simulated experiments designed by referring to related works and real worlds, to evaluate the performance of the proposed method.

Experiment design

A DE3C system is established, which is composed of one public cloud, an edge, and 10 user devices. The set of various system parameters are referring to that of Du et al. (2019), Chen et al. (2016), Alkhalaileh et al. (2020), the Gaia Cluster (University of Luxembourg, 2020), as well as Amazon EC2 (Amazon, 2020), which are detailed as followings and shown in Table 2. One hundred tasks are generated randomly. Each task is randomly associated to one device that is regarded as the device launching the task. The length and the size of each task are randomly set in ranges of [1, 2000] GHz and [20, 500] MB, respectively, to cover small to large tasks. The computing capacities of each core in a device, an edge server, a cloud VM are randomly in the ranges [1, 2] GHz, [2, 3] GHz, and [2, 3] GHz, respectively. The number of computing core is randomly set in the ranges [1, 4] and [4, 8], respectively for each device and each edge server. The number of servers is set in [1, 4] for the edge. The price of each core is $0.01 per hour for cloud VMs. The bandwidths for transmitting data from a device to the edge and the cloud are in [10, 100] Mbps and [1, 10] Mbps respectively.

Table 2:

The parameters of simulated DE3C system.

Tasks			Device	Edge server	Cloud VM
Number	100	Number	10	[1, 4]	–
Computing length	[1, 2000] GHz	Core number	[1, 4]	[4, 8]	–
Processed data size	[20, 500] MB	Capacity per core	[1, 2] GHz	[2, 3] GHz	[2, 3] GHz
Deadline	[500, 1500] s	Bandwidth	–	[10, 100] Mbps	[1, 10] Mbps
		Price	–	–	$0.01/hour

DOI: 10.7717/peerjcs.851/table-2

The performance of TSSLA is compared with the following classical or state-of-the-art scheduling methods designed for DE3C system. As done in all of the existed works (the best of our knowledge), the following methods employ local resources first, and the cloud resource at last, for processing tasks.

FF (First Fit) (Bays, 1977) iteratively assigns the first task to the first core satisfying its requirements.
FFD (First Fit Decreasing) (B.V. & Guddeti, 2018) iteratively assigns the largest task to the first core satisfying its requirements.
EDF (Earliest Deadline First) (Benoit, Elghazi & Robert, 2021) iteratively assigns the task with the earliest deadline to the first core satisfying its requirements.
BF (Best Fit) (Zhao & Kim, 2020) iteratively assigns the first task to the core that satisfies its requirements and provides the latest finish time for the task.
EFTF (Earliest Finish Time First) which is the basic idea of the method proposed by Liu et al. (2019a), iteratively assigns the first task to the core providing the earliest finish time.
EDF_EFTF, the idea of Stavrinides’s and Karatza’s proposed method (Stavrinides & Karatza, 2019), respectively employs EDF and EFTF for task selection and resource selection.
LSTF (Least Slack Time First) (Michel et al., 2021) iteratively assigns the first task to the core providing the least slack time.
LSSRF (Least Size-Slack time ratio First), the idea employed by Mahmud et al. (2020), iteratively assigns the first task to the core providing the maximal ratio between the profit and the slack time. The length is regarded as the profit for each task.

The performance metrics used to quantify the performance of each task scheduling method include the followings.

SLA satisfaction can be quantified by the amount of finished tasks in number, length, and processed data size, which are respectively calculated by Eqs. (7), (34), and (35). The larger value is better for the metric. For the length and the processed data size of finished tasks, the followings report the percentages of that of all launched tasks. (34) $l e n = \sum_{i = 1}^{T} ((\sum_{j = 1}^{M} \sum_{h = 1}^{n_{j}} x_{i, j, h} + \sum_{k = 1}^{E} \sum_{l = 1}^{s_{k}} \sum_{h = 1}^{n_{k, l}} y_{i, k, l, h} + \sum_{r = 1}^{V} \sum_{h = 1}^{n_{v_{r}}} z_{i, r, h}) \cdot f_{i}) .$ (35) $s i z e = \sum_{i = 1}^{T} ((\sum_{j = 1}^{M} \sum_{h = 1}^{n_{j}} x_{i, j, h} + \sum_{k = 1}^{E} \sum_{l = 1}^{s_{k}} \sum_{h = 1}^{n_{k, l}} y_{i, k, l, h} + \sum_{r = 1}^{V} \sum_{h = 1}^{n_{v_{r}}} z_{i, r, h}) \cdot i n_{i}) .$
Resource utilization is one of the most popular metrics to quantify the resource efficiency, which is the ratio between amounts of consumed resources and occupied resources, i.e., U calculated by Eq. (31). It is better for a higher value.
Makespan is the latest finish time of tasks, which can be achieved by Eq. (36). Earlier makespan means faster processing rate, and thus is better. (36) $m a k e s p a n = {max}_{i = 1}^{T} \{max \{{max}_{j = 1}^{M} {max}_{h = 1}^{n_{j}} f t_{i, j, h}, {max}_{k = 1}^{E} {max}_{l = 1}^{s_{k}} {max}_{h = 1}^{n_{k, l}} f t_{i, k, l, h}, {max}_{r = 1}^{V} {max}_{h = 1}^{n_{v_{r}}} f t_{i, v_{r}, h}\}\} .$
Cost efficiency is the length of tasks processed by per-dollar resource in the cloud, as calculated by Eq. (37). It is a metric for quantifying the resource efficiency in clouds. A greater value is better. (37) $C e f f = \frac{l e n}{\sum_{r = 1}^{V} ⌈ c r_{v_{r}} \cdot p_{r} ⌉}$

Experiment results

SLA satisfaction

Figure 2 shows the performance of various task scheduling methods in SLA satisfaction. As shown in the figure, TSSLA has 22.2%–27.6%, 47.3%–59.1%, and 25.4%–32.6% better SLA satisfaction performance compared with other methods in task number, task computing length, and processed data size, respectively. The superiority of our method is allocating computing resources according to the scarcity degree of resources. TSSLA prefers using the abundant computing resources of the cloud, and employs scarce computing resources of edges and devices for processing tasks cannot be finished by the cloud due to the poor network performance between users and their cloud, in its first stage. In contrary, other methods prefer local resources or nearby edge resources, aiming at providing the best performance for each task with minimal resource costs. But these methods result in several local and nearby edge resources that are used by some tasks which can be finished by the cloud, and these resources can be reserved for processing other tasks whose demands cannot be satisfied by the cloud. Thus, our method has a better performance than other methods in SLA satisfaction. Based on the idea of our method, these works can be improved by reassigning some tasks from local devices or edges to the cloud, to make some resources idle for finishing remaining unassigned tasks.

Figure 2 also shows that, except TSSLA, EDF has the best performance in optimizing SLA satisfaction in devices, and LSTF achieves the most number of completed tasks in the edge. Even so, all methods except TSSLA have comparable performance in SLA optimization overall. The reason why EDF is better than FF, FFD, BF and EFTF in SLA optimization in the devices is that EDF prioritizes the demands of tasks with tight deadlines, and thus postpones tasks with more slack time, which can finish more tasks with tight deadlines compared with other methods. Besides, EDF yields an optimal schedule for maximizing the number of finished tasks in each core (Pinedo, 2016). After completing more tasks locally, there are fewer tasks can be finished in edges or the cloud, as shown in Fig. 2, when applying EDF. This phenomenon does not occur when employing TSSLA. TSSLA satisfies more tasks not only in local devices but also in the edge, compared with other methods (except EDF in devices and LSFT in the edge). This is mainly because TSSLA assigns tasks that can be finished by both the cloud and local devices or the edge to the cloud at the first, which results in more available local and edge resources for processing tasks whose demands can be satisfied only by local devices or the edge. This further verifies the high efficiency of our method. The main reason why LSTF provides the most number of completed tasks at the edge is that it completes much fewer tasks by local resources, compared with other methods, and thus leaves more tasks with loose deadlines to the edge for processing.

Figure 3: (A–D) The overall computing resources of the DE3C, devices, edge servers, and cloud servers, when applying various scheduling methods.

Download full-size image

DOI: 10.7717/peerjcs.851/fig-3

Resource utilization

As shown in Fig. 3, TSSLA achieves almost same overall resource utilization to BF, and 4.1%–54.6% higher than other methods, which verifies that our method provides high resource efficiency for task processing in DE3C environments. The reason that TSSLA achieves a higher overall resource utilization, compared with other methods, is because it provides a much better utilization than others in the edge, as shown in Fig. 3C. In addition, TSSLA completes more than half of tasks’ computing length at the edge, as shown in Fig. 2B. In general, more tasks processed by the edge or the cloud means higher resource utilization in the edge or the cloud (shown in Figs. 2, 3C, and 3D). This is because more tasks can result in less ratio between the amounts of idle computing resources and occupied computing resources, as the data transmission and the data computing can be parallel for different tasks, in each core. Compared with LSTF, TSSLA achieves 41.7% higher utilization, as shown in Fig. 3C, although it completes 22.5%, 13.3%, and 21.1% less tasks in number, computing length, and processed data size, respectively, as shown in Fig. 2, in the edge. The main reason is that TSSLA has a greater ratio between the computing length and the processed data size of tasks completed by the edge than LSTF, which leads to less idle computing resources, and thus results in a higher computing resource utilization. This phenomena can be exploited to design heuristic scheduling methods for optimizing the efficiency of computing resources in edges and clouds.

The reason why BF achieves the highest overall resource utilization, as shown in Fig. 3A, is that the tasks processed by the cloud is the least in each SLA metric, as shown in Fig. 2, and thus the low utilization of cloud computing resources has the lowest impact on the overall resource utilization.

The utilization of cloud computing resources is much less than that of edge computing resources for each method, as shown in Fig. 3D, which is because the network performance of the cloud is much worse, leading more idle computing resources due to the longer time of data transmission, compared with that of the edge. Thus, it would be good to assign tasks with small data sizes to the cloud. Tasks processing less data usually have larger computing length, and thus there is a trade-off between the limited computing resources of devices or edges and the poor network performance of the cloud, which is one of our considerations for designing highly efficient scheduling methods in future.

Figure 4: The latest finish time of tasks in the DE3C, when applying various scheduling methods.

Download full-size image

DOI: 10.7717/peerjcs.851/fig-4

Makespan

Figure 4 shows the makespan when applying various scheduling methods. As shown in the figure, TSSLA has a larger makespan than other methods except LSTF, as the DE3C completes the most computing length and processes the most data of tasks when applying TSSLA, and the makespan is usually increased with the completed computing length and the processed data size. In fact, TSSLA has only about 20% larger makespan, but it completes more than 47.3% more computing length and processes more than 25.4% more data than other methods except LSTF. In addition, LSTF has larger makespan to TSSLA, even though it completes much less computing length and processes much fewer data than TSSLA. These results validate the efficiency of our methods further.

The major reason of LSTF having the largest makespan is that it completes the most number of tasks in both the edge and the cloud, and the local device provides much less processing time than an edge and especially the cloud for a task due to the data transmission time from the device to the edge and the cloud.

Cost efficiency

Figure 5 shows the cost efficiency for task processing in the cloud. TSSLA has a comparable cost efficiency to LSTF and is better than others. Comparing Figs. 3D and 5, we can see that the relative performance of the cost efficiency is almost same as that of the computing resource utilization. This is mainly because the cloud computing resources are charged by the use time. Thus, in most time, the resource utilization and the cost efficiency are equivalent for quantifying the usage efficiency of cloud computing resources. The reason why LSTF has a good cost efficiency is that it completes the most number of tasks in the cloud, as illustrated in ‘Resource Utilization’.

Conclusions

This paper studies on the SLA satisfaction optimization in a device-edge-cloud cooperative computing (DE3C) environment. This paper first formulates the problem into a BNLP, and then proposes a heuristic scheduling method, named TSSLA, to solve the problem in polynomial-time complexity. TSSLA consists of three heuristic stages which respectively exploit the abundant computing resources of the cloud, the shared resources of edges, and the low/zero network latency of edge and device resources, for optimizing the number of tasks whose requirements are satisfied and the resource efficiency. Experiment results confirm the superior performance of TSSLA in optimizing SLA satisfaction and resource efficiency.

In fact, our method improves the SLA satisfaction and the resource efficiency by improving the collaboration ability among devices, edges, and clouds to exploit all of their benefits. This idea can be also applied to other hybrid computing systems, e.g., multi-clouds, hybrid clouds, which is one of our future work.

This paper focuses on the task scheduling for DE3C environments, assuming the data is transmitted to the computing node only when the offloading decision is made for each task. Caching data in edge servers and especially the cloud in advance can improve the performance of task executions. Thus, the prediction of offloading decisions and the caching strategy in DE3C will be studied in the future. In addition, the design of cache-aware task scheduling methods will be concerned to improve the benefits of caching strategies.

Supplemental Information

Source code in C for implementing task scheduling methods

init_resources() randomly generates parameters of device, edge server, and cloud resources. init_tasks() randomly generates parameters of tasks. ff() implements FF. ffd() implements FFD. edf() implements EDF. bestf() implements BF. eftf() implements EFTF. edf_eftf() implements EDF_EFTF. lstf() implements LSTF. small_ratio_size_slack_first() implements LSSRF. heuristic() implements TSSLA.

DOI: 10.7717/peerj-cs.851/supp-1

Download

This paper considers the number of finished tasks with hard-deadlines as the SLA satisfaction metric. The proposed approach is compatible to any other metric.

In this paper, finishing/completing a task means finishing the task within its deadline, i.e., satisfying the task’s SLA requirement.

Multiple clouds can be seen as one big public cloud including the resources provisioned by these clouds.

The cloud resources can be provided in the form of virtual machine (VM), physical machine (PM), or both. The form of resource provisioning does not affect the application of our method.

This paper considers hard deadline, where the service provider has no gains for a task if the task is finished after its deadline. The scheduling of soft deadline tasks is considered as a future work.

In this paper, the redundant execution for a task is not employed to improve the task performance (Liu et al., 2019a) as it cost more resources. Each task is assumed to be processed by only one core, as done in many published articles. A task executed on more than one cores usually can be decomposed into several subtasks. One is suggested to refer to our previous work (Wang et al., 2019a) which studied on the task scheduling with parallelism awareness, which complements this work.

Usually the amount of result data is very small, and its transmission time is negligible compared with that of input data or the computing time. Thus, in this paper, the transmission time of the result data is ignored, as done by many previous works (Zhao & Zhou, 2019; Chen & Hao, 2018; Hong et al., 2019).

This paper assumes that cloud resources are provided on demand, and leave the concern of other provisioning scheme, e.g., spot, as a future work.

[1] Alkhalaileh M, Calheiros RN, Nguyen QV, Javadi B. 2020. Data-intensive application scheduling on Mobile Edge Cloud Computing. Journal of Network and Computer Applications 167:102735

[2] Amazon. 2020. Amazon web services –cloud computing services.

[3] Apat HK, s. Compt B, Bhaisare K, Maiti P. 2019. An optimal task scheduling towards minimized cost and response time in fog computing infrastructure. In: 2019 international conference on information technology (ICIT). 160-165

[4] Balasubramanian V, Otoum S, Aloqaily M, Al Ridhawi I, Jararweh Y. 2020. Low-latency vehicular edge: a vehicular infrastructure model for 5G. Simulation Modelling Practice and Theory 98:101968

[5] Bays C. 1977. A comparison of next-fit, first-fit, and best-fit. Communications of the ACM 20(3):191192

[6] Benoit A, Elghazi R, Robert Y. 2021. Max-stretch minimization on an edge-cloud platform. In: 2021 IEEE international parallel and distributed processing symposium (IPDPS). Piscataway. IEEE. 766-775

[7] Berkelaar M, Dirks J, Eikland K, Notebaert P, Ebert J, Gourvest H. 2020. lpsolve: a mixed integer linear programming (MILP) solver.

[8] B.V. N, Guddeti RMR. 2018. Heuristic-based IoT application modules placement in the fog-cloud computing environment. In: 2018 IEEE/ACM international conference on utility and cloud computing companion (UCC Companion). Piscataway. IEEE. 24-25

[9] Chen L, Guo K, Fan G, Wang C, Song S. 2020. Resource constrained profit optimization method for task scheduling in edge cloud. IEEE Access 8:118638-118652

[10] Chen M, Hao Y. 2018. Task offloading for mobile edge computing in software defined ultra-dense network. IEEE Journal on Selected Areas in Communications 36(3):587-597

[11] Chen X, Jiao L, Li W, Fu X. 2016. Efficient multi-user computation offloading for mobile-edge cloud computing. IEEE/ACM Transactions on Networking 24(5):2795-2808

[12] Chen X, Zhang J, Lin B, Chen Z., Wolter K, Min G. 2022. Energy-efficient offloading for DNN-Based smart IoT systems in cloud-edge environments. IEEE Transactions on Parallel and Distributed Systems 33(3):683-697

[13] Chen Y, Zhang N, Zhang Y, Chen X. 2019. Dynamic computation offloading in edge computing for Internet of Things. IEEE Internet of Things Journal 6(3):4242-4251

[14] Cisco. 2020. Cisco annual internet report (2018–2023)

[15] Du J, Zhao L, Chu X, Yu FR, Feng J, I CL. 2019. Enabling low-latency applications in LTE-A based mixed fog/cloud computing systems. IEEE Transactions on Vehicular Technology 68(2):1757-1771

[16] Gao G, Xiao M, Wu J, Huang H, Wang S, Chen G. 2019. Auction-based VM allocation for deadline-sensitive tasks in distributed edge cloud. IEEE Transactions on Services Computing 14(6):1702-1716

[17] Ghasempour A. 2019. Internet of things in smart grid: architecture, applications, services, key technologies, and challenges. Inventions 4(1):1-22

[18] Han Z, Tan H, Li X, Jiang SH, Li Y, Lau FCM. 2019. OnDisc: online latency-sensitive job dispatching and scheduling in heterogeneous edge-clouds. IEEE/ACM Transactions on Networking 27(6):2472-2485

[19] Hong Z, Chen W, Huang H, Guo S, Zheng Z. 2019. Multi-hop cooperative computation offloading for industrial IoT-Edge-Cloud computing environments. IEEE Transactions on Parallel and Distributed Systems 30(12):2759-2774

[20] Islam A, Debnath A, Ghose M, Chakraborty S. 2021. A survey on task offloading in multi-access edge computing. Journal of Systems Architecture 118:102225

[21] Kumar M, Sharma S, Goel A, Singh S. 2019. A comprehensive survey for scheduling techniques in cloud computing. Journal of Network and Computer Applications 143:1-33

[22] Lakhan A, Li X. 2019. Content aware task scheduling framework for mobile workflow applications in heterogeneous mobile-edge-cloud paradigms: cATSA framework. In: 2019 ISPA/BDCloud/SocialCom/SustainCom. 242-249

[23] Li C, Wang C, Luo Y. 2020. An efficient scheduling optimization strategy for improving consistency maintenance in edge cloud environment. The Journal of Supercomputing 76:6941-6968

[24] Liang D, Xu Z. 2017. The new extension of TOPSIS method for multiple criteria decision making with hesitant Pythagorean fuzzy sets. Applied Soft Computing 60:167-179

[25] Liu L, Tan H, Jiang SH-C, Han Z, Li X-Y, Huang H. 2019a. Dependent task placement and scheduling with function configuration in edge computing. In: Proceedings of the international symposium on quality of service, IWQoS’19. New York, NY, USA. Association for Computing Machinery. 1-10

[26] Liu Y, Yang C, Jiang L, Xie S., Zhang Y. 2019b. Intelligent edge computing for IoT-based energy management in smart cities. IEEE Network 33(2):111-117

[27] Ma Z, Zhang S, Chen Z, Han T, Qian Z, Xiao M, Chen N, Wu J, Lu S. 2022. Towards revenue-driven multi-user online task offloading in edge computing. IEEE Transactions on Parallel and Distributed Systems 33(5):1185-1198

[28] Mahmud R, Srirama SN, Ramamohanarao K, Buyya R. 2020. Profit-aware application placement for integrated FogCloud computing environments. Journal of Parallel and Distributed Computing 135:177-190

[29] Meng J, Tan H, Li X, Han Z, Li B. 2020. Online deadline-aware task dispatching and scheduling in edge computing. IEEE Transactions on Parallel and Distributed Systems 31(6):1270-1286

[30] Meng J, Tan H, Xu C, Cao W, Liu L, Li B. 2019. Dedas: online task dispatching and scheduling with bandwidth constraint in edge computing. In: IEEE INFOCOM 2019 - IEEE conference on computer communications. Piscataway. IEEE. 2287-2295

[31] Miao Y, Wu G, Li M, Ghoneim A, Al-Rakhami M, Hossain MS. 2020. Intelligent task prediction and computation offloading based on mobile-edge cloud computing. Future Generation Computer Systems 102:925-931

[32] Michel O, Bifulco R, Rétvári G, Schmid S. 2021. The programmable data plane: abstractions, architectures, algorithms, and applications. ACM Computing Surveys 54(4):82

[33] Papadakis-Vlachopapadopoulos K, González RS, Dimolitsas I, Dechouniotis D, Ferrer AJ, Papavassiliou S. 2019. Collaborative SLA and reputation-based trust management in cloud federations. Future Generation Computer Systems 100:498-512

[34] Pinedo ML. 2016. Scheduling: theory, algorithms, and systems (5th edition). Cham, Switzerland: Springer. 13-32

[35] Ren J, Yu G, He Y, Li GY. 2019. Collaborative cloud and edge computing for latency minimization. IEEE Transactions on Vehicular Technology 68(5):5031-5044

[36] Saaty TL. 2008. Decision making with the analytic hierarchy process. International Journal of Services Sciences 1(1):83-98

[37] Sorrel S. 2018. The Internet of Things: consumer industrial & public services 2018–2023. Sunnyvale, CA, USA

[38] Stavrinides GL, Karatza HD. 2019. A hybrid approach to scheduling real-time IoT workflows in fog and cloud environments. Multimedia Tools and Applications 78(17):24639-24655

[39] Strumberger I, Bacanin N, Tuba M, Tuba E. 2019. Resource scheduling in cloud computing based on a hybridized whale optimization algorithm. Applied Sciences 9(22)

[40] University of Luxembourg. 2020. The Gaia Cluster - HPC @ Uni.lu.

[41] Wang J, Hu J, Min G, Zhan W, Zomaya A, Georgalas N. 2021. Dependent task offloading for edge computing based on deep reinforcement learning. IEEE Transactions on Computers Epub ahead of print Nov 26 2021

[42] Wang J, Pan J, Esposito F, Calyam P, Yang Z, Mohapatra P. 2019b. Edge cloud offloading algorithms: issues, methods, and perspectives. ACM Computing Surveys 52(1)

[43] Wang B, Song Y, Cao J, Cui X, Zhang L. 2019a. Improving task scheduling with parallelism awareness in heterogeneous computational environments. Future Generation Computer Systems 94:419-429

[44] Wang B, Song Y, Wang C, Huang W, Qin X. 2020a. A study on heuristic task scheduling optimizing task deadline violations in heterogeneous computational environments. IEEE Access 8:205635-205645

[45] Wang B, Wang C, Huang W, Song Y, Qin X. 2020b. A survey and taxonomy on task offloading for edge-cloud computing. IEEE Access 8:186080-186101

[46] Wu C-J, Brooks D, Chen K, Chen D, Choudhury S, Dukhan M, Hazelwood K, Isaac E, Jia Y, Jia B, Leyvand T, Lu H, Lu Y, Qiao L, Reagen B, Spisak J, Sun F, Tulloch A, Vajda P, Wang X, Wang Y, Wasti B, Wu Y, Xian R, Yoo S, Zhang P. 2019. Machine learning at facebook: understanding inference at the edge. In: 2019 IEEE international symposium on high performance computer architecture (HPCA). Piscataway. IEEE. 331-344

[47] You C, Huang K, Chae H, Kim B. 2017. Energy-efficient resource allocation for mobile-edge computation offloading. IEEE Transactions on Wireless Communications 16(3):1397-1411