Figures
Abstract
Containers have emerged as a more portable and efficient solution than virtual machines for cloud infrastructure providing both a flexible way to build and deploy applications. The quality of service, security, performance, energy consumption, among others, are essential aspects of their deployment, management, and orchestration. Inappropriate resource allocation can lead to resource contention, entailing reduced performance, poor energy efficiency, and other potentially damaging effects. In this paper, we present a set of online job allocation strategies to optimize quality of service, energy savings, and completion time, considering contention for shared on-chip resources. We consider the job allocation as the multilevel dynamic bin-packing problem that provides a lightweight runtime solution that minimizes contention and energy consumption while maximizing utilization. The proposed strategies are based on two and three levels of scheduling policies with container selection, capacity distribution, and contention-aware allocation. The energy model considers joint execution of applications of different types on shared resources generalized by the job concentration paradigm. We provide an experimental analysis of eighty-six scheduling heuristics with scientific workloads of memory and CPU-intensive jobs. The proposed techniques outperform classical solutions in terms of quality of service, energy savings, and completion time by 21.73–43.44%, 44.06–92.11%, and 16.38–24.17%, respectively, leading to a cost-efficient resource allocation for cloud infrastructures.
Citation: Canosa-Reyes RM, Tchernykh A, Cortés-Mendoza JM, Pulido-Gaytan B, Rivera-Rodriguez R, Lozano-Rizk JE, et al. (2022) Dynamic performance–Energy tradeoff consolidation with contention-aware resource provisioning in containerized clouds. PLoS ONE 17(1): e0261856. https://doi.org/10.1371/journal.pone.0261856
Editor: Jacopo Soldani, University of Pisa, ITALY
Received: July 15, 2021; Accepted: December 10, 2021; Published: January 20, 2022
Copyright: © 2022 Canosa-Reyes et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the manuscript and its Supporting Information files.
Funding: Andrei Tchernykh and Jorge M. Cortés-Mendoza. Russian Foundation for Basic Research (RFBR) and Chelyabinsk Region, project number 20-47-740005. https://www.rfbr.ru/rffi/ru/ Mikhail Babenko. Russian Foundation for Basic Research (RFBR), Sirius University of Science and Technology, JSC Russian Railways and Educational Fund “Talent and success”, project number 20-37-51004. https://www.rfbr.ru/rffi/ru/ The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
I. Introduction
Nowadays, data centers are growing exponentially due to cloud services’ popularization [1]. Cloud Service Providers (CSPs) use this kind of infrastructure to offer different tools and resources. They are mostly grouped into several types of services: Software as a Service (SaaS), Platform as a Service (PaaS), Infrastructure as a Service (IaaS), Storage as a Service (STaaS), Communications as a Service (CaaS), Network as a Service (NaaS), Monitoring as a Service (MaaS), a rapidly grooving Serverless computing, etc. [2]. Virtual Machines (VMs) and Containers (CTs) are the backbones of the virtualized services provided on-demand.
The efficient use of the data center infrastructures is fundamental for users and CSPs. For example, the low utilization of the servers is a critical factor for energy consumption inefficiency. Traditionally, researchers have focused on CPU utilization, where the VM placement problem is usually solved by NP-hard bin-packing. Few studies consider additional factors, such as reducing power consumption and avoiding SLA violations due to resource contention.
From a business perspective, reducing energy consumption leads to a significant reduction of environmental concerns and costs, hence reducing the final price for the user and increasing earns for CSPs [3].
The optimization of energy consumption is a challenge despite the diversity of existing energy management strategies. The energy efficiency of clouds has become a crucial research issue [2], mainly after introducing the green computing paradigm. An underlying technology involved in resource utilization and energy consumption of data centers is virtualization.
Resource virtualization refers to the abstraction of the hardware in a computer. Efficient resource usage, portability, scalability, and fast deployment are advantages of virtualization in the cloud infrastructure. The Virtual Machine Monitor (VMM), also called the hypervisor, is the main component of the hardware virtualization systems. This software allows the simultaneous execution of multiple guest VMs on a single server [4]. Some responsibilities of the VMM are strengthening the isolation between VMs and the management of the hardware resource.
The Operating System-level (OS-level) virtualization can be used either alternatively or in addition to hardware virtualization. Virtual CTs can be created on each OS [5,6]. Nowadays, the containerization technique is a buzzword for the cloud industry, especially in data centers [7]. The CTs are beneficial for CSPs since they can be more densely packed than VMs.
Each CT provides an isolated space for the user by encapsulating a group of processes separated from others in the system. The CTs can share the host kernel, libraries, and binaries, making them adequate technology to be used in scientific workflow platforms [8]. Examples of container implementations include Linux Containers (LXD), Docker, Kubernetes, OpenVZ, Singularity, etc.
In general, efficient job scheduling and load balancing over computational nodes remain challenging in massive, dynamic, elastic, diverse, and heterogeneous computational environments such as clouds.
The introduction of CTs technology brought several advantages to the cloud domain, but relevant topics in its field must be improved. Scheduling, load balancing, security, energy consumption, among others, are current topics of high importance to make cloud environments more efficient and accessible for the users.
Several researchers address the problem of energy consumption and/or makespan in data centers. The speed scaling method is highly used to aboard problems where the main objective is to save the energy subjected to the execution time constraint [8–11].
Unfortunately, these strategies affect the Quality of Service (QoS), which can be expressed using priorities, and Service Level Agreements (SLA), among others. Moreover, priority-based job scheduling introduces additional challenges because jobs with higher priorities must be served before those with lower priorities. Furthermore, priorities might influence the resource assignment, e.g., high-priority jobs might receive higher computing power than lower-priority jobs.
Several contention-aware resource allocation strategies to reduce energy consumption and increase performance are proposed [12–16]. Resource contention emphasizes avoiding co-hosted applications that contend for shared resources. These works study the power consumption in environments with bare metal and VMs infrastructure. However, environments with CTs can increase the number of applications competing for shared resources, increasing energy consumption.
This paper focuses on the online jobs’ allocation that contends for resources in a container-based cloud environment. Our contribution is multifold:
- We present job allocation as the multilevel dynamic bin-packing problem that provides a lightweight runtime solution that minimizes contention and energy consumption while maximizing utilization.
- The energy model considers types of applications and their execution on shared resources generalized by the job concentration paradigm.
- Two and three-level hybrid heuristic algorithms with container selection, capacity distribution, and contention-aware allocation are designed to consolidate resources to maximize utilization and reduce resource contention.
- Extensive experiments with eighty-six scheduling heuristics considering scientific workloads with memory and CPU-intensive jobs demonstrate that proposed techniques outperform classical solutions in terms of energy savings and completion time.
The proposed heuristics use different amounts of information in the allocation process. They improve the performance of well-known heuristics and provide a good compromise between QoS, makespan, and energy. We demonstrate that they are suitable for containers in cloud environments as more efficient and valuable solutions.
The rest of the paper is structured as follows. Section II discusses related work. Section III defines the infrastructure, scheduling, and energy model for allocating jobs into a container-based cloud. Section IV describes the processing speed models of the jobs. Section V discusses job allocation strategies. Section VI introduces the experimental setup. Section VII presents the experimental analysis. Finally, we conclude and discuss future work in Section VIII.
II. Related work
The interest and usage of container-based technologies in cloud environments have been growing significantly. Several works pointed out that CTs are the future of clouds [17].
Several container technologies were developed, where each of them provides different deployment, management, orchestration, and communication. In this section, we review some of these container-based technologies. Then, we present the latest advances in the area of our interest.
An LXD container [18] is an OS-level virtualization technique that runs multiple isolated Linux systems on a single Linux control host. The LXD resides directly on top of the host kernel. Hence, it does not need any extra abstraction layer or another guest OS. A namespace provides an interface for the Linux kernel features and isolations of resources for each CT. The control group manages the allocation of resources (metering and limiting).
Docker [19] is an open-source container project that simplifies the deployment of services with the methodology of one process per container. The Docker Container Engine, an additional abstraction layer, runs a single application in each virtualized instance. This methodology of execution attaches the CT lifetime with the finish time of the application. Docker daemon constructs a writable layer at the top of the CT read-only image to execute the processes.
Kubernetes [20] is an open-source system for managing the lifecycle of heterogeneous containerized applications (deployment, scaling, orchestration, etc.). The smallest deployable computing unit (POD), created and managed by Kubernetes, encapsulates a set of containers tightly coupled with some shared resources. It groups containers with shared storage/network resources for easy management of logical units of an application. A POD can be replicated along with several machines for scalability and fault tolerance purposes. Still, two services that listen on the same port cannot be deployed inside a POD.
OpenVZ [21] is a container-based virtualization technology to manage multiple secure and isolated Linux containers on a single physical server. Virtual Private Servers (VPSs) and CTs are basic units that share hardware and run on the same OS kernel as the host system. Kernel namespaces allow each CT to have an independent set of resources.
Singularity [22] is an open-source scientific container solution supporting an application in existing and traditional High-Performance Computing (HPC) resources. It offers computing mobility, reproducibility, user freedom, and integration with any scientific computational workflow. A Singularity CT image encapsulates the OS-system environment and all application dependencies necessary to run a defined workflow.
Table 1 shows the relevant characteristics of the related works. It highlights a research gap in the domain of CTs and resource contention. Our approach focuses on the completion time and energy consumption of allocation strategies in container-based cloud environments under resource contention.
Xu et al. [23] discuss the brownout model to reduce data center energy consumption. This approach can reduce energy consumption by selectively and dynamically deactivating optional application components. The experimental evaluation considers two types of hosts and four types of VMs.
Xu and Buyya [24] increase the functionality of the brownout to reduce data center energy consumption. They proposed a brownout-based approximate Markov Decision Process approach to improve tradeoffs between energy-saving and user discounts.
Xu et al. [25] propose several scheduling algorithms for managing microservices and CTs to reduce power consumption with QoS constraints. They achieve better performance in both objectives than the baseline algorithms.
Xu and Buyya [26] develop BrownoutCon to deal with overloads and reduce power consumption on cloud systems. It is integrated with Docker Swarm [36] to demonstrate its efficiency in managing containers under several policies.
Gholipoura et al. [27] propose a multi-criteria decision-making method for cloud environments with VMs and CTs. The consolidation defines the migration policy based on the virtual resource, CTs, and VMs.
Piraghaj et al. [28] present a set of allocation policies for energy saving on the Containers as a Service (CaaS) cloud paradigm. The authors propose an architecture and several algorithms to minimize energy consumption while maintaining the required SLA and QoS.
Khan et al. [29] study consolidation algorithms for effective migration of VMs, CTs, and applications to save energy without negatively impacting service performance. The results show the impact between the migration of applications and the migration of VMs, the resource consolidation considering various workloads, resources, and datacenter set-ups.
The cloud-based solutions community has also generated Internet of Things (IoT) research using VMs and CTs. The IoT workloads contain workflows with different types of jobs: CPU Intensive (CI), Memory Intensive (MI), and Bandwidth Intensive (BI) [33].
Celesti et al. [30] study the overhead costs of CT virtualization on an IoT device with a Raspberry Pi and Docker engine. The authors highlight the overhead introduced by container virtualization in a real scenario.
Dambreville et al. [31] discuss energy consumption reduction by introducing a prediction process in the scheduling task. The authors modify the available servers to fit the prediction and schedule all of the jobs on the available servers.
Cui and Xiaoqing [32] propose a scheduling solution in cloud computing for workflow tasks based on genetic algorithms. The fitness function defined the weighted sum of the user-defined deadline and the total cost of the workflow application execution. A top-down leveling is defined as the longest path from the task to the leaf task in the workflow for each task. This level is used as a task priority. According to a descending order of these priorities, the tasks are scheduled for the available hosts.
Tchernykh et al. [33] analyze several solutions for a digital twin workflow allocation in the virtual resource in a cloud infrastructure. The goal is to minimize the rent cost and satisfy the computational resources demand; the workloads include CI, MI, and BI jobs in order to model applications with different requirements. The authors propose allocation algorithms based on heuristics, metaheuristics, and mixed-integer programming to find low-cost solutions.
Several approaches focus on resource contention issues, where co-hosted applications contend for shared resources increasing energy consumption and reducing performance.
Armenta-Cano et al. [12,13] propose a resource allocation model for online energy-aware scheduling with job concentration. The authors characterize the energy consumption of applications and their combinations. It is used for heterogeneous job allocation to avoid resource contention.
Sheikhalishahi et al. [14] propose a multilevel resource contention-aware scheduling for energy-efficient in distributed systems. The authors define a resource contention metric for high-performance computing workloads. The approach models the interaction between system and scheduler information concerning jobs and resources.
Muraña et al. [15] study the power consumption for scientific computing applications in multicore systems. The authors evaluate the power consumption of applications in single and combined executions on Intel and AMD servers. The results indicate a strong dependency between the type of applications and power consumption.
Van Beek et al. [16] develop a CPU-contention predictor for Business-Critical Workloads (BCW) in data centers. It estimates performance degradation for VMs, hence, the risks of SLA violations.
Lovász et al. [34] present an energy-performance aware model for VMs allocation in heterogeneous servers as a variant of the multidimensional vector packing problem. The authors propose a prediction model to estimate performance degradation when different services are consolidated.
Blagodurov et al. [35] propose scheduling strategies to mitigate different resource contention sources on multicore processors. The authors define a classification scheme for contention in a cache space, memory controller, memory bus, and prefetching hardware.
In general, the different types of containers allow running a higher number and variety of applications in a single resource. Moreover, the energy consumption of the system is increased if the applications contend for resources. The following section defines the model to characterize job allocation in a container cloud environment aware of resource contention.
III. Model
In this section, we formulate the infrastructure, job, and energy consumption models, and define the optimization criteria of the job allocation problem. The main notations are summarized in Table A1 of Appendix.
A. Scheduling model
The IaaS environment is represented by a set of M servers (processors), P = {p1, p2,…,pM}. Each server pk, for all k = 1,…,M, has a maximum processing capacity Qk expressed in Millions of Instructions Per Second (MIPS), and runs a set of mk containers, . Each container , for all i = 1,…,mk, has a processing capacity (CPU quota) , expressed in MIPS, such that .
The total number of containers running in the infrastructure is denoted by defines the subset of jobs running in the container and is the number of SLA violations.
We consider a set of n independent jobs, J = {j1, j2,…,jn} that must be scheduled on the set of containers . Each job jj is described by a tuple (rj, sj, wj, ρj, oj) that consists of its release time rj≥0, the minimum required processing speed sj in MIPS, the total amount of work wj in millions of instructions, the job type ρj (CI and MI), and the priority expressed by an integer value oj.
rj is not available before the job is submitted, and ρj is only used for the system to compute the energy consumption. At any given time t, a processing speed might be assigned to job jj, when then the resource incurs in SLA violations for jj in ci.
Additionally, wj can be used as an estimator of the finish time of jj, concerning . Several techniques can be used to estimate an accuracy value of wj.
In this paper, we limit its use to identify CTs with a major amount of pendant processing. The idea is to study dynamic performance degradation and energy consumption increase due to shared resource contention and present consolidation heuristics based on contention-aware resource provisioning.
We consider this problem as a special case of multilevel dynamic bin-packing (online and non-clairvoyant). Bins represent CTs, and the jobs define the contribution to CT utilization. An additional element to the traditional dynamic bin-packing focuses on the contention-aware distribution of available processing capacity.
The processing speed of a job can be changed during its execution. It can be increased and reduced but should not be lower than the minimum required processing speed sj to satisfy SLA. When a job finishes its execution, the available processing capacity of the container is distributed between running jobs according to the strategies described in Section IV.
Cmax is the maximum finishing time (completion time or makespan) of all jobs.
Let fi be the finishing time of jj job. Cmax is defined as: (1)
The total number of SLA violations is calculated as follows: (2)
In this paper, we consider three conflicting objectives for optimization: SLAv, E, and Cmax: (3)
The measure of the energy consumption E depends on the model. Traditional energy models do not take into account the types of applications and heterogeneity of workloads.
The next section describes an energy model that considers system performance degradation due to jobs contend for resources [12].
B. Energy model
Let us describe the energy consumption model used in our experimental evaluation. The power consumption of the system is defined by (4) where Eop(t) is the energy consumption of the infrastructure at time t calculated as a sum of the energy consumption of all individual processors k at time t: (5)
Two types of consumption are distinguished in the processor: static and dynamic. The static one is the power consumed by the component without performing useful work due to the leakage current, also known as idle power or base power eidle. The dynamic energy consumption is produced when an application utilizes components during its execution eused.
The energy consumption of the processor k at time t, of Eq (5), is computed as: (6) where o(t) = 1, if the processor is on at time t, and o(t) = 0, otherwise, and the constant value does not depend on the time t.
The energy consumption of the processor k at time t depends on its utilization level and concentration of different types of jobs: (7) where defines a constant for the maximum energy consumption of the processor at full capacity, 0≤Fk(t)≤1 is the portion of the power consumed by different types of jobs, and gk is a fraction of the total energy consumption introduced by job type combinations.
Known energy models are based on the observation that CPU utilization is highly correlated with overall energy consumption. Linear and nonlinear models are frequently used. In this work, we follow a nonlinear hybrid energy consumption model proposed in [12,13] as a function of CPU utilization and concentration of jobs of different types.
We calculate Fk(t) as a sum of the portion of the power consumed by each job as follows: (8) where defines the fraction of energy consumption of a given job type d when its processor utilization is at time t.
Fig 1 shows of CI and MI applications versus CPU utilization. We see that each job type contributes differently to the total energy consumption with the same CPU utilization.
Results are obtained by the well-known performance tool suite LIKWID for the GNU/Linux operating system and SysBench benchmark.
The server used in the experiments is an Express x3650 M4, with two Xeon Ivy Bridge processors E5-2650v2 95W, default clock speed of 2.6 GHz. Each processor has eight cores and two threads per core (16 with hyperthreading), level 1 memory of 32 kB, level 2 of 256 kB, level 3 of 20 MB. Two Non-Uniform Memory Access (NUMA) domains of 32 GB each, with a total memory of 64 GB are used. The server OS is a CentOS Linux release 7.1.1503.
The combination of applications is considered in the following manner. Let us assume two types of jobs. The total CPU utilization is calculated as: (9) where and are the utilization of all jobs of type CI and MI, respectively, executed on the processor at time t. They are calculated as: (10) (11) where is a processing speed assigned to job jj at time t.
is a fraction of the total energy consumption introduced by the job types’ combination.
is the concentration (proportion) of the job CI (quantity of job utilization in a processor utilization) at time t. It is a measure of the total fraction of CPU capacity consumption of all jobs of a given type in the total capacity. (12) (13) when jobs just with one type are running on the processor.
Fig 2 shows versus for two job types. We see that gk = 1 when MI jobs are not executed, hence and , or only MI jobs are executed, hence and .
According to Fig 2, we see a reduction in energy consumption when jobs of both types are allocated to the processor.
The coefficient with the lowest energy consumption for CI and MI concentration is between 0.125 and 0.88. For instance, the concentration 0.125 is obtained for one MI and seven CI applications per eight-core processor. The concentration 0.88 is obtained for seven MI and one CI applications. If both concentrations are about 0.5, gk = 0.84.
The concentration implies that all applications in the processor are MI type, , and vice versa. In both cases, there is no reduction in energy consumption because .
A Lagrange interpolating polynomial represents the energy profile of each job type [15]. We use the obtained equations to estimate the energy consumption of jobs in the energy model.
IV. Processing capacity model
In this section, we focus on three models of the processing capacity distribution between jobs during allocation and execution: required capacity, full proportional capacity, and full priority capacity.
A. Required capacity
In the Required Capacity model (RC), the assigned job speed cannot be higher than the minimum required processing speed sj to satisfy SLA.
The main idea of the RC model is to limit the utilization of CTs to avoid that jobs conflict for shared physical resources. We limit the utilization of the CTs to avoid the saturation of the resource due to several CTs can be hosted in it.
The threshold of the job allocation and available capacity redistribution is the minimum processing speed of each job. Formally, (14)
Fig 3 illustrates an example of the job allocation with the RC model. We consider three jobs with the following specifications: j1 = (0, 300, 3*1014, 1), j2 = (50, 400, 2*1014, 3), j3 = (70, 500, 3.5*1014, 2), and container with capacity MIPS.
Initially, the scheduler defines the processing speed of j1 to s′1(0) = 300 because it is available at time 0. When j2 arrives, at time t = 50, the scheduler assigns the required processing speed s′2(50) = 400. At time t = 70, job j3 cannot be allocated in the container because does not have enough available resources to fulfill s′3(70) = 500. So, the processing capacity is distributed using Eq (14), violating SLA.
Job j2 ends its execution at the time f2 = 646, and the scheduler updates the speed of jobs in the container, s′1(646) = 300 and s′3(646) = 500. The finish time of jobs are f3 = 866, and f1 = 1,096.
The two additional models keep resource utilization in full. The only difference is the distribution of the exceeding capacity, additional capacity after satisfying the basic requirement of jobs running in the CT.
B. Full proportional capacity
In the Full Proportional capacity model (Prop), unallocated available capacity in the container is distributed proportional among jobs. Formally, we assume that ji is assigned to the container . (15)
Fig 4 shows an example of allocation jobs with Prop model. Similar to the previous example, we consider the three jobs j1, j2, and j3 and one container .
Job j1 is available at time 0, with the Prop model, j1 can use the full capacity of the container s′1(0) = 1,000. Job j2 arrives at time t = 50, the capacity of is divided between both jobs according to Eq (15), so the scheduler defines the jobs processing speed with s′1(50) = 429 and s′2(50) = 571. During the execution of job j3, from t = 70 to t = 635, the speed values for the three jobs in the container are 250, 334, and 416, respectively. Job j2 ends its execution at a time f2 = 635 and the speed of the jobs in the container are updated to s′1(635) = 375, and s′3(635) = 625. The finish times of jobs j3, and j1 are f3 = 819, and f1 = 850.
C. Full priority capacity
In the Full Priority capacity model (Prio), unallocated or available capacity in the container is distributed between jobs based on their priorities. Formally, it assumes that jj is assigned to the i container, (16) where (17)
Fig 5 presents an example of allocation jobs with the Prio model. Similar to the two previous examples, we consider three jobs j1, j2, j3, and one container .
The scheduler assigns the total CPU capacity to job j1 at time 0, s′1(0) = 1,000. Job j2 arrives at time t = 50, then the capacity of is distributed between both jobs according to Eq (16), the speed of both jobs is set up to s′1(50) = 250 and s′2(50) = 750. When job j3 arrives, the scheduler reassigns the CPU capacity and the values of processing change to s′1(70) = 167, s′2(70) = 500, and s′3(70) = 333. Job j2 ends its execution at the time f2 = 440 and the CPU capacity is redistributed between jobs j1 and j3, in this way s′1(440) = 333 and s′3(440) = 667. After 340 seconds, job j3 finishes its execution and all the CPU capacity is reassigned to j1. The finish time of both jobs are f3 = 780 and f1 = 850. Note that two of the three jobs finish their execution before the previous examples.
V. Job allocation
This section describes the set of scheduling heuristics for job allocation. They are similar to the well-known dynamic bin-packing problem, a variation of the classical NP-hard optimization problem.
Several heuristics were proposed to affront bin-packing problems producing solutions in a reasonable time frame. Both time and performance are fundamental in online resource allocation. Therefore, we propose heuristic-based scheduling strategies for the job allocation to CTs.
The scheduler performs two main tasks: CT selection and CPU assignation. In the selection stage, the scheduler decides whether the job is placed into one of the available CTs or a new CT should be invoked.
In the CPU assignation stage, the scheduler establishes the capacity inside the CT to execute the job. We use different factors in the system to allocate the jobs in the CT selection stage.
A. Container selection strategies
As an example, we assume that the scheduler follows the RC model. According to this model, the processing speed assigned to a job jj at time t must never be higher than the minimum required processing speed sj, i.e., .
The first two baseline strategies are:
- Random (Rand)–the scheduler allocates a job to a random container .
- Round Robin (RR)–the scheduler allocates a job using a Round Robin strategy, distributing to containers in rotation.
These strategies are “blind” because they do not use specific information about job allocations. Other strategies take into account information about the job to be assigned and CTs.
We consider three scenarios in the selection of a CT:
- At least one container has sufficient capacity to execute the arriving job without SLA violation.
- At least one container is not running at its full capacity.
- All containers are using their maximum capacity.
Several policies can be used to face each scenario. A specialized policy per scenario can provide strategies with better results.
Whenever a job arrives, the scheduler uses the first policy to select a CT with enough capacity to execute it. If the job cannot be allocated without SLA violation, the scheduler employs a second policy to choose between CTs with available capacity. Finally, if all containers are running at full capacity, the scheduler uses the third policy to choose a CT.
Table 2 presents the selection policies of CTs that have sufficient capacity to execute the arriving job. Table 3 shows the policies to select one CT among sufficient capacity containers. The preferred candidates are those that are not running at their full capacity.
Finally, we use two policies for CT selection when all CTs are working with full capacity: MinTasks and MinSLAv. Both strategies are described in Table 3.
B. Resources allocation strategies
Once a CT is selected, the scheduler assigns the CPU capacity to a job inside the selected CT. If the CT is selected with enough capacity, there are no SLA violations; otherwise, SLA violations can occur.
The capacity assignment is provided by the three models defined in section IV.
- Required capacity model (RC).
- Proportional capacity model (Prop).
- Priority capacity model (Prio).
RC strategies combine three policies to allocate jobs to CTs. For instance, FFit–MinTask–MinSLAv strategy selects between CTs with enough capacity using the FFit policy. MinTask policy is used when some CTs have free capacity but not enough to satisfy the capacity requested by the job, and MinSLAv policy selects CT at maximum capacities. The RC strategy reassigns capacity when it is overpassed by the requested capacity of the jobs.
As an example, Algorithm 1 presents the FFit–MinTask–MinSLAv strategy.
The worst-case occurs when all CTs are full and active. Then the scheduler searches in the list of active CTs to allocate the job (line 2) with complexity O(k) for k containers. It cannot assign the job at this stage, so it uses the second strategy (line 8) with the same complexity O(k). Finally, the strategy runs the procedure in line 10 with O(k) complexity. The procedure is performed each time a job arrives, so the algorithm has an asymptotical complexity O(nk).
Algorithm 1: FFit – MinTask – MinSLAv strategy
Input: List of active CTs (CTA) and job ji
Output: Index of container to execute job ji
1.ctIndex ← -1
2.ctIndex ← FFit (CTA, ji)
3. if ctIndex < 0 then
4. if can create a new]
5.container is true then
6. create a new
7.container nCT
8. add nCT to CTA and
9.ctIndex ← index of nCT
10. else
11. ctIndex ← MinTask
12.(CTA, ji)
if ctIndex < 0
then
ctIndex ←
MinSLAv (ji)
reassign CPU speed in
container ctIndex with RC
return ctIndex
In Prop and Prio strategies, the CT selection is a combination of two policies. CT always has full capacity, but the strategy can estimate if an allocation generates SLA violation. For instance, with the FFit–MinTask–Prio strategy, the scheduler selects a CT using FFit strategy as a first policy and MinTask as the second policy. Once a container has been selected, Prio model reassigns the processing capacity using Eq (16). Algorithm 2 presents the FFit–MinTask strategy.
Similar to Algorithm 1, the complexity of Algorithm 2 is bounded by O(k) in lines 2 and 8, so the complexity of the procedure is defined by O(nk). Note, if we can create an unlimited number of CTs in the infrastructure then both procedures have an asymptotical factor of (n2) in the worst-case.
Algorithm 2: FFit – MinTask strategy
Input: List of active CTs (CTA) and job ji
Output: Index of container to execute job ji
ctIndex ← -1
ctIndex ← FFit (CTA, ji)
if ctIndex < 0 then
1. if can create a new
2.container is true then
3. create a new
4.container nCT
5. add nCT to CTA
6.and ctIndex ← index of nCT
7. else
8. ctIndex ← MinTask
9.(ji)
10.reassign CPU speed in
container ctIndex with Prio
return ctIndex
VI. Experimental setup
All experiments are performed on the standard trace-based simulator CloudSim v3.0.3 [37]. It supports the modeling and simulation of large-scale cloud computing environments. CloudSim includes classes and interfaces such as data centers, single computing nodes, an autonomous platform for modeling clouds, service intermediaries, provisioning policies, allocation strategies, among others.
We extended its functionality with our scheduling algorithms, energy model, dynamic jobs arrival, container deployment, statistical analysis, and workload processing. The simulator represents an excellent tool to develop our experiments [38].
Algorithms are implemented using jdk 1.8.0_221 64-bit. The execution was performed by a computer with Windows 10 Pro OS, an Intel (R) Core (TM) i5-8400HQ CPU 2.8 GHz, 8 GB of memory, and 500 GB of HDD.
A. Scenario
We use a two-tier topology with M processors at the first level and m containers at the second level. We perform an experimental analysis with a limited but a representative number of physical resources. In the experimental environment based on CTs on bare metal, two CTs can be executed in one physical resource. Under this scenario, the jobs of both CTs contend for the underlay resources.
The setup defines a basic cloud environment that simplifies the analysis and evaluation of the proposed strategies and maintains the relevant elements of real environments. This configuration can be generalized considering the characteristics and size of resources, CTs capabilities, and workloads. Moreover, the energy model of specific resources and job energy profiles (i.e., BI) and their combination (concentration).
eidle and emax values are defined according to [12,13]. The collected data from a Power Distributor Unit (PDU) showed that the eidle of the server is 86 Watts (W) on average. A similar procedure under a fully busy processor was performed to find emax. Table 4 presents the experimental setup.
B. Workload
The workload is based on HPC traces of parallel workloads [39] and grids from Grid Workload Archive [40]. We use the Standard Workload Format (SWF) with two additional fields to describe the requested work speed and the type of work. Several filters were applied to workloads to eliminate jobs with inconsistent information.
The performance of our strategies was evaluated in a homogeneous resource environment using 30 workloads of 24 hours to obtain valid statistical values. Variability of the arriving time, size, and types in the workload is fundamental to evaluate our strategies’ efficiency under different scenarios properly.
Fig 6 shows the number of jobs of two types during 30 days. We observe that there is no predominance of one type of job in logs. Some weeks have more jobs of type CI, and others more jobs of type MI. The total number of jobs in the workload is 109,345. Day twenty-two has the biggest workload among all, with more than 20,000 jobs.
CI jobs request high computational power in their processing, e.g., calculate prime numbers, sorting, search, graph traversal, etc. In MI jobs, the job processing is limited by the speed of memory to feed data to the processor, e.g., work with datasets much larger than the available cache on the system.
Fig 7 shows a histogram with the total number of jobs arriving per hour on an average of 30 days. It can be seen that most jobs arrive between 7 AM and 3 PM.
C. Analysis methodology
The optimization problem is to minimize three conflict objectives: SLAv, Cmax, and E, see Eq (3). In multi-objective optimization, one solution can represent the best solution concerning Cmax, while another solution could be the best one concerning E or SLAv. The goal is to obtain a set of compromise solutions that represents a good approximation to the Pareto front.
If a solution cannot be improved in terms of all objective functions, then the outcome is Pareto optimal. Pareto fronts can be compared via visual observation of the solution space and formal approaches.
Set Coverage (SC) metric is a formal and statistical approach to calculate the proportion of solutions dominated between two sets [41]. Larger values of SC represent a better approximation of the Pareto front. The dominance operator is not symmetric, so it is necessary to compute the dominance of one set over the other and vice versa.
A multi-objective optimization problem can be simplified to a single objective through different methods. The prevalent approach is the aggregation method, which defines weights for each objective to model preferences and performs a single weighted sum. That is, the preferences can explicitly specify the importance of every criterion or relative importance between criteria.
The degradation in performance (relative error) indicates the ratio of a metric generated by the algorithms to the best-found solution [42]. The analysis is conducted as follows. First, the degradation in performance of each strategy is computed as follows: (18)
Then, the strategies are ranked based on the average of their computed values considering all the scenarios. The best strategy with the lowest average performance degradation has a rank of 1. Low ranks identify strategies that perform reliably well over different scenarios, they represent a good compromise for all test cases.
One disadvantage of the degradation approach is to use the mean values to analyze the results. Hence, they can be influenced by a small portion of data with a large deviation. For deeper analysis, we present the strategies’ performance profiles to help with the interpretation of the data.
The Performance Profile (PP) defines a non-decreasing, piecewise constant function δ(τ) that presents the probability that a ratio τ is within a factor of the best ratio. The function δ(τ) is the cumulative distribution function. Strategies with a large probability for small τ are to be preferred [43].
To choose an adequate strategy, we compare the performance of all strategies by the three analysis methodologies: degradation, PP, and SC.
VII. Experimental evaluation
In this section, we present results of evaluation of 86 strategies. Their names are combinations of strategies on the first, second, third levels, and capacity models. Table 5 shows 56 strategies for RC model. Table 6 shows 14 strategies that can be combined with Prop and Prio model (28 in total), Random (Rand), and Round Robin (RR).
The performance of our strategies is evaluated based on the simulation of 30 days of 24 hours each. Runtime of one strategy simulation of the 24 hours workload takes about 1–2 second, on average.
Let us consider the best and worst performance strategies for each objective independently. The results of other strategies were omitted for simplicity and clarity.
Fig 8 presents SLAv for the best and worst strategies during 30 days on 24 hours average. WF_MinTask_Prio has the best performance. Its SLAv varies from 27 to 5,294 per day. SLAv of WF_MinTask_Prop varies from 0 to 7,752. RR and Rand are the worst strategies, their SLAvs vary from 636 and 716 to 16,653 and 15,869, respectively.
Fig 9 shows Cmax of the best and worst strategies during 30 days on 24 hours average. Comparing these strategies, we see that the total Cmax of all jobs of a day varies between 32.33 and 54.44 hours on average. MaxSLAv_MaxCap_MinTask, FF_MinTask_Prio, and WF_MinTask_Prio are the best strategies showing a similar behavior. RR and Rand are the worst strategies.
Fig 10 presents E during 30 days on 24 hours average. BF_MinTask_Prop is the best strategy. It provides allocation of jobs that consume between 48 and 157 Kw per day. The worst two strategies are RR and Rand with energy consumption between 70 and 71 to 200 and 220 Kw per day, respectively.
A. Degradation in performance analysis
Table 7 presents the average degradation of SLAv, E, Cmax, and their means, see Eq (18). The last five columns contain the ranking of SLAv, E, Cmax, Rank mean, Rank, their means, and the final ranking. Rank SLAv is based on the number of SLA violations. Rank E depends on the energy degradation. Rank Cmax refers to the position to the completion of the total jobs. The most significant values are highlighted in bold and red.
The Rank refers to the relative position to all 86 strategies. It considers the mean of SLAv, E and Cmax. The best strategies in each criterion are highlighted.
For SLA violations, the best strategies are WF_MinTask_Prop, MinAW_MinTask_Prop, and MinSLAv_MinTask_Prop with 22.3%, 24.7%, and 72.3% away from the best solutions found, respectively. RR and Rand are 12,236.5% and 12,955.1% worse.
For energy consumption, the best strategy is BF_MinTask_Prop, with an average E degradation 0.027. It allocates the arriving job to the container with the smallest available capacity. If any container has enough space to provide the required speed to the job, the scheduler selects the container with the least number of tasks. MaxSLAv_MinTask_Prio and MaxSLAv_MinTask_Prop follow the best strategy with degradation 0.037 and 0.038, respectively.
For Cmax degradation, WF_MinTask_Prop is the strategy with the best performance with a degradation of 0.087. The arriving job is assigned to the container with the biggest available capacity. At the second level of assignation, it selects the container with the least number of tasks. The second and third best strategies with degradations of 0.87 and 0.96 are FF_MinTask_Prio and MaxSLAv_MaxCap_MinTask.
According to the final Rank, strategies with a good compromise between SLAv, E and Cmax, are BF_MinTask_Prop, MinAW_MinTask_Prop, and MaxSLAv_MinTask_Prop. The final Rank of RR is 85. It means that at least 84 strategies performed better than RR. Rand and RR provide 129,551% of worse solutions with respect to the best solutions found in the simulation. While, BF_MinTask_Prop is away from the best solutions between 78.1% for SLA, 2.7% for E, and 10.3% for Cmax.
B. Performance profile analysis
Fig 11 shows the PP of SLAv in the interval τ = [1, 4]. WF_MinTask_Prop generastes all solutions in the interval τ = [1, 2.5]. Hence, it does not produce solutions with SLAv more than 2.5 worse than the best found. MinAW_MinTask_Prop generates 96.7% solution in the same interval. Both strategies have similar performance.
Fig 12 presents PP of Cmax in the interval τ = [1.0, 1.3]. WF_MinTask_Prop provides 67% of its solutions within a factor of 1.13 from the best found. It is followed by FF_MinTask_Prio, with 83% of their solution within a factor of 1.18. MinAW_MaxCap_MinTask strategy generates all the solutions within a factor of 1.24.
Fig 13 shows PP of E in the interval τ = [1.0, 1.14]. All solutions of FF_MinSLAv_Prio are within a factor of 1.1. It has the best global performance. Two strategies with a high probability of offering good performance are MinSLAv_MinTask_Prop and MaxSLAv_MinTask_Prio. The first strategy provides 70% of its solutions within a factor of 1.05. The second strategy has 95% of solutions below a factor of 1.08.
Fig 14 presents the PP of eight strategies according to mean E and Cmax in the interval τ = [1.0, 1.3]. If we choose a factor of 1.08 from the best result as the scope of our interest, MaxSLAv_MinTask_Prio has a 72% probability of being the winner. FF_MinTask_Prio is the strategy with 92% of solutions within a factor of 1.18. MaxSLAv_MaxCap_MinTask is the strategy with all its solution within a factor of 1.28.
We see that the best strategies include MinTask allocation. Minimizing the number of the running of jobs on the resource reduces possible contention and SLAv increasing the assigned speed to each job. Hence, it reduces the energy consumption and job completion time.
C. Set coverage analysis
Let us analyze Pareto fronts of strategies considering three optimization criteria: SLA, E and Cmax.
Two sets of non-dominated solutions are compared using the SC metric. The rows of the Table 8 show the values SC(A, B) for the dominance of strategy A over strategy B. The columns indicate SC(B, A), that is, the dominance of B over A.
The column “Average best” shows SC of row A over each of the best strategies. The column “Average all” shows SC of row A over all studied strategies. The rankings “Rank best” and “Rank all” are based on the average dominance over best and all studied strategies, respectively.
Similarly, the last four rows show average dominance B over A, average dominance B, and two ranks in each column, respectively. The higher ranking implies that the front is better. The most significant values are highlighted in bold and red.
According to the SC metric, the strategies with the best compromise between SLAv, E and Cmax are WF_MinTask_Prop, BF_MinTask_Prop, and MaxSLAv_MinTask_Prop considering all strategies, and WF_MinTask_Prop, BF_MinTask_Prop, and MinAW_MinTask_Prop according to the best strategies.
WF is trying to allocate jobs with more capacity left to avoid resource contention. WF_MinTask_Prop improves the non-dominated fronts of other strategies in the range of 10–36.7%, with 33.3% for the best strategies and 51.6% for all strategies, on average, occupying the first and the second ranks, respectively. We see that WF_MinTask_Prop can provide solutions with better quality, on three objectives, with respect to other strategies.
SC(A, WF_MinTask_Prop) displays that strategy is dominated by other strategies with a maximum of 16.7%. These ranges show that other strategies have a better performance than WF_MinTask_Prop. In general, a desired strategy exhibits the behavior of WF_MinTask_Prop. SC(A, RR) and SC(A, Rand) are equal to 1, hence, all the strategies dominate RR and Rand. Moreover, SC(RR, B) and SC(Rand, B) are zero, so any solution of Rand and RR is dominated by other strategies. We see that similar to the performance profile analysis, best strategies include MinTask allocation. Moreover, the MinTask and Prop provide the four best strategies in both ranks.
VIII. Conclusion
Consolidation of services is a key technology to reduce resource overprovisioning and energy consumption. However, when multiple jobs and processes run on a multicore CPU, they can compete for shared resources such as caches, memory controllers, memory buses, prefetching hardware, disks, networking, etc. This resource contention can invoke performance degradation defeating the benefits of the consolidation, leading to QoS violation and increasing energy consumption.
In this paper, we consider potentially damaging effects of job consolidation. We propose a novel method of consolidation based on the job concentration paradigm that avoids allocation of jobs of the same type on shared resources. It reduces resource contention and makes job placement more efficient with the energy-utilization tradeoff.
We present online job allocation heuristics for heterogeneous infrastructures as a multilevel variant of the dynamic bin-packing problem considering container selection, capacity distribution, and contention-aware allocation. We study eighty-six scheduling heuristics. We distinguish them depending on the type and amount of information they use for allocation. We provide their comprehensive analysis on a real workload.
The results show that proposed allocation strategies carefully guided by the energy, performance, and QoS provide a reduction of 21.73–43.44%, 44.06–92.11%, and 16.38–24.17% of these criteria, respectively, with respect to known strategies used in the literature.
However, further study is required to assess allocation strategies on bigger variety of infrastructures, processors, and containers. It is important to analyze network-intensive, disk-intensive, etc. job types, combining their execution over different scenarios.
Supporting information
S1 Fig. Time, SLA violations, Cmax and E for all the strategies.
https://doi.org/10.1371/journal.pone.0261856.s001
(XLSX)
S5 Fig. Performance profile average of E and Cmax.
https://doi.org/10.1371/journal.pone.0261856.s005
(XLSX)
References
- 1. Zhu X., Young D.,·Watson B. J., Wang Z., Rolia J.,·Singhal S., et al. 1000 islands: An integrated approach to resource management for virtualized data centers. In Cluster Computer, 2009; 12(1): 45–57. https://doi.org/10.1007/s10586-008-0067-6.
- 2. Lee Y. C., & Zomaya A. Y. Energy efficient utilization of resources in cloud computing systems. The Journal of Supercomputing, 2012; 60(2): 268–280. https://doi.org/10.1007/s11227-010-0421-3.
- 3. Vafamehr A., & Khodayar M. E. Energy-aware cloud computing. The Electricity Journal, 2018; 31(2): 40–49. https://doi.org/10.1016/j.tej.2018.01.009.
- 4.
Kumar D., & Magloire A. F. F. Hypervisor based performance characterization: XEN/KVM. In 2017 2nd International Conference on Telecommunication and Networks (TEL-NET), IEEE, 2017: 1–4. https://doi.org/10.1109/TEL-NET.2017.8343570.
- 5.
Seibold M., Wolke A., Albutiu M., Bichler M., Kemper A., & Setzer T. Efficient deployment of main-memory DBMS in virtualized data centers. 2012 IEEE Fifth International Conference on Cloud Computing, IEEE, 2012: 311–318. https://doi.org/10.1109/CLOUD.2012.13.
- 6.
U-Chupala P., Watashiba Y., Ichikawa K., Date S., & Iida H. Container Rebalancing: Towards Proactive Linux Containers Placement Optimization in a Data Center. In 2017 IEEE 41st Annual Computer Software and Applications Conference (COMPSAC), IEEE, 2017: 788–795. https://doi.org/10.1109/COMPSAC.2017.94.
- 7.
Kovács Á. Comparison of different linux containers. In 2017 40th International Conference on Telecommunications and Signal Processing (TSP). IEEE, 2017: 47–51. https://doi.org/10.1109/TSP.2017.8075934.
- 8.
Jin X., Zhang F., & Liu Z. Discrete min-energy scheduling on restricted parallel processors. In 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and PhD Forum, IEEE, 2013: 2226–2229. https://doi.org/10.1109/IPDPSW.2013.43.
- 9.
Jin X., Zhang F., Song Y., Fan L., & Liu Z. Energy-efficient scheduling with time and processors eligibility restrictions. In European Conference on Parallel Processing. Springer, Berlin, Heidelberg, 2013: 66–77. https://doi.org/10.1007/978-3-642-40047-6_10.
- 10.
Liu N., Dong Z., & Rojas-Cessa R. Task and server assignment for reduction of energy consumption in datacenters. In 2012 IEEE 11th International Symposium on Network Computing and Applications, IEEE, 2012: 171–174. https://doi.org/10.1109/NCA.2012.42.
- 11. Cotes-Ruiz I.T., Prado R.P., García-Galán S., Muñoz-Expósito J.E., Ruiz-Reyes N. Dynamic Voltage Frequency Scaling Simulator for Real Workflows Energy-Aware Management in Green Cloud Computing. PLoS ONE, 2017; 12(1): e0169803. pmid:28085932
- 12.
Armenta-Cano F. A. Heterogeneous Jobs Concentration Energy Model and Consolidation Strategies in Cloud. PhD Thesis, CICESE research center. Mexico, 2018. Available from: https://biblioteca.cicese.mx/catalogo/tesis/ficha.php?id=25075.
- 13. Armenta-Cano F., Tchernykh A. N., Cortés-Mendoza J. M., Yahyapour R., Drozdov A. Y. E., Bouvry P., et al. Min_c: Heterogeneous Concentration Policy for Power Aware Scheduling. Proceedings of the Institute for System Programming of the RAS, 2015; 27(6): 355–380. https://doi.org/10.15514/ISPRAS-2015-27(6)-23.
- 14. Sheikhalishahi M., Grandinetti L., Wallace R. M., & Vazquez‐Poletti J. L. Autonomic resource contention‐aware scheduling. Software: Practice and Experience, 2015; 45(2): 161–175. https://doi.org/10.1002/spe.2223.
- 15. Muraña J., Nesmachnow S., Armenta F., & Tchernykh A. Characterization, modeling and scheduling of power consumption of scientific computing applications in multicores. Cluster Computing, 2019; 22(3): 839–859. https://doi.org/10.1007/s10586-018-2882-8.
- 16.
Van Beek V., Oikonomou G., & Iosup A. A CPU contention predictor for business-critical workloads in cloud datacenters. In 2019 IEEE 4th International Workshops on Foundations and Applications of Self* Systems (FAS* W), IEEE, 2019: 56–61. https://doi.org/10.1109/FAS-W.2019.00027.
- 17.
Singh S., & Singh N. Containers & Docker: Emerging roles & future of Cloud technology. 2016 2nd International Conference on Applied and Theoretical Computing and Communication Technology (iCATccT), IEEE, 2016: 804–807. https://doi.org/10.1109/ICATCCT.2016.7912109.
- 18.
Linux containers, [cited 2021 Jul 10]. Available from: https://linuxcontainers.org.
- 19.
Docker, [cited 2021 Jul 10]. Available from: https://www.docker.com.
- 20.
Kubernetes, [cited 2021 Jul 10]. Available from: https://kubernetes.io/es.
- 21.
Open source container-based virtualization for Linux, [cited 2021 Jul 10]. Available from: https://openvz.org.
- 22. Kurtzer GM, Sochat V, Bauer MW. Singularity: Scientific containers for mobility of compute. PLoS ONE, 2017; 12(5): e0177459. pmid:28494014
- 23. Xu M., Dastjerdi A. V., & Buyya R. Energy Efficient Scheduling of Cloud Application Components with Brownout IEEE Transactions on Sustainable Computing, 2016; 1(2): 40–53. https://doi.org/10.1109/TSUSC.2017.2661339.
- 24.
Xu M., & Buyya R. Energy efficient scheduling of application components via brownout and approximate markov decision process. In International Conference on Service-Oriented Computing. Springer, Cham, 2017: 206–220. https://doi.org/10.1007/978-3-319-69035-3_14.
- 25. Xu M., Toosi A. N., & Buyya R. iBrownout: An Integrated Approach for Managing Energy and Brownout in Container-based Clouds. IEEE Transactions on Sustainable Computing, 2018; 4(1): 53–66. https://doi.org/10.1109/TSUSC.2018.2808493.
- 26. Xu M., & Buyya R. BrownoutCon: A software system based on brownout and containers for energy-efficient cloud computing. Journal of Systems and Software, 2019; 155: 91–103. https://doi.org/10.1016/j.jss.2019.05.031.
- 27. Gholipour N., Arianyan E., & Buyya R. A novel energy-aware resource management technique using joint VM and container consolidation approach for green computing in cloud data centers. Simulation Modelling Practice and Theory, 2020; 104: 102127. https://doi.org/10.1016/j.simpat.2020.102127.
- 28.
Piraghaj S. F. Energy-Efficient Management of Resources in Enterprise and containers-based cloud. PhD Thesis, The University of Melbourne. Australia, 2016. Available from: http://www.cloudbus.org/students/SarehPhDThesis2016.pdf.
- 29. Khan A. A., Zakarya M., Khan R., Rahman I. U., & Khan M. An energy, performance efficient resource consolidation scheme for heterogeneous cloud datacenters. Journal of Network and Computer Applications, 2020; 150: 102497. https://doi.org/10.1016/j.jnca.2019.102497.
- 30.
Celesti A., Mulfari D., Fazio M., Villari M., & Puliafito A. Exploring Container Virtualization in IoT Clouds. In 2016 IEEE international conference on Smart Computing (SMARTCOMP). IEEE, 2016: 1–6. https://doi.org/10.1109/SMARTCOMP.2016.7501691.
- 31.
Dambreville A., Tomasik J., Cohen J., & Dufoulon F. Load Prediction for Energy-Aware Scheduling for Cloud Computing Platforms. In 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS). IEEE, 2017: 2604–2607. https://doi.org/10.1109/ICDCS.2017.201.
- 32.
Cui Y., & Xiaoqing Z. Workflow tasks scheduling optimization based on genetic algorithm in clouds. In 2018 IEEE 3rd International Conference on Cloud Computing and Big Data Analysis (ICCCBDA). IEEE, 2018: 6–10. https://doi.org/10.1109/ICCCBDA.2018.8386458
- 33.
Tchernykh A., Facio-Medina A., Pulido-Gaytan B., Rivera-Rodriguez R., Cortés-Mendoza J. M., Radchenko G., et al. Toward digital twins’ workload allocation on clouds with low-cost microservices streaming interaction. In 2020 Ivannikov Ispras Open Conference (ISPRAS). IEEE, 2020: 115–121. https://doi.org/10.1109/ISPRAS51486.2020.00024.
- 34. Lovász G., Niedermeier F., & De Meer H. Performance tradeoffs of energy-aware virtual machine consolidation. Cluster Computing, 2013; 16(3): 481–496. https://doi.org/10.1007/s10586-012-0214-y.
- 35. Blagodurov S., Zhuravlev S., & Fedorova A. Contention-Aware Scheduling on Multicore Systems. ACM Transactions on Computer Systems (TOCS), 2010; 28 (4): 1–45. https://doi.org/10.1145/1880018.1880019.
- 36.
Docker Swarm, https://docs.docker.com/engine/swarm.
- 37. Calheiros R. N., Ranjan R., Beloglazov A., De Rose C. A., & Buyya R. CloudSim: A toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms. Software: Practice and experience, 2011; 41 (1): 23–50. https://doi.org/10.1002/spe.995.
- 38.
Alshammari D., Singer J., & Storer T. Does CloudSim Accurately Model Micro Datacenters?. In 2017 IEEE 10th International Conference on Cloud Computing (CLOUD). IEEE, 2017: 705–709. https://doi.org/10.1109/CLOUD.2017.97.
- 39.
Feitelson D. G. Parallel Workloads Archive [cited 2021 Jul 10]. Available from: http://www.cs.huji.ac.il/labs/parallel/workload.
- 40.
Delft T. U. The Grid Workloads Archive. [cited 2021 Jul 10]. Available from: http://gwa.ewi.tudelft.nl.
- 41.
Zitzler E. Evolutionary algorithms for multiobjective optimization: Methods and applications. PhD thesis, Swiss Federal Institute of Technology. Zurich, 1999. Available from: https://sop.tik.ee.ethz.ch/publicationListFiles/zitz1999a.pdf.
- 42. Tsafrir D., Etsion Y., & Feitelson D. G. Backfilling using system-generated predictions rather than user runtime estimates. EEE Transactions on Parallel and Distributed Systems, 2007: 18 (6): 789–803. https://doi.org/10.1109/TPDS.2007.70606.
- 43. Dolan E. D., Moré J. J., & Munson T. S. Optimality Measures for Performance Profiles. SIAM Journal on Optimization, 2006; 16(3): 891–909. https://doi.org/10.1137/040608015.