Security prioritized multiple workflow allocation model under precedence constraints in cloud computing environment

Alam, Mahfooz; Shahid, Mohammad; Mustajab, Suhel

doi:10.1007/s10586-022-03819-5

Security prioritized multiple workflow allocation model under precedence constraints in cloud computing environment

Published: 03 January 2023

Volume 27, pages 341–376, (2024)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Cluster Computing Aims and scope Submit manuscript

Security prioritized multiple workflow allocation model under precedence constraints in cloud computing environment

Download PDF

716 Accesses
13 Citations
Explore all metrics

Abstract

In the last decade, cloud computing has become an effective and efficient service delivery platform to offer resources, innovation, and economies of scale on telecommunication networks. An allocation scheme without security constraints consideration may assign the high security-sensitive tasks onto the lower trustworthy machines. This would lead the performance deterioration. To address this issue, in this paper, a security-prioritized multiple workflow allocation (SPMWA) model is proposed by integrating the security-prioritized mapping scheme for Infrastructure-as-a-Service cloud computing environment. It is expected that incorporating a security-prioritized allocation scheme under precedence constraints will enhance the performance of workflow processing in risk-prone environments. In this model, more priority is given to tasks with high-security demand to get allocated onto the more trustworthy virtual machines during allocation to minimize the failure probability of the cloud system. The failure probability can be minimized by assigning the tasks on to the trustable virtual machines exhibiting sufficient trust level to minimize the number of task failures. The number of task failure, failure probability, and makespan have been computed for the comparative evaluation of the SPMWA. For performance comparison, the SPMWA model has been compared with state-of-the-art models from the literature having the same environment and objective. The experimental evaluation confirms the superior performance of the proposed model on account of the considered objective among its peers.

Security-Aware Workflow Allocation Strategy for IaaS Cloud Environment

Security challenges for workflow allocation model in cloud computing environment: a comprehensive survey, framework, taxonomy, open issues, and future directions

Article 27 January 2024

Workflow Security Scheduling Strategy in Cloud Computing

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Cloud computing is an internet-based service infrastructure that facilitates on-demand access on a pay-as-you-go basis for users of configuration resources such as stockpiling, server, network, administrations, and other applications to allow them pervasive services which can be rapidly discharged and provisioned with insignificant administrative effort [1, 2]. Day-by-day, cloud computing is growing in heterogeneous distributed processing, automatic computing, utility, and matrix computing. The heterogeneous distributed computing facilitates any type of administration for worldview like social networking, registering applications of computational assets, broadcast communication, and web administration. From these benefits of cloud computing, cloud users can make use of processing assets, and an apparition of IT resources used on-demand. These assets can be used for managing the different types of services such as Workflow as a Service (WaaS), Infrastructure as a Service (IaaS), Network as a Service (NaaS), Platform as a Service (PaaS), Storage as a Service (StaaS) Software as a Service (SaaS), and Data as a Service (DaaS) [3,4,5,6]. Cloud computing also allows anyone to provision services, virtual hardware, and runtime environments with a credit card. However, with newly developed technology, several issues have to be addressed in cloud computing such as dynamic resources provisioning, virtualization technologies and large computing infrastructure for cloud service providers (CSPs), availability and storage for large data processing, protection, and confidentiality of resources, legal issues arise in different countries and losing of data [2, 6, 7]. Some researchers and scientists utilize the CSP for computation-intensive applications and running large-scale data by workflow applications.

Workflow is a specific model used for scientific application in various domains having dependent and communicating components [8]. Generally, a workflow application is modeled by using Directed Acyclic Graph (DAG), having a node-set representing tasks and an edge set with the dependencies among the tasks using the directed edges between them. Workflow allocation faces several challenges due to its dynamic nature and heterogeneity as well as searching for suitable resources of geographical distribution to construct the allocation decision while meeting optimization of the Quality of Service (QoS) parameters in a cloud computing environment. The workflows are used in several applications from different domains like genomics, earthquake analysis, gravitational waves detection, astronomy, healthcare, project planning, chemical reaction, and supply chain management. Since a huge amount of data is processed in cloud computing daily so the task allocation mechanism is more important and should be computed efficiently. Workflow’s task allocation is one of the most common application models, particularly in designing, business plans, and logical fields which has been directed towards workflow task allocation. Therefore, many researchers address the workflow tasks allocation problem for many aspects such as computational time minimization [9,10,11,12,13,14,15], minimum budget assurance [16, 17], least energy consumption [18, 19], improving the throughput and server performance [20, 21] and employing security measures [22,23,24,25,26,27,28,29,30,31,32] in heterogeneous and other efficient computing [33, 34] for single workflow. Further, the work has been also reported for multiple workflows allocation to improve the turnaround time, response time and flowtime [35,36,37,38,39,40,41,42,43], budget [43, 44], and energy [45, 46], by aggregating the batch of workflows prior to allocation. In workflow allocation, precedence constraints (execution order) management is one of the challenging issues. Precedence constraints in workflow tasks are preserved by various methods such as ranking methods [11], and level attribute methods [10, 38, 39]. In recent times, we can see that many researchers have been working to ensure the security constraints in the IaaS cloud and the trust of cloud users so that their information and applications are protected and managed effectively. However, the creation of ad-hoc security solutions, targeting a very small part of the whole problem makes a fair and sound evaluation of the state of the art. At this point, we start from the view that the cloud computing paradigm can be fully exploited only if the contributions of cloud users and CSPs for security constraints are made wider by improving their trust. As a result, software security assurance mechanisms improve the confidence level of cloud users and cloud transparency so that the CSPs behave as expected [47]. The standard software security assurance is defined with the line and the assurance of cloud security is defined to gain reasonable confidence that applications and/or infrastructure will reliably demonstrate more than one security constraint, and process performs as expected despite attacks and failures [48]. Since assurance is an extensive notion in a cloud environment than security for any type of workflow applications, it consists of methodologies for collecting and validating proof supporting security constraints. Moreover, various challenging works have been proposed for secure workflow allocation only for a single workflow [22,23,24,25,26,27,28,29,30,31,32] but lagging for multiple workflows. So, emergent work requires exploiting the process of security services for multiple workflow applications to defend security-critical applications from threats in the IaaS cloud where batch mode processing is necessary. In addition to security constraints in the cloud user request will increase the security overhead (in terms of time), which turns the turnaround time, flowtime, and operational cost of use increments [49] but reduces the risk or failure probability [24, 28].

In this paper, a security-prioritized multiple workflow allocation (SPMWA) model is proposed by integrating the security-prioritized mapping scheme for Infrastructure-as-a-Service cloud computing environment. It is expected that incorporating a security-prioritized allocation scheme under precedence constraints will enhance the performance of workflow processing in risk-prone environments. SPMWA gives more priority to the tasks having high-security demands to get allocated onto the more trustworthy virtual machines during allocation to minimize the failure probability of the system. The contributions of the proposed model are given below as follows:

A Security Prioritized Multiple Workflows Allocation (SPMWA) model is proposed by incorporating a security prioritization scheme in workflow task allocation to minimize the failure probability of the system for the cloud computing environment.
A security model is introduced to estimate the failure probability of the system. This model also calculates the total number of task failure to assess how often tasks with a given security requirement end up being allocated on the VMs exhibiting insufficient trust.
Workflows are partitioned in accordance to depth level to maintain precedence constraints. Communication requirements among the workflow tasks are estimated by considering the edge weights and machine distances.
An idle gap reduction policy is employed to the utilization of the idle gaps generated during the allocation process by accommodating the successor tasks from the next partitions.
We expect that integrating such security measures would aid in designing a more robust workflow task allocation model for networked and high security-sensitive applications.
The experimental results of SPMWA are compared with state-of-the-art workflow models from the literature. For this, we have taken two multiple workflow allocation strategies namely sequential-based strategy and merge-based strategy [35]. These strategies and security prioritized allocation are incorporated in HEFT [11] to develop two versions of Security Prioritized Multiple workflow allocation (SPM) models, namely SPM1 (Merge-Based) and SPM2 (Sequential-Based) respectively. For performance comparison, LBSIR [39] and HEFT [11] have also been included.
The performance evaluation of the proposed model has been carried out on random multiple workflows and real-world scientific application graphs namely Montage [50], CyberShake [50], and LIGO [51].
The proposed model can be used in many emerging security-sensitive domains such as batch mode transaction processing, chemical reaction, supply chain management, project management, and Pegasus project for some other scientific workflow applications.

The basic structure of the paper is as follows: Sect. 2 presents the standard and current literature review related to the work. Section 3 describes the system model for the proposed methodology. Section 4 describes the proposed model design with an algorithmic template, a simple illustration, and a time complexity analysis. The performance evaluation for the proposed work with experimental results is discussed in Sect. 5. The conclusion and future work are presented in Sect. 6.

2 Related work

Workflow allocations have been a common research topic in the recent computing environment for decades and have been built together with changes in technology. In the last few years, most of the research has been engrossed in workflow allocation problems using DAG heuristics [8, 11]. The mapping of workflow allocation problems is well-established NP-complete [52]. Consequently, various heuristic algorithms have been developed to achieve sub-optimal solutions for resource allocation in the cloud environment. Here, some reported works deal with independent task allocation considering VM placement [53], cost-effective task allocation [54], security-aware task allocation [55,56,57], etc. On other hand, many other models have presented the mechanism for addressing the allocation of dependent communication workflow tasks. In this section, the various models are further categorized into single workflow allocation [9,10,11,12,13,14,15,16,17,18,19], security-aware workflow allocation [22,23,24,25,26,27,28,29,30,31,32], and multiple workflow allocation models [35,36,37,38,39,40,41,42,43,44,45,46].

2.1 Single workflow allocation

In single workflow allocation, only one DAG is represented by task mapping onto parallel resources. The most common DAG-based scheduling is designed for a single workflow on heterogeneous distributed systems such as [9,10,11,12,13,14,15,16,17,18,19]. Dynamic-Level Scheduling (DLS) [9] is one of the first algorithms that find the availability of resources and thus allow the task to be scheduled onto the current busy resource. DLS does not guarantee the minimum processing time for a task. Furthermore, it also does not attempt to idle time gaps between two tasks on the same resource in contrast to other more current algorithms. Levelized Minimum Time (LMT) algorithm [10] is very simple and based on the precedence level of tasks onto the resource having the lowest processing time for a heterogeneous distributed system. The CPOP [11] algorithm attains better allocations than LMT and is similar to DLS with lower time complexity. The main characteristic of CPOP is the workflow of all the tasks that belong to the critical path assigned to a single resource. The HEFT [11] is one of the best DAG list scheduling algorithms, as it has quadratic time complexity. This algorithm aims to reduce the time complexity and schedule length. The modified HEFT has also been proposed to reduce processing time in a cloud computing environment [12]. Other versions of HEFT have also been proposed, such as Predict Earliest Finish Time (PEFT) [13], Communication aware Earliest Finish Time (CEFT) [14], and Dependency-ratio Bundling Earliest Finish Time (DBEFT) [15] to reduce the schedule length. PEFT algorithm defines the task priority based on the optimistic cost table. PEFT takes the assurance of task execution in minimum processing time. CEFT algorithm is based on the task duplication heuristic using communication ratio having task execution order as according to upward rank. DBEFT is also list-based task duplication scheduling to reduce communication costs. DBEFT improves the scheduling length ratio over the PEFT, CEFT, and HEFT. Some of the work has been proposed for multi-objective workflow allocation reducing the makespan and economic cost, [16, 17], and optimizing time and energy [18, 19].

2.2 Security-aware workflow allocation

Security plays an essential role in e-commerce and digital transaction processing systems. In the field of distributed information-sharing networks, significant work considering security constraints [22,23,24,25,26,27,28,29,30,31,32] has been reported in the heterogeneous distributed environment like grid/cloud computing to share and process the resources with trustworthy contributing peers. The authors presented a task allocation strategy with security constraints and the deadline for parallel applications [22] with the objectives of optimizing security parameters and processing time. However, it is implemented on homogeneous clusters. In [23], the authors present the trust-based allocation model for the scientific workflow to improve the stability of the schedule. In [24], authors have developed the security-driven scheduling (SDS) model for heterogeneous distributed systems to optimize the makespan, speedup, and risk probability. SDS introduces task priority allocation on the suitable processor using estimated security overhead. The strategy presented in [25] namely Cloud-DLS incorporates dynamic trust-based task allocation in the DLS algorithm in the cloud environment. The objective of Cloud-DLS is to assure the execution of the task and minimize the processing time. In [26], the authors have proposed a trust service-oriented workflow allocation (TMOWS) model to minimize the execution time and cost simultaneously in a cloud environment using fuzzy member functions. TMOWS meets the security demands of the users or other constraints with balance factors. In [27], the authors model maintains the reliability of services. It avoids discrete events, and workflow application failure between the direct and recommended trust. It also found the best solution and concurrently satisfied deadline conditions. The work in [28] presents a novel security-sensitive workflow allocation with a task duplication (SOLID) scheme. SOLID has been developed having three features, firstly, the selection of duplicated predecessor tasks which is useful to avoid the data encryption and transmission time by delaying the start time of the task, further, defines the latest finish time of workflow’s task, and lastly, it also assures these tasks should be finished on the cheapest resources with aim of minimizes the makespan and monetary cost. In [29], trust-based stochastic workflow scheduling (TSS) is proposed to minimize the makespan with increased speedup using the TSS trust model for security estimation including both direct trust and reputation relationships. The strategy presented in [30] for security-aware workflow allocation (SAWA) is to reduce the number of failed tasks. SAWA selects the task allocation as per depth level. In [32], the authors are presenting a security-prioritized HEFT (SPHEFT) algorithm in the cloud computing environment to optimize the guarantee ratio. SPHEFT is integrated by the security requirements into the HEFT [11] algorithm by giving more priority to the tasks having a high degree of security constraints. SPHEFT creates the clusters for distinct upward rank values and then again sorts each cluster as per the security demand of the tasks. Thus, the tasks with higher security demand will get more chances to execute at first. Therefore, it will improve the guarantee ratio over the HEFT algorithm.

2.3 Multiple workflow allocation

In the multiple workflow allocation, more than one workflow task is grouped to form a batch for processing on suitable machines to achieve the desired QoS parameters. To tackle the multiple workflow allocation problems, Bittencourt and Madeira [35] have first introduced four strategies to schedule multiple workflows namely sequential-based strategy, gap search strategy, interleave strategy, and merge-based strategy. The sequential-based strategy, schedules the workflows sequentially, one after another on the available resources. The gap search strategy works the same as the first strategy but it finds the gaps between tasks already executed, and then accommodatable tasks from the workflow at hand are being scheduled into the found gaps without interfering with their starting time. Interleave strategy uses both the first and second strategies, however, the strategy schedules tasks of each workflow in turns, interleaving their tasks in the schedule of the available resources. The merge-based multiple workflow strategy merges all workflows into a single one and then schedules this resulting workflow as a single workflow. Here, a significant number of works have been proposed for multiple workflow allocation to reduce turnaround time [38, 39]. The strategy proposed in [36] aggregates multiple workflows to achieve near-optimal throughput for heterogeneous cloud computing. In [37], the authors analyze the allocation strategy for multiple workflows including two and four stages like labeling, adaptive information, prioritization, and parallel machines in the grid environment. In [38], the authors have proposed a novel approach (SLBBS) for the multiple workflow allocation problem in a computational grid environment to optimize the turnaround time. SLBBS divides the multiple workflows as per depth level and allocation of tasks is assigned on best-fit resource levelwise. Level-based Batch scheduling Strategy with Idle slot Reduction (LBSIR) [39] strategy reduces the drawbacks of SLBBS by incorporating an idle slot reduction policy. The work reported in [40], concurrently executes multiple workflows using a rescheduling algorithm and dynamic task rearrangement to exploit task allocation flexibility under precedence constraints. In [41], the authors present the cluster-based allocation strategy for multiple workflow applications with soft deadlines to achieve the quality of schedule in terms of fairness and execution time. In [42], the authors present deadline budget workflow allocation to optimize the time and cost in the cloud environment. Some of the work has been proposed for multi-objective of multiple workflow allocation reducing the makespan and economic cost [43, 44] and to optimizing time and energy [45, 46].

In the literature, various approaches have been proposed for solving single workflow [9,10,11,12,13,14,15,16,17,18,19] and multiple workflows [35,36,37,38,39,40,41,42,43] allocation problems in heterogenous distributed systems. However, many cloud-based workflow applications need to be processed in batch mode to enhance the systems performance. Therefore, to develop superior multiple workflow allocation models is a requirement in various domains. Further, as we see in Table 1, a significant number of approaches [22,23,24,25,26,27,28,29,30,31,32] have also been proposed for security-aware workflow allocation problems for considering only a single workflow. In the majority of the work, the task execution order is computed by the ranking method, and then the allocation of the resources is done as per the task execution order. In this scenario, high-security demand tasks with low rank may be allocated to untrustworthy machines leading to more failure in the system. But in the proposed work, workflow tasks are grouped into partitions as per depth level. In each partition, the task execution order is computed by using the security demand levels. In this way, high-security demand tasks always get allocated first and have a chance of getting higher trustable machines leading to lower failures in the system.

Table 1 A comparison of related works for workflow allocation models

Security prioritized multiple workflow allocation model under precedence constraints in cloud computing environment

Abstract

Similar content being viewed by others

Security-Aware Workflow Allocation Strategy for IaaS Cloud Environment

Security challenges for workflow allocation model in cloud computing environment: a comprehensive survey, framework, taxonomy, open issues, and future directions

Workflow Security Scheduling Strategy in Cloud Computing

Explore related subjects

1 Introduction

2 Related work

2.1 Single workflow allocation

2.2 Security-aware workflow allocation

2.3 Multiple workflow allocation

3 System model

3.1 Symbols used

3.2 Multiple workflow model

3.3 Machine model

3.4 Security model

3.5 Problem statement and parameter estimation

4 Security prioritized multiple workflow allocation model

4.1 VM selection

4.2 Idle gap reduction

4.3 An illustrative example

4.4 Time complexity analysis

5 Performance evaluation

5.1 Parameter setting

5.2 Experimental results for random workflows

5.3 Experimental results for real application workflows

6 Conclusion and future work

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation