An Science Gateway with Cost Adaptive Resource Management Schemes

Kim, Woojoong; Kim, Seung-Hwan; Youn, Chan-Hyun

doi:10.1007/978-3-319-38904-2_14

Woojoong Kim¹⁸,
Seung-Hwan Kim¹⁸ &
Chan-Hyun Youn¹⁸

Part of the book series: Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering ((LNICST,volume 167))

Included in the following conference series:

International Conference on Cloud Computing

1514 Accesses

Abstract

In order to process heavy data and computation applications such as scientific application in cloud computing environment, it is important to do an efficient resource schedule that decrease the resource usage cost while guaranteeing users’ Service Level Agreement. To resolve this issue, we propose and implement Science Gateway, which execute the scientific application efficiently on heterogeneous cloud service providers instead of users. Especially, we propose a cost-adaptive resource management schemes (i.e. VM pool management scheme that decreases the resource management cost significantly based on the long-term payment plans of cloud resource providers and VM placement management scheme that guarantee the performance of communication between VMs). Finally, we demonstrate that our proposed system improves the performance of existing cloud systems through some experimental and simulation results.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Cost Adaptive VM Management for Scientific Workflow Application in Mobile Cloud

Article 17 April 2015

Scheduling multiple scientific workflows using containers on IaaS cloud

Article 26 August 2020

VLAB-C: Collaborative Virtual Laboratory in Cloud Computing and Its Applications

Keywords

1 Introduction

Heavy data and computation applications usually require the huge amount of computing and storage resource. With the emergence of cloud computing, users can utilize the resource as a cloud service based on pay-per-usage base to execute their own heavy applications [1]. However, it is still difficult for users to decide how much cloud service should be leased in order to minimize the amount of cloud resource while guaranteeing the certain performance of the application. In addition, there are a lot of cloud services from various cloud providers and they have various policies on cloud service and service charge. In this case, users can make non-optimal decision with the limited information on cloud services and waste the execution time and cost because of the inefficient decision. Thus, third party entity (i.e. cloud broker) presented between user and Cloud Service Provider (CSP) is defined to make an optimal decision with rich information on cloud services instead of users.

The key issue of the cloud broker is to decrease the resource usage cost from cloud environment in order to increase the profit while guaranteeing the certain performance of cloud service required by user. In this paper, to resolve this problem, we propose and implement the cloud broker called Science Gateway with the cost adaptive resource schemes including VM pool management scheme and VM placement management scheme.

The VM pool management scheme utilizes the reserved VM payment policy(RVM) which reserve some virtual machines(VM) on long-term in a discount price. Organizing a RVM pool by estimating the amount of resource demand on the future with this scheme, the cloud broker decreases the resource usage cost by providing the available RVM in the RVM pool instead of On-demand VM(OVM).

The VM placement management scheme decides a certain physical node within the cloud server structure to lease new VM instance in order to guarantee the network capacity between leased VMs and decrease the data transmission delay between VMs.

2 A Model Description of the Science Gateway

The object of science gateway is to minimize a resource usage cost while satisfying user’s SLA for cloud resource providers instead of users. In our model, the science gateway concentrates on scientific applications and users only request for their application execution with their SLA to the science gateway.

The scientific application can be represented as a workflow $ A\left( {T,E, D} \right) $ where A is the set of tasks, E is the set of edges and D is deadline. We assume that the total execution time of each tasks in A are known. By the science gateway, each task is allocated to their proper VM instance and processed in order of their starting time decided by SLA constraints such as deadline D.

Different types of VM instances have the different computing performance. We assume that CSP provides only three types of cloud resource service: small, medium, large. Each type of VM is charged in proportion to their capacity with Billing Time Unit (BTU). BTU is the base time unit to charge for resource usage time and partial-BTU resource usage time is rounded up to one BTU. A VM type is represented as $ VT_{i} = \left\{ {VT_{{c_{i} }} , VT_{{m_{i} }} , VT_{{s_{i} }} } \right\}, i \in \left\{ {1,2, \ldots ,K} \right\} $: number of VCPU (#) $ VT_{{c_{i} }} $ , memory size (GBs) $ VT_{{m_{i} }} $ , storage space (GBs) $ VT_{{s_{i} }} $. CSP provides OVM and RVM payment plans [2]. By RVM plan, VM instance can be leased for long BTU (e.g., monthly or yearly) with the discount price.

The science gateway is described in Fig. 1. It receives the scientific application request from users through GUI based workflow designer. Workflow management schedules the resource plan for each task within the requested application and processes all tasks on the VM instances provided by VM Placement Management Scheme and VM Pool Management Scheme.

3 Cost Adaptive Cloud Resource Management Schemes

VM Pool Management Scheme is shown in Algorithm 1. This scheme decides the appropriate amount of RVMs to be leased in the heuristic way in order to decrease the resource usage cost. This scheme works in the period of time interval T. The amount of RVMs is dependent on the resource demand. The historical data on all the executed tasks and their allocated VM types during the previous time interval $ T^{{\prime }} $ is used for this scheme. We assume that the request trend in the current time interval T will be similar with the one in $ T^{{\prime }} $. As a result, we can decide the appropriate amount of i-type RVM, N _i for current time interval T. First, we organize clusters for each task within A based on their assigned VM instance type VT _i. Eventually, all the tasks within A are classified into some clusters $ Cl_{{VT_{i} }} $. On each cluster $ Cl_{{VT_{i} }} $, we organize groups g _m. It is a batch of non-overlapped tasks. After the tasks of $ Cl_{{VT_{i} }} $ are sorted in order of their starting time, each task of $ Cl_{{VT_{i} }} $ is picked into g _m in sequence if its start time st is later than the finish time ft of last task in g _m. This procedure is repeated until there is no available $ {\text{task}} $ in $ Cl_{{VT_{i} }} $ anymore. Finally, with gct(g _m) total execution time of g _m and VT(g _m) allocated VM instance type of g _m, we derive a RVM lease duration $ RVM_{{VT\left( {g_{m} } \right),gct\left( {g_{m} } \right)}} $ by finding the size of BTU closest to the gct(g _m).

We determine whether to lease $ RVM_{{VT\left( {g_{m} } \right),gct\left( {g_{m} } \right)}} $ from CSP or not with the following condition.

$$ \frac{{C\left( {RVM_{{VT\left( {g_{m} } \right),gct\left( {g_{m} } \right)}} } \right)}}{{\mathop \sum \nolimits_{{\forall t \in g_{m} }}^{{}} et\left( t \right) \cdot C\left( {OVM_{VT\left( t \right)} } \right)}} < 1 $$

(1)

The denominator of Eq. (1) is the total resource usage cost on OVMs for the tasks having the execution time et in g _m. The numerator of Eq. (1) is the total resource usage cost of RVM for g _m. If Eq. (1) is satisfied, it means that RVM is more efficient on cost than OVMs for g _m. As the value of Eq. (1) is decreased, leasing RVM is more efficient.

VM Placement Management Scheme guarantees the network capacity for the continuous created VM considering the network interference occurred between co-location VMs, so that reduce the occurred data transmission delay between VMs for executing the scientific application efficiently. To do this, the science gateway manages the last used resource table LastUsedResourceTable for each user in order to save the physical node used lastly to create the VM. When a user requests a VM in the first time, there is no available last used resource information. In this case, the VM is created in a physical node maxCapacityPN having maximum free resource capacity within cloud server structure in order to increase the probability that the physical node keep a free resource capacity for the next VM creation of the user. After creating the VM, the information on the physical node of the created VM with the user id uid are saved to the last used resource table. When the user requests again, the last used resource information is available in the last used resource table. Therefore, a VM is created in the same physical node with the last used resource if possible in order to guarantee the maximum network bandwidth. In the case that the resource capacity of this physical node is not enough for the flavor type f requested by the user, available physical nodes availablePNs are sorted in the closest order from the physical node of the last used resource. The new physical node close to the physical node of the last used resource while having enough free resource capacity for flavor type f and having the smallest network traffic is chosen to reduce the network interference with other VMs on the physical node. The network traffic of each physical node is monitored in real-time. After finding the new physical node and creating the VM, the resource information on the new physical node is updated to the last used resource table for the corresponding user. Algorithm 2 shows the procedure of the proposed network aware VM placement algorithm.

4 Test Environments and Performance Evaluation

For the evaluation, we built a science gateway, which is a solution platform to execute workflow typed scientific applications for solving complicated scientific problems by organizing the distributed heterogeneous cloud resources in this paper. The request from the user is dealt through the workflow designer as scientific application service provider. The workflow engine demands cloud resources to resource provisioning manager and process the scientific application. We applied a phased workflow scheduling scheme with division policy [3] as scheduler. Also, we organized Next Generation Sequencing (NGS) with Burrows-Wheeler Aligner (BWA) as a scientific application [4].

To evaluate the performance of the proposed schemes, we built the test environment with the science gateway as shown in Fig. 2 with the specific configuration on testbed platform shown in Table 1. We organize the cloud platforms with 5 computing nodes on the OpenStack and 4 computing nodes on the CloudStack respectively [5, 6] to compose the heterogeneous resource.

Table 1. Specific configurations on testbed environment

Full size table

We applied relative cost in order to evaluate the proposed scheme on cost performance in cloud environment. The definition of relative cost on the i ^th cloud service is described as Eq. (2) where c ⁱ_u is unit time cost and $ {\text{t}}_{\text{u}}^{\text{i}} $ is resource usage time.

For the convenience of this experiment, we assign unit time as a second for OVM and a week for RVM. The unit time cost on resource contract c ⁱ_ur is calculated as Eq. (3) where the weight vector $ \vec{w} = \left[ {w_{c} ,w_{m} } \right] $ means the effectiveness of each element such as CPU core and memory respectively. The weight vector for RVM is applied to half of OVM’s with the reference of CSP GoGrid (on annual case) [7].

$$ {\text{RC}}^{\text{i}} = {\text{c}}_{\text{u}}^{\text{i}} \cdot t_{u}^{i} $$

(2)

$$ c_{ur}^{i} = w_{c} \cdot r_{c}^{i} + w_{m} \cdot r_{m}^{i} $$

(3)

In the experiment of the VM pool management scheme, we measured relative cost on the sum of OVM leasing cost and RVM leasing cost between the case without VM pool management, the case of the VM pool management scheme with static number of RVM (1, 2 respectively) and the case of the proposed scheme on various average interarrival time of workflow request in exponential distribution. This experiment is executed in scale-downed 4 weeks.

As shown in Fig. 3, the measured VM leasing costs for three cases are expressed in log scale. Our proposed scheme shows good adaptivity and cost efficiency on the resource management compared to the static pool management for all environment case. Therefore, we can decrease the VM leasing cost by the proposed scheme.

In the experiment of the VM placement management scheme, we evaluate the performance of proposed scheme compared to the existing VM placement scheme proposed by Alicherry and Lakshman [8].

The proposed scheme has the smaller total data transmission delay over the entire request interarrival time compared to the existing scheme as shown in Fig. 4. The existing scheme cannot guarantees the network capacity for the continuous created VM on BWA service compared to the proposed scheme because it is only focused on the placement for a VM creation request and not considers the interference occurred by the traffic congestion.

5 Conclusion

In this paper, to decrease the resource usage cost while guaranteeing user’s SLA on executing the scientific application in cloud environment, we propose and implement the science gateway with the cost adaptive resource management schemes. Experiment shows the proposed schemes decrease the expenditure of leasing VM instances and also guarantee the good performance. Finally, we demonstrate that our proposed schemes shows good adaptivity and cost efficiency compared to the existing schemes.

References

Zhang, Q., Cheng, L., Boutaba, R.: Cloud computing: state-of-the art and research challenges. J. Internet Serv. Appl. 1(1) (2010)
Google Scholar
Chaisiri, S., Lee, B.S., Niyato, D.: Optimization of resource provisioning cost in cloud computing. IEEE Trans. Serv. Comput. 5(2), 164–177 (2012)
Article Google Scholar
Kim, S.-H., Joo, K.-N., Ha, Y.-G., Choi, G.-B., Youn, C.-H.: A phased workflow scheduling scheme with task division policy in cloud broker. In: Leung, V.C.M., Lai, R., Chen, M., Wan, J. (eds.) CloudComp 2014. LNICST, vol. 142, pp. 76–86. Springer, Heidelberg (2015)
Google Scholar
BWA. http://bio-bwa.sourceforge.net
Openstack. http://www.openstack.org
Cloudstack. http://cloudstack.apache.org
GoGrid. http://www.gogrid.com/
Mansoor, A., Lakshman. T.V.: Network aware resource allocation in distributed clouds. In: 2012 Proceedings IEEE, INFOCOM. IEEE (2012)
Google Scholar

Download references

Acknowledgments

This work was supported by ‘The Cross-Ministry Giga KOREA Project’ grant from the Ministry of Science, ICT and Future Planning, Korea and the MSIP (Ministry of Science, ICT & Future Planning), Korea in the ICT R&D Program 2014.

Author information

Authors and Affiliations

Department of Electrical Engineering, KAIST, Daejeon, Korea
Woojoong Kim, Seung-Hwan Kim & Chan-Hyun Youn

Authors

Woojoong Kim
View author publications
You can also search for this author in PubMed Google Scholar
Seung-Hwan Kim
View author publications
You can also search for this author in PubMed Google Scholar
Chan-Hyun Youn
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Woojoong Kim .

Editor information

Editors and Affiliations

Zhongnan University of Economics and Law, Wuhan, China
Yin Zhang
Ajou University, Suwon, Korea (Republic of)
Limei Peng
Department of Electrical Engineering, KAIST, Daejeon, Korea (Republic of)
Chan-Hyun Youn

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kim, W., Kim, SH., Youn, CH. (2016). An Science Gateway with Cost Adaptive Resource Management Schemes. In: Zhang, Y., Peng, L., Youn, CH. (eds) Cloud Computing. CloudComp 2015. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 167. Springer, Cham. https://doi.org/10.1007/978-3-319-38904-2_14

Download citation

DOI: https://doi.org/10.1007/978-3-319-38904-2_14
Published: 06 May 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-38903-5
Online ISBN: 978-3-319-38904-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

An Science Gateway with Cost Adaptive Resource Management Schemes

Abstract

Similar content being viewed by others

Cost Adaptive VM Management for Scientific Workflow Application in Mobile Cloud

Scheduling multiple scientific workflows using containers on IaaS cloud

VLAB-C: Collaborative Virtual Laboratory in Cloud Computing and Its Applications

Keywords

1 Introduction

2 A Model Description of the Science Gateway

3 Cost Adaptive Cloud Resource Management Schemes

4 Test Environments and Performance Evaluation

5 Conclusion

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

An Science Gateway with Cost Adaptive Resource Management Schemes

Abstract

Similar content being viewed by others

Cost Adaptive VM Management for Scientific Workflow Application in Mobile Cloud

Scheduling multiple scientific workflows using containers on IaaS cloud

VLAB-C: Collaborative Virtual Laboratory in Cloud Computing and Its Applications

Keywords

1 Introduction

2 A Model Description of the Science Gateway

3 Cost Adaptive Cloud Resource Management Schemes

4 Test Environments and Performance Evaluation

5 Conclusion

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation