Abstract
We present GRUBER, a Grid Resource Usage service level agreement (uSLA) based BrokERing infrastructure, aimed at addressing the challenging issues that can arise within virtual organizations (VOs) that integrate participants and resources spanning multiple physical administrative domains. In such environments, participants delegate to one or more VOs the right to use certain resources subject to local policy and service level agreements; each VO then uses those resources subject to VO policy. GRUBER supports the explicit representation, enforcement, and management of service level agreements (SLAs) concerning resource usage (uSLAs) that can serve as an objective organizing principle for controlled resource sharing in distributed systems. uSLAs express how resources must be used over various time intervals and represent a novelty for the Grid domain. This paper provides a detailed overview of the GRUBER infrastructure, the evolution of its design to improve scalability, specifically the distribution of the resource brokering service, and the extended support for dynamic environments. We also present various results achieved over time that demonstrate both the utility and performance of GRUBER under various application workloads and scenarios.
Article PDF
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
Abbreviations
- uSLA:
-
usage Service Level Agreement
- VO:
-
Virtual Organization
- RM:
-
Resource Manager
References
Dumitrescu, C., Wilde, M., Foster, I.: A model for usage policy-based resource allocation in Grids. In: Proceedings of the 6th IEEE International Workshop on Policies for Distributed Systems and Networks (POLICY 2005), Stockholm, Sweden, pp. 191–200 (2005) (ISSN: 0-7695-2265-3)
Foster, I., et al.: The Grid 2003 production Grid: principles and practice. In: Proceedings of the 13th International Symposium on High Performance Distributed Computing (HPDC), pp. 236–245 (2004) (ISSN: 1082–8907)
Foster, I.: Grid computing. In: Proceedings of the Advanced Computing and Analysis Techniques in Physics Research (ACAT). AIP Conference Proceedings, Chicago, IL, vol. 583, pp. 51–56 (2000)
Foster, I., Kesselman, C., Tuecke, S.: The anatomy of the Grid: enabling scalable virtual organizations. Int. J. Supercomput. Appl. 2150, 200–222 (2001) (ISBN: 3-540-42495-4)
Dan, A., Davis, D., Kearney, R., Keller, A., King, R., Kuebler, D., Ludwig, H., Polan, M., Spreitzer, M., Youssef, A.: Web services on demand: WSLA-driven automated management. IBM Syst. J. 43, 136 (2004)
Czajkowski, K., Foster, I., Kesselman, C., Sander, V., Tuecke, S.: SNAP: a protocol for negotiating service level agreements and coordinating resource management in distributed systems. In: Proceedings of the 8th Workshop on Job Scheduling Strategies for Parallel Processing, Edinburgh, Scotland (2002)
Czajkowski, K., Dan, A., Rofrano, J., Tuecke, S., Xu, M.: WS-agreement: agreement-based Grid service management (OGSI-Agreement), Version 0. [Online: http://forge.gridforum.org/projects/graap-wg/document/Draft_OGSI-agreement_Specification/en/1/Draft_OGSI-Agreement_ Specification.doc]
Gimpel, H., Ludwig, H., Dan, A., Kearney, R.: PANDA: specifying policies for automated negotiations of service contracts. In: Proceedings of the 1st International Conference on Service Oriented Computing, pp. 287–302. Trento, Italy (2003)
Verma, D.C.: Policy Based Networking, Architecture and Algorithm. New Riders, Indianapolis, IN (2000 November)
Dumitrescu, C.: INTCTD: a peer-to-peer approach for intrusion detection. In: Proceedings of the 6th IEEE International Symposium on Cluster Computing and the Grid (CCGrid’06), Singapore (2006) (ISBN: 0-7695-2585-7)
Verma, D.C.: Simplifying Network Administration using Policy based Management. IBM, UK (2004)
Lamanna, D., Skene, J., Emmerich, W.: SLang: a language for defining service level agreements. In: Proceedings of the 9th IEEE Workshop on Future Trends in Distributed Computing Systems, Puerto Rico, pp. 100–106. IEEE-CS Press (2003 May)
LHC Computing Project (2004)
Ranganathan, K., Foster, I.: Decoupling computation and data scheduling in distributed data-intensive applications. In: Proceedings of the 11th IEEE International Symposium on High Performance Distributed Computing, pp. 352. Edinburgh, Scotland (2002) (ISSN: 1082-8907)
Mambelli M.: Capone and VDS: The University of Chicago and Argonne National Laboratory: Chicago [Online: http://griddev.uchicago.edu/swhome/atgce/] (2005)
Annis, J., Kent, S., Szalay, A.: The SDSS-GriPhyN Challenge Problems: Cluster Finding, Correlation Functions and Weak Lensing. FermiLab, Batavia, IL (2001)
Maltsev, N., Sulakhe, D., D’Souza, M.J., Glass, E., Rodriguez, A., Syed, M., Zhang, Y.: GNARE: Genome Analysis Research Environment. 2005, Argonne National Laboratory/Chicago [Online: http://compbio.mcs.anl.gov/gnare/gnare_home.cgi] (2006)
Foster, I., Voeckler, J., Wilde, M., Zhao, Y.: Chimera: a virtual data system for representing, querying, and automating data derivation. In: Proceedings of the Global and Peer-to-Peer Computing on Large Scale Distributed Systems Workshop. IEEE Computer Society, Washington, DC (1995 May)
Iosup, A., Dumitrescu, C., Epema, D., Liu, H., Wolters, L.: An analysis of four long-term Grid traces. Technical University of Delft: Delft, Netherlands [Online: http://pds.twi.tudelft.nl/reports/2006/PDS-2006-003/PDS-2006-003.pdf] (2006)
Open Science Grid (OSG) [Online: http://www.opensciencegrid.org/] (2004)
Dan, A., Dumitrescu, C., Ripeanu, M.: Connecting client objectives with resource capabilities: an essential component for Grid service management infrastructures. In: Proceedings of the 2nd ACM International Conference on Service Oriented Computing (ICSOC’04), pp. 57–64. New York, NY (2004) (ISSN 1-58113-871-7)
Legrand, I., Newman, H., Galvez, P., Voicu, E., Cirstoiu, C.: MonALISA: a distributed monitoring service architecture in computing. In: Proceedings of the High Energy Physics (HEP), La Jolla, CA (2003)
DOE Science Grid PKI Certificate Policy and Certification Practice Statement (2002)
Dumitrescu, C.: ARESRAN: A WSRF-based resource reservation service for Grid service. [Online: http://peopellcs.uchicago.edu/~cldumitr/ARESRAN] (2005)
Pearlman, L., Welch, V., Foster, I., Kesselman, C., Tuecke, S.: A community authorization service for group collaboration. In: Proceedings of the IEEE 3rd International Workshop on Policies for Distributed Systems and Networks, pp 55–59. Monterey, CA (2002) (ISBN: 0-7695-1611-4)
The Globus Project Team: CAS – community authorization service. [Online: http://www.nsf-middleware.org/Lists/Products/DispForm.aspx?ID=47] (2006)
Zhao, T., Karamcheti, V.: Expressing and enforcing distributed resource agreements. In: Proceedings of High Performance Networking and Computing Conference (SC’2000), pp. 62. Dallas, Texas (2000) (ISSN 0-7803-9802-5)
Raman, R.: Matchmaking Frameworks for Distributed Resource Management. PhD Thesis, University of Wisconsin (2000)
Foster, I., Roy, A., Sander, V., Winkler, L.: End-to-End Quality of Service for High-end Applications. Computer Communications 27(14). Kluwer, Norwell, MA (2004) (ISBN: 1375–1388)
Thain, D., Tannenbaum, T., Livny, M.: Condor and the Grid. In: Berman, F., Hey, A.J.G., Fox , G. (eds.) Grid Computing: Making The Global Infrastructure a Reality. Wiley, New York, NY (2003) (ISBN: 0-470-85319-0)
Ludwig, H., Dan, A., Kearney, B.: Cremona: an architecture and library for creation and monitoring WS-Agreements. In: Proceedings of the ACM International Conference on Service Oriented Computing (ICSOC’04), New York, NY (2004)
In, J., Avery, P., Cavanaugh, R., Ranka, S.: Policy based scheduling for simple quality of service in Grid computing. In: Proceedings of the International Parallel & Distributed Processing Symposium (IPDPS), p. 23. Santa Fe, New Mexico (2004) (ISBN: 0-7695-2132-0)
Buyya, R.: GridBus: A Economy-based Grid Resource Broker. The University of Melbourne, Melbourne, Australia (2004)
Foster, I.: The Grid: a new infrastructure for 21st century science. Phys. Today 55(2), 42–47 (2002)
Mueller, E.T., Moore, J.D., Popek, G.J.: A nested transaction mechanism for LOCUS. In: Proceedings of the 9th ACM Symposium on Operating System Principles (SOSP), Bretton Woods, New Hampshire (1983)
Stonebraker, M., et al.: Mariposa: a wide-area distributed database system. VLDB J. 5(1), 48–63 (1996)
Foster, I., Kesselman, C., Nick, J., Tuecke, S.: Grid Services for Distributed Systems Integration. IEEE Computer 35(6), 37–46 (2002)
Humphrey, M., Wasson, G., Jackson, K., Boverhof, J., Rodriguez, M., Bester, J., Gawor, J., Lang, S., Foster, I., Meder, S., Pickles, S., McKeown, M.: State and events for web services: a comparison of five WS-resource framework and WS-notification implementations. In: Proceedings of the 4th IEEE International Symposium on High Performance Distributed Computing (HPDC-14), Research Triangle Park, NC, 24–27 July 2005
Czajkowski, K., et al.: Grid information services for distributed resource sharing. In: Proceedings of the 10th IEEE International Symposium on High Performance Distributed Computing. San Francisco, IEEE Computer Society, Los Alamitos, CA (2001)
Thompson, M.R., Essiari, A., Mudumbai, S.: Certificate-based authorization policy in a PKI environment. ACM Trans. Inf. Syst. Secur. 6(4), 566–588 (2003)
Lupu, E.: A role-based framework for distributed systems management, in Department of Computing. PhD thesis, University of London, London (1998)
Rowstron, A., Druschel, P.: Pastry: scalable, distributed object location and routing for large-scale peer-to-peer systems. Lect. Notes Comput. Sci. 2218, 329–350 (2001)
Dumitrescu, C., Foster, I.: GRUBER: a Grid resource SLA broker. In: Proceedings of the 11th International Euro-Par Conference, pp. 465. Portugal (2005) (ISBN: 3-540-28700-0)
The University of Wisconsin: UWMadisonCMS Open Science Grid Site Policy Page. University of Wisconsin, Madison, WI (2006)
FNAL: FNAL: GPFARM Site Policy for OSG. FNAL (2006)
USCMS: USCMS: OSG Policy Pages. USCMS (2006)
Keahey, K., Araki, T., Lane, P.: Agreement-based interactions for experimental science. In: Proceedings of the 10th International Euro-Par Conference, p. 399. Italy (2004) (ISBN: 3-540-22924-8)
Kay, J., Lauder, P.: A Fair Share Scheduler. University of Sydney, AT&T Bell Labs (1998)
Epema, D.H.J., Livny, M., van Dantzig, R., Evers, X., Pruyne, J.: A worldwide flock of condors: load sharing among workstation clusters. Future Gener. Comput. Systs. 12, 53–65 (1996) (ISSN: 0167-739X)
Wolf, L.C., Steinmetz, R.: Concepts for reservation in Advance. Multimed. Tools Appl. 4(3), 255–278 (1997)(ISSN 1380–7501) (Kluwer)
Maui Team: Maui Scheduler. Center for HPC Cluster Resource Management and Scheduling. [Online: http://www.clusterresources.com/pages/products/maui-cluster-scheduler.php]
Keller, A., Ludwig, H.: The WSLA framework: specifying and monitoring service level agreements for web services. J. Netw. Syst. Manag. 11(1), 57–81 (2003)(Plenum)
Litzkow, M.J., Livny, M., Mutka, M.W.: Condor – a hunter of idle workstations. In: Proceedings of the 8th International Conference on Distributed Computing Systems, pp. 104–111. San Jose, CA (1998) (ISBN: 0-8186-0865-X)
Altair Grid Technologies. OpenPBS (Portable Batch System) (2004) [Online: http://www.openpbs.org/]
Foster, I., Kesselman, C.: Globus: a toolkit-based Grid architecture. In: The Grid: Blueprint for a Future Computing Infrastructure, pp. 259–278. Morgan Kaufmann, San Mateo, CA (1998)
Dumitrescu, C., Foster, I.: Usage policy-based CPU sharing in virtual organizations. In: Proceedings of the 5th International Workshop on Grid Computing, pp.53–60. Pittsburgh, PA (2004) (ISSN: 1550-5510)
Wolf, L.C., Steinmetz, R.: Concepts for resource reservation in advance. Multimed. Tools Appl. 4(3): 255–278
Tannenbaum, T., Wright, D., Miller, K., Livny, M.: Condor – a distributed job scheduler. In: Berman, F., Hey, A.J.G., Fox, G. (eds.) Grid Computing: Making the Global Infrastructure a Reality. Wiley, New York, NY (2003) (ISBN: 0-470-85319-0)
Henderson, R., Tweten, D.: Portable batch system: external reference specification. Technical report, NASA, Ames Research Center (1996)
Platform, User’s Guide: (2006)[Online: http://www.platform.com/Products/Platform.LSF.Family/]
Dumitrescu, C.: Policy Research for iVDGL. 2004, The University of Chicago/GriPhyN Project NSF Review 2004. Chicago, USA (2004) [Online: http://poeple.cs.uchicago.edu/~cldumitr/]
Dumitrescu, C., Wilde, M., Foster, I.: Usage policies at the site level in Grid. iVDGL/GriPhyN Project: The University of Chicago (2006) [Online: http://poeple.cs.uchicago.edu/~cldumitr/]
Constandache, I.: Policy based dynamic negotiation for Grid services authorization. In: L3S Research Center. University of Hannover, Hannover, Germany (2005)
Dumitrescu, C., Raicu, I., Foster, I.: DI-GRUBER: a distributed approach for Grid resource brokering. In: Proceedings of the Proceedings of the 2005 ACM/IEEE Conference on Supercomputing (SC’2005), p. 38. Seattle, WA (2005) (ISBN: 1-59593-061-2)
Vöckler, J.-S., Wilde, M., Foster, I.: The GriPhyN Virtual Data System. GriPhyN Technical Report, The University of Chicago (2002) [Online: http://www.griphyn.org/]
Mohamed, H.H., Epema, D.H.J.: Experiences with the KOALA co-allocating scheduler in multiclusters. In: Proceedings of the 5th IEEE/ACM Int’l Symp. on Cluster Computing and the GRID (CCGrid2005), Cardiff, UK (2005 May)
SUN: Sun Grid Engine. (2004) [Online: http://www.sun.com]
LCG: LHC – The Large Hadron Collider Project [Online: http://lcg.web.cern.ch/LCG/] (2006)
Chervenak, A., Foster, I., Kesselman, C., Salisbury, C., Tuecke, S.: The Data Grid: towards an architecture for the distributed management and analysis of large scientific data sets. J. Netw. Comput. Appl. 23, 187–200 (2001) [Online: http://www.globus.org/]
Chun, B., Culler, D., Roscoe, T., Bavier, A., Peterson, L., Wawrzoniak, M., Bowman, M.: PlanetLab: an overlay testbed for broad-coverage services. ACM SIGCOMM Comput. Commun. Rev. 33(3), 3–12 (2003) (ISSN: 0146-4833)
Dumitrescu, C., Raicu, I., Foster, I.: Experiences in running workloads over Grid3. In: Proceedings of the Grid and Cooperative Computing (GCC2005), pp. 274–286. Beijing, China (2005) (ISBN: 3-540-30510-6)
Dumitrescu, C., Raicu, I., Ripeanu, M., Foster, I.: DiPerF: automated distributed performance testing framework. In: Proceedings of the 5th IEEE/ACM International Workshop in Grid Computing (Grid’04), pp. 289–296. IEEE Computer Society, Los Alamitos, CA (2004) (ISBN: 0-7695-2256-4)
Author information
Authors and Affiliations
Corresponding author
Additional information
This work was carried out for CoreGrid IST project n°004265, funded by the European Commission.
Rights and permissions
About this article
Cite this article
Dumitrescu, C.L., Raicu, I. & Foster, I. The Design, Usage, and Performance of GRUBER: A Grid Usage Service Level Agreement based BrokERing Infrastructure. J Grid Computing 5, 99–126 (2007). https://doi.org/10.1007/s10723-006-9060-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10723-006-9060-6