Abstract
The Large Hadron Collider (LHC) is preparing for data taking at the end of 2009. The Worldwide LHC Computing Grid (WLCG) provides data storage and computational resources for the high energy physics community. Operating the heterogeneous WLCG infrastructure, which integrates 140 computing centers in 33 countries all over the world, is a complicated task. Reliable monitoring is one of the crucial components of the WLCG for providing the functionality and performance that is required by the LHC experiments. The Experiment Dashboard system provides monitoring of the WLCG infrastructure from the perspective of the LHC experiments and covers the complete range of their computing activities. This work describes the architecture of the Experiment Dashboard system and its main monitoring applications and summarizes current experiences by the LHC experiments, in particular during service challenges performed on the WLCG over the last years.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Evans, L., Bryant, P. (eds): LHC machine. J. Instr. 3:S08001. doi:10.1088/1748-0221/3/08/S08001 (2008)
The ATLAS Computing Group: ATLAS computing technical design report. ATLAS-TDR-017, CERN-LHCC-2005-022 (2005)
The CMS Computing Group: CMS computing technical design report. CMS-TDR-007, CERN-LHCC-2005-023 (2005)
The ALICE Computing Group: ALICE computing technical design report. CMS-TDR-012, CERN-LHCC-2005-020 (2005)
The LHCb Computing Group: LHCb computing technical design report. CMS-TDR-0011, CERN-LHCC-2005-019 (2005)
European Organization for Nuclear Research, CERN. http://public.web.cern.ch/public/. Accessed 21 April 2010
Shiers, J.: The Worldwide LHC Computing Grid (worldwide LCG), Computer Physics communications, vol. 177, issues 1–2, July 2007, pages 219–223, Proceedings of the Conference on Computational Physics 2006 - CCP (2006)
Rehn, J., et al.: PhEDEx high-throughput data transfer management system. In: CHEP06, Conference on Computing in High Energy and Nuclear Physics Proceedings, Mumbai, India, (2006)
Tsaregorodsev A., et al.: Dirac: A community Grid solution. In: CHEP07, Conference on Computing in High Energy and Nuclear Physics Proceedings, Victoria, BC, Canada (2007)
Nilsson, P.: PanDA system in ATLAS Experiment. ACAT’08, Italy (2008)
Saiz, P., et al.: AliEn—ALICE environment on the GRID. Nucl. Instrum. Methods A502, 437–440 (2003)
http://www.hpcwire.com/offthewire/STEP09-Demonstrates-LHC-Readiness-49631242.html. Accessed 21 April 2010
Laure, E., Fisher, S.M., Frohner, A., Grandi, C., Kunszt, P., et al.: Programming theGrid with gLite. Comput. Methods Sci. Technol. 12(1), 33–45 (2006)
Pordes, R., et al.: The open science Grid. J. Phys. Conf. Ser. 78, 012057 (2007)
Eerola, P., et al.: Roadmap for the ARC Grid middleware. PARA LNCS 4699 (2006)
CMS Dashboard stats. http://lxarda18.cern.ch/awstats/awstats.pl?config=lxarda18.cern.ch. Accessed 21 April 2010
Karmady, R., et al.: GridView: a monitoring and visualization tool. In: CHEP06, Conference on Computing in High Energy and Nuclear Physics Proceedings, Mumbai, India (2006)
Martyniak, J., et al.: Real time monitor of Grid job executions. In: CHEP09, Conference on Computing in High Energy and Nuclear Physics Proceedings, Prague, Chech Republic (2009)
Ruda, M.: A uniform job monitoring service in multiple job universes. High Performance Distributed Computing, Proceedings of the 2007 workshop on Grid monitoring Monterey, California USA (2007)
Collados, D.: Evolution of SAM in an enhanced model for monitoring WLCG services. In: CHEP09, Conference on Computing in High Energy and Nuclear Physics Proceedings, Pargue, Chech Republic (2009)
Aiftimiei, C., P, et al.: Using CREAM and CEMON for job submission and management in the gLite middleware. In: CHEP09 Conference Proceedings, Pargue, Chech Republic (2009)
Moscicki, J., et al.: Ganga: a tool for computational-task management and easy access to Grid resources. Comput. Phys. Commun. arXiv:0902.2685v1
Spiga, D., et al.: The CMS remote analysis builder (CRAB). Lect. Notes Comput. Sci. 4873, 580–586 (2007)
Legrand, I., Newman, H., Cirstoiu, C., Grigoras, C., Toarta, M., Dobre, C.: MonALISA: an agent based, dynamic service system to monitor, controland optimize Grid based applications. In: Proceedings of Computing for High Energy Physics, Interlaken, Switzerland (2004)
Casey, J., Rodrigues, D., Schwickerath, U., Silva, R.: Monitoring the efficiency of user jobs. In: CHEP’09: 17th International Conference on Computing in High Energy and Nuclear Physics, Prague, Czech Republic (2009)
Metson, S., et al.: CMS offline webtools. In: CHEP’07: Conference on Computing in High Energy and Nuclear Physics Procedings, Victoria, BC, Canada (2007)
Schulz, M., et al.: Building the WLCG file transfer service. In: CHEP07: Conference on Computing in High Energy and Nuclear Physics Proceedings, Victoria, BC, Canada (2007)
Cooke, W., et al.: The relational Grid monitoring architecture: mediating information about Grid. J. Grid Comput. 2(4), 1–17 (2004)
Thain, D., Tannenbaum, T., Livny, M.: Distributed computing in practice: the condor experience. Concurrency Comput. Pract. Ex. 17(2–4), 323–356 (2005)
Karavkis, E., et al.: CMS dashboard for monitoring of the user analysis activities. In: CHEP’09: 17th International Conference on Computing in High Energy and Nuclear Physics, Prague, Czech Republic (2009)
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules in large databases. In: Proceedings of the 20th International Conference on Very Large Data Bases, VLDB, Santiago, Chile, 487–499 (1994)
Belforte, S., et al.: The commissioning of CMS sites: improving the site reliability. In: 17th International Conference on Computing in High Energy and Nuclear Physics Proceedings, Prague, Czech Republic (2009)
GridMap visualization, http://www.isgtw.org/?pid=1000728. Accessed 21 April 2010
Andreozzi, S., et al.: GridICE: a monitoring service for Grid systems. Future Gener. Comput. Syst. 21(4), 559–571 (2005)
Smallen, S., et al.: User-level Grid monitoring with Inca 2. In: High Performance Distributed Computing, Proceedings of the 2007 workshop on Grid monitoring Monterey, California, USA (2007)
Andreeva, J., et al.: The experiment dsahboard for medical applications. In: 3rd EGEE User Forum, Clermont-Ferrand, France (2008)
Oracle Partitioning. http://www.oracle.com/us/products/database/options/partitioning/index.htm. Accessed 21 April 2010
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Andreeva, J., Boehm, M., Gaidioz, B. et al. Experiment Dashboard for Monitoring Computing Activities of the LHC Virtual Organizations. J Grid Computing 8, 323–339 (2010). https://doi.org/10.1007/s10723-010-9148-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10723-010-9148-x