Abstract
Performance analysis of applications on large clusters of SMPs requires a monitoring approach that supports tools realizing concepts like automation, distribution and on-line operations. Key goals are a minimization of the perturbation of the target application and flexibility and efficiency with respect to data pre-processing and filtering. To achieve these goals, our approach separates the monitor into a passive monitoring library linked to the application and an active ‘runtime information producer’ (RIP) which handles monitoring requests and performs pre-processing (e.g., aggregation) of performance data for individual cluster nodes. A directory service can be queried to discover which RIPs handle which nodes.
Part of this work is funded by the Competence Network for High-Performance Computing in Bavaria KONWIHR (http://konwihr.in.tum.de) and by the European Commission via the APART working group (http://www.fz-juelich.de/apart).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Balis, B., Bubak, M., Funika, W., Szepieniec, T., Wismller, R.: Monitoring of Interactive Grid Applications. In: To appear in Proceedings of Dagstuhl Seminar 02341 on Performance Analysis and Distributed Computing, Kluwer Academic Publishers, Dordrecht (2003)
Mohr, B., Malony, A.D., Shende, S., Wolf, F.: Towards a Performance Tool Interface for OpenMP: An Approach Based on Directive Rewriting. In: EWOMP 2001 Third European Workshop on OpenMP (September 2001)
The Top 500 Supercomputer Sites, http://www.top500.org
Gerndt, M., Frlinger, K.: Towards Automatic Performance Analysis for Large Scale Systems. In: At the 10th International Workshop on Compilers for Parallel Computers (CPC 2003), Amsterdam, The Netherlands (January 2003)
The Hitachi Performance Monitor Function (Hitachi Confidential)
Browne, S., Dongarra, J., Garner, N., London, K., Mucci, P.: A Scalable Cross-Platform Infrastructure for Application Performance Tuning Using Hardware Counters. In: Proc. SC 2000 (November 2000)
Fahringer, T., Gerndt, M., Riley, G., Trff, J.L.: Formalizing OpenMP Performance Properties with the APART Specification Language (ASL). In: International Workshop on OpenMP: Experiences and Implementation. LNCS, pp. 428–439. Springer, Tokyo (2000)
Fahringer, T., Gerndt, M., Riley, G., Trff, J.L.: Knowledge Specification for Automatic Performance Analysis. APART Technical Report (2001), http://www.fz-juelich.de/apart
CrossGrid Project, http://www.eu-crossgrid.org
Ludwig, T., Wismller, R., Sunderam, V., Bode, A.: OMIS – On-line Monitoring Interface Specification (Version 2.0). LRR-TUM Research Report Series, vol. 9. Shaker Verlag, Aachen (1997), http://wwwbode.in.tum.de/omis/OMIS/Version-2.0/version-2.0.ps.gz
Dynamic Probe Class Library, http://oss.software.ibm.com/dpcl/
Dyninst. An Application Program Interface (API) for Runtime Code Generation, http://www.dyninst.org
Thiffault, C., Voss, M., Healey, S.T., Kim, S.W.: Dynamic Instrumentation of Large-Scale MPI/OpenMP Applications. To appear in Proc. of IPDPS 2003: International Parallel and Distrubuted Processing Symposium, Nice, France (April 2003)
Tierney, B., Aydt, R., Gunter, D., Smith, W., Swany, M., Taylor, V., Wolski, R.: A Grid Monitoring Architecture, http://www-didc.lbl.gov/GGF-PERF/GMA-WG/papers/GWD-GP-16-2.pdf
Nagel, W.E., Arnold, A., Weber, M., Hoppe, H.C., Solchenbach, K.: VAMPIR: Visualization and analysis of MPI resources. Supercomputer 12(1), 69–80 (1996), http://www.pallas.com/e/products/vampir/index.htm
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Fürlinger, K., Gerndt, M. (2003). Distributed Configurable Application Monitoring on SMP Clusters. In: Dongarra, J., Laforenza, D., Orlando, S. (eds) Recent Advances in Parallel Virtual Machine and Message Passing Interface. EuroPVM/MPI 2003. Lecture Notes in Computer Science, vol 2840. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39924-7_58
Download citation
DOI: https://doi.org/10.1007/978-3-540-39924-7_58
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20149-6
Online ISBN: 978-3-540-39924-7
eBook Packages: Springer Book Archive