ScalaJack: Customized Scalable Tracing with In-situ Data Analysis

Ananthakrishnan, Srinath Krishna; Mueller, Frank

doi:10.1007/978-3-319-09873-9_2

Srinath Krishna Ananthakrishnan¹⁶ &
Frank Mueller¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8632))

Included in the following conference series:

European Conference on Parallel Processing

2804 Accesses
1 Citations

Abstract

Root cause diagnosis of large-scale HPC applications often fails because tools, specifically trace-based ones, can no longer record all metrics they measure. We address this problems by combining customized tracing and providing support for in-situ data analysis via ScalaJack, a framework with customizable instrumentation and pluggable extension capabilities for problem directed instrumentation and in-situ data analysis. We further eliminate cross cutting concerns by code refactoring for aspect orientation and evaluate these capabilities in case studies within and beyond the scope of tracing.

This work was supported in part by NSF grants 1217748, 0958311, and 0937908.

Download to read the full chapter text

Chapter PDF

Lightweight control-flow instrumentation and postmortem analysis in support of debugging

Article 10 March 2016

ParLoT: Efficient Whole-Program Call Tracing for HPC Applications

ONE View: A Fully Automatic Method for Aggregating Key Performance Metrics and Providing Users with a Synthetic View of HPC Applications

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Adhianto, L., Banerjee, S., Fagan, M., Krentel, M., Marin, G., Mellor-Crummey, J., Tallent, N.: HPCToolkit: Tools for performance analysis of optimized parallel programs. Concurrency & Comp. Practice and Experience 22(6), 685–701 (2010)
Google Scholar
Arnold, D.C., Ahn, D.H., de Supinski, B.R., Lee, G.L., Miller, B.P., Schulz, M.: Stack trace analysis for large scale debugging. In: International Parallel and Distributed Processing Symposium (2007)
Google Scholar
Aspect, C.: AspectC: AOP for C. (2004)
Google Scholar
Brunst, H., Hackenberg, D., Juckeland, G., Rohling, H.: Comprehensive performance tracking with Vampir 7. In: Tools for HPC 2009, pp. 17–29 (2010)
Google Scholar
Buck, B., Hollingsworth, J.: An API for runtime code patching. International Journal of High Performance Computing Applications 14(4), 317–329 (2000)
Article Google Scholar
De Rose, L., Hollingsworth, J., Hoover, T.: The dynamic probe class library – an infrastructure for developing instrumentation for performance tools. In: International Parallel and Distributed Processing Symposium (April 2001)
Google Scholar
Eaddy, M., Zimmermann, T., Sherwood, K., Garg, V., Murphy, G., Nagappan, N., Aho, A.: Do crosscutting concerns cause defects? IEEE Transactions on Software Engineering 34(4), 497–515 (2008)
Article Google Scholar
Eaddy, M., Aho, A., Murphy, G.C.: Identifying, assigning, and quantifying crosscutting concerns. In: Workshop on Assessment of Contemporary Modularization Techniques, pp. 2–2 (2007)
Google Scholar
Geimer, M., Wolf, F., Wylie, B.J.N., Abraham, E., Becker, D., Mohr, B.: The scalasca performance toolset architecture. In: International Workshop on Scalable Tools for High-End Computing (June 2008)
Google Scholar
Graham, S.L., Kessler, P.B., Mckusick, M.K.: Gprof: A call graph execution profiler. ACM Sigplan Notices 17(6), 120–126 (1982)
Article Google Scholar
Kiczales, G., Hilsdale, E.: Aspect-oriented programming. In: ACM SIGSOFT Software Engineering Notes, vol. 26, p. 313 (2001)
Google Scholar
Kiczales, G., Hilsdale, E., Hugunin, J., Kersten, M., Palm, J., Griswold, W.G.: An overview of AspectJ. In: Lindskov Knudsen, J. (ed.) ECOOP 2001. LNCS, vol. 2072, pp. 327–354. Springer, Heidelberg (2001)
Chapter Google Scholar
Laboratory, L.A.N.: Cell-based adaptive mesh refinement using MPI and OpenCL GPU code, https://github.com/losalamos/CLAMR
Marathe, J., Mueller, F., Mohan, T., de Supinski, B.R., McKee, S.A., Yoo, A.: METRIC: Tracking down inefficiencies in the memory hierarchy via binary rewriting. In: Int’l Symp. on Code Generation and Optimization, pp. 289–300 (March 2003)
Google Scholar
Mucci, P.J., Browne, S., Deane, C., Ho, G.: PAPI: A portable interface to hardware performance counters. In: HPCMP Users Group Conference (1999)
Google Scholar
Nagel, W.E., Arnold, A., Weber, M., Hoppe, H.C., Solchenbach, K.: VAMPIR: Visualization and analysis of MPI resources. Supercomputer 12(1), 69–80 (1996)
Google Scholar
Noeth, M., Ratn, P., Mueller, F., Schulz, M., de Supinski, B.R.: ScalaTrace: Scalable compression and replay of communication traces for high-performance computing. Journal of Parallel Distributed Computing 69(8), 696–710 (2009)
Article Google Scholar
of Dresden, T.U.: Score-p: Application instrumentation, https://silc.zih.tu-dresden.de/scorep-current/html
Pillet, V., Labarta, J., Cortes, T., Girona, S.: PARAVER: A tool to visualise and analyze parallel code. In: WoTUG-18: Transputer and occam Developments.Transputer and Occam Engineering, vol. 44, pp. 17–31 ( April 1995)
Google Scholar
Rajaraman, A., Ullman, J.: Mining of Massive Datasets. Cambridge Press (2011)
Google Scholar
Roth, P., Arnold, D., Miller, B.: MRNet: A software-based multicast/reduction network for scalable tools. Supercomputing, 21–36 (2003)
Google Scholar
Shende, S.S., Malony, A.D.: The tau parallel performance system. Int. J. High Perform. Comput. Appl. 20(2), 287–311 (2006)
Article Google Scholar
Vetter, J., Chambreau, C.: mpiP: Lightweight, scalable MPI profiling. CASC/mpip (2005), http://mpip.sourceforge.net/
Wu, X., Mueller, F.: Elastic and scalable tracing and accurate replay of non-deterministic events. In: Int’l Conference on Supercomputing, pp. 59–68 (June 2013)
Google Scholar
Wu, X., Deshpande, V., Mueller, F.: ScalaBenchGen: Auto-generation of communication benchmarks traces. In: International Parallel and Distributed Processing Symposium, pp. 1250–1260 (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

North Carolina State University, USA
Srinath Krishna Ananthakrishnan & Frank Mueller

Authors

Srinath Krishna Ananthakrishnan
View author publications
You can also search for this author in PubMed Google Scholar
Frank Mueller
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

CRACS/INESC-TEC and FCUP, Universidade do Porto, Rua do Campo Alegre, 1021, 4169-007, Porto, Portugal
Fernando Silva , Inês Dutra & Vítor Santos Costa , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ananthakrishnan, S.K., Mueller, F. (2014). ScalaJack: Customized Scalable Tracing with In-situ Data Analysis. In: Silva, F., Dutra, I., Santos Costa, V. (eds) Euro-Par 2014 Parallel Processing. Euro-Par 2014. Lecture Notes in Computer Science, vol 8632. Springer, Cham. https://doi.org/10.1007/978-3-319-09873-9_2

Download citation

DOI: https://doi.org/10.1007/978-3-319-09873-9_2
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09872-2
Online ISBN: 978-3-319-09873-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

ScalaJack: Customized Scalable Tracing with In-situ Data Analysis

Abstract

Chapter PDF

Similar content being viewed by others

Lightweight control-flow instrumentation and postmortem analysis in support of debugging

ParLoT: Efficient Whole-Program Call Tracing for HPC Applications

ONE View: A Fully Automatic Method for Aggregating Key Performance Metrics and Providing Users with a Synthetic View of HPC Applications

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

ScalaJack: Customized Scalable Tracing with In-situ Data Analysis

Abstract

Chapter PDF

Similar content being viewed by others

Lightweight control-flow instrumentation and postmortem analysis in support of debugging

ParLoT: Efficient Whole-Program Call Tracing for HPC Applications

ONE View: A Fully Automatic Method for Aggregating Key Performance Metrics and Providing Users with a Synthetic View of HPC Applications

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation