Susan L. Graham, Peter B. Kessler and Marshall K. McKusick.
gprof: a call graph execution profiler.
In Best of PLDI 1979-1999.
ACM SIGPLAN Notices, 39(4):49-57, April 2004.
(Originally appeared in ACM SIGPLAN PLDI 1982.)
Jennifer M. Anderson, Lance M. Berc, Jeffrey Dean, Sanjay Ghemawat, Monika R. Henzinger, Shun-Tak A. Leung, Richard L. Sites, Mark T. Vandevoorde, Carl A. Waldspurger and William E. Weihl.
Continuous profiling: Where have all the cycles gone?
ACM Transactions on Computer Systems (TOCS), 15(4):357-390,
November 1997.
Tom Anderson and Ed Lazowska.
Quartz: a tool for tuning parallel program performance.
In Proceedings of the 1990 ACM SIGMETRICS conference on Measurement and modeling of computer systems. Boulder, Colorado, United States, pages 115 - 125, 1990.
Holger Brunst, Hans-Christian Hoppe, Wolfgang E. Nagel and Manuela Winkler.
Performance Optimization for Large Scale Computing: The Scalable VAMPIR Approach.
In Proceedings of International Conference on Computational Science (ICCS)
San Francisco, CA, USA, May 28-30, 2001. Published as in LNCS 2074,
(V.N. Alexandrov, J.J. Dongarra, B.A. Juliano, R.S. Renner and C.J.Kenneth Tan, editors), Springer 2001.
David Culler, Richard Karp, David Patterson, Abhijit Sahay, Eunice Santos,
Klaus Erik Schauser, Ramesh Subramonian, and Thorsten von Eicken.
LogP: A Practical Model of Parallel Computation.
Communications of the ACM
Volume 39, Issue 11 (November 1996), pages 79-85.
A. Alexandrov, M. F. Ionescu, K. E. Schauser, and C. Scheiman. LogGP: Incorporating long messages into the LogP model. JPDC, 44(1):71-79, 1997.
Using LogP/LogGP
C. Bell, D. Bonachea, Y. Cote, J. Duell, P. Hargrove, P. Husbands, C. Iancu, M. Welcome, K. Yelick. An Evaluation of Current High Performance Networks.
IPDPS, Nice, France, April 22-26, 2003.