logo Technical Papers


The technical papers listed below are available in their entirety in PostScript form for viewing, printing, and retrieving. Note that some papers are listed under more than one subject.


- next subject - list of subjects -

High Performance Fortran (HPF)

  • High Performance Fortran Forum. High Performance Fortran Language Specification, Version 1.0. Scientific Programming, 2(1 & 2), 1993. (also available as CRPC-TR92225).

  • C. Koelbel, D. Loveman, R. Schreiber, G. Steele, and M. Zosel. The High Performance Fortran Handbook. MIT Press, Cambridge, MA, 1993.

  • "High Performance Fortran in Practice" A tutorial by Chuck Koelbel.

  • C. Koelbel. High Performance Fortran Journal of Development. Also available as CRPC-TR93300, Center for Research on Parallel Computation, 1993.


    - previous subject - next subject - list of subjects -

    The dHPF Compiler

  • High Performance Fortran Compilation Techniques for Parallelizing Scientific Codes, Vikram Adve, Guohua Jin, John Mellor-Crummey and Qing Yi, Supercomputing '98. November 1998 (to appear). (HTML) (Postscript)

  • Using Integer Sets for Data-Parallel Program Analysis and Optimization, Vikram Adve and John Mellor-Crummey, Proceedings of the SIGPLAN'98 Conference on Programming Language Design and Implementation (PLDI), Montreal, CA (June 1998).

  • Advanced Code Generation for High Performance Fortran. Vikram Adve and John Mellor-Crummey. In Languages, Compilation Techniques and Run Time Systems for Scalable Parallel Systems, Chapter 18, Lecture Notes in Computer Science Series, Springer Verlag, 1997.

  • A Control-Flow Simplification Algorithm for Optimizing Compiler-Generated Parallel Code. John Mellor-Crummey and Vikram Adve. 10th International Workshop on Languages and Compilers for Parallel Computing, August 1997.

  • A Model and Compilation Strategy for Out-of-Core Data Parallel Programs. R.Bordawekar, A.Choudhary, K.Kennedy, K.Koelbel, and M.Paleczny. Proceedings of the ACM Symposium on Principles and Practice of Parallel Programming, July 1995.

  • Compiler Support for Out-of-Core Arrays on Data Parallel Machines. M.Paleczny, K.Kennedy, and K.Koelbel. Proceedings of the IEEE Symposium on The Frontiers of Massively Parallel Computation, February 1995.

  • Integrated Support for Task and Data Parallelism. K. M. Chandy, I. Foster, K. Kennedy, C. Koelbel, and C.-W. Tseng. Journal of Supercomputing Applications, 8(2), 1994.

  • K. Kennedy, C. Koelbel, and M. Paleczny. Scalable I/O for Out-of-Core Structures. Technical Report CRPC-TR93357-S, Center for Research on Parallel Computation, 1993.


    - previous subject - next subject - list of subjects -

    Programming Environments and Performance Analysis

  • An Integrated Compilation and Performance Analysis Environment for Data Parallel Programs. V. Adve, J-C. Wang, J. Mellor-Crummey, D. Reed, M. Anderson, and K. Kennedy. SUPERCOMPUTING'95, San Diego, CA, December 1995. (PostScript Version - 6.5M)

  • The D Editor: A New Interactive Parallel Programming Tool. S. Hiranandani, K. Kennedy, C.-W. Tseng, and S. Warren. Proceedings of Supercomputing '94, Washington, DC, November 1994.

  • Requirements for Data-Parallel Programming Environments. V. Adve, A. Carle, E. Granston, S. Hiranandani, K. Kennedy, C. Koelbel, U. Kremer, J. Mellor-Crummey, C.-W. Tseng, and S. Warren. IEEE Transactions on Parallel and Distributed Technology, 2(3):48-58, Fall 1994.

  • Compiler Support for Analysis and Tuning of Data Parallel Programs Vikram S. Adve, Charles Koelbel, and John Mellor-Crummey, Proceedings of the 1994 Workshop on Parallel Processing Tools and Environments (May 1994).

  • M. W. Hall, J. Mellor-Crummey, A. Carle, and R. Rodriguez. FIAT: A Framework for Interprocedural Analysis and Transformation. In Proceedings of the Workshop on Compilers for Parallel Processing, Portland, August 1993.


    - previous subject - next subject - list of subjects -

    Automatic Data Layout Selection

  • R. Bixby, K. Kennedy, and U. Kremer. Automatic Data Layout Using 0-1 Integer Programming. In PACT94, Montreal, Canada, August 1994. (also available as CRPC-TR93349-S).

  • K. Kennedy and U. Kremer. Initial Framework for Automatic Data Layout in Fortran D: A Short Update on a Case Study. Technical Report CRPC-TR93324-S, Center for Research on Parallel Computation, 1993.

  • U. Kremer. Automatic Data Layout for Distributed-Memory Machines. Technical Report CRPC-TR93299-S, Center for Research on Parallel Computation, 1993. (thesis proposal).

  • U. Kremer. NP-completeness of Dynamic Remapping. In Proceedings of the Fourth Workshop on Compilers for Parallel Computers, Delft, The Netherlands, 1993. (also available as CRPC-TR93330-S).

  • U. Kremer, J. Mellor-Crummey, K. Kennedy, and A. Carle. Automatic Data Layout for Distributed-Memory Machines in the D Programming Environment. In C. W. Kessler, editor, Automatic Parallelization _ New Approaches to Code Generation, Data Distribution, and Performance Prediction, pages 136-152. Vieweg Advanced Studies in Computer Science, Verlag Vieweg, Wiesbaden, Germany, 1993. (also available as CRPC-TR93298-S).

  • K. Kennedy and U. Kremer. Automatic Data Alignment and Distribution of Loosely Synchronous Problems in an Interactive Programming Environment. Technical Report CRPC-TR91205-S, Center for Research on Parallel Computation, 1992.

  • K. Kennedy, N. McIntosh, and K. McKinley. Static Performance Estimation in a Parallelizing Compiler. Technical Report CRPC-TR92204-S, Center for Research on Parallel Computation, 1992. (formerly "Static Performance Estimation").

  • G. Fox, K. Kennedy, U. Kremer, and V. Balasundaram. A Static Performance Estimator to Guide Data Partitioning Decisions. Proceedings of the Third ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, SIGPLAN Notices, 26(7):213-223, July 1991.


    - previous subject - next subject - list of subjects -

    Fortran90 Analysis and Optimization

  • G. Roth and K. Kennedy. Loop Fusion in High Performance Fortran. In Proceedings of the ACM International Conference on Supercomputing (ICS'98), Melbourne, Australia, July 1998. (Also available as CRPC-TR98745).

  • G. Roth, J. Mellor-Crummey, K. Kennedy and R. G. Brickner. Compiling Stencils in High Performance Fortran. In Proceedings of SC'97: High Performance Networking and Computing, San Jose, CA, November 1997. (Also available as an HTML document).

  • G. Roth. Optimizing Fortran90D/HPF for Distributed-Memory Computers. PhD thesis, Rice University, Houston, TX, April 1997. (Also available as CRPC-TR97695-S).

  • G. Roth and K. Kennedy. Dependence Analysis of Fortran90 Array Syntax. In Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA'96), Sunnyvale, CA, August 1996. (Also available as CRPC-TR96653-S).

  • K. Kennedy, J. Mellor-Crummey and G. Roth. Optimizing Fortran90 Shift Operations on Distributed-Memory Multicomputers. In Proceedings of the 8th International Workshop on Languages and Compilers for Parallel Computing (LCPC'95), Columbus, OH, August 1995. (Also available as CRPC-TR95558-S).


    - previous subject - next subject - list of subjects -

    The Fortran D Language

  • G.C. Fox, S. Hiranandani, K. Kennedy, C. Koelbel, U. Kremer, C.-W. Tseng, M. Wu. The Fortran D Language Specification. December 1990 (revised April 1991). (also available as CRPC-TR90079).

    See also High Performance Fortran.


    - previous subject - next subject - list of subjects -

    The Fortran D Compiler: Regular Problems

  • K. Kennedy, N. Nedeljkovic, and A. Sethi. A Linear Time Algorithm for Computing the Memory Access Sequence in Data-Parallel Programs. Technical Report CRPC-TR94485-S, Center for Research on Parallel Computation, October 1994.

  • S. Hiranandani, K. Kennedy, J. Mellor-Crummey, and A. Sethi. Compilation Techniques for Block-Cyclic Distributions. In ICS94, Manchester, England, July 1994.

  • S. Hiranandani, K. Kennedy, and C.-W. Tseng. Evaluating Compiler Optimizations for Fortran D. Journal of Parallel and Distributed Computing, 21(1):27-45, April 1994.

  • L. Liebrock and K. Kennedy. Parallelization of Linearized Applications in Fortran D. In International Parallel Processing Symposium, 1994, Washington, D. C., April 1994. (also available as CRPC-TR93342-S).

  • K. Kennedy. Compiler Technology for Machine-Independent Parallel Programming. International Journal of Parallel Programming, 22(1):79-97, January 1994. (also available as CRPC-TR93364).

  • K. Kennedy and G. Roth. Context Optimization for SIMD Execution. In Proceedings of the Scalable High Performance Computing Conference (SHPCC'94), Knoxville, TN, 1994. (conference version of TR93306-S, also available as CRPC-TR94455-S).

  • A. Choudhary, G. Fox and S. Hiranandani, K. Kennedy, C. Koelbel, S. Ranka, and C.-W. Tseng. Unified Compilation of Fortran 77D and 90D. ACM Letters on Programming Languages and Systems, 2(1-4), Mar-Dec 1993.

  • S. Hiranandani, K. Kennedy, J. Mellor-Crummey, and A. Sethi. Advanced Compilation Techniques for Fortran D. Technical Report CRPC-TR93338, Center for Research on Parallel Computation, October 1993.

  • S. Hiranandani, K. Kennedy, and C.-W. Tseng. Preliminary Experiences with the Fortran D Compiler. In Proceedings of Supercomputing '93, Portland, OR, April 1993. (also available as CRPC-TR93307).

  • C.-W. Tseng. An Optimizing FORTRAN D Compiler for MIMD Distributed-Memory Machines. PhD thesis, Rice University, Houston, TX, January 1993. (also available as CRPC-TR93291-S).

  • K. Kennedy and G. Roth. Context Optimization for SIMD Execution. Technical Report CRPC-TR93306-S, Center for Research on Parallel Computation, 1993.

  • S. Hiranandani, K. Kennedy, C. Koelbel, U. Kremer, and C.-W. Tseng. An Overview of the Fortran D Programming System. In U. Banerjee, D. Gelernter, A. Nicolau, and D. Padua, editors, Language and Compilers for Parallel Computing, Lecture Notes in Computer Science #589, pages 18-34. Springer-Verlag, Berlin, August 1992.

  • A. Choudhary, G. Fox, S. Hiranandani, K. Kennedy, C. Koelbel, S. Ranka, and C.-W. Tseng. Compiling Fortran 77D and 90D for MIMD Distributed-Memory Machines. In Frontiers '92, McLean, VA, October 1992. (also available as CRPC-TR92203).

  • M. W. Hall, S. Hiranandani, K. Kennedy, and C.-W. Tseng. Interprocedural Compilation of Fortran D for MIMD Distributed-Memory Machines. In Proceedings of Supercomputing '92, Minneapolis, MN, November 1992. (also available as CPRC-TR91195).

  • S. Hiranandani, K. Kennedy, and C.-W. Tseng. Compiling Fortran D for MIMD Distributed-Memory Machines. Communications of the ACM, 35(8):66-80, August 1992.

  • S. Hiranandani, K. Kennedy, and C.-W. Tseng. Evaluation of Compiler Optimizations for Fortran D on MIMD Distributed-Memory Machines. In Proceedings of the 1992 ACM International Conference on Supercomputing, pages 1-14, Washington, D. C., July 1992. (also available as CRPC-TR91196).

  • S. Hiranandani, K. Kennedy, and C.-W. Tseng. Compiler Support for Machine-Independent Parallel Programming in Fortran D. In J. Saltz and P. Mehrotra, editors, Languages, Compilers and Run-Time Environments for Distributed Memory Machines. North-Holland, Amsterdam, The Netherlands, 1992.

  • S. Hiranandani, K. Kennedy, and C.-W. Tseng. Compiler Optimizations for Fortran D on MIMD Distibuted-Memory Machines. In Proceedings of Supercomputing '91, pages 86-100, Albuquerque, NM, November 1991. (also available as CRPC-TR91162).

  • C. Koelbel. Compile-Time Generation of Regular Communications Patterns. In Proceedings of Supercomputing '91, pages 101-110, Albuquerque, NM, November 1991.


    - previous subject - next subject - list of subjects -

    The Fortran D Compiler: Irregular Problems

  • R. von Hanxleden and K. Kennedy. Give-N-Take -- A Balanced Code Placement Framework. In Proceedings of the ACM SIGPLAN '94 Conference on Program Language Design and Implementation, 1994. (also available as CRPC-TR94388-S).

  • L. Liebrock and K. Kennedy. Parallelization of Linearized Applications in Fortran D. In International Parallel Processing Symposium, 1994, Cancun, Mexico, April 1994. (also available as CRPC-TR93342-S).

  • A. Choudhary, G. Fox and S. Ranka, S. Hiranandani, K. Kennedy, C. Koelbel, and J. Saltz. Software Support for Irregular and Loosely Synchronous Problems. International Journal of Computing Systems in Engineering, 3(4):43-52, 1993. (also available offline as CRPC-TR92258).

  • T. Clark, R. von Hanxleden, J. A. McCammon, and L. R. Scott. Parallelism Using Spatial Decomposition for Molecular Dynamics. Technical Report CRPC-TR93356-S, Center for Research on Parallel Computation, 1993.

  • R. von Hanxleden. Handling Irregular Problems with Fortran D-A Preliminary Report. Technical Report CRPC-TR93339-S, Center for Research on Parallel Computation, 1993.

  • R. Das, J. Saltz, and R. von Hanxleden. Slicing Analysis and Indirect Access to Distributed Arrays. Technical Report CRPC-TR93319-S, Center for Research on Parallel Computation, June 1993.

  • R. Das, R. von Hanxleden, K. Kennedy, C. Koelbel, and J. Saltz. Compiler Analysis for Irregular Problems in Fortran D. In Proceedings of the Fifth Workshop on Languages and Compilers for Parallel Computing, New Haven, CT, August 1992. (also available as CRPC-TR92287-S, revised January 1993).

  • K. Kennedy, J. Saltz, and R. von Hanxleden. Value-based Distributions in Fortran D - A Preliminary Report. Technical Report CRPC-TR93365-S, Center for Research on Parallel Computation, 1993. (to appear in Journal of Programming Languages, Special Issue on Compiling and Run-Time Issues for Distributed Address Space Machines).

  • K. Kennedy and R. von Hanxleden. A Code Placement Framework and Its Application to Communication Generation. Technical Report CRPC-TR93337-S, Center for Research on Parallel Computation, 1993.

  • T. Clark, R. von Hanxleden, K. Kennedy, C. Koelbel, and L. R. Scott. Evaluating Parallel Languages for Molecular Dynamics Computations. In Proceedings of 1992 Scalable High Performance Computing Conference, pages 98-105, Williamsburg, VA, April 1992. (also available as CRPC-TR92202-S).

  • R. von Hanxleden and K. Kennedy. Relaxing SIMD Control Flow constraints Using Loop Transformations. Proceedings of the SIGPLAN '92 Conference on Program Language Design and Implementation, SIGPLAN Notices, 27(7), July 1992. (also available as CRPC-TR92207-S).

  • T. Clark, R. von Hanxleden, J. A. McCammon, and L. R. Scott. Parallelization Strategies for a Molecular Dynamics Program. In Intel Supercomputer University Partners Conference, Mt. Hood, OR, April 1992.

  • R. von Hanxleden. Compiler Support for Machine Independent Parallelization of Irregular Problems. Technical Report CRPC-TR92301-S, Center for Research on Parallel Computation, 1992. (thesis proposal).

  • L. Liebrock, D. Hicks, K. Kennedy, and J. Dongarra. Using Problem and Algorithm Topology for Parallelization. Technical Report CRPC-TR91166-S, Center for Research on Parallel Computation, 1991.


    - previous subject - next subject - list of subjects -

    Program Analysis & Transformation

  • P. Havlak. Interprocedural Symbolic Analysis. PhD thesis, Rice University, May 1994. (also available as CRPC-TR94451-S).

  • K. Kennedy and K. McKinley. Typed Fusion with Applications to Parallel and Sequential Code Generation. (also available offline as Technical Report CRPC-TR94370, Center for Research on Parallel Computation, 1994.

  • P. Havlak. Construction of Thinned Gated Single-Assignment Form. In Proceedings of the Sixth Workshop on Languages and Compilers for Parallel Computing, volume 768 of Lecture Notes in Computer Science, pages 477-499, Portland, OR, August 1993. Springer-Verlag.

  • K. Kennedy and K. McKinley. Maximizing Loop Parallelism and Improving Data Locality via Loop Fusion and Distribution. In U. Banerjee, D. Gelernter, A. Nicolau, and D. Padua, editors, Languages and Compilers for Parallel Computing, pages 301-320. Springer-Verlag, 1993.

  • M. W. Hall and K. Kennedy. Efficient Call Graph Analysis. In ACM Letters on Programming Languages and Systems, pages 227-242, September 1992. (also available offline as CRPC-TR92223).

  • K. D. Cooper, M. W. Hall, and L. Torczon. Unexpected Side Effects of Inline Substitution. ACM Letters on Programming Languages and Systems, 1(1):22-32, March 1992. (also available offline as CRPC-TR90065).

  • P. Havlak and K. Kennedy. An Implementation of Interprocedural Bounded Regular Section Analysis. IEEE Transactions on Parallel and Distributed Systems, 2(3):350-360, July 1991.

  • K. Kennedy and P. Havlak. Experience with Interprocedural Analysis of Array Side Effects. IEEE Transactions on Parallel and Distributed Systems, 2(3), 1991. (also available offline as CRPC-TR90063-S).


    - previous subject - next subject - list of subjects -

    ParaScope

  • K. Kennedy, K. McKinley, and C.-W. Tseng. Analysis and Transformation in an Interactive Parallel Programming Tool. Concurrency: Practice and Experience, 5(7):575-602, October 1993.

  • M. W. Hall, T. Harvey, K. Kennedy, N. McIntosh, K. McKinley, J. Oldham, M. Palenczny, and G. Roth. Experiences Using the ParaScope Editor: An Interactive Parallel Programming Tool. In Proceedings of the Symposium on Principles and Practice of Parallel Programming, San Diego, CA, May 1993. (also available as CRPC-TR93297).

  • K. D. Cooper, M. W. Hall, R. Hood, K. Kennedy, K. McKinley, J. Mellor-Crummey, L. Torczon, and S. Warren. The ParaScope Parallel Programming Environment. In Proceedings of the IEEE, pages 244-263, February 1993.

  • M. W. Hall, T. Harvey, K. Kennedy, N. McIntosh, K. McKinley, J. Oldham, M. Paleczny, and G. Roth. Experiences Using the ParaScope Editor. No longer available; replaced by CRPC-TR93297, Center for Research on Parallel Computation, 1991.

  • K. Kennedy, K. McKinley, and C.-W. Tseng. Interactive Parallel Programming Using the ParaScope Editor. IEEE Transactions on Parallel and Distributed Systems, 2(3):329-341, July 1991. (also available as CRPC-TR90096).


    - previous subject - list of subjects -

    Miscellaneous

  • R. Mirchandaney, S. Hiranandani, and A. Sethi. Improving the Performance of DSM Systems via Compiler Involvement. In SC94, DC, November 1994.

  • G. Roth. Optimizing Fortran 90D Programs for SIMD Execution. Technical Report CRPC-TR93341-S, Center for Research on Parallel Computation, 1993.


    Please direct questions about the D System to dsystem-info@cs.rice.edu.

    D System Home Page - Software & Documentation
    The Fortran Tools Glossary - The D Team

    Updated 9/30/97.
    http://www.cs.rice.edu/~dsystem/techPapers.html


    ftp Instructions

    The following instructions are provided for readers who do not have WWW access.

    To retrieve a CRPC technical report (CRPC-TR) via ftp, first ftp to softlib.rice.edu, log in as anonymous, and give your email address as the password. Next, enter the "binary" command, then cd into pub/CRPC-TRs/reports. Finally, using the CRPC-TR number -- for example, CRPC-TR93341-S -- get the ps file that matches your CRPC-TR number.

    If the report you want is not listed in Softlib, please contact dsystem-info@cs.rice.edu.