ISBN: 3-540-67858-1
TITLE: Languages and Compilers for Parallel Computing
AUTHOR: Carter, Larry; Ferrante, Jeanne (Eds.)
TOC:

Java
High Performance Numerical Computing in Java: Language and Compiler Issues 1
Pedro V. Artigas, Manish Gupta, Samuel P. Midkiff, and Jos E. Moreira
Instruction Scheduling in the Presence of Java's Runtime Exceptions 18
Matthew Arnold, Michael Hsiao, Ulrich Kremer, and Barbara Ryder
Dependence Analysis for Java 35
Craig Chambers, Igor Pechtchanski, Vivek Sarkar, Mauricio J. Serrano, and Harini Srinivasan
Low-Level Transformations A
Comprehensive Redundant Load Elimination for the IA-64 Architecture 53
Youngfeng Wu and Yong-fong Lee
Minimum Register Instruction Scheduling: A New Approach for Dynamic Instruction Issue Processors 70
R. Govindarajan, Chihong Zhang, and Guang R. Gao
Unroll-Based Copy Elimination for Enhanced Pipeline Scheduling 85
Suhyun Kim, Soo-Mook Moon, Jinpyo Park, and HanSaem Yun
Data Distribution
A Linear Algebra Formulation for Optimising Replication in Data Parallel Programs 100
Olav Beckmann and Paul H.J. Kelly
Accurate Data and Context Management in Message-Passing Programs 117
Dhruva R. Chakrabarti and Prithviraj Banerjee
An Automatic Iteration/Data Distribution Method Based on Access Descriptors for DSMM 133
Angeles G. Navarro and Emilio L. Zapata
High-Level Transformations
Inter-array Data Regrouping 149
Chen Ding and Ken Kennedy
Iteration Space Slicing for Locality 164
William Pugh and Evan Rosser
A Compiler Framework for Tiling Imperfectly-Nested Loops 185
Yonghong Song and Zhiyuan Li
Models
Parallel Programming with Interacting Processes 201
Peiyi Tang and Yoichi Muraoka
Application of the Polytope Model to Functional Programs 219
Nils Ellmenreich, Christian Lengauer, and Martin Griebl
Multilingual Debugging Support for Data-Driven and Thread-Based Parallel Languages 236
Parthasarathy Ramachandran and Laxmikant V. Kale'
Array Analysis
An Analytical Comparison of the I-Test and Omega Test 251
David Niedzielski and Kleanthis Psarris
The Access Region Test 271
Jay Hoeflinger and Yunheung Paek
A Precise Fixpoint Reaching Definition Analysis for Arrays 286
Jean-Franois Collard and Martin Griebl
Demand-Driven Interprocedural Array Property Analysis 303
Yuan Lin and David Padua
Language Support
Language Support for Pipelining Wavefront Computations 318
Bradford L. Chamberlain, E. Christopher Lewis, and Lawrence Snyder
The Data Mover: A Machine-Independent Abstraction for Managing Customized Data Motion 333
Scott B. Baden and Stephen J. Fink
Optimization of Memory Usage Requirement for a Class of Loops Implementing Multi-dimensional Integrals 350
Chi-Chung Lam, Daniel Cociorva, Gerald Baumgartner, and P. Sadayappan
Compiler Design and Cost Analysis
Compile-Time Based Performance Prediction 365
Calin Cascaval, Luiz DeRose, David A. Padua, and Daniel A. Reed
Designing the Agassiz Compiler for Concurrent Multithreaded Architectures 380
B. Zheng, J. Y. Tsai, B. Y. Zang, T. Chen, B. Huang, J. H. Li, Y. H. Ding, J. Liang, Y. Zhen, P. C. Yew, and C. Q. Zhu
The Scc Compiler: SWARing at MMX and 3DNow! 399
Randall J. Fisher and Henry G. Dietz
Low-Level Transformation B
Loop Shifting for Loop Compaction 415
Alain Darte and Guillaume Huard
Speculative Predication Across Arbitrary Interprocedural Control Flow 432
Hank G. Dietz
Posters
Porting an Ocean Code to MPI Using tsf 447
F. Bodin, Y. Mvel, S. Chauveau, and E. Rohou
A Geometric Semantics for Program Representation in the Polytope Model 451
Brian J. d'Auriol
Compiler and Run-Time Support for Improving Locality in Scientific Codes 455
Hwansoo Han, Gabriel Rivera, and Chau-Wen Tseng
Code Restructuring for Improving Real Time Response through Code Speed, Size Trade-offs on Limited Memory Embedded DSPs 459
Vipin Jain, Siddharth Rele, Santosh Pande, and J. Ramanujam
Compiling for Speculative Architectures 464
Seon Wook Kim and Rudolf Eigenmann
Symbolic Analysis in the PROMIS Compiler 468
Nicholas Stavrakos, Steven Carroll, Hideki Saito, Constantine Polychronopoulos, and Alex Nicolau
Data I/O Minimization for Loops on Limited Onchip Memory Processors 472
Lei Wang and Santosh Pande
Time Skewing for Parallel Computers 477
David Wonnacott
Run-Time Parallelization Optimization Techniques 481
Hao Yu and Lawrence Rauchwerger
Thresholding for Work Distribution of Recursive, Multithreaded Functions 485
Gary M. Zoppetti, Gagan Agrawal, and Lori L. Pollock
An Empirical Study of Function Pointers Using SPEC Benchmarks 490
Ben-Chung Cheng and Wen-mei W. Hwu
Data Driven Graph: A Parallel Program Model for Scheduling 494
V. D. Tran, L. Hluchy, and G. T. Nguyen
Author Index 499
END
