ISBN: 3-540-41128-3
TITLE: High Performance Computing
AUTHOR: Valero, Mateo; Joe, Kazuki; Kitsuregawa, Masaru; Tanaka, Hidehiko (Eds.)
TOC:

I. Invited Papers 
Instruction Level Distributed Processing: Adapting to Future Technology 1 
J. E. Smith 
Macroservers: An Object-Based Programming and 
Execution Model for Processor-in-Memory Arrays 7 
Hans P. Zima and Thomas L. Sterling 
The New DRAM Interfaces: SDRAM, RDRAM and Variants 26 
Brian Davis, Bruce Jacob and Trevor Mudge 
Blue Gene 32 
Henry S. Warren, Jr. 
Earth Simulator Project in Japan - Seeking a Guide Line for 
the Symbiosis between the Earth and Human Beings  Visualizing 
an Aspect of the Future of the Earth by a Supercomputer  33 
Keiji Tani 
II. Compilers, Architectures and Evaluation 
Limits of Task-Based Parallelism in Irregular Applications 43 
Barbara Kreaseck, Dean Tullsen and Brad Calder 
The Case for Speculative Multithreading on SMT Processors 59 
Haitham Akkary and Sbastien Hily 
Loop Termination Prediction 73 
Timothy Sherwood and Brad Calder 
CompilerDirected Cache Assist Adaptivity 88 
Xiaomei Ji, Dan Nicolaescu, Alexander Veidenbaum, 
Alexandru Nicolau and Rajesh Gupta 
Skewed Data Partition and Alignment Techniques 
for Compiling Programs on Distributed Memory Multicomputers 105 
Tzung-Shi Chen and Chih-Yung Chang 
Processor Mechanisms for Software Shared Memory 120 
Nicholas P. Carter, William J. Dally, Whay S. Lee, 
Stephen W. Keckler and Andrew Chang 
An Evaluation of Page Aggregation Technique 
on Different DSM Systems 134 
Mario Donato Marino and Geraldo Lino de Campos 
Nanothreads vs. Fibers for the Support 
of Fine Grain Parallelism on Windows NT/2000 Platforms 146 
Vasileios K. Barekas, Panagiotis E. Hadjidoukas, 
Eleftherios D. Polychronopoulos and Theodore S. Papatheodorou 
III. Algorithms, Models and Applications 
Partitioned Parallel Radix Sort 160 
Shin-Jae Lee, Minsoo Jeon, Andrew Sohn and Dongseung Kim 
Transonic Wing Shape Optimization Based on Evolutionary Algorithms 172 
Shigeru Obayashi, Akira Oyama and Takashi Nakamura 
A Common CFD Platform UPACS 182 
Hiroyuki Yamazaki, Shunji Enomoto and Kazuomi Yamamoto 
On Performance Modeling for HPF Applications with ASL 191 
Thomas Fahringer, Michael Gerndt, Graham Riley and 
Jesper Larsson Trff 
A "Generalized k-Tree-Based Model to Sub-system Allocation" 
for Partitionable Multi-dimensional Mesh-Connected Architectures 205 
Jeeraporn Srisawat and Nikitas A. Alexandridis 
An Analytic Model for Communication Latency in Wormhole-Switched 
k-Ary n-Cube Interconnection Networks with Digit-Reversal Traffic 218 
H. Sarbazi-Azad, L. M. Mackenzie and M. Ould-Khaoua 
Performance Sensitivity of Routing Algorithms to Failures 
in Networks of Workstations 230 
Xavier Molero, Federico Silla, Vicente Santonja and Jos Duato 
IV. Short Papers 
Decentralized Load Balancing in Multi-node Broadcast Schemes 
for Hypercubes 243 
Satoshi Fujita and Yuji Kashima 
Design and Implementation of an Efficient Thread Partitioning 
Algorithm 252 
Jos Nelson Amaral, Guang Gao, Erturk Dogan Kocalar, 
Patrick O'Neill and Xinan Tang 
A Flexible Routing Scheme for Networks of Workstations 260 
Jos Carlos Sancho, Antonio Robles and Jos Duato 
Java Bytecode Optimization with Advanced Instruction 
Folding Mechanism 268 
Austin Kim and Morris Chang 
Performance Evaluation of a Java Based Chat System 276 
Fabian Breg, Mike Lew and Harry A. G. Wijshoff 
Multi-node Broadcasting in All-Ported 3-D Wormhole-Routed Torus 
Using Aggregation-then-Distribution Strategy 284 
Yuh-Shyan Chen, Che-Yi Chen and Yu-Chee Tseng 
On the Influence of the Selection Function on the Performance 
of Networks of Workstations 292 
J. C. Martnez, F. Silla, P. Lpez and J. Duato 
Combining In-Transit Buffers with Optimized Routing Schemes 
to Boost the Performance of Networks with Source Routing 300 
Jose Flich, Pedro Lpez, Manuel. P. Malumbres, Jos Duato and Tom Rokicki 
A Comparison of Locality-Based and Recency-Based 
Replacement Policies 310 
Hans Vandierendonck and Koen De Bosschere 
The Filter Data Cache: A Tour Management Comparison 
with Related Split Data Cache Schemes Sensitive to Data Localities 319 
Julio Sahuquillo, Ana Pont and Veljko Milutinovic 
Global Magneto-Hydrodynamic Simulations of Differentially 
Rotating Accretion Disk by Astrophysical Rotational Plasma Simulator 328 
Mami Machida, Ryoji Matsumoto, Shigeki Miyaji, 
Kenji E. Nakamura and Hideaki Tonooka 
Exploring Multi-level Parallelism in Cellular Automata Networks 336 
Claudia Roberta Calidonna, Claudia Di Napoli, Maurizio Giordano 
and Mario Mango Furnari 
Orgel: An Parallel Programming Language with Declarative 
Communication Streams 344 
Kazuhiko Ohno, Shigehiro Yamamoto, Takanori Okano 
and Hiroshi Nakashima 
BS lambda_p: Functional BSP Programs on Enumerated Vectors 355 
Frdric Loulergue 
Ability of Classes of Dataflow Schemata with Timing Dependency 364 
Yasuo Matsubara and Hiroyuki Miyagawa 
ANew Model of Parallel Distributed Genetic Algorithms 
for Cluster Systems: Dual Individual DGAs 374 
Tomoyuki Hiroyasu, Mitsunori Miki, Masahiro Hamasaki and 
Yusuke Tanimura 
V. International Workshop on OpenMP: Experiences and Implementations (WOMPEI) 
A n Introduction to OpenMP 2.0 384 
Timothy G. Mattson 
Implementation and Evaluation of OpenMP for Hitachi SR8000 391 
Yasunori Nishitani, Kiyoshi Negishi, Hiroshi Ohta and Eiji Nunohiro 
Performance Evaluation of the Omni OpenMP Compiler 403 
Kazuhiro Kusano, Shigehisa Satoh and Mitsuhisa Sato 
Leveraging Transparent Data Distribution in OpenMP 
via User-Level Dynamic Page Migration 415 
Dimitrios S. Nikolopoulos, Theodore S. Papatheodorou, 
Constantine D. Polychronopoulos, Jess Labarta and Eduard Ayguad 
Formalizing OpenMP Performance Properties with ASL 428 
Thomas Fahringer, Michael Gerndt, Graham Riley and 
Jesper Larsson Trff 
Automatic Generation of OpenMP Directives and Its Application 
to Computational Fluid Dynamics Codes 440 
Haoqiang Jin, Michael Frumkin and Jerry Yan 
Coarse-Grain Task Parallel Processing Using the OpenMP Backend 
of the OSCAR Multigrain Parallelizing Compiler 457 
Kazuhisa Ishizaka, Motoki Obata and Hironori Kasahara 
Impact of OpenMP Optimizations for the MGCG Method 471 
Osamu Tatebe, Mitsuhisa Sato and Satoshi Sekiguchi 
Quantifying Differences between OpenMP and MPI Using 
a Large-Scale Application Suite 482 
Brian Armstrong, Seon Wook Kim and Rudolf Eigenmann 
VI. International Workshop on Simulation and Visualization (IWSV) 
Large Scale Parallel Direct Numerical Simulation of 
a Separating Turbulent Boundary Layer Flow over a Flat Plate 
Using NAL Numerical Wind Tunnel 494 
Naoki Hirose, Yuichi Matsuo, Takashi Nakamura, Martin Skote and 
Dan Henningson 
Characterization of Disorderd Networks in Vitreous SiO2 and 
Its Rigidity by Molecular-Dynamics Simulations on Parallel Computers 501 
Hajime Kimizuka, Hideo Kaburaki, Futoshi Shimizu and Yoshiaki Kogure 
Direct Numerical Simulation of Coherent Structure in Turbulent 
Open-Channel Flows with Heat Transfer 502 
Yoshinobu Yamamoto, Tomoaki Kunugi and Akimi Serizawa 
High Reynolds Number Computation for Turbulent Heat Transfer 
in a Pipe Flow 514 
Shin-ichi Satake, Tomoaki Kunugi and Ryutaro Himeno 
Large-Scale Simulation System and Advanced Photon Research 524 
Yutaka Ueshima and Yasuaki Kishimoto 
Parallelization, Vectorization and Visualization of 
Large Scale Plasma Particle Simulations and Its Application 
to Studies of Intense Laser Interactions 535 
Katsunobu Nishihara, Hirokazu Amitani, Yuko Fukuda, Tetsuya Honda, 
Y. Kawata, Yuko Ohashi, Hitoshi Sakagami and Yoshitaka Sizuki 
Fast LIC Image Generation Based on Significance Map 537 
Li Chen, Issei Fujishiro and Qunsheng Peng 
Fast Isosurface Generation Using the Cell-Edge Centered 
Propagation Algorithm 547 
Takayuki Itoh, Yasushi Yamaguchi and Koji Koyamada 
Fast Ray-Casting for Irregular Volumes 557 
Koji Koyamada 
A Study on the Effect of Air on the Dynamic Motion 
of a MEMS Device and Its Shape Optimization 573 
Hidetoshi Kotera, Taku Hirasawa, Sasatoshi Senga and Susumu Shima 
A Distributed Rendering System "On Demand Rendering System" 585 
Hideo Miyachi, Toshihiko Kobayashi, Yasuhiro Takeda, 
Hiroshi Hoshino and Xiuyi Jin 
Author Index 593 
END
