ISBN: 3-540-66443-2
TITLE: Euro-Par'99 Parallel Processing
AUTHOR: Amestoy, Patrick; Berger, Philippe; Dayde, Michel; Duff, Iain; Fraysse, Valerie; Giraud, Luc; Ruiz, Daniel (Eds.)
TOC:

Invited Talks
Some Parallel Algorithms for Integer Factorisation 1
Richard P. Brent
MERCATOR, the Mission 23
Philippe Courtier
Adaptive Scheduling for Task Farming with Grid Middleware 30
Henri Casanova, MyungHo Kim, James S. Plank, Jack J. Dongarra
Applying Human Factors to the Design of Performance Tools 44
Cherri M. Pancake
Building the Teraflops/Petabytes Production Supercomputing Center 61
Horst D. Simon, William T.C. Kramer, Robert F. Lucas
A Coming of Age for Beowulf-Class Computing 78
Thomas Sterling, Daniel Savarese
Topic 01
Support Tools and Environments 89
Frdric Desprez
Systematic Debugging of Parallel Programs in DIWIDE Based on Collective Breakpoints and Macrosteps 90
P. Kacsuk, R. Lovas, J. Kovcs
Project Workspaces for Parallel Computing - The TRAPPER Approach 98
Dino Ahr, Andreas Bcker
PVMbuilder - A Tool for Parallel Programming 108
Jan B. Pedersen, Alan Wagner
Message-Passing Specification in a CORBA Environment 113
T. Es-sqalli, E. Fleury, E. Dillon, J. Guyard
Using Preemptive Thread Migration to Load-Balance Data-Parallel Applications 117
Gabriel Antoniu, Christian Perez
FITS-A Light-Weight Integrated Programming Environment 125
B. Chapman, F. Bodin, L. Hill, J. Merlin, G. Viland, F. Wollenweber
INTERLACE: An Interoperation and Linking Architecture for Computational Engines 135
Matthew J. Sottile, Allen D. Malony
Multi-protocol Communications and High Speed Networks 139
Benoit Planquelle, Jean-Franois Mhaut, Nathalie Revol
An Online Algorithm for Dimension-Bound Analysis 144
Paul A.S. Ward
Correction of Monitor Intrusion for Testing Nondeterministic MPI-Programs 154
D. Kranzlmller, J. Chassin de Kergommeaux, Ch. Schaubschlger
Improving the Performance of Distributed Shared Memory Environments on Grid Multiprocessors 159
Dimitris Dimitrelos, Constantine Halatsis
Topic 02
Performance Evaluation and Prediction 163
Jean-Marc Vincent
Performance Analysis of Wormhole Switching with Adaptive Routing in a Two-Dimensional Torus 165
M. Colajanni, B. Ciciani, F. Quaglia
Message Passing Evaluation and Analysis on Cray T3E and SGI Origin 2000 Systems 173
M. Prieto, D. Espadas, I.M. Llorente, F. Tirado
Performance Evaluation and Modeling of the Fujitsu AP3000 Message-Passing Libraries 183
Juan Tourio, Ramon Doallo
Improving Communication Support for Parallel Applications 188
Joerg Cordsen, Marco Dimas Gubitoso
A Performance Estimator for Parallel Programs 193
Jeff Reeve
Min-Cut Methods for Mapping Dataflow Graphs 203
Volker Elling, Karsten Schwan
Influence of Variable Time Operations in Static Instruction Scheduling 213
Patricia Borensztejn, Cristina Barrado, Jesus Labarta
Evaluation of LH*lh for a Multicomputer Architecture* 217
Andy D. Pimentel, Louis O. Hertzberger
Set Associative Cache Behavior Optimization 229
Ramn Doallo, Basilio B. Fraguela, Emilio L. Zapata
A Performance Study of Modern Web Server Applications 239
Ramesh Radhakrishnan, Lizy Kurian John
An Evaluation of High Performance Fortran Compilers Using the HPFBench Benchmark Suite 248
Guohua Jin, Y. Charlie Hu
Performance Evaluation of Object Oriented Middleware 258
Lszlo Bszormnyi, Andreas Wickner, Harald Wolf
PopSPY: A PowerPC Instrumentation Tool for Multiprocessor Simulation 262
C. Limousin, A. Vartanian, J-L. Bchennec
Performance Evaluation and Benchmarking of Native Signal Processing 266
Deependra Talla, Lizy Kurian John
Topic 03
Scheduling and Load Balancing 271
Jean-Marc Geib, Bruce Hendrickson, Pierre Manneback, Jean Roman
A Polynomial-Time Branching Procedure for the Multiprocessor Scheduling Problem 272
Ricardo C. Corr ea, Afonso Ferreira
Optimal and Alternating-Direction Load Balancing Schemes 280
Robert Elssser, Andreas Frommer, Burkhard Monien, Robert Preis
Process Mapping Given by Processor and Network Dynamic Load Prediction 291
Jean-Marie Garcia, David Gauchard, Thierry Monteil, Olivier Brun
Ordering Unsymmetric Matrices into Bordered Block Diagonal Form for Parallel Processing 295
Y.F. Hu, K.C.F. Maguire, R.J. Blake
Dynamic Load Balancing for Ocean Circulation Model with Adaptive Meshing 303
Eric Blayo, Laurent Debreu, Grgory Mounie, Denis Trystram
DRAMA: A Library for Parallel Dynamic Load Balancing of Finite Element Applications 313
Bart Maerten, Dirk Roose, Achim Basermann, Jochen Fingberg, Guy Lonsdale
Job Scheduling in a Multi-layer Vision System 317
M. Fikret Ercan, Ceyda O guz, Yu-Fai Fung
A New Algorithm for Multi-objective Graph Partitioning 322
Kirk Schloegel, George Karypis, Vipin Kumar
Scheduling Iterative Programs onto LogPMachine 332
Welf Lwe, Wolf Zimmermann
Scheduling Arbitrary Task Graphs on LogP Machines 340
Cristina Boeres, Aline Nascimento, Vinod E.F. Rebello
Scheduling with Communication Delays and On-Line Disturbances 350
Aziz Moukrim, Eric Sanlaville, Frdric Guinand
Scheduling User-Level Threads on Distributed Shared-Memory Multiprocessors 358
Eleftherios D. Polychronopoulos, Theodore S. Papatheodorou
Using Duplication for the Multiprocessor Scheduling Problem with Hierarchical Communications 369
Evripidis Bampis, Rodolphe Giroudeau, Jean-Claude Knig
Topic 04
Compilers for High Performance Systems 373
Barbara Chapman
Storage Mapping Optimization for Parallel Programs 375
Albert Cohen, Vincent Lefebvre
Array SSA for Explicitly Parallel Programs 383
Jean-Francois Collard
Parallel Data-Flow Analysis of Explicitly Parallel Programs 391
Jens Knoop
Localization of Data Transfer in Processor Arrays 401
Dirk Fimmel, Renate Merker
Scheduling Structured Systems 409
Jason B. Crop, Doran K. Wilde
Compiling Data Parallel Tasks for Coordinated Execution 413
Erwin Laure, Matthew Haines, Piyush Mehrotra, Hans Zima
Flexible Data Distribution in PGHPF 418
Mark Leair, Douglas Miles, Vincent Schuster, Michael Wolfe
On Automatic Parallelization of Irregular Reductions on Scalable Shared Memory Systems 422
E. Gutirrez, O. Plata, E.L. Zapata
I/O-Conscious Tiling Strategy for Disk-Resident Data 430
Mahmut Kandemir, Alok Choudhary, J. Ramanujam
Post-Scheduling Optimization of Parallel Programs 440
Stephen Shafer, Kanad Ghose
Piecewise Execution of Nested Parallel Programs - A Thread-Based Approach 445
W. Pfannenstiel
Topic 05
Parallel and Distributed Databases 449
Burkhard Freitag, Kader Hameurlain
Distributed Database Checkpointing450
Roberto Baldoni, Francesco Quaglia, Michel Raynal
A Generalized Transaction Theory for Database and Non-database Tasks 459
Armin Feler, Hans-Jrg Schek
On Disk Allocation of Intermediate Query Results in Parallel Database Systems 469
Holger Mrtens
Highly Concurrent Locking in Shared Memory Database Systems 477
Christian Jacobi, Cdric Lichtenau
Parallel Processing of Multiple Text Queries on Hypercube Interconnection Networks 482
Basilis Mamalis, Paul Spirakis, Basil Tampakas
Topic 06 + 20
Fault Avoidance and Fault Removal in Real-Time Systems & Fault-Tolerant Computing 487
Gilles Motet, David Powell
Quality of Service Management in Distributed Asynchronous Real-Time Systems 489
Binoy Ravindran
Multiprocessor Scheduling of Real-Time Tasks with Resource Requirements 497
Costas Mourlas
Designing Multiprocessor/Distributed Real-Time Systems Using the ASSERTS Toolkit 505
Kanad Ghose, Sudhir Aggarwal, Abhrajit Ghosh, David Goldman, Peter Sulatycke, Pavel Vasek, David R. Vogel
UML Framework for the Design of Real-Time Robot Controllers 511
L. Carroll, B. Tondu, C. Baron, J.C. Geffroy
Software Implemented Fault Tolerance in Hypercube 515
D.R. Avresky, S. Geoghegan
Managing Fault Tolerance Transparently Using CORBA Services 519
Ren Meier, Paddy Nixon
Topic 07
Theory and Models for Parallel Computation 523
Michel Cosnard
Parallel Algorithms for Grounded Range Search and Applications 525
Michael G. Lamoureux, Andrew Rau-Chaplin
Multi-level Cooperative Search: A New Paradigm for Combinatorial Optimization and an Application to Graph Partitioning. 533
Michel Toulouse, Krishnaiyan Thulasiraman, Fred Glover
A Quantitative Measure of Portability with Application to Bandwidth-Latency Models for Parallel Computing. 543
Gianfranco Bilardi, Andrea Pietracaprina, Geppino Pucci
A Cost Model for Asynchronous and Structured Message Passing 552
Emmanuel Melin, Bruno Raffin, Xavier Rebeuf, Bernard Virot
A Parallel Simulation of Cellular Automata by Spatial Machines 557
Bruno Martin
Topic 08
High-Performance Computing and Applications 561
Wolfgang Gentzsch
Null Messages Cancellation Through Load Balancing in Distributed Simulations 562
Azzedine Boukerche, Sajal K. Das
Efficient Load-Balancing and Communication Overlap in Parallel Shear-Warp Algorithm on a Cluster of PCs 570
Frderique Chaussumier, Frederic Desprez, Michel Loi
A Hierarchical Approach for Parallelization of a Global Optimization Method for Protein Structure Prediction 578
S. Crivelli, T. Head-Gordon, R. Byrd, E. Eskow, R. Schnabel
Parallelization of a Compositional Simulator with a Galerkin Coarse/Fine Method 586
Geir
ge ye, Hilde Reme
Some Investigations of Domain Decomposition Techniques in Parallel CFD 595
F. Chalot, G. Chevalier, Q.V. Dinh, L. Giraud
A Parallel Ocean Model for High Resolution Studies 603
Marc Guyon, Gurvan Madec, Francois-Xavier Roux, Maurice Imbard
Nonoverlapping Domain Decomposition Applied to a Computational Fluid Mechanics Code 608
Paulo B. Vasconcelos, Filomena D. d'Almeida
A PC Cluster with Application-Quality MPI 613
M Golebiewski, A. Basermann, M. Baum, R. Hempel, H. Ritzdorf, J.L. Trff
Using Network of Workstations to Support a Web-Based Visualization Service 624
Wilfrid Lefer, Jean-Marc Pierson
High-Speed LANs: New Environments for Parallel and Distributed Applications 633
Patrick Geoffray, Laurent Lef evre, CongDuc Pham, Loc Prylli, Olivier Reymann, Bernard Tourancheau, Roland Westrelin
Consequences of Modern Hardware Design for Numerical Simulations and Their Realization in FEAST 643
Ch. Becker, S. Kilian, S. Turek, the FEAST Group
A Structured SADT Approach to the Support of a Parallel Adaptive 3D CFD Code 651
Jonathan Nash, Martin Berzins, Paul Selwood
A Parallel Algorithm for 3D Geometry Transformations in OpenGL 659
J. Sbot Julien, A. Vartanian, J-L. Bechennec, N. Drach-Temam
Parallel Implementation in a Industrial Framework of Statistical Tolerancing Analysis in Microelectronics 663
Salvatore Rinaudo, Francesco Moschella, Marcello A. Anile
Interaction Between Data Parallel Compilation and Data Transfer and Storage Cost Minimization for Multimedia Applications 668
Chidamber Kulkarni, Koen Danckaert, Francky Catthoor, Manish Gupta
Parallel Numerical Simulation of a Marine Host-Parasite System 677
Michel Langlais, Guillaume Latu, Jean Roman, Patrick Silan
Parallel Methods of Training for Multilayer Neural Network 686
El Mostafa Daoudi, El Miloud Jara
Partitioning of Vector-Topological Data for Parallel GIS Operations:
Assessment and Performance Analysis 691
Terence M. Sloan, Michael J. Mineter, Steve Dowers, Connor Mulholland, Gordon Darling, Bruce M. Gittings
Topic 09
Parallel Computer Architecture - What Is Its Future? 695
Chris Jesshope
The Algebraic Path Problem Revisited 698
Sanjay Rajopadhye, Claude Tadonki, Tanguy Risset
Vector ISA Extension for Sparse Matrix-Vector Multiplication 708
Stamatis Vassiliadis, Sorin Cotofana, Pyrrhos Stathis
A Study of a Simultaneous Multithreaded Processor Implementation 716
Dominik Madon, Eduardo Snchez, Stefan Monnier
The MorphoSys Parallel Reconfigurable System 727
Guangming Lu, Hartej Singh, Ming-hau Lee, Nader Bagherzadeh, Fadi Kurdahi, Eliseu M.C. Filho
A Graph-Oriented Task Manager for Small Multiprocessor Systems 735
Xavier Verians, Jean-Didier Legat, Jean-Jacques Quisquater, Benoit Macq
Implementing Snoop-Coherence Protocol for Future SMP Architectures 745
Wissam Hlayhel, Jacques Collet, Laurent Fesquet
An Adaptive Limited Pointers Directory Scheme for Cache Coherence of Scalable Multiprocessors 753
Cheol Ho Park, Jong Hyuk Choi, Kyu Ho Park, Daeyeon Park
Two Schemes to Improve the Performance of a Sort-Last 3D Parallel Rendering Machine with Texture Caches 757
Alexis Vartanian, Jean-Luc Bchennec, Nathalie Drach-Temam
ManArray Processor Interconnection Network: An Introduction 761
Gerald G. Pechanek, Stamatis Vassiliadis, Nikos Pitsianis
Topic 10
Distributed Systems and Algorithms 767
Grard Padiou, Andr Schiper
A Cooperation Service for CORBA Objects. From the Model to the Applications 769
Khalil Drira, Frdric Gouzec, Michel Diaz
Symphony: Managing Virtual Servers in the Global Village 777
Roy Friedman, Assaf Schuster, Ayal Itzkovitz, Eli Biham, Erez Hadad, Vladislav Kalinovsky, Sergey Kleyman, Roman Vitenberg
pidaure: A Java Distributed Tool for Building DAI Applications 785
Djamel Fezzani, Jocelyn Desbiens
A Client/Broker/Server Substrate with 50s Round-Trip Overhead 790
Olivier Richard, Franck Cappello
Universal Constructs in Distributed Computations 795
Ajay D. Kshemkalyani, Mukesh Singhal
Illustrating the Use of Vector Clocks in Property Detection: An Example and a Counter-Example 806
Michel Raynal
A Node Count-Independent Logical Clock for Scaling Lazy Release Consistency Protocol 815
Luciana Bezerra Arantes, Bertil Folliot, Pierre Sens
Mutual Exclusion Between Neighboring Nodes in an Arbitrary System Graph Tree That Stabilizes Using Read/Write Atomicity 823
Gheorghe Antonoiu, Pradip K. Srimani
Topic 11
Parallel Programming: Models, Methods and Languages 831
Luc Boug, Bill McColl, Mamoun Filali, Henk Sips
Exploiting Advanced Task Parallelism in High Performance Fortran via a Task Library* 833
Thomas Brandes
A Run-Time System for Dynamic Grain Packing 845
Jo ao Lus Sobral, Alberto Jos Proena
Optimising Skeletal-Stream Parallelism on a BSP Computer 853
Andrea Zavanella
Parallel Programming by Transformation 858
Noel Winstanley
Condensed Graphs: A Multi-level, Parallel, Intermediate Representation 866
John P. Morrison, Niall J. Dalton
A Skeleton for Parallel Dynamic Programming 877
D. Morales, F. Almeida, F. Garcia, J. Gonzalez, J. Roda, C. Rodriguez
Programming Effort vs. Performance with a Hybrid Programming Model for Distributed Memory Parallel Architectures 888
Andreas Rodman, Mats Brorsson
DAOS - Scalable And-Or Parallelism 899
Lus Fernando Castro, Vtor Santos Costa, Cludio F.R. Geyer, Fernando Silva, Patrcia Kayser Vargas, Manuel E. Correia
Write Detection in Home-Based Software DSMs 909
Weiwu Hu, Weisong Shi, Zhimin Tang
D'Caml: Native Support for Distributed ML Programming in Heterogeneous Environment 914
Ken Wakita, Takashi Asano, Masataka Sassa
ParBlocks - A New Methodology for Specifying Concurrent Method Executions in Opus 925
Erwin Laure
Static Parallelization of Functional Programs: Elimination of Higher-Order Functions & Optimized Inlining 930
Christoph A. Herrmann, Jan Laitenberger, Christian Lengauer, Christian Schaller
A Library to Implement Neural Networks on MIMD Machines 935
Yann Boniface, Frderic Alexandre, Stphane Vialle
Topic 12
Architectures and Algorithms for Vision and Other Senses 939
Alain Ayache, Virginio Cantoni, Concettina Guerra, Pieter Jonker
LUX: An Heterogeneous Function Composition Parallel Computer for Graphics 940
Stphane Mancini, Renaud Pacalet
A Parallel Accelerator Architecture for Multimedia Video Compression 950
Bertil Schmidt, Manfred Schimmler
A Parallel Architecture for Stereoscopic Processing 961
Milton Romero, Bruno Ciciani
A Robust Neural Network Based Object Recognition System and Its SIMD Implementation 969
Alfredo Petrosino, Giuseppe Salvi
Multimedia Extensions and Sub-word Parallelism in Image Processing: Preliminary Results 977
Marco Ferretti, Davide Rizzo
Vanishing Point Detection in the Hough Transform Space 987
Andrea Matessi, Luca Lombardi
Parallel Structure in an Integrated Speech-Recognition Network 995
M. Fleury, A.C. Downton, A.F. Clark
3D Optoelectronic Fix Point Unit and Its Advantages Processing 3D Data 1005
B. Kasche, D. Fey, T. Hhn, W. Erhard
Parallel Wavelet Transforms on Multiprocessors 1013
Manfred Feil, Rade Kutil, Andreas Uhl
Vector Quantization-Fractal Image Coding Algorithm Based on Delaunay
Triangulation 1018
Zahia Brahimi, Karima Ait Saadi, Noria Baraka
Topic 13+19
Numerical Algorithms for Linear and Nonlinear Algebra 1023
mpC + ScaLAPACK = Efficient Solving Linear Algebra Problems on
Heterogeneous Networks 1024
Alexey Kalinov, Alexey Lastovetsky
Parallel Subdomain-Based Preconditioner for the Schur Complement 1032
Luiz M. Carvalho, Luc Giraud
A Preconditioner for Improved Fermion Actions 1040
Wolfgang Bietenholz, Norbert Eicker, Andreas Frommer,
Thomas Lippert, Bjrn Medeke, Klaus Schilling
Application of a Class of Preconditioners to Large Scale Linear
Programming Problems 1044
Venansius Baryamureeba, Trond Steihaug, Yin Zhang
Estimating Computer Performance for Parallel Sparse QR Factorisation 1049
David J. Miron, Patrick M. Lenders
A Mapping and Scheduling Algorithm for Parallel Sparse Fan-In Numerical
Factorization 1059
Pascal Hnon, Pierre Ramet, Jean Roman
Scheduling of Algorithms Based on Elimination Trees on NUMA Systems 1068
Mara J. Martin, Inmaculada Pardines, Francisco F. Rivera
Block-Striped Partitioning and Neville Elimination 1073
P. Alonso, R. Cortina, J. Ranilla
A Comparison of Parallel Solvers for Diagonally Dominant and General Narrow-Banded Linear Systems II 1078
Peter Arbenz, Andrew Cleary, Jack Dongarra, Markus Hegland
Using Pentangular Factorizations for the Reduction to Banded Form 1088
B. Groer, B. Lang
Experience with a Recursive Perturbation Based Algorithm for Symmetric Indefinite Linear Systems 1096
Anshul Gupta, Fred Gustavson, Alexander Karaivanov,
Jerzy Wasniewski, Plamen Yalamov
Parallel Cyclic Wavefront Algorithms for Solving Semidefinite Lyapunov Equations 1104
Jos M. Claver, Vicente Hernndez, Enrique S. Quintana-Ort
Parallel Constrained Optimization via Distribution of Variables 1112
Claudia A. Sagastizbal, Mikhail V. Solodov
Solving Stable Stein Equations on Distributed Memory Computers 1120
Peter Benner, Enrique S. Quintana-Ort, Gregorio Quintana-Ort
Convergence Acceleration for the Euler Equations Using a Parallel Semi-Toeplitz Preconditioner 1124
Andreas Khari, Samuel Sundberg
A Stable and Efficient Parallel Block Gram-Schmidt Algorithm 1128
Denis Vanderstraeten
On the Extension of the Code GAM for Parallel Computing 1136
Felice Iavernaro, Francesca Mazzia
PAMIHR. A Parallel FORTRAN Program for Multidimensional Quadrature on Distributed Memory Architectures 1144
G. Laccetti, M. Lapegna
Stability Issues of the Wang's Partitioning Algorithm for Banded and Tridiagonal Linear Systems 1149
Velisar Pavlov, Plamen Yalamov
Topic 14
Emerging Topics in Advanced Computing in Europe 1153
Renato Campo, Luc Giraud
The HPF+ Project: Supporting HPF for Advanced Industrial Applications 1155
Siegfried Benkner, Guy Lonsdale, Hans Zima
TIRAN: Flexible and Portable Fault Tolerance Solutions for Cost Effective Dependable Applications 1166
O. Botti, V. De Florio, G. Deconinck, F. Cassinari, S. Donatelli, A. Bobbio, A. Klein, H. Kufner, R. Lauwereins, E. Thurner, E. Verhulst
OCEANS  Optimising Compilers for Embedded Applications 1171
Michel Barreteau, Francois Bodin, Zbigniew Chamski, Henri-Pierre Charles, Christine Eisenbeis, John Gurd, Jan Hoogerbrugge, Ping Hu, William Jalby, Toru Kisuki, Peter M.W. Knijnenburg, Paul van der Mark, Andy Nisbet, Michael F.P. O'Boyle, Erven Rohou, Andr Seznec, Elena A. Sthr, Menno Treffers, Harry A.G. Wijshoff
Cray T3E Performances of a Parallel Code for a Stochastic Dynamic Assets and Liabilities Management Model 1176
G. Zanghirati, F. Cocco, F. Taddei, G. Paruolo
Parametric Simulation of Multi-body Systems on Networks of Heterogeneous Computers 1187
Javier G. Izaguirre, Jos M. Jimenez, Unai Martn, Bruno Thomas, Alberto Larzbal, Luis M. Matey
Parallel Data Mining in the HYPERBANK Project 1195
S. Fotis, J. A. Keane, R. I. Scott
High Performance Computing for Optimum Design of Multi-body Systems 1199
Jos M. Jimenez, Nassouh A. Chehayeb, Javier G. Izaguirre, Beidi Hamma, Yan Thiaudire
Topic 15
Routing and Communication in Interconnection Networks 1203
Optimizing Message Delivery in Asynchronous Distributed Applications 1204
Girindra D. Sharma, Nael B. Abu-Ghazaleh, Umesh Kumar V. Rajasekaran, Philip A. Wilsey
Circuit-Switched Broadcasting in Multi-port Multi-dimensional Torus Networks* 1209
San-Yuan Wang, Yu-Chee Tseng, Sze-Yao Ni, Jang-Ping Sheu 

Impact of the Head-of-Line Blocking on Parallel Computer Networks: Hardware to Applications 1222
V. Puente, J.A. Gregorio, C. Izu, R. Beivide
Interval Routing on Layered Cross Product of Trees and Cycles 1231
R. Krlovic, B. Rovan, P. Ruicka
Topic 16
Instruction-Level Parallelism and Uniprocessor Architecture 1241
Pascal Sainrat, Mateo Valero
Design Considerations of High Performance Data Cache with Prefetching 1243
Chi-Hung Chi, Jun-Li Yuan
Annotated Memory References: A Mechanism for Informed Cache Management 1251
Alvin R. Lebeck, David R. Raymond, Chia-Lin Yang, Mithuna S. Thottethodi
Understanding and Improving Register Assignment 1255
Cindy Norris, James B. Fenwick, Jr.
Compiler-Directed Reordering of Data by Cyclic Graph Coloring 1260
Daniela Genius, Sylvain Lelait
Code Cloning Tracing: A "Pay per Trace" Approach 1265
Thierry Lafage, Andr Seznec, Erven Rohou, Francois Bodin
Execution-Based Scheduling for VLIW Architectures 1269
Kemal Ebcio glu, Erik R. Altman, Sumedh Sathaye, Michael Gschwind
Decoupling Recovery Mechanism for Data Speculation from Dynamic Instruction Scheduling Structure 1281
Toshinori Sato
Implementation of Hybrid Context Based Value Predictors Using Value Sequence Classification 1291
Luis Pi nuel, Rafael A. Moreno, Francisco Tirado
Heterogeneous Clustered Processors: Organization and Design 1296
Francesco Pessolano
An Architecture Framework for Introducing Predicated Execution into Embedded Microprocessors 1301
Daniel A. Connors, Jean-Michel Puiatti, David I. August, Kevin M. Crozier, Wen-mei W. Hwu
Multi-stage Cascaded Prediction 1312
Karel Driesen, Urs Hlzle
Mispredicted Path Cache Effects 1322
Jonathan Combs, Candice Bechem Combs, John Paul Shen
Topic 17
Concurrent and Distributed Programming with Objects 1333
Patrick Sall, Marc Pantel
Non-regular Process Types 1334
Franz Puntigam
Decision Procedure for Temporal Logic of Concurrent Objects 1344
Jean-Paul Bahsoun, Rami El-Bada, Hugues-Olivier Yar
Aliasing Models for Object Migration* 1353
Uwe Nestmann, Hans Httel, Josva Kleist, Massimo Merro
Dynamic Extension of CORBA Servers 1369
Marco Catunda, Noemi Rodriguez, Roberto Ierusalimschy
On the Concurrent Object Model of UML 1377
Iulian Ober, Ileana Stan
Object Oriented Design for Reusable Parallel Linear Algebra Software 1385
Eric Noulard, Nahid Emad
Topic 18
Global Environment Modelling 1393
Michel Dque
The Parallelization of the Princeton Ocean Model 1395
L.A. Boukas, N.Th. Mimikou, N.M. Missirlis, G.L. Mellor, A. Lascaratos, G. Korres
Modular Fortran 90 Implementation of a Parallel Atmospheric General Circulation Model 1403
William Sawyer, Lawrence Takacs, Andrea Molod, Robert Lucchesi
Implementation of the Limited-Area Numerical Weather Prediction Model Aladin in Distributed Memory 1411
Claude Fischer, Jean-Francois Estrade, Jure Jerman
Parallelization of the French Meteorological Mesoscale Model MsoNH 1417
Patrick Jabouille, Ronan Guivarch, Philippe Kloos, Didier Gazen, Nicolas Gicquel, Luc Giraud, Nicole Asencio, Veronique Ducrocq, Juan Escobar, Jean-Luc Redelsperger, Jol Stein, Jean-Pierre Pinty
The PALM Project: MPMD Paradigm for an Oceanic Data Assimilation Software 1423
A. Fouilloux, A. Piacentini
A Parallel Distributed Fast 3D Poisson Solver for Mso-NH 1431
Luc Giraud, Ronan Guivarch, Joel Stein
Porting a Limited Area Numerical Weather Forecasting Model on a Scalable Shared Memory Parallel Computer 1435
Roberto Ansaloni, Paolo Malfetti, Tiziana Paccagnella
Topic 22
High-Performance Data Mining and Knowledge Discovery 1439
David Skillicorn, Domenico Talia
Mining of Association Rules in Very Large Databases: A Structured Parallel Approach 1441
P. Becuzzi, M. Coppola, M. Vanneschi
Parallel k/h-Means Clustering for Large Data Sets 1451
Kilian Stoffel, Abdelkader Belkoniene
Performance Analysis for Parallel Generalized Association Rule Mining on a Large Scale PC Cluster 1455
Takahiko Shintani, Masato Oguchi, Masaru Kitsuregawa
Inducing Load Balancing and Efficient Data Distribution Prior to Association Rule Discovery in a Parallel Environment 1460
Anna M. Manning, John A. Keane
Topic 23
Symbolic Computation 1465
Mike Dewar
Parallelism in Aldor - The Communication Library it for Parallel, Distributed Computation 1466
Thierry Gautier, Niklaus Mannhart
A Library for Parallel Modular Arithmetic 1476
David Power, Russell Bradford
Performance Evaluation of Or-Parallel Logic Programming Systems on Distributed Shared-Memory Architectures 1484
Vanusa Menditi Calegario, Ines de Castro Dutra
A Parallel Symbolic Computation Environment: Structures and Mechanics 1492
Mantsika Matooane, Arthur Norman
Index of Authors 1497
END
