Euro-Par – the European Conference on Parallel Computing – is an international conference series dedicated to the promotion and advancement of all aspects of parallel computing. The major themes can be divided into the broad categories of hardware, software, algorithms, and applications for parallel computing. The objective of Euro-Par is to provide a forum within which to promote the dev- opment of parallel computing both as an industrial technique and an academic discipline, extending the frontier of both the state of the art and the state of the practice. This is particularlyimportant at a time when parallel computing is - dergoing strong and sustained development and experiencing real industrial take up. The main audience for and participants of Euro-Par are seen as researchers in academic departments, government laboratories, and industrial organisations. Euro-Par’s objective is to become the primarychoice of such professionals for the presentation of new results in their speci?c areas. Euro-Par is also interested in applications that demonstrate the e?ectiveness of the main Euro-Par themes. Euro-Par now has its own Internet domain with a permanent Web site where the historyof the conference series is described: http://www. euro-par. org. The Euro-Par conference series is sponsored bythe Association of Computer Machineryand the International Federation of Information Processing.
Author(s): David E. Keyes (auth.), Arndt Bode, Thomas Ludwig, Wolfgang Karl, Roland Wismüller (eds.)
Series: Lecture Notes in Computer Science 1900
Edition: 1
Publisher: Springer-Verlag Berlin Heidelberg
Year: 2000
Language: English
Pages: 1372
Tags: Computer Systems Organization and Communication Networks; Software Engineering/Programming and Operating Systems; Theory of Computation; Mathematics of Computing; Computational Mathematics and Numerical Analysis
Four Horizons for Enhancing the Performance of Parallel Simulations Based on Partial Differential Equations....Pages 1-17
E2K Technology and Implementation....Pages 18-21
Grid-Based Asynchronous Migration of Execution Context in Java Virtual Machines....Pages 22-34
Logical Instantaneity and Causal Order: Two “First Class” Communication Modes for Parallel Computing....Pages 35-42
The TOP500 Project of the Universities Mannheim and Tennessee....Pages 43-43
Support Tools and Environments....Pages 45-46
Visualization and Computational Steering in Heterogeneous Computing Environments....Pages 47-56
A Web-Based Finite Element Meshes Partitioner and Load Balancer....Pages 57-64
A Framework for an Interoperable Tool Environment....Pages 65-69
ToolBlocks: An Infrastructure for the Construction of Memory Hierarchy Analysis Tools....Pages 70-74
A Preliminary Evaluation of Finesse , a Feedback-Guided Performance Enhancement System....Pages 75-85
On Combining Computational Differentiation and Toolkits for Parallel Scientific Computing....Pages 86-94
Generating Parallel Program Frameworks from Parallel Design Patterns....Pages 95-104
Performance Evaluation and Prediction....Pages 105-107
A Callgraph-Based Search Strategy for Automated Performance Diagnosis....Pages 108-122
Automatic Performance Analysis of MPI Applications Based on Event Traces....Pages 123-132
Pajé: An Extensible Environment for Visualizing Multi-threaded Programs Executions....Pages 133-140
A Statistical-Empirical Hybrid Approach to Hierarchical Memory Analysis....Pages 141-148
Use of Performance Technology for the Management of Distributed Systems....Pages 149-159
Delay Behavior in Domain Decomposition Applications....Pages 160-167
Automating Performance Analysis from UML Design Patterns....Pages 168-172
Integrating Automatic Techniques in a Performance Analysis Session....Pages 173-177
Combining Light Static Code Annotation and Instruction-Set Emulation for Flexible and Efficient On-the-Fly Simulation....Pages 178-182
SCOPE - The Specific Cluster Operation and Performance Evaluation Benchmark Suite....Pages 183-188
Implementation Lessons of Performance Prediction Tool for Parallel Conservative Simulation....Pages 189-193
A Fast and Accurate Approach to Analyze Cache Memory Behavior....Pages 194-198
Impact of PE Mapping on Cray T3E Message-Passing Performance....Pages 199-207
Performance Prediction of an NAS Benchmark Program with ChronosMix Environment....Pages 208-216
Scheduling and Load Balancing....Pages 217-217
A Hierarchical Approach to Irregular Problems....Pages 218-222
Load Scheduling with Profile Information....Pages 223-233
Neighbourhood Preserving Load Balancing: A Self-Organizing Approach....Pages 234-241
The Impact of Migration on Parallel Job Scheduling for Distributed Systems....Pages 242-251
Memory Management Techniques for Gang Scheduling....Pages 252-261
Exploiting Knowledge of Temporal Behaviour in Parallel Programs for Improving Distributed Mapping....Pages 262-271
Preemptive Task Scheduling for Distributed Systems....Pages 272-276
Towards Optimal Load Balancing Topologies....Pages 277-287
Scheduling Trees with Large Communication Delays on Two Identical Processors....Pages 288-295
Parallel Multilevel Algorithms for Multi-constraint Graph Partitioning....Pages 296-310
Experiments with Scheduling Divisible Tasks in Clusters of Workstations....Pages 311-319
Optimal Mapping of Pipeline Algorithms....Pages 320-324
Dynamic Load Balancing for Parallel Adaptive Multigrid Solvers with Algorithmic Skeletons....Pages 325-328
Compilers for High Performance....Pages 329-330
Improving the Sparse Parallelization Using Semantical Information at Compile-Time....Pages 331-339
Automatic Parallelization of Sparse Matrix Computations: A Static Analysis....Pages 340-348
Automatic SIMD Parallelization of Embedded Applications Based on Pattern Recognition....Pages 349-356
Temporary Arrays for Distribution of Loops with Control Dependences....Pages 357-367
Automatic Generation of Block-Recursive Codes....Pages 368-378
Left-Looking to Right-Looking and Vice Versa: An Application of Fractal Symbolic Analysis to Linear Algebra Code Restructuring....Pages 379-388
Identifying and Validating Irregular Mutual Exclusion Synchronization in Explicitly Parallel Programs....Pages 389-394
Exact Distributed Invalidation....Pages 395-404
Scheduling the Computations of a Loop Nest with Respect to a Given Mapping....Pages 405-414
Volume Driven Data Distribution for NUMA-Machines....Pages 415-424
Parallel and Distributed Databases and Applications....Pages 425-426
Database Replication Using Epidemic Communication....Pages 427-434
Evaluating the Coordination Overhead of Replica Maintenance in a Cluster of Databases....Pages 435-444
A Communication Infrastructure for a Distributed RDBMS....Pages 445-450
Distribution, Replication, Parallelism, and Efficiency Issues in a Large-Scale Online/Real-Time Information System for Foreign Exchange Trading....Pages 451-454
Complexity Theory and Algorithms....Pages 455-455
Positive Linear Programming Extensions: Parallel Complexity and Applications....Pages 456-460
Parallel Shortest Path for Arbitrary Graphs....Pages 461-470
Periodic Correction Networks....Pages 471-478
Applications on High-Performance Computers....Pages 479-480
An Efficient Algorithm for Parallel 3D Reconstruction of Asymmetric Objects from Electron Micrographs....Pages 481-490
Fast Cloth Simulation with Parallel Computers....Pages 491-499
The Input, Preparation, and Distribution of Data for Parallel GIS Operations....Pages 500-505
Study of the Load Balancing in the Parallel Training for Automatic Speech Recognition....Pages 506-510
Pfortran and Co-Array Fortran as Tools for Parallelization of a Large-Scale Scientific Application....Pages 511-518
Sparse Matrix Structure for Dynamic Parallelisation Efficiency....Pages 519-526
A Multi-color Inverse Iteration for a High Performance Real Symmetric Eigensolver....Pages 527-531
Parallel Implementation of Fast Hartley Transform (FHT) in Multiprocessor Systems....Pages 532-535
Parallel Computer Architecture....Pages 537-538
Coherency Behavior on DSM: A Case Study....Pages 539-544
Hardware Migratable Channels....Pages 545-549
Reducing the Replacement Overhead on COMA Protocols for Workstation-Based Architectures....Pages 550-557
Cache Injection: A Novel Technique for Tolerating Memory Latency in Bus-Based SMPs....Pages 558-566
Adaptive Proxies: Handling Widely-Shared Data in Shared-Memory Multiprocessors....Pages 567-572
Distributed Systems and Algorithms....Pages 573-574
A Combinatorial Characterization of Properties Preserved by Antitokens....Pages 575-582
Searching with Mobile Agents in Networks with Liars....Pages 583-590
Complete Exchange Algorithms for Meshes and Tori Using a Systematic Approach....Pages 591-594
Algorithms for Routing AGVs on a Mesh Topology....Pages 595-599
Self-Stabilizing Protocol for Shortest Path Tree for Multi-cast Routing in Mobile Networks....Pages 600-604
Quorum-Based Replication in Asynchronous Crash-Recovery Distributed Systems....Pages 605-608
Timestamping Algorithms: A Characterization and a Few Properties....Pages 609-616
Programming Languages, Models, and Methods....Pages 617-619
HPF vs. SAC — A Case Study....Pages 620-624
Developing a Communication Intensive Application on the EARTH Multithreaded Architecture....Pages 625-637
On the Predictive Quality of BSP-like Cost Functions for NOWs....Pages 638-646
Exploiting Data Locality on Scalable Shared Memory Machines with Data Parallel Programs....Pages 647-657
The Skel-BSP Global Optimizer: Enhancing Performance Portability in Parallel Programming....Pages 658-667
A Theoretical Framework of Data Parallelism and Its Operational Semantics....Pages 668-677
A Pattern Language for Parallel Application Programs....Pages 678-681
Oblivious BSP....Pages 682-685
A Software Architecture for HPC Grid Applications....Pages 686-689
Satin: Efficient Parallel Divide-and-Conquer in Java....Pages 690-699
Implementing Declarative Concurrency in Java....Pages 700-708
Building Distributed Applications Using Multiple, Heterogeneous Environments....Pages 709-717
A Multiprotocol Communication Support for the Global Address Space Programming Model on the IBM SP....Pages 718-728
A Comparison of Concurrent Programming and Cooperative Multithreading....Pages 729-738
The Multi-architecture Performance of the Parallel Functional Language GpH ....Pages 739-743
Novel Models for Or-Parallel Logic Programs: A Performance Analysis....Pages 744-753
Executable Specification Language for Parallel Symbolic Computation....Pages 754-757
Efficient Parallelisation of Recursive Problems Using Constructive Recursion....Pages 758-761
Development of Parallel Algorithms in Data Field Haskell....Pages 762-766
The ParCeL-2 Programming Language....Pages 767-770
Numerical Algorithms for Linear and Non linear Algebra....Pages 771-773
Ahnentafel Indexing into Morton-Ordered Arrays, or Matrix Locality for Free....Pages 774-783
An Efficient Parallel Linear Solver with a Cascadic Conjugate Gradient Method: Experience with Reality....Pages 784-794
A Fast Solver for Convection Diffusion Equations Based on Nested Dissection with Incomplete Elimination....Pages 795-805
Low Communication Parallel Multigrid....Pages 806-814
Parallelizing an Unstructured Grid Generator with a Space-Filling Curve Approach....Pages 815-823
Solving Discrete-Time Periodic Riccati Equations on a Cluster....Pages 824-828
A Parallel Optimization Scheme for Parameter Estimation in Motor Vehicle Dynamics....Pages 829-834
Sliding-Window Compression on the Hypercube....Pages 835-838
A Parallel Implementation of a Potential Reduction Algorithm for Box-Constrained Quadratic Programming....Pages 839-848
European Projects....Pages 849-850
NEPHEW: Applying a Toolset for the Efficient Deployment of a Medical Image Application on SCI—Based Clusters....Pages 851-860
SEEDS: Airport Management Database System....Pages 861-868
HIPERTRANS: High Performance Transport Network Modelling and Simulation....Pages 869-874
Routing and Communication in Interconnection Networks....Pages 875-876
Experimental Evaluation of Hot—Potato Routing Algorithms on 2—Dimensional Processor Arrays....Pages 877-881
Improving the Up*/Down* Routing Scheme for Networks of Workstations....Pages 882-889
Deadlock Avoidance for Wormhole Based Switches....Pages 890-899
An Analytical Model of Adaptive Wormhole Routing with Deadlock Recovery....Pages 900-903
Analysis of Pipelined Circuit Switching in Cube Networks....Pages 904-908
A New Reliability Model for Interconnection Networks....Pages 909-917
A Bandwidth Latency Tradeoff for Broadcast and Reduction....Pages 918-926
Optimal Broadcasting in Even Tori with Dynamic Faults....Pages 927-930
Broadcasting in All-Port Wormhole 3-D Meshes of Trees....Pages 931-934
Probability-Based Fault-Tolerant Routing in Hypercubes....Pages 935-938
Instruction-Level Parallelism and Processor Architecture....Pages 939-939
On the Performance of Fetch Engines Running DSS Workloads....Pages 940-949
Cost-Efficient Branch Target Buffers....Pages 950-959
Two-Level Address Storage and Address Prediction....Pages 960-964
Hashed Addressed Caches for Embedded Pointer Based Codes....Pages 965-968
BitValue Inference: Detecting and Exploiting Narrow Bitwidth Computations....Pages 969-979
General Matrix-Matrix Multiplication Using SIMD Features of the PIII....Pages 980-983
Redundant Arithmetic Optimizations....Pages 984-988
The Decoupled-Style Prefetch Architecture....Pages 989-993
Exploiting Java Bytecode Parallelism by Enhanced POC Folding Model....Pages 994-997
Cache Remapping to Improve the Performance of Tiled Algorithms....Pages 998-1007
Code Partitioning in Decoupled Compilers....Pages 1008-1017
Limits and Graph Structure of Available Instruction-Level Parallelism....Pages 1018-1022
Pseudo-vectorizing Compiler for the SR8000....Pages 1023-1027
Object Oriented Architectures, Tools, and Applications....Pages 1029-1030
Debugging by Remote Reflection....Pages 1031-1038
Compiling Multithreaded Java Bytecode for Distributed Execution....Pages 1039-1052
A More Expressive Monitor for Concurrent Java Programming....Pages 1053-1060
An Object-Oriented Software Framework for Large-Scale Networked Virtual Environments....Pages 1061-1070
TACO — Dynamic Distributed Collections with Templates and Topologies....Pages 1071-1080
Object-Oriented Message-Passing with TPO++....Pages 1081-1084
Architectures and Algorithms for Multimedia Applications....Pages 1085-1085
Design of Multi-dimensional DCT Array Processors for Video Applications....Pages 1086-1094
Design of a Parallel Accelerator for Volume Rendering....Pages 1095-1104
Automated Design of an ASIP for Image Processing Applications....Pages 1105-1109
A Distributed Storage System for a Video-on-Demand Server....Pages 1110-1114
Cluster Computing....Pages 1115-1117
Partition Cast — Modelling and Optimizing the Distribution of Large Data Sets in PC Clusters....Pages 1118-1131
A New Home-Based Software DSM Protocol for SMP Clusters....Pages 1132-1142
Encouraging the Unexpected: Cluster Management for OS and Systems Research....Pages 1143-1147
Flow Control in ServerNet R Clusters....Pages 1148-1156
The WMPI Library Evolution: Experience with MPI Development for Windows Environments....Pages 1157-1164
Implementing Explicit and Implicit Coscheduling in a PVM Environment....Pages 1165-1170
A Jini-Based Prototype Metacomputing Framework....Pages 1171-1174
SKElib : Parallel Programming with Skeletons in C....Pages 1175-1184
Token-Based Read/Write-Locks for Distributed Mutual Exclusion....Pages 1185-1195
On Solving a Problem in Algebraic Geometry by Cluster Computing....Pages 1196-1200
PCI-DDC Application Programming Interface: Performance in User-Level Messaging....Pages 1201-1205
A Clustering Approach for Improving Network Performance in Heterogeneous Systems....Pages 1206-1209
Metacomputing....Pages 1211-1212
Request Sequencing: Optimizing Communication for the Grid....Pages 1213-1222
An Architectural Meta-application Model for Coarse Grained Metacomputing....Pages 1223-1230
Javelin 2.0: Java-Based Parallel Computing on the Internet....Pages 1231-1238
Data Distribution for Parallel CORBA Objects....Pages 1239-1249
Parallel I/O andStor age Technology....Pages 1251-1252
Towards a High-Performance Implementation of MPI-IO on Top of GPFS....Pages 1253-1262
Design and Evaluation of a Compiler-Directed Collective I/O Technique....Pages 1263-1272
Effective File-I/O Bandwidth Benchmark....Pages 1273-1283
Instant Image: Transitive and Cyclical Snapshots in Distributed Storage Volumes....Pages 1284-1291
Scheduling Queries for Tape-Resident Data....Pages 1292-1301
Logging RAID — An Approach to Fast, Reliable, and Low-Cost Disk Arrays....Pages 1302-1311
Problem Solving Environments....Pages 1313-1314
AMANDA - A Distributed System for Aircraft Design....Pages 1315-1322
Problem Solving Environments: Extending the Rôle of Visualization Systems....Pages 1323-1331
An Architecture for Web-Based Interaction and Steering of Adaptive Parallel/Distributed Applications....Pages 1332-1339
Computational Steering in Problem Solving Environments....Pages 1340-1344
Implementing Problem Solving Environments for Computational Science....Pages 1345-1349
Pseudovectorization, SMP, and Message Passing on the Hitachi SR8000-F1....Pages 1351-1361