This book constitutes the refereed proceedings of the 16th International Euro-Par Conference held in Ischia, Italy, in August/September 2010. The 90 revised full papers presented were carefully reviewed and selected from 256 submissions. The papers are organized in topical sections on support tools and environments; performance prediction and evaluation; scheduling and load-balancing; high performance architectures and compilers; parallel and distributed data management; grid, cluster and cloud computing; peer to peer computing; distributed systems and algorithms; parallel and distributed programming; parallel numerical algorithms; multicore and manycore programming; theory and algorithms for parallel computation; high performance networks; and mobile and ubiquitous computing.
Author(s): Thilo Kielmann, Andrea Clematis, Sergei Gorlatch, Alexey Lastovetsky (auth.), Pasqua D’Ambra, Mario Guarracino, Domenico Talia (eds.)
Series: Lecture Notes in Computer Science 6272 : Theoretical Computer Science and General Issues
Edition: 1
Publisher: Springer-Verlag Berlin Heidelberg
Year: 2010
Language: English
Pages: 544
Tags: System Performance and Evaluation; Algorithm Analysis and Problem Complexity; Programming Techniques; Numeric Computing; Processor Architectures; Performance and Reliability
Front Matter....Pages -
Parallel and Distributed Programming....Pages 1-1
Transactional Mutex Locks....Pages 2-13
Exceptions for Algorithmic Skeletons....Pages 14-25
Generators-of-Generators Library with Optimization Capabilities in Fortress....Pages 26-37
User Transparent Task Parallel Multimedia Content Analysis....Pages 38-50
Parallel Simulation for Parameter Estimation of Optical Tissue Properties....Pages 51-62
Parallel Numerical Algorithms....Pages 63-64
Scalability and Locality of Extrapolation Methods for Distributed-Memory Architectures....Pages 65-76
CFD Parallel Simulation Using Getfem++ and Mumps....Pages 77-88
Aggregation AMG for Distributed Systems Suffering from Large Message Numbers....Pages 89-100
A Parallel Implementation of the Jacobi-Davidson Eigensolver and Its Application in a Plasma Turbulence Code....Pages 101-112
Scheduling Parallel Eigenvalue Computations in a Quantum Chemistry Code....Pages 113-124
Scalable Parallelization Strategies to Accelerate NuFFT Data Translation on Multicores....Pages 125-136
Multicore and Manycore Programming....Pages 137-138
JavaSymphony: A Programming and Execution Environment for Parallel and Distributed Many-Core Architectures....Pages 139-150
Scalable Producer-Consumer Pools Based on Elimination-Diffraction Trees....Pages 151-162
Productivity and Performance: Improving Consumability of Hardware Transactional Memory through a Real-World Case Study....Pages 163-174
Exploiting Fine-Grained Parallelism on Cell Processors....Pages 175-186
Optimized On-Chip-Pipelined Mergesort on the Cell/B.E.....Pages 187-198
Near-Optimal Placement of MPI Processes on Hierarchical NUMA Architectures....Pages 199-210
Parallel Enumeration of Shortest Lattice Vectors....Pages 211-222
A Parallel GPU Algorithm for Mutual Information Based 3D Nonrigid Image Registration....Pages 223-234
Multi-GPU and Multi-CPU Parallelization for Interactive Physics Simulations....Pages 235-246
Long DNA Sequence Comparison on Multicore Architectures....Pages 247-259
Adaptive Fault Tolerance for Many-Core Based Space-Borne Computing....Pages 260-274
Maestro: Data Orchestration and Tuning for OpenCL Devices....Pages 275-286
Multithreaded Geant4: Semi-automatic Transformation into Scalable Thread-Parallel Software....Pages 287-303
Parallel Exact Time Series Motif Discovery....Pages 304-315
Optimized Dense Matrix Multiplication on a Many-Core Architecture....Pages 316-327
A Language-Based Tuning Mechanism for Task and Pipeline Parallelism....Pages 328-340
A Study of a Software Cache Implementation of the OpenMP Memory Model for Multicore and Manycore Architectures....Pages 341-352
Programming CUDA-Based GPUs to Simulate Two-Layer Shallow Water Flows....Pages 353-364
Theory and Algorithms for Parallel Computation....Pages 365-366
Analysis of Multi-Organization Scheduling Algorithms....Pages 367-379
Area-Maximizing Schedules for Series-Parallel DAGs....Pages 380-392
Parallel Selection by Regular Sampling....Pages 393-399
Ants in Parking Lots....Pages 400-411
High Performance Networks....Pages 412-412
An Efficient Strategy for Reducing Head-of-Line Blocking in Fat-Trees....Pages 413-427
A First Approach to King Topologies for On-Chip Networks....Pages 428-439
Optimizing Matrix Transpose on Torus Interconnects....Pages 440-451
Mobile and Ubiquitous Computing....Pages 452-453
cTrust: Trust Aggregation in Cyclic Mobile Ad Hoc Networks....Pages 454-465
Maximizing Growth Codes Utility in Large-Scale Wireless Sensor Networks....Pages 466-477
@Flood: Auto-Tunable Flooding for Wireless Ad Hoc Networks....Pages 478-489
On Deploying Tree Structured Agent Applications in Networked Embedded Systems....Pages 490-502
Meaningful Metrics for Evaluating Eventual Consistency....Pages 503-515
Caching Dynamic Information in Vehicular Ad Hoc Networks....Pages 516-527
Collaborative Cellular-Based Location System....Pages 528-539
Back Matter....Pages -