The 5th International Symposium on High Performance Computing (ISHPC–V) was held in Odaiba, Tokyo, Japan, October 20–22, 2003. The symposium was thoughtfully planned, organized, and supported by the ISHPC Organizing C- mittee and its collaborating organizations. The ISHPC-V program included two keynote speeches, several invited talks, two panel discussions, and technical sessions covering theoretical and applied research topics in high–performance computing and representing both academia and industry. One of the regular sessions highlighted the research results of the ITBL project (IT–based research laboratory, http://www.itbl.riken.go.jp/). ITBL is a Japanese national project started in 2001 with the objective of re- izing a virtual joint research environment using information technology. ITBL aims to connect 100 supercomputers located in main Japanese scienti?c research laboratories via high–speed networks. A total of 58 technical contributions from 11 countries were submitted to ISHPC-V. Each paper received at least three peer reviews. After a thorough evaluation process, the program committee selected 14 regular (12-page) papers for presentation at the symposium. In addition, several other papers with fav- able reviews were recommended for a poster session presentation. They are also included in the proceedings as short (8-page) papers. Theprogramcommitteegaveadistinguishedpaperawardandabeststudent paper award to two of the regular papers. The distinguished paper award was given for “Code and Data Transformations for Improving Shared Cache P- formance on SMT Processors” by Dimitrios S. Nikolopoulos. The best student paper award was given for “Improving Memory Latency Aware Fetch Policies for SMT Processors” by Francisco J. Cazorla.
Author(s): Jack Dongarra (auth.), Alex Veidenbaum, Kazuki Joe, Hideharu Amano, Hideo Aiso (eds.)
Series: Lecture Notes in Computer Science 2858
Edition: 1
Publisher: Springer-Verlag Berlin Heidelberg
Year: 2003
Language: English
Pages: 573
Tags: Programming Techniques; Software Engineering; Algorithm Analysis and Problem Complexity; Mathematics of Computing; Simulation and Modeling; Computational Mathematics and Numerical Analysis
Front Matter....Pages -
High Performance Computing Trends and Self Adapting Numerical Software....Pages 1-9
Kilo-instruction Processors....Pages 10-25
CARE: Overview of an Adaptive Multithreaded Architecture....Pages 26-38
Numerical Simulator III – A Terascale SMP-Cluster System for Aerospace Science and Engineering: Its Design and the Performance Issue....Pages 39-53
Code and Data Transformations for Improving Shared Cache Performance on SMT Processors....Pages 54-69
Improving Memory Latency Aware Fetch Policies for SMT Processors....Pages 70-85
Tolerating Branch Predictor Latency on SMT....Pages 86-98
A Simple Low-Energy Instruction Wakeup Mechanism....Pages 99-112
Power-Performance Trade-Offs in Wide and Clustered VLIW Cores for Numerical Codes....Pages 113-126
Field Array Compression in Data Caches for Dynamically Allocated Recursive Data Structures....Pages 127-145
FIBER: A Generalized Framework for Auto-tuning Software....Pages 146-159
Evaluating Heuristic Scheduling Algorithms for High Performance Parallel Processing....Pages 160-173
Pursuing Laziness for Efficient Implementation of Modern Multithreaded Languages....Pages 174-188
SPEC HPG Benchmarks for Large Systems....Pages 189-201
Distribution-Insensitive Parallel External Sorting on PC Clusters....Pages 202-213
Distributed Genetic Algorithm for Inference of Biological Scale-Free Network Structure....Pages 214-221
Is Cook’s Theorem Correct for DNA-Based Computing?....Pages 222-233
LES of Unstable Combustion in a Gas Turbine Combustor....Pages 234-244
Grid Computing Supporting System on ITBL Project....Pages 245-257
A Visual Resource Integration Environment for Distributed Applications on the ITBL System....Pages 258-268
Development of Remote Visualization and Collaborative Visualization System in ITBL Grid Environment....Pages 269-277
Performance of Network Intrusion Detection Cluster System....Pages 278-287
Constructing a Virtual Laboratory on the Internet: The ITBL Portal....Pages 288-297
Evaluation of High-Speed VPN Using CFD Benchmark....Pages 298-306
The Development of the UPACS CFD Environment....Pages 307-319
Virtual Experiment Platform for Materials Design....Pages 320-329
Ab Initio Study of Hydrogen Hydrate Clathrates for Hydrogen Storage within the ITBL Environment....Pages 330-341
RI2N – Interconnection Network System for Clusters with Wide-Bandwidth and Fault-Tolerancy Based on Multiple Links....Pages 342-351
A Bypass-Sensitive Blocking-Preventing Scheduling Technique for Mesh-Connected Multicomputers....Pages 352-359
Broadcast in a MANET Based on the Beneficial Area....Pages 360-367
An Optimal Method for Coordinated En-route Web Object Caching....Pages 368-375
An Improved Algorithm of Multicast Topology Inference from End-to-End Measurements....Pages 376-384
Chordal Topologies for Interconnection Networks....Pages 385-392
Distributed Location of Shared Resources and Its Application to the Load Sharing Problem in Heterogeneous Distributed Systems....Pages 393-401
Design and Implementation of a Parallel Programming Environment Based on Distributed Shared Arrays....Pages 402-411
Design and Implementation of Parallel Modified PrefixSpan Method....Pages 412-422
Parallel LU-decomposition on Pentium Streaming SIMD Extensions....Pages 423-430
Parallel Matrix Multiplication and LU Factorization on Ethernet-Based Clusters....Pages 431-439
Online Remote Trace Analysis of Parallel Applications on High-Performance Clusters....Pages 440-449
Performance Study of a Whole Genome Comparison Tool on a Hyper-Threading Multiprocessor....Pages 450-457
The GSN Library and FORTRAN Level I/O Benchmarks on the NS-III HPC System....Pages 458-467
Large Scale Structures of Turbulent Shear Flow via DNS....Pages 468-475
Molecular Dynamics Simulation of Prion Protein by Large Scale Cluster Computing....Pages 476-485
OpenMP/MPI Hybrid vs. Flat MPI on the Earth Simulator: Parallel Iterative Solvers for Finite Element Method....Pages 486-499
Performance Evaluation of Low Level Multithreaded BLAS Kernels on Intel Processor Based cc-NUMA Systems....Pages 500-510
Support of Multidimensional Parallelism in the OpenMP Programming Model....Pages 511-522
On the Implementation of OpenMP 2.0 Extensions in the Fujitsu PRIMEPOWER Compiler....Pages 523-528
Improve OpenMP Performance by Extending BARRIER and REDUCTION Constructs....Pages 529-539
OpenMP for Adaptive Master-Slave Message Passing Applications....Pages 540-551
OpenGR: A Directive-Based Grid Programming Environment....Pages 552-563
Back Matter....Pages -