This volume presents the proceedings of the Third Joint International Conference on Vector and Parallel Processing (CONPAR 94 - VAPP VI), held in Linz, Austria in September 1994.
The 76 papers contained were carefully selected from a wealth of submissions and address the most important aspects of parallel processing research. The volume is organized into sections on performance analysis and monitoring, parallel program development, parallel algorithms and complexity models, parallel architectures and abstract machines, parallel languages and compiler technology, networks and routing, and scheduling in distributed memory systems.
Author(s): Kai Hwang (auth.), Bruno Buchberger, Jens Volkert (eds.)
Series: Lecture Notes in Computer Science 854
Edition: 1
Publisher: Springer-Verlag Berlin Heidelberg
Year: 1994
Language: English
Pages: 893
Tags: Processor Architectures; Algorithm Analysis and Problem Complexity; Memory Structures; Computer Communication Networks; System Performance and Evaluation; Programming Techniques
Scalability and programmability of massively parallel processors....Pages 1-4
Language constructs and runtime systems for compositional parallel programming....Pages 5-16
Locality optimizations for parallel machines....Pages 17-28
TAU: A portable parallel program analysis environment for pC++....Pages 29-40
Communication pattern based performance prediction on the nCUBE 2 multiprocessor system....Pages 41-52
Performance oriented development of SPMD programs based on task structure specifications....Pages 53-65
Monitoring for detecting bugs and blocking communication....Pages 66-75
Minimizing the log size for execution replay of shared-memory programs....Pages 76-87
Gold Mine specification language for event-based monitoring....Pages 88-99
On the expressivity of a weakest precondition calculus for a simple data-parallel programming language (short version)....Pages 100-111
Reduction in Pei ....Pages 112-123
Towards the synthesis of asynchronous concurrent programs....Pages 124-135
Formal derivation of SIMD parallelism from non-linear recursive specifications....Pages 136-147
Deriving efficient parallel implementations of algorithms operating on general sparse matrices using automatic program transformation....Pages 148-159
Flexible program and architecture specification for massively parallel systems....Pages 160-171
Systematic approach and software tool for systolic design....Pages 172-183
ADVISE agricultural developmental visualisation interactive software environment....Pages 184-195
TOP 2 tool suite for the development and testing of parallel applications....Pages 196-207
Vectorization of the radix r self-sorting FFT....Pages 208-217
Efficient Gram-Schmidt orthogonalisation on an array processor....Pages 218-228
Parallel algorithm for inverting tridiagonal matrix on linear processor array....Pages 229-240
Systolic algorithms for long integer GCD computation....Pages 241-252
Parallelization of non-simultaneous iterative methods for systems of linear equations....Pages 253-264
Hypercube implementation and performance analysis for extrapolation methods....Pages 265-276
An approach to machine-independent parallel programming....Pages 277-288
On logic program parallelisation strategies....Pages 289-300
Paradigms admitting superunitary behaviour in parallel computation....Pages 301-312
Exact solution of linear systems over rational numbers by parallel p -adic arithmetic....Pages 313-323
Parallel computation of modular multivariate polynomial resultants on a shared memory machine....Pages 325-336
Partial Marking GC....Pages 337-348
Parallel recognition and location algorithms for chordal graphs using distance matrices....Pages 349-358
Comparing two probabilistic models of the computational complexity of the branch and bound algorithm....Pages 359-370
Parallel heap construction using multiple selection....Pages 371-380
Engineering high quality parallel software using PARSE....Pages 381-392
Massively parallel volume rendering....Pages 393-404
Parallel visibility computations for parallel radiosity....Pages 405-413
Memory access synchronization in vector multiprocessors....Pages 414-425
A switch cache design for MIN-based shared-memory multiprocessors....Pages 426-437
Modeling cache coherence overhead with geometric objects....Pages 438-448
Accelerated access to shared distributed arrays on distributed memory systems by access objects....Pages 449-460
A comparison of shared virtual memory and message passing programming techniques based on a finite element application....Pages 461-472
Comparison between a message passing and a global shared memory parallel computer by means of a concrete Monte-Carlo algorithm....Pages 473-484
Non homogenous parallel memory operations in a VLIW machine....Pages 485-496
Experience with the implementation of a concurrent graph reduction system on an nCUBE/2 platform....Pages 497-508
The Rewrite Rule Machine node architecture and its performance....Pages 509-520
Automatic spark strategies and granularity for a parallel functional language reducer....Pages 521-532
Virtual tasks for the PACLIB kernel....Pages 533-544
MUSE: a message passing concurrent computer for on-board space systems....Pages 545-556
Self-timed communication strategies for massively parallel systolic architectures....Pages 557-567
VIM language paradigm....Pages 569-580
Distributed Linda tuplespace algorithms and implementations....Pages 581-592
Supporting coarse and fine grain parallelism in an extension of ML....Pages 593-604
Towards mixed computation/communication in parallel scientific libraries....Pages 605-615
Composing functional unit blocks for efficient Interpretation of MIMD code sequences on SIMD processors....Pages 616-627
Using sacks to organize registers in VLIW machines....Pages 628-639
A framework for resource-constrained rate-optimal software pipelining....Pages 640-651
Design of a meta-parallelizer for large scientific applications....Pages 652-663
A software architecture for multidisciplinary applications: Integrating task and data parallelism....Pages 664-676
On scanning space-time mapped while loops....Pages 677-688
Optimizing communications by using compile time analysis....Pages 689-700
Optimal tile sizing....Pages 701-712
Evaluating array expressions on massively parallel machines with communication/computation overlap....Pages 713-724
User interaction and symbolic extensions to dependence analysis....Pages 725-736
Simplifying polynomial constraints over integers to make dependence analysis more precise....Pages 737-748
Asymptotically optimal communication algorithms on faulty folded petersen networks....Pages 749-760
Optimization of diagnostic examination....Pages 761-772
Laying out midimew networks with constant dilation....Pages 773-784
Optimal interval routing....Pages 785-796
Broadcasting in bus interconnection networks....Pages 797-807
Broadcasting in the arrowhead torus....Pages 808-819
An efficient mapping heuristic for mesh-connected parallel architectures based on Mean Field Annealing....Pages 820-831
Mapping and scheduling by genetic algorithms....Pages 832-841
Self-organizing processes....Pages 842-853
Balancing load under large and fast load changes in distributed computing systems — A case study....Pages 854-865
Dynamic load balancing mechanisms for a parallel operating system kernel....Pages 866-877
A new load-prediction scheme based on algorithmic cost functions....Pages 878-889