This volume presents the proceedings of the joint meeting CONPAR 92 - VAPP V, held in Lyon, France, September 1992. The international Conferences on Parallel Processing (CONPAR) and the meetings on Vector and Parallel Processors in computational science (VAPP) have been held jointly since CONPAR 90 - VAPP IV, held in Zurich. The aim of the meeting presented in this volume is to review hardware and architecture developmentstogether with languages and software tools for supporting parallel processing and to highlight advances in models, algorithms, andapplications software on vector and parallel architectures. The papers in the volume are organized into sections on networks, software tools, distributed algorithms, dedicated architectures, numerical applications, systolic algorithms, parallel linear algebra, architectures, shared virtual memory, load balancing, data parallelism, parallel algorithms, image processing, compiling and scheduling, simulation and performance analysis, parallel artificialintelligence, dataflow architectures, parallel programming, and poster presentations.
Author(s): Sandeep N. Bhatt, Fan R. K. Chung (auth.), Luc Bougé, Michel Cosnard, Yves Robert, Denis Trystram (eds.)
Series: Lecture Notes in Computer Science 634
Edition: 1
Publisher: Springer-Verlag Berlin Heidelberg
Year: 1992
Language: English
Pages: 853
Tags: Processor Architectures; Algorithm Analysis and Problem Complexity; System Performance and Evaluation; Programming Techniques; Software Engineering; Operating Systems
Tolerating faults in synchronization networks....Pages 1-12
On incomplete hypercubes....Pages 13-24
Reducing network hardware quantity by employing multi-processor cluster structure in distributed memory parallel processors....Pages 25-30
Connection Machine results for pyramid embedding algorithms....Pages 31-36
Interconnection networks based on block designs....Pages 37-42
Partitioning and mapping communication graphs on a modular reconfigurable parallel architecture....Pages 43-48
Generalized shuffle-exchange networks....Pages 49-54
Execution replay: A mechanism for integrating a visualization tool with a symbolic debugger....Pages 55-66
The software-monitor DELTA-T and its use for performance measurements of some farming variants on the multi-transputer system DAMP....Pages 67-78
Visualization of message passing parallel programs....Pages 79-90
Parallel physical optimization algorithms for data mapping....Pages 91-96
Profiling on a massively parallel computer....Pages 97-102
A multiprocessor multiwindow visualization subsystem....Pages 103-108
Data race detection based on execution replay for parallel applications....Pages 109-114
The C_NET programming environment: An overview....Pages 115-120
P++, a C++ virtual shared grids based programming environment for architecture-independent development of structured grid applications....Pages 121-126
Detection of concurrency-related errors in Joyce....Pages 127-132
Analysis of an efficient distributed algorithm for mutual exclusion (average-case analysis of path reversal)....Pages 133-144
Invariance properties in distributed systems....Pages 145-156
Synchronization of parallel processes in distributed systems....Pages 157-162
Statistical probabilistic clock synchronization algorithm....Pages 163-168
A SIMD architecture for medical imaging....Pages 169-180
Computing the inner product on reconfigurable buses with shift switching....Pages 181-192
A novel sorting array processor....Pages 193-204
The time-parallel solution of parabolic partial differential equations using the frequency-filtering method....Pages 205-216
The combination technique for parallel sparse-grid-preconditioning or -solution of PDE's on workstation networks....Pages 217-228
Comparing the DAP, meiko and suprenum with a fluid dynamic benchmark....Pages 229-240
Parallel detection algorithm of radar signals....Pages 241-246
Efficient linear systolic array for the knapsack problem....Pages 247-258
On the loading, recovery and access of stationary data in systolic arrays....Pages 259-264
Designing modular linear systolic arrays using dependence graph regular partitions....Pages 265-270
Reducing symmetric banded matrices to tridiagonal form — A comparison of a new parallel algorithm with two serial algorithms on the iPSC/860....Pages 271-282
An implementation of the BLAS on the i860: A RISC approach to software for RISC devices....Pages 283-294
Partitioning and mapping for parallel nested dissection on distributed memory architectures....Pages 295-306
On the accuracy of solving triangular systems in parallel-III....Pages 307-318
Linear algebra calculations on the BBN TC2000....Pages 319-330
Parallel homotopy algorithm for large sparse generalized eigenvalue problems: Application to hydrodynamic stability analysis....Pages 331-342
Parallel algorithms for solving linear recurrence systems....Pages 343-348
A new parallel factorization A = DD t BC for band symmetric positive definite matrices....Pages 349-354
Computation of the eigenvalues of real symmetric matrices using a processor farm....Pages 355-360
Numerical performance of an asynchronous Jacobi iteration....Pages 361-366
Matrix inversion algorithm for linear array processor....Pages 367-372
Massively parallel preconditioners for the sparse conjugate gradient method....Pages 373-378
A block algorithm for orthogonalization in elliptic norms....Pages 379-385
Asynchronous polycyclic architecture....Pages 387-398
A comparison of two memory models for high performance computers....Pages 399-404
A decoupled multicomputer architecture with optical full interconnection....Pages 405-410
Very high speed vectorial processors using serial multiport memory as data memory....Pages 411-416
Resource management on a packet-based parallel graph reduction machine....Pages 417-422
A large context multithreaded architecture....Pages 423-428
On the practical efficiency of Randomized Shared Memory....Pages 429-440
KOAN: A Shared Virtual Memory for the iPSC/2 hypercube....Pages 441-452
A scalable distributed shared memory....Pages 453-466
Cost prediction for load-balancing: Application to algebraic computations....Pages 467-478
Envelopes in adaptive local queues for MIMD load balancing....Pages 479-484
Dynamo — A portable tool for dynamic load balancing on distributed memory multicomputers....Pages 485-490
Semantics and compilation of the data-parallel switch statement....Pages 491-502
κ-Project/first step: To improve data manipulations and representations on parallel computers....Pages 503-508
F-Code: A portable software platform for data-parallel languages....Pages 509-514
A parallel best-first B & B with synchronization phases....Pages 515-526
On tests of uniform tree circuits....Pages 527-538
Improved parallel sorting of presorted sequences....Pages 539-544
MIMD dictionary machines: From theory to practice....Pages 545-550
Parallel searching for 3D-objects....Pages 551-556
Parallel Gcd and lattice basis reduction....Pages 557-564
Parallel minimum spanning forest algorithms on the star and pancake interconnection networks....Pages 565-570
Parallel algorithms for the distance transformation....Pages 571-582
A multiprocessor system for displaying quadric CSG models....Pages 583-588
Efficient image computations on reconfigurable meshes (preliminary version)....Pages 589-594
A multiscale parallel thinning algorithm....Pages 595-600
Static scheduling of parallel programs for message passing architectures....Pages 601-612
A polynomial time method for optimal software pipelining....Pages 613-624
Scheduling loops on parallel processors: A simple algorithm with close to optimum performance....Pages 625-636
Automatic parallelization of grid-based applications for the iPSC/860....Pages 637-645
Semantic analysis for parallelizing C....Pages 647-652
A task scheduling algorithm for the parallel expression evaluation in a reconfigurable fully digit on-line network....Pages 653-658
Unimodularity considered non-essential (extended abstract)....Pages 659-664
PEPSIM-ST: A simulator tool for benchmarking....Pages 665-676
Performance modeling of sparse matrix methods for distributed memory architectures....Pages 677-688
ParSim: A tool for the analysis of parallel and distributed programs....Pages 689-700
Estimating the effective performance of program parallelization on shared memory MIMD multiprocessors....Pages 701-706
On the simulation of pipelining of fully digit on-line floating-point adder networks on massively parallel computers....Pages 707-712
A parallel expert system using a backward chaining strategy....Pages 713-718
Optimal speedup conditions for a parallel back-propagation algorithm....Pages 719-724
A time-bounded binding access with low task-switching cost in an OR-parallel prolog....Pages 725-730
Threads and subinstruction level parallelism in a data flow architecture....Pages 731-736
A hybrid dataflow architecture with multiple tokens....Pages 737-742
Parallelism control scheme in a dataflow architecture....Pages 743-748
A Petri-Net-based method to the design of parallel programs for a multiprocessor system....Pages 749-754
A parallel structure for static iterative transformation algorithms....Pages 755-760
The construction of numerical mathematical software for the AMT DAP by program transformation....Pages 761-767
On using object oriented parallel programming to build distributed algebraic abstractions....Pages 769-774
Implementation of the self-organizing feature map on parallel computers....Pages 775-776
Candela — A topology description language....Pages 777-778
Parallelism in generation of sequences for Monte Carlo simulation....Pages 779-780
Data parallel evaluation-interpolation algorithm for solving functional matrix equations....Pages 781-782
Dynamic tree embeddings into de Bruijn graphs....Pages 783-784
Memory access in shared virtual memory....Pages 785-786
Parallel implementations of Jacobi's algorithm for the eigensolution of large matrices using array processors....Pages 787-788
A parallel architecture for a VLSI-hardware-realization of a numerical stable variant of the Simplex-Method....Pages 789-790
Generating physical channel access protocol data unit of CCSDS recommendation by use of the transputers and OCCAM....Pages 791-792
Manifold: Concepts and implementation....Pages 793-794
How to compile systems of recurrence equations into networks of communicating processes....Pages 795-796
Parallel algorithms simulation applied to differential — Algebraic systems, and electrical network application....Pages 797-798
Optimal performances and scheduling for parallel algorithms with equal cost tasks....Pages 799-800
Load balancing in a neighbourhood-based multiprocessor....Pages 801-802
A DAP-based implementation of a portable parallel image processing machine....Pages 803-804
The design of a parallel algorithm to solve the word problem for free partially commutative groups....Pages 805-806
A VLSI multigrid poisson solver amenable to biharmonic equation....Pages 807-808
Monochrome image coding using hierarchical closed loop vector quantizer on a multiprocessing system....Pages 809-810
Superscalar floating-point vector computation in scheme....Pages 811-812
An efficient parallel algorithm for maximal matching....Pages 813-814
Minimal, adaptive and deadlock-free routing for multiprocessors....Pages 815-816
A theoretical study of reconfigurability for numerical algorithms on a reconfigurable network....Pages 817-818
Multiprocessor simulation using object-oriented programming....Pages 819-820
Real-number codes for fault-tolerant matrix inversion on processor arrays....Pages 821-822
Automatic mapping of parallel programs onto processor networks....Pages 823-824
Dynamic allocation on the transputer network....Pages 825-826
Block implementations of the symmetric QR and Jacobi algorithms....Pages 827-828
An improved parallel algorithm for the solution of molecular dynamics problems on MIMD multiprocessors....Pages 829-830
Computational models for image processing: Towards a concise parallel language....Pages 831-832
Broadcasting in faulty Cube-Connected-Cycles with minimum recovery time....Pages 833-834
Parallel robot motion planning in a dynamic environment....Pages 835-836
A hypergraph-based interconnection network for large multicomputers....Pages 837-838
Greedy permutation routing on Cayley graphs....Pages 839-840
Experimenting upon the CG algorithm within the C_NET programming environment....Pages 841-842
An algorithm of broadcasting in the mesh of trees....Pages 843-844
Complexity estimation in the pips parallel programming environment....Pages 845-846
A graphical Petri net based editor for a visualization of distributed and parallel systems....Pages 847-848
Automatic parallelization of divide- and-conquer algorithms....Pages 849-850