Sparse Cholesky factorization on a local-memory multiprocessor

Abstract
The problem of factoring a large sparse positive definite matrix is implemented on a multiprocessor system. The processors are assumed to have substantial local memory but no globally shared memory. They communicate among themselves and with a host processor through message passing. The primary interest is in designing an algorithm which exploits parallelism, rather than in exploiting features of the underlying topology of the hardware. However, part of the study is aimed at determining, for certain sparse matrix problems, whether hardware based on the binary hypercube topology adequately supports the communication requirements for such problems. Numerical results from experiments running on a multiprocessor simulator are included. 20 refs., 7 figs., 5 tabs.