| Sign In to gain access to subscriptions and/or personal tools. |
Static LU Decomposition on Heterogeneous PlatformsLIP, UMR CNRSENS LyonINRIA 5668, Ecole Normale Supérieure de Lyon, France
LIP, UMR CNRSENS LyonINRIA 5668, Ecole Normale Supérieure de Lyon, France
LIP, UMR CNRSENS LyonINRIA 5668, Ecole Normale Supérieure de Lyon, France
LIP, UMR CNRSENS LyonINRIA 5668, Ecole Normale Supérieure de Lyon, France In this paper, the authors deal with algorithmic issues on heterogeneous platforms. They concentrate on dense linear algebra kernels, such as matrix multiplication or LU decomposition. Block-cyclic distribution techniques used in ScaLAPACK are no longer sufficient to balance the load among processors running at different speeds. The main result of this paper is to provide a static data distribution scheme that leads to an asymptotically perfect load balancing for LU decomposition, thereby providing solid foundations toward the design of a cluster-oriented version of ScaLAPACK.
International Journal of High Performance Computing Applications, Vol. 15, No. 3,
310-323 (2001) |
|||