Abstract: A modification of standard design procedure for mapping nested loop algorithms into systolic arrays is described in this article. This modification enables us to obtain non--planar systolic arrays for matrix multiplication with optimal number of processing elements for a given problem size. The modification is based on composition of two linear mappings.
Key words Systolic arrays, matrix multiplication.