Small complex matrix performance issue

Small complex matrix performance issue

There seems to be some kind of significant setup overhead with MKL routines. When doing operations on "small" (6x6) double precision complex matrices (zgemm), MKL performs somewhat worse than "regular" LAPACK/BLAS. MKL pulls ahead with larger matrices.
Unfortunately, though it is possible to "hand-code" an approach for these 6x6 matrices, a "naive" matrix multiply loop is much slower than using zgemm, MKL or not.


3 posts / 0 new
Last post
For more complete information about compiler optimizations, see our Optimization Notice.

MKL ZGEMM performance for complex data types would not be outstanding, unless you have a recent release, with an SSE3 specific version. Even then, for such a small matrix, it may be possible to write an in-line version which is faster. If you are able to use the MKL 8.0, with the version optimized for your processor, and you still find this slowdown, please submit a problem report on

I should have said I am using MKL 8.0 on a Pentium D

Leave a Comment

Please sign in to add a comment. Not a member? Join today