I've found in my benchmark that compared with MKL 7.2, in the release 10.1.1.019 for Linux, cblas_dgemm slows down a lot when matrix size is amall and beta is set to 0 on Pentium 4 machine.
Is this a known issue? Has it been fixed?
Thanks a lot!
cblas_dgemm slows down a lot for Linux on Pentium 4 machine
Jue, 26/11/2009 - 22:29