I just downloaded MKL. Browsing the online documentation, I see that the performance of some VML functions (atan2 for instance) is significantly slower (even a factor of 10, or 20, or worse) on Intel Xeon EM64T than on other architectures (see www.intel.com/software/products/mkl/data/vml/functions/atan2.html).
- Does that simply mean that some of the VML functions have not been optimized for that architecture yet?
- What would a Pentium D be on that performance table? A P4 with SSE3 or a Xeon with EM64T?