I have for evaluation purposes downloaded the vector math libray, where I am particularly interested in exp and log for vectors of length about 20. From the information I found on the Intel website I had expected considerable speed-up but achieved only marginal results compared to the conventional scalar compiler library functions (Intel Fortran v. 9.0) when the compiler was using SSE2 instructions (option -QxW). Can this be correct, or am I missing something?
I do get a significant speedup compared to compilation to default P4.
Vector math library speedup