Hi All.
Since vectorization in one way to achieve parallelization of code withina section of code through Compiler's directives within SMP system.
It seemsauto-vectorization is not infallible as in many cases the compiler can't prove independence of statements, so code stays scalar, or even if the compiler appears to have vectorisedthe code - it might still under-exploit some hardware features and hence generate less efficient code than the programmer would have written using vector intrinsics.
Could anyone explain what arekey Compilers limitations for executing efficient vectorization code which makes a sinerio for SIMD programming totake it's importance?
Any papers or publications which discusses above thoughts?
~BR



