I would like to know how IPP routines are coded.
I am especially interested in the FFT routine.
The reason is FFT consists of mainly complex multiplications.
My experience with Intel C++ compiler is that it cannot vectorize
complex operations properly.
The performance level of FFT on a 2.8C GHz P4 is about 6GFlops.
Now it is impossible that this CPU reaches that performance without full vectorization.
Since the Intel C++ compiler cannot vectorize this code efficiently, I conclude that IPP FFT is coded in assembly.
Can somebody confirm this?
If yes do we have access to the FFT code?