SSE
GDC 2010
Identifying JVM SIMD and SSE Usage with the VTune™ Performance Analyzer
12,000 IPP Functions! Where to begin? (part 1)
Your first encounter with the Intel® IPP library can be overwhelming, due to the number of functions contained within. At Intel we sometimes even "proudly" make statements like "over 12,000 functions in 16 domains" in our marketing literature!
Don't be overwhelmed by such chest-beating marketing statements! (Not that many engineers ever would take them seriously.)
如何使用SSE指令提高FIR算法效率
好的,言归正传!一般的FIR算法如下:

<!--[if !supportEmptyParas]--> <!--[endif]-->
那么如何使用SSE指令来优化它呢?
SSE的指令在于它使用了向量计算,极大的提高了运算速度,所以在优化之前,我们首先要对原有的c代码进行向量构造。因为水平有限,构造了一个粗糙向量(如果有那位网友能够指导我进行优化,我将万分感谢!)。
这里假设相关的所有长度都是16个字节的倍数。

<!--[if !supportEmptyParas]--> <!--[endif]-->
JPEG Bug Fix Details
Threading and the Intel® IPP Library – part 3 of 3
OpenMP Threading and Intel IPP
The low-level primitives within the IPP library generally represent basic atomic operations. This limits threading within the library to ~15-20% of the primitives. Intel OpenMP is used to implement internal threading and is enabled, by default, when you use one of the multi-threaded variants of the library. Multi-threaded versions of the library are only supported on Linux, Windows, and Mac OS X.
Threading and the Intel® IPP Library – part 2 of 3
Threading Choices for Your Intel IPP Application
Source code for some multi-threaded IPP application examples are included in the free sample downloads. Several of these examples implement threading at the application level, and some use the OpenMP* threading that is built into the Intel IPP library. In most cases the performance gains due to multi-threading is substantial.
Threading and the Intel® IPP Library - part 1 of 3
Introduction to Threading in IPP
