This will be the final post in my planned short vectorization series. Although I reserve the right to post more on vectorization in the future!
In my last blog, I introduced the concept of vectorization, which is parallelism across data ele
One of my performance focus areas for this year is vectorization.
In the previous blog, I explained two mini-kernels, the scatter and gather, which can be written up quickly and still have the benefits of compiler vectorization with Array Notations.
It’s been a busy month preparing for SuperComputing ‘09 and
I recently had a question from a customer who had introduced a succesful optimization to a hot function in his application, but did not see as much improvement in the overall application as he expe
Today Intel announced Intel® Parallel Studio XE 2013 (available immediately) and
The upcoming OpenMP 4.0 will be discussed at SC12, and there wil
A Parallel Programming training opportunity (Intel® Xeon® processors and Intel® Xeon Phi™ coprocessors)
SC12 is underway, and the opening gala is tonight. Drop by the Intel booth at the opening (7pm) and check out the amazing space and see what new things Intel has to share!