This will be the final post in my planned short vectorization series. Although I reserve the right to post more on vectorization in the future!
In my last blog, I introduced the concept of vectorization, which is parallelism across data elements in a regi
One of my performance focus areas for this year is vectorization.
Any parent knows the simple rule: "Never help a child with a task he can succeed at himself. Otherwise you don't make any good for the kid, for you and for the whole planet".
I recently had a question from a customer who had introduced a succesful optimization to a hot function in his application, but did not see as much improvement in the overall application as he expe
This blog contains additional content for the article "Advanced Vectorization" from Parallel Universe #12:
The upcoming OpenMP 4.0 will be discussed at SC12, and there wil
It is only a few weeks until you will get a chance to get your hands on the 4th Generation Intel® Core&tm; Processor Family
I was hoping to write a brief two part overview of how to configure the various power settings for the Intel® Xeon Phi™ coprocessor.
This is part of a series of blogs on Embree, a collection of high performance ray tracing kernels. Embree has been released open source since version 1.0.