A couple of back-to-back opportunities to see great talks about harness lots of cores, and to give talks about programming options and why we do not need to give up on programmability in our quest
Improving the Compute Performance of Video Processing Software Using AVX (Advanced Vector Extensions) InstructionsThis paper describes a case study in which AVX instructions are used to enhance the performance of a de-saturation algorithm (a common video filter). The case study takes the algorithm from a non-SIMD state to AVX based SIMD.
by Jim Dempsey
Cache Blocking Techniques Overview