One of my performance focus areas for this year is vectorization.
I've known this day was coming - but when I saw Knights Corner clearly sustaining a TeraFlop (DGEMM, wide range of block sizes) per second - I was surprised by my emotional reaction inside.
Real results for many-core processors illustrate the power of a familiar configuration (SMP) even when reduced to a single chip.
How does a high performance SMP on-a-chip sound to you?
Knights Corner: Open source software stack
This article focuses on aspects of porting Fortran codes to the Intel® Xeon Phi™ coprocessor. Most of the documentation for the coprocessor is C/C++ centric.
I had an interesting question come across my desk a few days ago: “Is it still worthwhile to understand T-states?” My first response was to think, “Huh? What the heck is a T-state?”
Applying Intel® Threading Building Blocks Observers for Thread Affinity on Intel® Xeon Phi™ Coprocessors
In spite of the fact that the Intel® Threading Building Blocks (Intel® TBB) library   provides high-level task based parallelism intended to hide sof
I was hoping to write a brief two part overview of how to configure the various power settings for the Intel® Xeon Phi™ coprocessor.