The code used in examples (Chapters 2-4) in our book can be downloaded from the book's website. We appreciate attribution, but there are no restrictions on use of the code - please use and enjoy! You can use the step by step instructions in the book or if you prefer we've included a Makefile for each of the chapter examples to make life a little easier.
OpenMP on the Intel® Xeon Phi™ coprocessor performs as well as on Intel® Xeon processors. However, the slower clock on the Intel Xeon Phi coprocessor and the sheer number of threads accentuates OpenMP overhead. In most cases, the problem is either load imbalance or a significant amount of serial execution and is rarely the overhead itself.
Let’s take a look at the following Intel Vtune screenshot.