I am curious about the CPU implementation of OpenCL for Intel processors.
I run a small set of benchmark from clpeak on a i7-4770S (4 cores, hyperthreading enabled) under linux.
it shows the CPU utilization can achieve almost 800% (using top), meaning all CPU resource are utilized.
However, when I run the benchmark in clpeak individually, it shows maximum 400%.
Run benchmark consecutively can benefit from OpenCL runtime.
Is that mean when a workload is issued to OpenCL CPU runtime, it will not all of the cores but part of them.