Hello,I am wondering if the runtime OpenCL consumes a significant CPU time? I mean, I am using the HD4000 to free the CPU of some computation which represents only 1-2% of the CPU usage of my process.But in the end, the performance is worst. And I measure almost the same when I just call my setup_openCL() function and do not process this small computation at all.Does it mean there is some kind of treshold in term of computation, below which one it's not efficient to use the GPU with OpenCL? (Memory transfert is not the problem here). Is there any way to fix this? (Besides doing more computation on the GPU).Thank you,Chris.
For more complete information about compiler optimizations, see our Optimization Notice.