We need to use both the CPU and GPU of our Ivy Bridge processor at the same time. However, in Intel OpenCL SDK, the kernel launch call is not asynchronous. In other words, clEnqueueNDRangeKernel waits for the GPU execution to be completed, which is against the OpenCL standard. We have tried the latest Intel OpenCL SDKs (2012 and 2013). Is there a workaround?