the attached PyOpenCL example demonstrates an issue with the below version of the Intel CPU CL runtime, observed on Intel Sandy Bridge and AMD Piledriver. Specifically, the assertion at the end of the code ("the bounding box is found correctly") is violated when running the example with the 2013 runtime. Everything is fine when using the 2012 runtime or the AMD runtime. I'm aware that this is not very efficient code for a CPU device, but I claim that it is correct.
SDK Version: intel_sdk_for_ocl_applications_2013_xe_beta_sdk_3.0.56860_x64