Optimizing memory accesses is the first step to achieving high performance with OpenCL* on the Intel® Processor Graphics. Tune your kernel to access memory at an optimal granularity and with optimal addresses.
The OpenCL* implementation for the Intel® Processor Graphics primarily accesses memory through the following caches:
- GPU-specific L3 cache
- CPU and GPU shared Last Level Cache (LLC).
L1 and L2 caches are specific to the sampler and renderer.