Developer Guide for Intel® SDK for OpenCL™ Applications
Deep Kernel Analysis feature of the Intel® SDK for OpenCL™ Applications standalone version enables getting profiling data for OpenCL™ kernels running on Intel® Processor Graphics. The data includes:
- Exact kernel runtime for each execution unit and hardware thread (in GPU cycles).
- Exact execution time for selected OpenCL code lines (in GPU cycles).
- Execution units occupancy and hardware thread utilization across the execution.
The new feature uses the automatic host application feature, so you only need to write an OpenCL kernel, assign variables to its arguments, and define the global and local group sizes. You may also mark specific OpenCL code lines as IL profiling points, and then use the Deep Analysis button to run the analysis.
To work with the Deep Kernel Analysis feature, add the following key in the registry: