CPU/GPU Concurrency Analysis

Use the CPU/GPU Concurrency analysis as a starting point to understand the code execution on the various CPU and GPU cores in your system and identify whether your target application is GPU or CPU bound.

How It Works

The tool infrastructure automatically aligns clocks across all cores in the entire system so that you can analyze some CPU-based workloads together with GPU-based workloads within a unified time domain.

Use the CPU/GPU Concurrency analysis to:

  • Explore GPU usage and analyze a software queue for GPU engines at each moment of time

  • Correlate CPU and GPU activity and identify whether your application is GPU or CPU bound

  • Identify GPU and CPU application frame rate and how it depends on vertical synchronization

  • Explore the performance of your application per selected GPU metrics over time

  • Analyze execution of Intel Media SDK tasks over time (for Linux targets only)

  • Explore your application performance for user tasks created with Intel ITT API

Configure and Run Analysis

To view configuration options for the CPU/GPU Concurrency analysis:

Prerequisites: Create a project.

  1. Click the (standalone GUI)/ (Visual Studio IDE)Configure Analysis button on the Intel® VTune™ Amplifier toolbar.

    The Configure Analysis window opens.

  2. From HOW pane, click the Browse button and select CPU/GPU Concurrency from the Platform group.

    The CPU/GPU Concurrency pane opens displaying predefined collection options for this analysis. To modify an option or change the list of collected hardware events, create a new custom analysis type.


    You may generate the command line for this configuration using the Command Line button at the bottom.

  3. Click the Start button to run the analysis.

View Data

VTune Amplifier runs the analysis and opens the data in the GPU Compute/Media Hotspots viewpoint providing various platform data in the following windows:

  • Platform window displays over-time data as GPU usage on a software queue, CPU time usage, OpenCL™ kernels data, Memory Bandwidth, and CPU Frequency.

  • Graphics window displays CPU and GPU usage data per thread.

  • Summary window provides high-level statistics on how your application uses CPU and GPU resources and helps you understand whether your application is GPU bound.

What's Next

If you explored the CPU/GPU Concurrency analysis results and discovered that your application is GPU-bound, you may proceed with the GPU Compute/Media Hotspots analysis for detailed GPU analysis.

See Also

For more complete information about compiler optimizations, see our Optimization Notice.
Select sticky button color: 
Orange (only for download buttons)