Get Started

  • 2020
  • 08/24/2021
  • Public Content

Example: Profile a DPC++ Application on Windows*

Profile a sample
matrix_multiply
DPC++ (Data Parallel C++) application with Intel® VTune™ Profiler. Get familiar with the product and understand the statistics collected for GPU-bound applications.

Prerequisites

  • Make sure you have Microsoft* Visual Studio (v2017 or newer) installed on your system.
  • Install
    Intel VTune
    Profiler
    from the
    Intel® oneAPI Base Toolkit
    or the
    Intel® System Bring-up Toolkit
    . These toolkits contain the
    Intel® oneAPI
    DPC++/C++
    Compiler
    (
    dpcpp
    ) compiler required for the profiling process.
  • Set up environment variables. Execute the
    vars.bat
    script located in the
    <
    vtune-install-dir
    >\env
    directory.
  • Ensure that the Intel oneAPI DPC++ Compiler (installed with the Intel oneAPI Base toolkit) is integrated into Microsoft Visual Studio.
  • Compile the DPC++ code using the
    -gline-tables-only
    and
    -fdebug-info-for-profiling
    options for Intel oneAPI DPC++ Compiler.
For information on installing
Intel VTune
Profiler
in the Microsoft* Visual Studio environment, see VTune Profiler User Guide.

Build the Matrix App

Download the
matrix_multiply_vtune
code sample package for Intel oneAPI toolkits
. This contains the sample which you can use to build and profile a DPC++ application.
  1. Open Microsoft* Visual Studio.
  2. Click
    File > Open > Project/Solution
    . Find the
    matrix_multiply_vtune
    folder and select
    matrix_multiply.sln
    .
  3. Build this configuration (
    Project > Build
    ).
  4. Run the program (
    Debug > Start Without Debugging
    ).
  5. To choose a DPC++ or threaded version of the sample, use preprocessor definitions.
    1. Go to
      Project Properties > DPC++ > Preprocessor > Preprocessor Definition
      .
    2. Define
      DPCPP
      or
      USE_THR
      .

Run GPU Analysis

Run a GPU analysis on the Matrix sample.
  1. From the Visual Studio toolbar, click the
    Configure Analysis
    button.
    The
    Configure Analysis
    window opens. By default, it inherits your VS project settings and specifies the
    matrix_multiply.exe
    as an application to profile.
  2. In the
    Configure Analysis
    window, click the Browse button in the
    HOW
    pane.
  3. Select the
    GPU Compute/Media Hotspots
    analysis type from the
    Accelerators
    group in the Analysis Tree.
    GPU Analysis
  4. Click the
    Start
    button to launch the analysis with the predefined options.
VTune
Profiler
collects data and displays analysis results in the
GPU Compute/Media Hotspots
viewpoint. In the
Summary
window, see statistics on CPU and GPU resource usage to understand if your application is GPU-bound. Switch to the
Graphics
window to see basic CPU and GPU metrics representing code execution over time.

Product and Performance Information

1

Performance varies by use, configuration and other factors. Learn more at www.Intel.com/PerformanceIndex.