Get Started

  • 2020
  • 05/19/2021
  • Public Content

Example: Profile an OpenMP Application on Linux*

Use
Intel VTune
Profiler
on a Linux machine to profile a sample
iso3dfd_omp_offload
OpenMP application offloaded onto an Intel GPU. Learn how to run a GPU analysis and examine results.

Prerequisites

  • Make sure your system is running Linux* OS kernel 4.14 or a newer version.
  • Use one of these versions of Intel Processor Graphics:
    • Gen 8
    • Gen 9
    • Gen 11
  • Your system should be running on one of these Intel processors:
    • 7th Generation Intel® Core™ i7 Processors (code name Kaby Lake)
    • 8th Generation Intel® Core™ i7 Processors (code name Coffee Lake)
    • 10th Generation Intel® Core™ i7 Processors (code name Ice Lake)
  • For the Linux GUI, use:
    • GTK+ version 2.10 or newer (2.18 and newer versions are recommended)
    • Pango version 1.14 or newer
    • X.Org version 1.0 or newer (1.7 and newer versions are recommended)
  • Download the Intel® oneAPI HPC Toolkit which contains the
    Intel® oneAPI DPC++/C++ Compiler
    (
    icx/icpx
    ) that you need to profile OpenMP applications.
  • Set up environment variables. Execute the
    vars.sh
    script.

Build and Compile the OpenMP Offload Application

  1. Open to the sample directory.
    cd <sample_dir>/DirectProgramming/C++/StructuredGrids/iso3dfd_omp_offload
  2. Compile the OpenMP Offload application.
    mkdir build; cmake -DVERIFY_RESULTS=0 .. make -j
    This generates a
    src/iso3dfd
    executable.
To delete the program, type:
make clean
This removes the executable and object files that you created with the
make
command.

Run a GPU Analysis on the OpenMP Offload Application

You are now ready to run the GPU Offload Analysis on the OpenMP application you compiled.
  1. Open
    VTune
    Profiler
    and click on
    New Project
    to create a project.
  2. On the welcome page, click on
    Configure Analysis
    to set up your analysis.
  3. Select these settings for your analysis.
    • In the
      WHERE
      pane, select
      Local Host
      .
    • In the
      WHAT
      pane, select
      Launch Application
      and specify the
      iso3dfd_omp_offload
      binary as the application to profile.
    • In the
      HOW
      pane, select the
      GPU Offload
      analysis type from the
      Accelerators
      group in the Analysis Tree.
    Configuration for GPU Offload Analysis
  4. Click the
    Start
    button to run the analysis.
VTune
Profiler
collects data and displays analysis results in the
GPU Offload
viewpoint.
  • In the
    Summary
    window, see statistics on CPU and GPU resource usage. Use this data to determine if your application is:
    • GPU-bound
    • CPU-bound
    • Utilizing the compute resources of your system inefficiently
  • Use the information in the
    Platform
    window to see basic CPU and GPU metrics.
  • Investigate specific computing tasks in the
    Graphics
    window.
For a deeper analysis, see a related recipe in the
VTune
Profiler
Performance Analysis Cookbook
. You can also continue your profiling with the GPU Compute/Media Hotspots analysis.

Product and Performance Information

1

Performance varies by use, configuration and other factors. Learn more at www.Intel.com/PerformanceIndex.