User Guide

Contents

Accelerators Analysis Group

The
Accelerators
group introduces analysis types that monitor CPU, GPU and FPGA usage for your application/system.
  • GPU Offload is targeted for applications using a Graphics Processing Unit (GPU) for rendering, video processing, and computations. It helps you identify whether your application is CPU or GPU bound.
  • GPU Compute/Media Hotspots (preview) is targeted for GPU-bound applications and helps analyze GPU kernel execution per code line and identify performance issues caused by memory latency or inefficient kernel algorithms.
  • CPU/FPGA Interaction analysis explores FPGA utilization for each FPGA accelerator and identifies the most time-consuming FPGA computing tasks.
A
PREVIEW FEATURE
may or may not appear in a future production release. It is available for your use in the hopes that you will provide feedback on its usefulness and help determine its future. Data collected with a preview feature is not guaranteed to be backward compatible with future releases.
Prerequisites:
  • Install the sampling driver for hardware event-based sampling collection types. For Linux* and Android* targets, if the sampling driver is not installed,
    VTune
    Profiler
    can work on Perf* (driverless collection).
  • To enable system-wide and uncore event collection, use root or sudo to set
    /proc/sys/kernel/perf_event_paranoid
    to
    0
    .
    $ echo 0>/proc/sys/kernel/perf_event_paranoid

Product and Performance Information

1

Performance varies by use, configuration and other factors. Learn more at www.Intel.com/PerformanceIndex.