Create Efficient Media & OPENCL™ Applications


Get the data you need to optimize OpenCL™ software and deliver high-performance image and video processing pipelines.

Analyze GPU & Platform Data

On newer Intel® processors, you can optionally collect GPU and platform data for tuning OpenCL™ and media applications, and in turn, view correlated GPU and CPU activities.

The timeline provides a detailed view of both CPU and GPU activity.

Easier OpenCL™ Application & GPU Profiling

When tuning OpenCL applications on newer processors, the Architecture Diagram helps you understand GPU hardware metrics and identify bottlenecks. Select an OpenCL kernel of interest and an execution time frame, and then Intel® VTune™ Profiler updates the diagram with accurate performance data.

The GPU Architecture Diagram displays key metrics, making it easier to see the performance bottleneck.

Tune Inefficient Kernel Algorithms

Use GPU In-Kernel Profiling to identify performance issues caused by memory latency or inefficient algorithms. View a profile of where the most time is spent on the OpenCL source and the compiler assembly. Analyze Direct Memory Access (DMA) packet execution with the Packet Queue Depth and Packet Duration histograms.

Performance data displays on the OpenCL application source code so you know exactly where time is being spent.

Additional Capabilities

Single Thread

Optimize single-threaded performance.

Multithread

Effectively use all available cores.

System

See a system-level view of application performance.

HPC & Cloud

Access specialized, in-depth analyses for HPC and cloud computing.

Memory & Storage Management

Diagnose memory, storage, and data plane bottlenecks.

Analyze & Filter Data

Mine data for answers.

Environment

Fits your environment and workflow.

Are you ready to try or purchase Intel VTune Profiler?

Product and Performance Information

1

Intel's compilers may or may not optimize to the same degree for non-Intel microprocessors for optimizations that are not unique to Intel microprocessors. These optimizations include SSE2, SSE3, and SSSE3 instruction sets and other optimizations. Intel does not guarantee the availability, functionality, or effectiveness of any optimization on microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended for use with Intel microprocessors. Certain optimizations not specific to Intel microarchitecture are reserverd for Intel microprocessors. Please refer to the applicable product User and Reference Guides for more information regarding the specific instruction sets covered by this notice.

Notice revision #20110804