Introduction
Intel® VTune™
is a performance analysis tool for serial and multithreaded applications. Use
Profiler
VTune
to analyze your choice of algorithm. Identify potential benefits for your application from available hardware resources.
Profiler
Intel® VTune™ Profiler is a new renamed version of the Intel® VTune™ Amplifier.
Use
VTune
to locate or determine:
Profiler
- The most time-consuming (hot) functions in your application and/or on the whole system
- Sections of code that do not effectively utilize available processor time
- The best sections of code to optimize for sequential performance and for threaded performance
- Synchronization objects that affect the application performance
- Whether, where, and why your application spends time on input/output operations
- Whether your application is CPU or GPU bound and how effectively it offloads code to the GPU
- The performance impact of different synchronization methods, different numbers of threads, or different algorithms
- Thread activity and transitions
- Hardware-related issues in your code such as data sharing, cache misses, branch misprediction, and others
Usage Models
- InstallVTuneon Windows*, macOS*, or Linux* platforms and use it to analyze local and remote target systems.Profiler
- On a macOS system, you can installVTuneand run a remote analysis and view collected data on the macOS host. You cannot profile the local macOS system. On all supported systems,Profiler
- On all supported platforms, use the GUI or run analyses from the command line interface () to collect data and perform regression testing.vtune
- UseVTuneas a web server. This is an optimal solution for multi-user environments.Profiler
- Install the standalone GUI client or integrateVTuneinto IDEs, such as Microsoft Visual Studio* or Eclipse*.Profiler
Documentation for versions of Intel® VTune™ Profiler prior to the 2021 release are available for download only. For a list of available documentation downloads by product version, see these pages:
Key Features
This table summarizes the availability of important analysis types per host and remote target platform using
VTune
:
Profiler
Analysis
| Windows Target
| Linux Target
| Android Target
| FreeBSD* Target
|
---|---|---|---|---|
¹Preview only; ²Intel HD Graphics and Intel Iris® Graphics only; ³EBS analysis only
VTune
provides features that facilitate the analysis and interpretation of the results:
Profiler
- Top-down tree analysis: Use to understand which execution flow in your application is more performance-critical.
- Timeline analysis: Analyze thread activity and the transitions between threads.
- ITT API analysis: Use the ITT API to mark significant transition points in your code and analyze performance per frame, task, and so on.
- Architecture diagram: Analyze GPU OpenCL™ applications by exploring the GPU hardware metrics per GPU architecture blocks.
- Source analysis: View source with performance data attributed per source line to explore possible causes of an issue.
- Comparison analysis: Compare performance analysis results for several application runs to localize the performance changes you got after optimization.
- Start data collection paused mode: Click theStart Pausedbutton on the command bar to start the application without collecting performance data and click theResumebutton to enable the collection at the right moment.
- Grouping: Group your data by different granularity in the grid view to analyze the problem from different angles.
- Viewpoints: Choose among preset configurations of windows and panes available for the analysis result. This helps focus on particular performance problems.
- Hot keys to start and stop the analysis: Use a batch file to create hot keys to start and stop a particular analysis.
Because
VTune
requires specific knowledge of assembly-level instructions, its analysis may not operate correctly if a program (target) is compiled to generate non-Intel architecture instructions. In this case, run the analysis with a target executable compiled to generate only Intel instructions. After you finish using
Profiler
VTune
, you can use optimizing compiler options that generate non-Intel architecture instructions.
Profiler