Intel® VTune™ Amplifier XE 2013

Performance profiler for serial and parallel performance analysis.

  • Collect a rich set of data to tune performance and multi-core scalability.
  • Sort, filter and visualize results for quick insight into performance bottlenecks.

Buy Now

Or Download a Free 30-Day Evaluation Version

Optimize Serial and Parallel Performance

Intel® VTune™ Amplifier XE 2013 is the premier performance profiler for C, C++, C#, Fortran, Assembly and Java*.


Functions using the most CPU time float to the top of the list. Click on a function to see the call stack. Double click to see the source with detailed profile data.

Easy
Performance optimization can be difficult, but the performance profiling tool you use shouldn’t be.

Versatile – Rich Set of Performance Profiles
Whether you are tuning for the first time or doing advanced performance optimization, VTune Amplifier XE 2013 provides the data needed to meet a wide variety of tuning needs.  Collect a rich set of performance data for hotspots, threading, locks & waits, DirectX*, bandwidth and more.

Productive – Sort, Filter and Visualize
Good data is not enough.  You need tools to mine the data and make it easy to understand.  Powerful analysis lets you sort, filter and visualize results on the timeline and on your source.

New for 2013!
Call counts, hardware stack sampling, better bandwidth analysis, Java profiling, tune Intel® Xeon Phi™ products, user tasks, DirectX* frames, power analysis and more.


Quotes

“The new VTune™ Amplifier XE brings even more capability to an already indispensable tool. The sampling based call stack hotspots is excellent and alone is worthy of the upgrade. We have also been impressed by how the concurrency and Locks and Waits analysis can even provide useful data on complex applications such as Premiere Pro.”
Rich Gerber - Engineering Manager, MediaCore, Adobe Systems Inc.

“The new interface is a joy to use. Intel® VTune Amplifier XE gives us precise, down-to-the-metal performance data that’s invaluable for pinpointing hotspots and evaluating the effect of optimizations”
Daniel Schwarz, Performance Engineer, Nik Software

“Intel® VTune™ Amplifier XE’s timeline is very information intensive.  It organizes the data I need to tune threaded applications.”
Sergey Zaritchny, Software Development Manager, Open Cascade SAS

“Last week, Intel® VTune™ Amplifier XE helped us find almost 3X performance improvement.  This week it helped us improve the performance another 3X.”
Claire Cates, Principal Developer, SAS Institute Inc.

“One of Intel® VTune™ Amplifier XE’s best features is that it is easy to use.  I did not need to read the documentation.”
Richard Shepherd, Software Engineer, ESRI (UK) Limited

Quickly Locate Code Taking A Lot of CPU Time

Hotspots analysis gives you a sorted list of the functions using a lot of CPU time. Click [+] for the call stacks. Double click to see the source.

See the Results on Your Source

A double click from the function list takes you to the hottest spot in the function.

Tune Threading with Locks and Waits Analysis

Quickly find a common cause of slow performance in parallel programs: waiting too long on a lock while the cores are underutilized during the wait. Profiles like hotspots and locks & waits use a software collector that works on both Intel and compatible processors.

Mine the Data with Timeline Filtering

Select a time range in the timeline to filter out data (e.g., application startup) that masks the information you need. When you select and filter in the timeline, the grid that lists functions using a lot of CPU time updates to show the list filtered for the selected time.

Visualize Thread Behavior

See when threads are running and waiting, and when transitions occur. Balance workloads.

Profile a Running Application

There is no need to stop and re-launch the application when profiling

  • Attach to a running process
  • Profile the whole system with hardware event sampling and filter out what you need later

Low Overhead / High Resolution Hardware Profiling

In addition to hotspots analysis that works on both Intel and compatible processors, VTune Amplifier XE 2013 has lightweight hotspots analysis that uses the Performance Monitoring Unit (PMU) on Intel processors to collect data with very low overhead. Increased resolution (~1 ms vs. ~10 ms) can find hot spots in small functions that run quickly. New! Now with optional stack collection to identify the calling sequence.

Advanced Analysis Like Bandwidth

Preset profiles provide an easy “point and shoot” set-up. Choose Hotspot, Lightweight Hotspot, Concurrency, Locks and Waits or more advanced analyses. No memorizing complex event names. Advanced profiles like memory bandwidth analysis, memory access and branch mispredictions find tuning opportunities. New! Advanced profiles can optionally collect stacks to identify the calling sequence. (Profiles vary by microarchitecture.)

Opportunities Highlighted

The cell is highlighted in pink when there is a potential tuning opportunity. Hover to get suggestions.

No special builds

Use a production build with symbols from your normal compiler.

Low overhead

Accurate results you can count on.

Command line

Automate regression analysis. Simple remote collection.

System Wide Analysis

Tune drivers, kernel modules and multi-process apps.

New!Tune Inlining with Call Counts

When a function is called frequently it may make sense to “inline” the code and eliminate the overhead of the function call. VTune Amplifier XE 2013 now provides statistical call count data to help you make better inlining decisions. It also displays profile results on the source code, even if the code is inlined, making it easier to interpret profile results.

New!Auto Detect Microsoft DirectX* Frames

Got a slow spot in your game play? You don’t want to know where you are spending a lot of time, you want to know where you are spending a lot of time and the frame rate is slow. VTune Amplifier XE 2013 can now automatically detect Microsoft DirectX* frames and filter results to show you what is happening in slow frames. Not using DirectX*? Just define the critical region using the API and frame analysis becomes a powerful tool for analyzing latency.

New!Better Memory Bandwidth Analysis

VTune Amplifier XE 2013 performs a more accurate memory bandwidth analysis for both reads & writes to cache and memory. It also adds bandwidth analysis for additional processor types.

Intel® Threading Building Blocks

Built-in understanding of parallel programming models means profiling data is described using familiar terms from the source, not with cryptic internal runtime labels.

New!Low Overhead Java* Profiling

Analyze Java or mixed Java and native code.  Results are mapped to the original Java source.  Unlike some Java profilers that instrument the code, VTune Amplifier XE 2013 uses low overhead statistical sampling with either a hardware or software collector.  Hardware collection has extremely low overhead because it uses the on-chip performance monitoring hardware.

New!Analyze User Tasks

The task annotation API is used to annotate your source so VTune Amplifier XE 2013 can display which tasks are executing. For example if you label the stages of your pipeline, they will be marked in the timeline and hovering will reveal details. This makes profiling data much easier to understand.

New!Tune for Intel® Xeon Phi™ Products

Hardware profiling is supported for Intel® Xeon Phi™ products and can be launched from the graphic user interface. It can collect lightweight hotspots and advanced event data and has time markers for correlation of data across multiple cards. Software collection (e.g., locks and waits analysis) is not supported on Intel® Xeon Phi™ products.

New!Power Analysis

Unlike other tools that measure average power usage, Power Analysis in VTune Amplifier XE 2013 identifies the cause of the wake up. Interrupts are mapped to the IRQ and timers are mapped to the source code with a call stack. This gives you actionable information to reduce / consolidate wake-ups thus reducing power usage by remaining in a low power state for longer periods of time.

New!Tune MPI Applications

Analyze hybrid applications using MPI and OpenMP. Install on a cluster.

New!Support for New Processors

VTune Amplifier XE 2013 is constantly adding support for the latest processors. Updates are released shortly after new processors begin shipping.

New!"Hot keys" Start and Stop Analysis

Add a short cut to quickly launch performance analysis whenever you see your app running slowly.  Program hot keys to start and stop the collection of performance data.

Technical Specifications

For additional details, please see the release notes.

Low Overhead / High Resolution Hardware Profiling

In addition to hotspots analysis that works on both Intel and compatible processors, VTune Amplifier XE 2013 has lightweight hotspots analysis that uses the Performance Monitoring Unit (PMU) on Intel processors to collect data with very low overhead. Increased resolution (~1 ms vs. ~10 ms) can find hot spots in small functions that run quickly. New! Now with optional stack collection to identify the calling sequence.

Advanced Analysis Like Bandwidth

Preset profiles provide an easy “point and shoot” set-up. Choose Hotspot, Lightweight Hotspot, Concurrency, Locks and Waits or more advanced analyses. No memorizing complex event names. Advanced profiles like memory bandwidth analysis, memory access and branch mispredictions find tuning opportunities. New! Advanced profiles can optionally collect stacks to identify the calling sequence. (Profiles vary by microarchitecture.)

New!Tune Inlining with Call Counts

When a function is called frequently it may make sense to “inline” the code and eliminate the overhead of the function call. VTune Amplifier XE 2013 now provides statistical call count data to help you make better inlining decisions. It also displays profile results on the source code, even if the code is inlined, making it easier to interpret profile results.

New!Auto Detect Microsoft DirectX* Frames

Got a slow spot in your game play? You don’t want to know where you are spending a lot of time, you want to know where you are spending a lot of time and the frame rate is slow. VTune Amplifier XE 2013 can now automatically detect Microsoft DirectX* frames and filter results to show you what is happening in slow frames. Not using DirectX*? Just define the critical region using the API and frame analysis becomes a powerful tool for analyzing latency.

New!Better Memory Bandwidth Analysis

VTune Amplifier XE 2013 performs a more accurate memory bandwidth analysis for both reads & writes to cache and memory. It also adds bandwidth analysis for additional processor types.

New!Low Overhead Java* Profiling

Analyze Java or mixed Java and native code.  Results are mapped to the original Java source.  Unlike some Java profilers that instrument the code, VTune Amplifier XE 2013 uses low overhead statistical sampling with either a hardware or software collector.  Hardware collection has extremely low overhead because it uses the on-chip performance monitoring hardware.

New!Analyze User Tasks

The task annotation API is used to annotate your source so VTune Amplifier XE 2013 can display which tasks are executing. For example if you label the stages of your pipeline, they will be marked in the timeline and hovering will reveal details. This makes profiling data much easier to understand.

New!Tune for Intel® Xeon Phi™ Products

Hardware profiling is supported for Intel® Xeon Phi™ products and can be launched from the graphic user interface. It can collect lightweight hotspots and advanced event data and has time markers for correlation of data across multiple cards. Software collection (e.g., locks and waits analysis) is not supported on Intel® Xeon Phi™ products.

New!Power Analysis

Unlike other tools that measure average power usage, Power Analysis in VTune Amplifier XE 2013 identifies the cause of the wake up. Interrupts are mapped to the IRQ and timers are mapped to the source code with a call stack. This gives you actionable information to reduce / consolidate wake-ups thus reducing power usage by remaining in a low power state for longer periods of time.

New!Tune MPI Applications

Analyze hybrid applications using MPI and OpenMP. Install on a cluster.

New!Support for New Processors

VTune Amplifier XE 2013 is constantly adding support for the latest processors. Updates are released shortly after new processors begin shipping.

New!"Hot keys" Start and Stop Analysis

Add a short cut to quickly launch performance analysis whenever you see your app running slowly.  Program hot keys to start and stop the collection of performance data.

Quickly Locate Code Taking A Lot of CPU Time

Hotspots analysis gives you a sorted list of the functions using a lot of CPU time. Click [+] for the call stacks. Double click to see the source.

See the Results on Your Source

A double click from the function list takes you to the hottest spot in the function.

Tune Threading with Locks and Waits Analysis

Quickly find a common cause of slow performance in parallel programs: waiting too long on a lock while the cores are underutilized during the wait. Profiles like hotspots and locks & waits use a software collector that works on both Intel and compatible processors.

Mine the Data with Timeline Filtering

Select a time range in the timeline to filter out data (e.g., application startup) that masks the information you need. When you select and filter in the timeline, the grid that lists functions using a lot of CPU time updates to show the list filtered for the selected time.

Visualize Thread Behavior

See when threads are running and waiting, and when transitions occur. Balance workloads.

Profile a Running Application

There is no need to stop and re-launch the application when profiling

  • Attach to a running process
  • Profile the whole system with hardware event sampling and filter out what you need later

Low Overhead / High Resolution Hardware Profiling

In addition to hotspots analysis that works on both Intel and compatible processors, VTune Amplifier XE 2013 has lightweight hotspots analysis that uses the Performance Monitoring Unit (PMU) on Intel processors to collect data with very low overhead. Increased resolution (~1 ms vs. ~10 ms) can find hot spots in small functions that run quickly. New! Now with optional stack collection to identify the calling sequence.

Advanced Analysis Like Bandwidth

Preset profiles provide an easy “point and shoot” set-up. Choose Hotspot, Lightweight Hotspot, Concurrency, Locks and Waits or more advanced analyses. No memorizing complex event names. Advanced profiles like memory bandwidth analysis, memory access and branch mispredictions find tuning opportunities. New! Advanced profiles can optionally collect stacks to identify the calling sequence. (Profiles vary by microarchitecture.)

Opportunities Highlighted

The cell is highlighted in pink when there is a potential tuning opportunity. Hover to get suggestions.

No special builds

Use a production build with symbols from your normal compiler.

Low overhead

Accurate results you can count on.

Command line

Automate regression analysis. Simple remote collection.

Quickly Locate Code Taking A Lot of CPU Time

Hotspots analysis gives you a sorted list of the functions using a lot of CPU time. Click [+] for the call stacks. Double click to see the source.

Tune Threading with Locks and Waits Analysis

Quickly find a common cause of slow performance in parallel programs: waiting too long on a lock while the cores are underutilized during the wait. Profiles like hotspots and locks & waits use a software collector that works on both Intel and compatible processors.

 

Low Overhead / High Resolution Hardware Profiling

In addition to hotspots analysis that works on both Intel and compatible processors, VTune Amplifier XE 2013 has lightweight hotspots analysis that uses the Performance Monitoring Unit (PMU) on Intel processors to collect data with very low overhead. Increased resolution (~1 ms vs. ~10 ms) can find hot spots in small functions that run quickly. New! Now with optional stack collection to identify the calling sequence.

Advanced Analysis Like Bandwidth

Preset profiles provide an easy “point and shoot” set-up. Choose Hotspot, Lightweight Hotspot, Concurrency, Locks and Waits or more advanced analyses. No memorizing complex event names. Advanced profiles like memory bandwidth analysis, memory access and branch mispredictions find tuning opportunities. New! Advanced profiles can optionally collect stacks to identify the calling sequence. (Profiles vary by microarchitecture.)

System Wide Analysis

Tune drivers, kernel modules and multi-process apps.

New!Tune Inlining with Call Counts

When a function is called frequently it may make sense to “inline” the code and eliminate the overhead of the function call. VTune Amplifier XE 2013 now provides statistical call count data to help you make better inlining decisions. It also displays profile results on the source code, even if the code is inlined, making it easier to interpret profile results.

New!Better Memory Bandwidth Analysis

VTune Amplifier XE 2013 performs a more accurate memory bandwidth analysis for both reads & writes to cache and memory. It also adds bandwidth analysis for additional processor types.

New!Low Overhead Java* Profiling

Analyze Java or mixed Java and native code.  Results are mapped to the original Java source.  Unlike some Java profilers that instrument the code, VTune Amplifier XE 2013 uses low overhead statistical sampling with either a hardware or software collector.  Hardware collection has extremely low overhead because it uses the on-chip performance monitoring hardware.

New!Tune for Intel® Xeon Phi™ Products

Hardware profiling is supported for Intel® Xeon Phi™ products and can be launched from the graphic user interface. It can collect lightweight hotspots and advanced event data and has time markers for correlation of data across multiple cards. Software collection (e.g., locks and waits analysis) is not supported on Intel® Xeon Phi™ products.

New!Power Analysis

Unlike other tools that measure average power usage, Power Analysis in VTune Amplifier XE 2013 identifies the cause of the wake up. Interrupts are mapped to the IRQ and timers are mapped to the source code with a call stack. This gives you actionable information to reduce / consolidate wake-ups thus reducing power usage by remaining in a low power state for longer periods of time.

New!Tune MPI Applications

Analyze hybrid applications using MPI and OpenMP. Install on a cluster.

New!Support for New Processors

VTune Amplifier XE 2013 is constantly adding support for the latest processors. Updates are released shortly after new processors begin shipping.

See the Results on Your Source

A double click from the function list takes you to the hottest spot in the function.

Mine the Data with Timeline Filtering

Select a time range in the timeline to filter out data (e.g., application startup) that masks the information you need. When you select and filter in the timeline, the grid that lists functions using a lot of CPU time updates to show the list filtered for the selected time.

Visualize Thread Behavior

See when threads are running and waiting, and when transitions occur. Balance workloads.

Profile a Running Application

There is no need to stop and re-launch the application when profiling

  • Attach to a running process
  • Profile the whole system with hardware event sampling and filter out what you need later

Opportunities Highlighted

The cell is highlighted in pink when there is a potential tuning opportunity. Hover to get suggestions.

New!Auto Detect Microsoft DirectX* Frames

Got a slow spot in your game play? You don’t want to know where you are spending a lot of time, you want to know where you are spending a lot of time and the frame rate is slow. VTune Amplifier XE 2013 can now automatically detect Microsoft DirectX* frames and filter results to show you what is happening in slow frames. Not using DirectX*? Just define the critical region using the API and frame analysis becomes a powerful tool for analyzing latency.

Intel® Threading Building Blocks

Built-in understanding of parallel programming models means profiling data is described using familiar terms from the source, not with cryptic internal runtime labels.

New!Analyze User Tasks

The task annotation API is used to annotate your source so VTune Amplifier XE 2013 can display which tasks are executing. For example if you label the stages of your pipeline, they will be marked in the timeline and hovering will reveal details. This makes profiling data much easier to understand.

Videos to help you get started.

Register for future Webinars


Previously recorded Webinars:

  • Accelerating financial services applications using Intel® Parallel Studio XE with the Intel® Xeon Phi™ coprocessor

  • Find 3 performance scaling barriers using Intel® VTune™ Amplifier XE

  • Performance analysis on Intel® Xeon® Phi™ Coprocessor

  •  

    Download slides

  • How Intel® Parallel Studio XE is used to improve the HMMER application
  • Webinar name 3

  • Webinar name 4

  • Webinar name 5

  • Webinar name 6

More Tech Articles

24-May-2013
1:30 AM PDT
VTune Amplifier XE 2013 installation on clusters
By vladimir-tsymba...0
Using VTune Amplifier XE 2013 on clusters or other distributed environments which have an LDAP/NIS administration model and share file resources by NFS is as easy as on a local machine. The only trick is installation of the kernel drivers.
14-May-2013
8:44 PM PDT
An example of using VTune™ Amplifier XE to profile Java class called by C/C++ application (JNI)
By Peter Wang (Intel)0
Article <Java support is back in VTune™ Amplifier XE> informs that VTune(TM) Amplifier XE 2013 already supports Java application, and Attaching to Java* processes for hardware event- . . .
07-May-2013
12:02 PM PDT
How to Tune Applications Using a Top-Down Characterization of Microarchitectural Issues
By Jackson M (Intel)0
Applies to: Platforms based on 2nd Generation Intel® Core™ processor family, 3rd Generation Intel® Core™ processor family, Intel® Xeon® processor E5 family Authors: Jackson Marusarz, Shannon Cepeda, Ahmad Yasin 1 Introduction Optimizing applications to take advantage of increasingly complex CPU . . .
07-May-2013
12:02 PM PDT
How to Tune Applications Using a Top-Down Characterization of Microarchitectural Issues
By Jackson M (Intel)0
Applies to: Platforms based on 2nd Generation Intel® Core™ processor family, 3rd Generation Intel® Core™ processor family, Intel® Xeon® processor E5 family Authors: Jackson Marusarz, Shannon Cepeda, Ahmad Yasin 1 Introduction Optimizing applications to take advantage of increasingly complex CPU . . .

Pages

Subscribe to

Product Documentation and Tutorials

  • For Windows*
  • Loading Intel Software Documentation...

  • For Linux*
  • Loading Intel Software Documentation...

Supplemental Documentation

16-Jan-2012
3:51 PM PST
Intel Guide for Developing Multithreaded Applications
By Admin22
The Intel® Guide for Developing Multithreaded Applications covers topics ranging from general advice applicable to any multithreading method to usage guidelines for Intel® software products to API-specific issues.
Subscribe to

You can reply to any of the forum topics below by clicking on the title. Please do not include private information such as your email address or product serial number in your posts. If you need to share private information with an Intel employee, they can start a private thread for you.

New topic    Search within this forum     Subscribe to this forum


julia-fedorova ...Wed, March 20th 2013 - 10:55
New experimental feature: OpenCL* performance analysis on Intel® HD Graphics2
If you use the recent Intel® SDK for OpenCL*  Applications you might know that Intel® VTune™ Amplifier XE 2013 Update 5 (released Feb. 26, 2013) includes a new experimental feature of OpenCL* performance analysis on Intel® HD Graphics.  Read ...
zhaoguo w.Sat, June 15th 2013 - 1:45
Different result of restricted transaction memory when using sdk and native machine3
Hi,    I run a simple test case with the xbegin and xend intructions provided by haswell. The test case just touched 20K continued memory bytes in the RTM protected region.(which is much smaller than the l1 cache size.)  When using sdk, the test will completes without any abort event. But when I ...
ccnhpcFri, June 14th 2013 - 7:11
License VTune 2011 not working with VTune 20131
Hello, We have installed Vtune 2013 on our cluster, but the following error message appears : """Intel Vtune Amplifier XE 2013 cannot find valid license. Data cannot be displayed."" ""Error 0x4000001f (no valid license) -- Your support service licences "" expired. Buy a new license to use this ...
Salvatore Li MandriThu, June 13th 2013 - 10:05
Premature Operation end6
HI, I have been using XE Amplifier and XE Inspector 2013 for a while quite successfully, and with either tools lately I enter the [Finalize] tool's program phase on their own prior I executed the section of code I am interested to test (Using latest updates - I just comfirmed this) The application ...

Pages

Subscribe to Forums
  • What is the difference between hotspots and lightweight hotspots?
  • HotspotsLightweight Hotspots
    Uses the software collector Uses the hardware collector and the on chip Performance Monitoring Unit (PMU)
    No driver required Requires a driver
    Runs on Intel® and compatible processors Requires a genuine Intel® processor for collection
    ~10ms resolution ~1ms resolution (finds smaller functions)
    Collects call stacks to show calling sequences New! Optional call stack collection
    Works in virtual environments Works in a virtual environment only when supported by the VM vendor (e.g., vSphere* 5.1)
  • Can I install and use Intel VTune Amplifier XE on a system with a compatible processor not manufactured by Intel® Corporation?
  • Yes. Intel VTune Amplifier XE will operate on both Intel® processors and compatible processors when analyzing applications containing Intel® instructions. Profiling features that use the software collectors work on both Intel processors and compatible processors. Profiling features that use the hardware collectors and the on-chip performance monitoring unit require a genuine Intel processor for data collection, but after collection the results can analyzed on a compatible processor.

  • Can I run a performance analysis on a remote system?
  • Yes.

  • Do I need multiple licenses to do remote data collections?
  • No, you do not need additional licenses for remote collection as long as the remote machine is the same OS (Linux or Windows*) as your license. Once you have the product, the CLI installer (command line installer) is part of it and can be installed on any number of systems. You can collect the data on the remote system, but you will need a license to view the data. Copy the results directory to a system with the full product installed for viewing. For more details see “Remote Tuning Workflow ” in the documentation. For installation details see “Installing Collectors on Remote Systems” in the release notes.

  • Do I need to recompile?
  • No, you do not need to recompile in order to profile with Intel® VTune Amplifier XE. However, it is recommended that you have debug and symbol information available for your optimized application in order to get the most complete and useful results. Thus, your release build process may need to be modified to add symbol information to the optimized build.

  • Do I need to use the Intel compiler to use Intel® VTune Amplifier XE?
  • No, you do not need the Intel compiler to analyze applications. However, if you are using OpenMP, it is recommended that you use the Intel runtime if possible to get the best results.

  • Why can’t I see my source code?
  • There are several possible reasons why VTune Amplifier XE may be unable to see your source.

    In order for source code to be visible you need to compile your code so that debug information is available. For example, on Linux*, verify you are compiling with the “-g” flag.

    You also need to let VTune Amplifier XE know where your source files, binary files and symbol files are located. To do this, open or create a Project and click on the “Project Properties” button. In the Project properties dialog, click on the “Search Directories” tab. In the pull down menu, click on “All files” and then specify the directory where your files exist. If you have any subdirectories remember to check the “Search subdirectories” box.

  • Do I need to be root to run the hardware collector used with Lightweight Hotspots and Advanced Analysis?
  • No. On Linux*, you need to be root to install the driver for the hardware collector, but once it is installed root access is not required. On Linux*, you do need to be a member of the driver access group (“vtune ” by default) to use the hardware collector. The hardware collector is used for lightweight hotspots analysis and advanced analysis. For more information see “Installing the Sampling Driver ” in the documentation.

  • What file and directory permissions are required to use VTune™ Amplifier XE?
  • Because the hardware-based sampling analysis types require communication with the Performance Monitoring Unit (PMU) of the central processor, the installer attempts to install a device driver. For Windows*, the driver is signed and the person installing must be part of the Administrators group. On Linux*, the person installing the software must be root or have sudo access to install the driver. However, a Linux user can install the software locally without the device driver and still use the user-mode sampling analysis types: Hotspots, Concurrency, and Locks-and-Waits. If the user is able to install the software as ‘root’, any user that desires to collect hardware-based samples must be part of the user group defined during the install. By default, this is the ‘vtune’ group, but it can be changed by accessing the Advanced options of the installer (install.sh).

  • Why can’t I import results?
  • In order to import results into VTune Amplifier XE, you must first create a project to contain the imported results. In the VTune Amplifier XE, click on the File->New->Project menu. This will bring up a dialog asking you to select a project name. Enter a name and press “OK”. VTune Amplifier XE will display the “Project Properties” dialog. If you are only importing results into the project then you will not need to specify an application name. However, if you want to view source of the imported results, you need to specify the search directories where your source and binaries are located. In the Project properties dialog, click on the “Search Directories” tab. In the pull down menu, click on “All files” and then specify the directory where your files exist. If you have any subdirectories remember to check the “Search subdirectories” box.

  • I added a path to the Search Directories, but nothing changed?
  • The Search Directories are used during finalization and that normally occurs after data collection completes. In order for new “Search directory” paths to take affect VTune Amplifier XE must re-resolve your results with the new information provided. Click on the “Analysis Type” tab and then press the “Re-resolve” button on the far right, located directly below the “Start” and “Project Properties” buttons.

  • Why do the sample counts look wrong?
  • Sometimes, the sample counts may be displayed on source lines that are not normally associated with executable code, for example, the closing brace of a ‘for’ or ‘while’ loop. This may appear to be an error but is a result of the instructions generated by the compiler. Viewing the assembly code can reveal that the debug information for the assembly instructions to which the samples are attributed, are tagged as belonging to that source line, i.e., the closing brace.

    Other times, viewing of the assembly instructions may show that certain hardware events were collected on instructions that could not possibility generate that event, e.g., a memory event on a jump instruction or an arithmetic event on a memory instruction. This is known as “event skid” and is a result of the processor being unable to stop the execution of some micro-ops before sampling the instruction pointer. Thus, the IP is pointing at a subsequent instruction by the time the sample is taken. Typically, you can determine which instruction was responsible for the event by examining the instruction flow.

  • How can I use Intel® VTune Amplifier XE to see how much time is spent doing Disk I/O?
  • If your application is doing blocking I/O, the function call attributed to the file accesses should appear in Hotspots Analysis. Additionally, if you have multiple threads waiting to access a single file, the synchronization object protecting the file , e.g. a Critical Section, should show up in the Locks and Waits analysis.

Intel® VTune™ Amplifier XE 2013

Getting Started?

Click the Learn tab for guides and links that will quickly get you started.

Get Help or Advice

Search Support Articles
Forums - The best place for timely answers from our technical experts and your peers. Use it even for bug reports.
Support - For secure, web-based, engineer-to-engineer support, visit our Intel® Premier Support web site. Intel Premier Support registration is required.
Download, Registration and Licensing Help - Specific help for download, registration, and licensing questions.

Resources

Release Notes - View Release Notes online!
Intel® VTune™ Amplifier XE 2013 documentation
Documentation for other software products