Intel® VTune™ Amplifier XE 2013
Intel® VTune™ Amplifier XE is an easy to use performance and thread profiler for C, C++, C#, Fortran, Java and MPI developers. No special recompiles are needed, just start profiling. Hotspots are highlighted on the source. A powerful timeline makes it easy to tune your application and scale performance on multicore processors.
New for Update 13!
- Microsoft* Windows* 8.1 and integration with Microsoft* Visual Studio* 2013 IDEnow supported! (Windows* only)
- ITT pause/resume APIs supported on the Intel® Xeon Phi™ coprocessor
- Display of externally collected data (CSV format with a predefined structure, only) with VTune Amplifier collected data
- SSH-based remote collection via amplxe-cl
- Debian* 7.1, SLES* 11 SP3 supported (Linux* only)
- Bug fixes
Installer for Intel® Vtune™ Amplifier XE 2013 Update 13 for Linux*
Installer for Intel® Vtune™ Amplifier XE 2013 Update 13 for Windows*
* Other names and brands may be claimed as the property of others.
Microsoft, Windows, Visual Studio, Visual C++, and the Windows logo are trademarks, or registered trademarks of Microsoft Corporation in the United States and/or other countries.
Next: What's New in Update 12?
Now you can use pause/resume ITT API to control collection on Intel® Xeon Phi™ coprocessor. Please note that To profile applications with user APIs on the Intel Xeon Phi coprocessor, environment variables that control collection must be propagated from the host to the Intel Xeon Phi coprocessor card. See User API Collection on the Intel® Xeon Phi™ Coprocessor help topic for more details.
Intel® VTune™ Amplifier enables you to collect data on a remote application from the host system (remote usage mode) via command line interface (amplxe-cl) and view the analysis result locally in the GUI. Remote data collection using the amplxe-cl command running on the host is very similar to the native collection on the target except that the -target ssh:user@target option is added to the command line.
As prerequisites you need to install collectors on the remote target and enable pasword-less SSH access to the target.
Example: to run event-based stack sampling collection for the application:
host>./amplxe-cl --target=ssh:user@target –collect advanced-hotspots -knob collection-detail=stack-sampling -- <path to the application on the target>
To control collection from the command line – pause resume or detach you can use commands from host as follows:
host>./amplxe-cl -r result@@@ -C pause
See Collecting Data Remotely from Command Line help topic to learn details on the collection set up and setting of search directories for proper symbol resolving.
VTune Amplifier provides an option to correlate interval or discrete data, provided by an external collector, with the regular data provided by the analyzer.
For example, you can see how the data captured from SoCs or peripheral devices (camera, touch screen, sensors, and so on) correlate with VTune Amplifier metrics collected for your analysis target.
You can extend standard VTune Amplifier performance analysis and launch a custom data collector directly from the VTune Amplifier. Your custom collector can be an application you analyze with the VTune Amplifier or a collector that can be launched with the VTune Amplifier. Learn more about configuring and launching a custom collector from GUI and command line from Using a Custom Collector help topic.
>amplxe-cl -collect hotspots -knob custom-collector="python.exe C:\work\custom_collector.py" -- notepad.exe
VTune Amplifier can process and integrate performance statistics collected externally with a custom collector in parallel with the native VTune Amplifier analysis. To achieve this, provide the collected custom data as a csv file with a predefined structure and save this file to the VTune Amplifier result directory.
VTune Amplifier can load and process the following data types:
· Interval data with start time and end time
· Samples with a set of counters
Data may be optionally bound to process and thread ID. VTune Amplifier represents data not bound to a particular process and thread (there are no PID and TID values in the csv file) as frames. Data bound to a process and a thread (there are PID and TID values in the csv file) is represented as tasks. Learn more about csv data format from Creating a CSV File with External Data help topic.
Example: Integrating Interval Data Not Bound to a Particular Process
You have a csv file with the following data types:
VTune Amplifier processes this data as frames (there are no TID and PID values specified) and displays the result as follows:
With the VTune Amplifier, you can easily correlate the frame data in the Timeline pane and grid view. You see that frame 4 took longer time to process than subsequent frames 5 and 6 due to the poll_idle() call.