Download this guide (see Article Attachments, below) to learn how to identify performance issues on software running on the 5th generation Intel® Core™ processor family (based on Intel® Microarchitecture Codename Broadwell). The guide explains the General Exploration Analysis viewpoint available in Intel® VTune™ Amplifier XE. It also walks through some of the most common performance issues that the VTune Amplifier XE interface highlights, what each issue means, and some suggested ways to fix them.
The General Exploration Analysis Type in Intel® VTune™ Amplifier XE is used to detect microarchitectural hardware bottlenecks in an application or system. General Exploration uses hardware event counters to detect and locate issues and presents the data in a user-friendly and actionable format. This article will explain the mechanisms used in this analysis, a few best-known-methods for interpreting the results, and the various complexities and issues that arise when doing this type of analysis.
The Mechanisms behind General Exploration
Intel® VTune™ Amplifier XE has the ability to use Performance Monitoring Units (PMUs) on Intel CPUs to count hardware events and use these events to locate performance issues. The most common way to do this is through the General Exploration analysis type. One set of metrics within General Exploration is related to the memory subsystem and can be found in the Back-End Bound > Memory Bound section of the hierarchy. A common question we receive about memory metrics is "can I calculate cache hit and miss rates?".
Download this guide (see Article Attachments, below) to learn how to identify performance issues on software running on the Intel® Xeon® Processor E5 v3 Family (based on Intel® Microarchitecture Codename Haswell). The guide explains the General Exploration Analysis viewpoint available in Intel® VTune™ Amplifier XE. It also walks through some of the most common performance issues that the VTune Amplifier XE interface highlights, what each issue means, and some suggested ways to fix them.
Intel® Threading Building Blocks (Intel TBB) applications may have an incorrectly high amount of Overhead or Spin Time associated with them due to function inlining without corresponding debug information.
Under Linux many commands are executed from the command line, which is OK. But if the program you are starting has a mouse driven GUI in my view the command line doesn't really make sense.
NOTE: In all the following <install dir> means where you installed Intel® System Studio. On my system this is /opt/intel/system_studio_2014.0.025. Use the location you chose during installation.
VTune Amplifier is for me such a case. To start it from the command line you need to
Intel® Transactional Synchronization Extensions (Intel® TSX) provides hardware transactional memory support. It exposes a speculative execution mode to the programmer to improve locking performance. There are many publications about Intel TSX and this article is not focused on explaining the concept. You can refer to the most comprehensive list of TSX-related technical resources in the Roman Dementiev blog.
- Стр. 1