This recipe introduces a flow to analyze CPU utilization of your OpenMP* or hybrid OpenMP-MPI application and identify causes of possible inefficiencies.
OpenMP* Imbalance and Scheduling Overhead from Intel® VTune™ Amplifier Performance Analysis Cookbook
This recipe shows how to detect and fix frequent parallel bottlenecks of OpenMP programs such as imbalance on barriers and scheduling overhead.
Processor Cores Underutilization: OpenMP* Serial Time from Intel® VTune™ Amplifier Performance Analysis Cookbook
This recipe shows how to identify a fraction of serial execution in an application parallelized with OpenMP, discover additional opportunities for parallelization, and improve scalability of the application.
Start cooking your performance analysis with understanding tuning techniques, performance metrics and hardware solutions for collecting statistics. After that you can drill down to particular tuning or configuration recipes provided for the Intel® VTune™ Amplifier.
These recipes explore the most typical performance problems that can be detected with the Intel® VTune™ Amplifier and provide possible steps for the performance optimization.
This recipe provides steps to identify OS thread migration on the NUMA architecture with the Intel® VTune™ Amplifier's Advanced Hotspots analysis.
This recipe shows how to locate inefficient synchronization in your code by running the Advanced Hotspots analysis of the Intel® VTune™ Amplifier with the stack collection enabled.
This recipe uses Intel® VTune™ Amplifier to identify imbalances and communications issues in MPI enabled applications, allowing you to improve the application performance.