Start cooking your performance analysis with understanding tuning techniques, performance metrics and hardware solutions for collecting statistics. After that you can drill down to particular tuning or configuration recipes provided for the Intel® VTune™ Amplifier.
Top-down Microarchitecture Analysis Method from Intel® VTune™ Amplifier Performance Analysis Cookbook
Use this recipe to know how an application is utilizing available hardware resources and how to make it take advantage of CPU microarchitectures. One way to obtain this knowledge is by using on-chip Performance Monitoring Units (PMUs).
This recipe introduces a flow to analyze CPU utilization of your OpenMP* or hybrid OpenMP-MPI application and identify causes of possible inefficiencies.
This recipe explores metrics that characterize core utilization in terms of packet receiving in DPDK-based applications.
This recipe introduces PCIe Bandwidth metrics used in the Intel® VTune™ Amplifier to explore the PCIe traffic for a packet forwarding DPDK-based workload.
These recipes explore the most typical performance problems that can be detected with the Intel® VTune™ Amplifier and provide possible steps for the performance optimization.
This recipe explores profiling a memory-bound linear_regression application using the General Exploration and Memory Access analyses of the Intel® VTune™ Amplifier.
This recipe explores profiling a memory-bound matrix application using the Microarchitecture Exploration and Memory Access analyses of the Intel® VTune™ Amplifier to understand the cause of the frequent DRAM accesses.
This recipe explores profiling a core-bound matrix application using the Microarchitecture Exploration analysis (formerly, General Exploration) of the Intel® VTune™ Amplifier to understand the cause of the poor port utilization and Intel® Advisor to benefit from compiler vectorization.