This recipe introduces a flow to analyze CPU utilization of your OpenMP* or hybrid OpenMP-MPI application and identify causes of possible inefficiencies.
This recipe shows how to detect and fix frequent parallel bottlenecks of OpenMP programs such as imbalance on barriers and scheduling overhead.
Processor Cores Underutilization: OpenMP* Serial Time from Intel® VTune™ Profiler Performance Analysis Cookbook
This recipe shows how to identify a fraction of serial execution in an application parallelized with OpenMP, discover additional opportunities for parallelization, and improve scalability of the application.
Stitch Stacks for Intel® Threading Building Blocks or OpenMP* Analysis from Intel® VTune™ Profiler User Guide
Use the Stitch stacks option to restore a logical call tree for Intel® TBB or OpenMP* applications by catching notifications from the runtime and attach stacks to a point introducing a parallel workload.