User Guide

Contents

Suitability Report

To access this window in the Result tab, click the
Suitability Report
tab, or run the Suitability tool.
This is the starting point for viewing the annotated sites found by the
Intel® Advisor
Suitability tool. Use this window to review the parallel sites in the upper right area. Select a site and view its annotations and related characteristics. Use the list of sites as a to-do list: start at the top and work your way down.
Suitability Report Layout
Suitability Source window layout
  1. Workflow Tab
  2. Result Tab
  3. The main part of the
    Suitability Report
    window, which displays a summary presentation of the collected data. It also provides modeling parameters you can use to customize the predicted parallel performance based on your program's target hardware configuration, parallel framework, as well experiment with refactoring loop characteristics.
Use This
To Do This
Workflow Tab
Run a tool of your choice and see results in the
Result
tab.
Result Tab
Select between available reports.
Upper part of the
Suitability Report
window
Any annotation-related error the Suitability tool detects appears at the top of the
Suitability Report
window. If you see such errors, the displayed Suitability data may not be reliable. To view the source location associated with an error, click the button. To fix the error, read the displayed error message, modify your source code to fix the problem, rebuild your target executable, and run Suitability tool analysis again.
Upper-left part of the
Suitability Report
window
View the
Maximum Program Gain for All Sites
in the program. Your overall goal of adding parallelism is to increase the
Maximum Program Gain for All Sites
so the parallel program will execute as fast as possible. The measured serial execution runtime, predicted parallel runtime, and measured are displayed below
Maximum Program Gain for All Sites
.
Upper-right
Use the upper-right row of to model performance. Choose a hardware configuration and threading model (parallel framework) values from the drop-down lists. If you select a
Target System
for
Intel® Xeon Phi™
processors, an additional value for total
Coprocessor Threads
appears.
Below this row is a grid of data that shows the estimated performance of each parallel site detected during program execution. The
Site Label
shows the argument to the site annotation. Examine the predicted
Site Gain
and
Impact to Program Gain
(higher values are better) to estimate how much each site contributes to the
Maximum Program Gain for All Sites
for all sites (described above). To expand the data under
Combined Site Metrics
or
Site Instance Metrics
, click the icon to the right of that heading; to collapse data, click to the right of that heading.
To view the source associated with a site in the
Suitability Source
window, double-click its name (or right-click and select View Source from the context menu). To edit the source in your code editor, use the
Edit Source
context menu item.
Middle-left above the graph
If you choose a
Target System
of
CPU
, to view detailed characteristics of the selected site as well as its tasks and locks, click the
Site Details
tab.
Graph in the lower-left
The
Scalability of Maximum Site Gain
graph summarizes performance for the selected site. The number of CPU processors or total number of coprocessor threads appears on the horizontal X axis and the target's predicted performance gain appears on the Y axis. To change the default
CPU Count
and the
Maximum CPU Count
, set the Options value.
If you select a
Target System
of
Intel Xeon Phi
or
Offload to Intel Xeon Phi
coprocessor (and select the column
Offload to Intel Xeon Phi
), the graph's appearance changes to a gray-green color and the X axis displays
Coprocessor Threads
instead of
CPU Count
. This graph represents the predicted performance of the many-core
Intel® Xeon Phi™
parallel coprocessor. For many applications, the number of task instances does not scale enough to fully utilize the many cores of the parallel coprocessor, as indicated by a hover tip. Applications that are not appropriate for a
Intel Xeon Phi
processing system have values that appear in the gray part of the graph; in this case, consider modeling other types of the
Target System
. Applications that are appropriate for offload to an
Intel Xeon Phi
processing system have values that appear in the green part of the graph.
The lines between the graph's gray and green areas is a reference baseline, where the reference CPU chosen to calculate the
Intel® Xeon®
processor peak baseline is a dual-socket 8-core
Intel Xeon
processor E5-26xx product family (2.7 GHz, 16 cores total). When the
Maximum Site Gain
exceeds this baseline, you might consider using an
Intel Xeon Phi
coprocessor rather than an
Intel Xeon
or similar processor.
Near the top of the vertical lines for each CPU number, a box and a circle indicate the minimum and maximum predicted gain values. The color shading indicates the predicted benefit. For example, if the minimum-maximum range appears in the red-shaded area, this site is hurting your program's performance (and power use), so you should significantly modify or remove the site and task annotations.
Lower-left
Below the graph is a list of issues that might be preventing better
predicted
performance gains as well as a summary of serial and predicted parallel time. To expand a line, click the down arrow to the right of the item's name. Most issues are related to the
Runtime Modeling
. Later, you can use other Analyzer tools like to measure
actual
performance of your parallel program.
Lower-middle under
Loop Iterations (Tasks) Modeling
or
Tasks Modeling
Use the
Loop Iterations (Tasks) Modeling
(or
Tasks Modeling
) to experiment with different loop structures, iteration counts, and instance durations that might improve the predicted parallel performance.
Lower-right under
Runtime impact for this site
Use the
Runtime Modeling
to learn which parallel overhead categories might have an impact on parallel overhead. If you agree to address a category later by using the chosen parallel framework's capabilities or by tuning the parallel code after you have implemented parallelism, check that category.
Bottom-right
If the chosen
Target System
is
Intel Xeon Phi
or
Offload to Intel Xeon Phi
, additional options appear below the
Runtime Modeling
area. To expand this area, click the down arrow to the right of
Intel Xeon Phi Advanced Modeling
.

Product and Performance Information

1

Intel's compilers may or may not optimize to the same degree for non-Intel microprocessors for optimizations that are not unique to Intel microprocessors. These optimizations include SSE2, SSE3, and SSSE3 instruction sets and other optimizations. Intel does not guarantee the availability, functionality, or effectiveness of any optimization on microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended for use with Intel microprocessors. Certain optimizations not specific to Intel microarchitecture are reserved for Intel microprocessors. Please refer to the applicable product User and Reference Guides for more information regarding the specific instruction sets covered by this notice.

Notice revision #20110804