• 2020
  • 06/18/2020
  • Public Content

Intel® Advisor
Tutorial for Adding Efficient SIMD Parallelism to C++ Code Using the
Vectorization Advisor

This tutorial - and C++ sample application you can use to follow along - shows how to use the
Vectorization Advisor
on a
platform to add efficient SIMD parallelism.
Intel® Advisor
provides tools to help ensure your Fortran, C and C++ native/managed applications realize full performance potential on modern processors, such as Intel® Xeon Phi™ processors (for full list, see release notes ):
  • Vectorization Advisor
    is a vectorization optimization tool that lets you identify high-impact, underoptimized loops, what is blocking vectorization, and where it is safe to force vectorization. It also provides code-specific how-can-I-fix-this-issue recommendations.
  • Roofline Analysis
    visualizes actual performance against hardware-imposed performance ceilings (rooflines). It provides insights into where the bottlenecks are, which loops are worth optimizing for performance, what are the likely causes of bottlenecks and what should be the next optimization steps.
  • Threading Advisor
    is a fast-track threading design and prototyping tool that lets you analyze, design, tune, and check threading design options without disrupting your normal development.
  • Flow Graph Analyzer
    is a visual prototyping tool that lets you represent and analyze performance for applications that use the Intel® Threading Building Blocks (Intel® TBB) flow graph interfaces.
Intel® Advisor
is available as a standalone product and as part of the following products:
  • Intel® System Studio
    Professional Edition
  • Intel® System Studio
    Ultimate Edition
  • Intel® Parallel Studio XE
    Professional Edition
  • Intel® Parallel Studio XE
    Cluster Edition
  • Intel® oneAPI Base Toolkit
About This Tutorial
This tutorial shows how to:
  • Identify loops that will benefit most from vectorization.
  • Identify what is blocking effective vectorization.
  • Increase the confidence that vectorization is safe.
  • Explore the benefit of alternative data reorganizations.
It was last updated for the 2018 product release.
Compilations and analyses were run using a 2018 version of an Intel compiler on an Intel® Core™ i5-6300U CPU with 4 logical CPUs.
Estimated Duration
15-20 minutes.
Learning Objectives
After you complete this tutorial, you should be able to:
  • Explain two recommended
    Vectorization Advisor
    workflows (usage scenarios).
  • Identify compiler/linker options that produce the most accurate and complete
    Vectorization Advisor
    analysis results.
  • Run
    Vectorization Advisor
    analysis tools.
  • View, interpret, and manipulate data collected by
    Vectorization Advisor
    analysis tools.
More Resources
The concepts and procedures in this tutorial apply regardless of programming language; however, a similar tutorial using a sample application in another programming language may be available at Intel® Advisor Tutorials .
The following sites also offer tutorials for other Intel products.
In addition, you can find more resources in:

Product and Performance Information


Intel's compilers may or may not optimize to the same degree for non-Intel microprocessors for optimizations that are not unique to Intel microprocessors. These optimizations include SSE2, SSE3, and SSSE3 instruction sets and other optimizations. Intel does not guarantee the availability, functionality, or effectiveness of any optimization on microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended for use with Intel microprocessors. Certain optimizations not specific to Intel microarchitecture are reserved for Intel microprocessors. Please refer to the applicable product User and Reference Guides for more information regarding the specific instruction sets covered by this notice.

Notice revision #20110804