Intel® Parallel Studio XE

Analyzing Intel® MPI applications using Intel® Advisor

Many of today’s HPC applications use Intel® MPI to implement their parallelism. However, using Intel’s analyzer tools in a multi-process environment can be tricky. Intel® Advisor can be very helpful to maximize your vectorization, memory and threading performance. You can also use the Intel Advisor Roofline chart to visualize your performance bottlenecks. To analyze Intel MPI applications using Intel Advisor you should follow these steps to get the best value out of your results.

Remote analysis flow

First, collect data using the following command on the target:

  • Intel® Parallel Studio XE
  • Intel® Advisor
  • Intel® MPI Library
  • Getting Started with Intel® Cluster Checker for Linux*

    Intel® Cluster Checker verifies the configuration and performance of Linux based clusters and checks compliance with the Intel® Scalable System Framework architecture specification. If issues are found, Intel® Cluster Checker diagnoses the problems and may provide recommendations on how to repair the cluster.

    Intel® Cluster Checker has the following features:

  • Linux*
  • Intel® Parallel Studio XE
  • Development Tools
  • Intel® Parallel Studio XE 2017 Update 1 Readme

    Deliver top application performance and reliability with Intel® Parallel Studio XE 2017. This C++ and Fortran software development suite simplifies the design, build, debug, and tune of applications that take advantage of scalable MPI, thread and vector parallel processing to boost application performance.
  • Apple macOS*
  • Linux*
  • Microsoft Windows* (XP, Vista, 7)
  • Microsoft Windows* 10
  • Microsoft Windows* 8.x
  • C/C++
  • Fortran
  • Python*
  • Intel® Parallel Studio XE
  • Intel® Parallel Studio XE Composer Edition
  • Intel® Parallel Studio XE Professional Edition
  • Intel® Parallel Studio XE Cluster Edition
  • Running Intel® Parallel Studio XE Analysis Tools on Clusters with Slurm* / srun

    Since HPC applications target high performance, users are interested in analyzing the runtime performance of such applications. In order to get a representative picture of that performance / behavior, it can be important to gather analysis data at the same scale as regular production runs. Doing so however, would imply that shared memory- focused analysis types would be done on each individual node of the run in parallel. This might not be in the user’s best interest, especially since the behavior of a well-balanced MPI application should be very similar across all nodes.

  • Linux*
  • Server
  • C/C++
  • Fortran
  • Advanced
  • Intermediate
  • Intel® VTune™ Amplifier
  • Intel® Advisor
  • Intel® Inspector
  • Intel® Parallel Studio XE
  • Message Passing Interface (MPI)
  • IMPI
  • Slurm
  • srun
  • cluster
  • analysis
  • Big Data
  • Cloud Computing
  • Cluster Computing
  • Data Center
  • Debugging
  • Development Tools
  • Enterprise
  • Optimization
  • Parallel Computing
  • Platform Analysis
  • Threading
  • Vectorization
  • The New Issue of The Parallel Universe is Out: Modernize Your Code for Intel® Xeon Phi™ Processors

    Are you ready for the future of programming?

    High-performance computing is changing fast, with trends like machine learning and next-generation hardware like the Intel® Xeon Phi™ processor. To help developers maximize the possibilities, Intel® Parallel Studio XE 2017 delivers a host of new capabilities to support important trends like machine learning.

  • Intel® Parallel Studio XE
  • Intel® Parallel Studio XE Composer Edition
  • Intel® Parallel Studio XE Professional Edition
  • Intel® Parallel Studio XE Cluster Edition
  • Intel® VTune™ Amplifier
  • Intel® Distribution for Python*
  • Subscribe to Intel® Parallel Studio XE