Benchmark Intel® MPI

Intel® Parallel Studio XE 2016 Cluster Edition Initial Release Readme

Intel® Parallel Studio XE 2016 Cluster Edition for Linux* and Windows*

Deliver top application performance and reliability with the Cluster Edition of Intel® Parallel Studio XE 2016. This C++ and Fortran software development suite simplifies the design, build, debug, and tune of applications that take advantage of scalable MPI, thread and vector parallel processing to boost application performance.

Key Features

  • Linux*
  • Microsoft Windows* (XP, Vista, 7)
  • Microsoft Windows* 10
  • Microsoft Windows* 8.x
  • C/C++
  • Fortran
  • Intel® Parallel Studio XE Cluster Edition
  • Message Passing Interface
  • Elaborazione basata su cluster
  • Intel® Trace Analyzer and Collector 9.1 Update 1 Readme

    The Intel® Trace Analyzer and Collector for Linux* and Windows* is a low-overhead scalable event-tracing library with graphical analysis that reduces the time it takes an application developer to enable maximum performance of cluster applications. This package is for users who develop on and build for Intel® 64 architectures on Linux* and Windows*, as well as customers running on the Intel® Xeon Phi™ coprocessor on Linux*. The package also includes an optional download on OS X* for analysis only.  You must have a valid license to download, install and use this product.

  • Sviluppatori
  • Linux*
  • Microsoft Windows* 10
  • Microsoft Windows* 8.x
  • C/C++
  • Fortran
  • Intel® Trace Analyzer and Collector
  • Message Passing Interface
  • Elaborazione basata su cluster
  • MPI-1 Benchmarks

    IMB-MPI1 component of the Intel® MPI Benchmarks provides benchmarks for MPI-1 functions. IMB-MPI1 contains the following benchmarks:

    Standard Mode

    Multiple Mode

    PingPong

    Multi-PingPong

    PingPongSpecificSource (excluded by default)

    Scatter

    The benchmark for the MPI_Scatter function. The root process inputs X*np bytes (X for each process). All processes receive X bytes. The root of the operation is changed round-robin.

    Property

    Description

    Measured pattern

    Accumulate

    This is the benchmark for the MPI_Accumulate function. It reduces a vector of length L = x/sizeof(float) of float items. The MPI data type is MPI_FLOAT. The MPI operation is MPI_SUM. See the basic definitions and a schematic view of the pattern below.

    Accumulate Definition

    Property

    Igather

    The benchmark for MPI_Igather that measures communication and computation overlap.

    Property

    Description

    Measured pattern

    MPI_Igather/IMB_cpu_exploit/MPI_Wait

    MPI data type

    MPI_BYTE

    Root

    All_put_all

    The benchmark tests the scenario when all processes communicate with each other using MPI_Put operation. To avoid congestion due to simultaneous access to the memory of a process by all other processes, different ranks choose different targets at each particular step. For example, while looping through all the possible target ranks, the next target is chosen as follows: (target_rank + current_rank)%num_ranks.

    Unidir_put

    This benchmark measures the MPI_Put operation in passive target communication mode. The benchmark runs on two active processes. The target process is waiting in the MPI_Barrier call.

    Property

    Description

    Measured pattern

    MPI_Put/MPI_Win_flush

    MPI data type

    Iscriversi a Benchmark Intel® MPI