Intel® MPI Library

Making applications perform better on Intel® architecture-based clusters with multiple fabric flexibility

  • Scalability verified up to 150k processes
  • Supports the latest MPI-3 standard
  • MPICH ABI compatibility

Available in Intel® Parallel Studio XE Cluster Edition.
Buy Now

$499.00
Or Download a Free 30-Day Evaluation Version

Students, educators, academic researchers, and open source contributors may qualify for Free Software Tools.

Deliver Flexible, Efficient, and Scalable Cluster Messaging

Intel® MPI Library 5.0 focuses on making applications perform better on Intel® architecture-based clusters—implementing the high performance Message Passing Interface Version 3.0 specification on multiple fabrics. It enables you to quickly deliver maximum end user performance even if you change or upgrade to new interconnects, without requiring changes to the software or operating environment.

Use this high performance MPI message library to develop applications that can run on multiple cluster interconnects chosen by the user at runtime. Benefit from a free runtime environment kit for products developed with Intel® MPI Library. Get excellent performance for enterprise, divisional, departmental, workgroup, and personal High Performance Computing.


Quotes

“Fast and accurate state of the art general purpose CFD solvers is the focus at S & I Engineering Solutions Pvt, Ltd. Scalability and efficiency are key to us when it comes to our choice and use of MPI Libraries. The Intel® MPI Library has enabled us to scale to over 10k cores with high efficiency and performance.”
Nikhil Vijay Shende, Director,
S & I Engineering Solutions, Pvt. Ltd.

Performance

Optimized shared memory path for multicore platforms allows more communication throughput and lower latencies. Native InfiniBand interface (OFED verbs) also provides support for lower latencies. Multi-rail capability for higher bandwidth and increased interprocess communication and Tag Matching Interface (TMI) support for higher performance on Intel® True Scale, Qlogic* PSM, and Myricom* MX solutions.

  • Low latency MPI implementation up to 2 times as fast as alternative MPI libraries
  • Enable optimized shared memory dynamic connection mode for large SMP nodes
  • Increase performance with improved DAPL, OFA, and TMI fabric support
  • Accelerate applications using the enhanced tuning utility for MPI

Scalability

Implementing the high performance MPI 3.0 specification on multiple fabrics, Intel® MPI Library for Windows* and Linux* focuses on making applications perform better on IA-based clusters. Intel® MPI Library enables you to quickly deliver maximum end-user performance, even if you change or upgrade to new interconnects without requiring major modifications to the software or to the operating environment. Intel also provides a free runtime environment kit for products developed with the Intel® MPI Library.

  • Scaling verified up to 150k Processes
  • Thread safety allows you to trace hybrid multithreaded MPI applications for optimal performance on multi- and many-core Intel® Architecture.
  • Improved start scalability through the mpiexec.hydra process manager

Interconnect Independence & Flexible Runtime Fabric Selection

Whether you need to run TCP sockets, shared memory, or one of many Remote Direct Memory Access (RDMA) based interconnects, including InfiniBand*, Intel® MPI Library covers all configurations by providing an accelerated universal, multi-fabric layer for fast interconnects via the Direct Access Programming Library (DAPL*) or the Open Fabrics Association (OFA*) methodology. Develop MPI code independent of the fabric, knowing it will run efficiently on whatever network is chosen by the user at runtime.

  • Get high-performance interconnects, including Intel® True Scale, Myrinet* MX, and QLogic* PSM interfaces as well as TCP, shared memory, and others
  • Efficiently work through the Direct Access Programming Library (DAPL*), Open Fabrics Association (OFA*), and Tag Matching Interface (TMI*), making it easy for you to test and run applications on a variety of network fabrics.
    Optimizations to all levels of cluster fabrics: from shared memory thru Ethernet and RDMA-based fabrics to the tag matching interconnects

Intel® MPI Library dynamically establishes the connection, but only when needed, which reduces the memory footprint. It also automatically chooses the fastest transport available. Memory requirements are reduced by several methods including a two-phase communication buffer enlargement capability which allocates only the memory space actually required.

MPI 3.0 Standard Support

The next major evolution of the Message Passing Interface is with the release of the MPI-3.0 standard. Significant changes to remote memory access (RMA) one-sided communications, addition of non-blocking collective operations, and large counts messages greater than 2GB will enhance usability and performance. Now available in the Intel® MPI Library 5.0.

Binary compatibility

Intel® MPI Library offers binary compatibility with existing MPI-1.x and MPI-2.x applications. Even if you’re not ready to move to the new standard, you can still take advantage of the latest Intel® MPI Library performance improvements without recompiling. Furthermore, the Intel® MPI Library is an active collaborator in the MPICH ABI Compatibility Initiative, ensuring any MPICH-compiled code can use our runtimes.

Support for Mixed Operating Systems

Run a single MPI job using a cluster with mixed operating systems (Windows* OS and Linux OS*) under the Hydra process manager. Get more flexibility in job deployment with this added functionality.

Latest Processor Support

Intel consistently offers the first set of tools to take advantage of the latest performance enhancements in the newest Intel product, while preserving compatibility with older Intel and compatible processors. New support includes AVX2, TSX, FMA3 and AVX-512.

Videos to help you get started.

Register for future Webinars


Previously recorded Webinars:

  • MPI-3 Is Here: Optimize and Perform with Intel MPI Tools
  • Intel® MPI library implementation of a new MPI3.0 standard - new features and performance benchmarks
  • Increase Cluster MPI Application Performance with a "MPI Tune" Up
  • MPI on Intel® Xeon Phi™ coprocessor

More Tech Articles

Mapping of Intel® MPI Library versions to bundle suites
Von Gergana Slavova (Intel)Veröffentlicht am 08/28/20140
Introduction: Mapping the Intel® MPI Library numbers to specific suites and update versions Intel® Parallel Studio XE 2015 Update 1 Cluster Edition (released 26 November 2014) Intel® MPI Library 5.0 Intel® Registration Center Activation Date (yr.mo.day) Windows Version / build Linux …
Using Intel® MPI Library 5.0 with MPICH based applications
Von Dmitry Sivkov (Intel)Veröffentlicht am 08/25/20140
Why it is needed? Different MPI implementations have their specific benefits and advantages. So in the specific cluster environment the HPC application with the other MPI implementation can probably perform better.  Intel® MPI Library has the following benefits: Support of the wide range of clus…
NOAA NIM with Support for Intel® Xeon Phi™ Coprocessor
Von Ashish Jha (Intel)Veröffentlicht am 07/03/20140
Non-hydrostatic Icosahedral Model is a weather forecasting model developed by NOAA. G6 K96 which is a smaller data-set which scales best up to 4 cluster nodes. G9 is useful for studying larger clusters. The code supports the symmetric mode of operation of the Intel® Xeon® processor (Referred to as …
Intel® Cluster Tools Open Source Downloads
Von Gergana Slavova (Intel)Veröffentlicht am 03/06/20140
This article makes available third-party libraries and sources that were used in the creation of Intel® Software Development Products. Intel provides this software pursuant to their applicable licenses. Products and Versions: Intel® Trace Analyzer and Collector for Linux* gcc-3.2.3-42.zip (which…
Intel Developer Zone Beiträge abonnieren

Supplemental Documentation

Intel® Parallel Studio XE 2015 Update 3 Cluster Edition Readme
Von Gergana Slavova (Intel)Veröffentlicht am 04/24/20150
The Intel® Parallel Studio XE 2015 Update 3 Cluster Edition for Linux* and Windows* combines all Intel® Parallel Studio XE and Intel® Cluster Tools into a single package. This multi-component software toolkit contains the core libraries and tools to efficiently develop, optimize, run, and distribut…
Intel® Parallel Studio XE 2015 Update 2 Cluster Edition Readme
Von Gergana Slavova (Intel)Veröffentlicht am 02/06/20150
The Intel® Parallel Studio XE 2015 Update 2 Cluster Edition for Linux* and Windows* combines all Intel® Parallel Studio XE and Intel® Cluster Tools into a single package. This multi-component software toolkit contains the core libraries and tools to efficiently develop, optimize, run, and distribut…
Intel® Parallel Studio XE 2015 Update 1 Cluster Edition Readme
Von Gergana Slavova (Intel)Veröffentlicht am 11/24/20140
The Intel® Parallel Studio XE 2015 Update 1 Cluster Edition for Linux* and Windows* combines all Intel® Parallel Studio XE and Intel® Cluster Tools into a single package. This multi-component software toolkit contains the core libraries and tools to efficiently develop, optimize, run, and distribut…
Intel® Parallel Studio XE 2015 Cluster Edition Initial Release Readme
Von Gergana Slavova (Intel)Veröffentlicht am 08/15/20140
The Intel® Parallel Studio XE 2015 Cluster Edition for Linux* and Windows* combines all Intel® Parallel Studio XE and Intel® Cluster Tools into a single package. This multi-component software toolkit contains the core libraries and tools to efficiently develop, optimize, run, and distribute paralle…
Intel Developer Zone Beiträge abonnieren

You can reply to any of the forum topics below by clicking on the title. Please do not include private information such as your email address or product serial number in your posts. If you need to share private information with an Intel employee, they can start a private thread for you.

New topic    Search within this forum     Subscribe to this forum


Intel® Parallel Studio XE 2016 Beta program has started!
Von Gergana Slavova (Intel)0
The Intel® Parallel Studio XE 2016 Beta program is now available! In this beta test, you will have early access to Intel® Parallel Studio XE 2016 products and the opportunity to provide feedback to help make our products better. Registration is easy through the pre-Beta survey site. This suite of products brings together exciting new technologies along with improvements to Intel’s existing software development tools: Expanded Standards and Features – Scaling Development Efforts Forward Additional language support for C11 and C++14, Fortran 2008 Submodules and IMPURE ELEMENTAL, and C Interoperability from Fortran 2015, and OpenMP* 4.1 TR 3.  New support for SIMD operator use with SSE integer types, Intel® Cilk™ Plus combined Parallel and SIMD loops, OpenMP* 4.0 user-defined reductions (C++ only), enhanced uninitialized variable detection (Fortran only), feature improvements to Intel’s Language Extensions for Offload, annotated source listings, and a new directory structure.  All ava…
Problem with Intel MPI on >1023 processes
Von Jack S.4
I have been testing code using Intel MPI (version 4.1.3  build 20140226) and the Intel compiler (version 15.0.1 build 20141023) with 1024 or more total processes. When we attempt to run on 1024 or more processes we receive the following error:  MPI startup(): ofa fabric is not available and fallback fabric is not enabled  Anything less than 1024 processes does not produce this error, and I also do not receive this error with 1024 processes using OpenMPI and GCC. I am using the High Performance Conjugate Gradient benchmark as my test code, although we have received the same errors with other test codes. 
Problems with Intel MPI
Von Palina L.2
I have trouble with running Intel MPI on cluster with different different numbers of processors on nodes (12 and 32). I use Intel MPI 4.0.3 and it works correctly on 20 nodes with 12 processors (Intel(Xeon(R)CPU X5650 @2.67)) at each, and all processors works correctly, then I try to run Intel MPI on other 3 nodes with 32 processors (Intel(Xeon(R)CPU E5-4620 v2@2.00) at each and they work correctly too. But when I try to run my tasks on all nodes with different types of processors and the same type of Intel MPI I cant use more than 48 processors. Spead falls. I use option --machinefile mpirun -machinefile mpihosts.txt ./wrf.exe mpihosts.txt (cn01:12 cn02:12 ... cn29:32 cn30:32) How can I use Intel MPI 4.0.3 correctly on all of these nodes?
Mapping ranks consecutively on nodes
Von 4f0drlp7eyj34
Hi,    Running Intel MPI 4.1.3    Contrary to the user guide, which states for the default round-robin mapping, To change this default behavior, set the number of processes per host by using the -perhost option, and set the total number of processes by using the -n option. See Local Options for details. The first <# of processes> indicated by the -perhost option is executed on the first host; the next <# of processes> is executed on the next host, and so on. , when I try to run on 2 nodes and I_MPI_DEBUG=4, I see [cchang@n0290]$ mpirun -n 4 -perhost 2 ./hello_MPIMP_multinode [0] MPI startup(): Rank    Pid      Node name  Pin cpu [0] MPI startup(): 0       54622    n0290      {0,1,2,3,4,5,6,7,8,9,10,11} [0] MPI startup(): 1       53310    n0289      {0,1,2,3,4,5,6,7,8,9,10,11} [0] MPI startup(): 2       54623    n0290      {12,13,14,15,16,17,18,19,20,21,22,23} [0] MPI startup(): 3       53311    n0289      {12,13,14,15,16,17,18,19,20,21,22,23} Hello world: rank 0 of …
Intel MPI 5.0.3.048 Data Transmission Corruption Issue
Von Stephen L.0
We are experiencing 3 failure modes with Intel MPI 5.0.3.048 on RHEL. Please change this to a private thread so that we can discuss details. Stephen Lecrenski
MPI: polling 'passive' rma operations
Von zp34
Hi, lately I'm wondering if your implementation of the passive target communication was ever really ment for usage... Despite the fact that it isn't really passive (since one has to call some mpi functions on the target to get the mpi_win_unlock ever to return), I couldn't even figure out which mpi functions exactly must/can be invoked to achieve the flushing. In the release notes is only written: The following MPI-2.2 features are not supported by the Intel(R) MPI Library: o Passive target one-sided communication when  the target process does not call any MPI functions By now I've tested several 'MPI functions', but the only one that seems to work is a blocking MPI_RECV (and only then if the recv has to wait for the message). - Thats very unsatisfactory So my question: exist there at least some special functions that I can use for polling the rma window? I know this isn't foreseen of the mpi standard as less as your kind of implementation is, but it would render the whole thing a bi…
Intel MPI, perhost, and SLURM: Can I override SLURM?
Von thematt2
All, (Note: I'm also asking this on the slurm-dev list.) I'm hoping you can help me with a question. Namely, I'm on a cluster that uses SLURM and lets say I ask for 2 28-core Haswell nodes to run interactively and I get them. Great, so my environment now has things like: SLURM_NTASKS_PER_NODE=28 SLURM_TASKS_PER_NODE=28(x2) SLURM_JOB_CPUS_PER_NODE=28(x2) SLURM_CPUS_ON_NODE=28 Now, let's run a simple HelloWorld on, say, 48 processors (and pipe through sort to see things a bit better): (1047) $ mpirun -np 48 -print-rank-map ./helloWorld.exe | sort -k2 -g srun.slurm: cluster configuration lacks support for cpu binding (borgj102:0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27) (borgj105:28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47) Process 0 of 48 is on borgj102 Process 1 of 48 is on borgj102 Process 2 of 48 is on borgj102 Process 3 of 48 is on borgj102 Process 4 of 48 is on borgj102 Process 5 of 48 is on borgj102 Proce…
mpitune get &quot;could not dump the session, because unknown encoding: utf-8&quot;
Von Xinwei X. (Intel)1
Hi forum, I try the following command on a server: (impi 5.0.2.044, icc 2015.2.164) mpitune -of analysis.conf -application \"mpirun -n 24 -host `hostname` ./myexe\" It did run for a while but output nothing of analysis.conf. Meanwhile the console output message like: ERR | Could not dump the session, because unknown encoding: utf-8 I try to change LANG=C, as locale outputs: LANG=C LC_CTYPE="C" ...(all other environment variables are "C") How can I run successfully of mpitune to get analysis.conf which has real contents. Thanks!
Foren abonnieren

Licensing

  • What kinds of licenses are available for the Intel® MPI Library?
  • The Runtime license includes everything you need to run Intel MPI-based applications. The license is free and permanent. The Developer license includes everything needed to build and run applications. It is fee-based and permanent. It allows free redistribution of the components needed to run Intel MPI-based applications.

  • When is a Developer license required for the Intel® MPI Library?
  • The two kits (developer and runtime) can co-exist on a machine and it is fine for customers of Intel MPI-based applications to relink the application to include user subroutines. If the customer is actually writing MPI code (calling MPI_* functions directly), then a Developer license would be needed.

  • I am an ISV and am planning to ship my product with Intel® MPI Library. Do my customers have to buy the Intel® MPI Library Development Kit in order to use my software?
  • No. There are currently 3 different models if ISVs want to ship with Intel MPI Library.
    1) An ISV can redistribute the runtime components of the Intel MPI Library available from the development kit (see the redist.txt file in the Intel MPI Library installation directory for list of redistributable files).
    2) If a customer would rather install the Intel MPI Library as a system component, the Runtime Environment Kit can be downloaded free of charge from the Intel MPI Library product page.
    3) The Intel® MPI Library Runtime Environment (RTO) can be pre-installed by the vendor and shipped with the application.

Downloads

Compatibility

  • Does the Intel® MPI Library support 32-bit applications on 64-bit operating systems?
  • No. The Intel® MPI Library only supports 64-bit apps on 64-bit operating systems on Intel® 64. For more details, visit our Deprecation page.

  • Is there a Microsoft* Windows* version of the Intel® MPI Library?
  • Yes. The Intel MPI Library for Windows is available now.

  • Does the Intel MPI Library run on AMD platforms?
  • Yes. The Intel® MPI Library is known to run on AMD platforms, and we have had no issue reports specific to AMD platforms so far.

  • Does the Intel® MPI Library support parallel I/O calls?
  • Yes. The parallel file I/O part of the MPI-2 standard is fully implemented by the Intel® MPI Library 5.0. Some of the currently supported file systems include Unix File System (UFS), Network File System (NFS), Parallel Virtual File System (PVFS2), and Lustre*.  For a complete list, check the Release Notes.

  • Does the Intel® MPI Library support one-sided communication?
  • Yes. The Intel® MPI Library supports both active target and passive target one-sided communication. The only exception is the passive target one-sided communication in case the target process does not call any MPI functions. Further support is available through the new one-sided calls and memory models in MPI-3.0.

  • Does the Intel® MPI Library support heterogeneous clusters?
  • Yes. The Intel® MPI Library now supports clusters running different operating systems as well as an environment of mixed Intel processors. The library provides default optimizations depending on the detected architecture.

  • What DAPL* version does the Intel® MPI Library support?
  • The Intel® MPI Library uses Direct Access Programming Library (DAPL) as a fabric independent API to run on fast interconnects like InfiniBand* or Myrinet*. Currently the Intel MPI Library supports DAPL* version 1.1, 1.2 as well as DAPL* version 2.0-capable providers. Intel MPI automatically determines the version of DAPL standard to which the provider conforms.

  • What compilers does the Intel® MPI Library support?
  • The Intel® MPI Library supports Intel® Compilers 13.1 through 15.0 (or higher), as well as GNU* C, C++, Fortran77 3.3 or higher, and GNU* Fortran95 4.0 or higher. Additionally, the Intel® MPI Library provides a bundled source kit that offers support for the PGI* C, PGI* Fortran 77, and Absoft* Fortran 77 compilers out of the box, with the following caveats:

    • The PGI* compiled source files must not transfer long double entities
    • The Absoft* based build procedure must use the -g77, -B108 compiler option
    • You must take care of installing and selecting the right compilers
    • You must make sure that the respective compiler runtime is installed on all nodes

    You may have to build extra Intel® MPI binding libraries if you need support for PGI* C++, PGI* Fortran 95, and Absoft* Fortran 95 bindings. If you need access to this additional binding kit, contact us via the Intel® Premier Support portal @ http://premier.intel.com

  • Does the Intel® MPI Library work with any common resource managers?
  • Yes. The Intel® MPI Library supports OpenPBS*, PBS Pro*, Torque, LSF*, Parallelnavi*, NetBatch*, SLURM*, SGE*, LoadLeveler* and Lava* batch schedulers. The simplified job startup command mpirun recognizes when it is run inside a session started by any PBS compatible resource manager (like OpenPBS*, PBS Pro*, Torque*), as well as LSF*. See the Intel® MPI Library Reference Manual for a description of this command.

  • I have a mixed application which uses both MPI and OpenMP* calls. Does the Intel® MPI Library support this type of hybrid functionality?
  • Yes, Intel MPI does support mixed MPI/OpenMP applications.

Technical

  • Is the Intel® MPI Library fault-tolerant?
  • Yes, to an extent. Note that the MPI standard does not yet define proper handling of aborted MPI ranks. By default, the Intel® MPI Library will stop the entire application if any of the processes exit abnormally. This behavior can be overwritten via a runtime option where the library does allow for an application to continue execution even if one of the processes stops responding. Check the Intel® MPI Library Reference Manual for details and application requirements.

  • Is the Intel® MPI Library thread safe?
  • Yes. The Intel® MPI Library includes thread safe libraries at level MPI_THREAD_MULTIPLE. Several threads can make the Intel MPI Library calls simultaneously. Use the compiler driver -mt_mpi option to link the thread safe version of the Intel MPI Library. Use the thread safe libraries if you request the thread support at the following levels:

    MPI_THREAD_FUNNELED,
    MPI_THREAD_SERIALIZED, or
    MPI_THREAD_MULTIPLE.

  • How can I learn what version of the Intel® MPI Library is installed on the system?
  • You can use mpirun –V to get versioning and build information:

    mpirun –V
    This will output version information.

    If this is an official package, look up the mpisupport.txt file or the Release Notes and search for a version information there:
    cat /opt/intel/mpi/5.0/mpisupport.txt

    If Intel MPI has been installed in RPM mode, try to query the RPM database:
    rpm –qa | grep intel-mpi

    Finally, for full build identification information, set I_MPI_VERSION to 1 and run any MPI program, grepping for "Build":
    mpirun –n 2 –env ./a.out | grep –i build
    This will turn up a couple of lines with the build date. Most of this information is also imbedded into the library and can be queried using the strings utility:
    strings /opt/intel/mpi/5.0/lib/libmpi.so | grep –i build

Intel® MPI Library 5.0

Getting Started?

Click the Learn tab for guides and links that will quickly get you started.

Get Help or Advice

Search Support Articles
Forums - The best place for timely answers from our technical experts and your peers. Use it even for bug reports.
Support - For secure, web-based, engineer-to-engineer support, visit our Intel® Premier Support web site. Intel Premier Support registration is required.
Download, Registration and Licensing Help - Specific help for download, registration, and licensing questions.

Resources

Release Notes - View Release Notes online!
Intel® MPI Library Product Documentation - View documentation online!
Documentation for other software products