Intel® MPI Library

Making applications perform better on Intel® architecture-based clusters with multiple fabric flexibility

  • Scalability verified up to 150k processes
  • Supports the latest MPI-3 standard
  • MPICH ABI compatibility

Available in Intel® Parallel Studio XE Cluster Edition.
Buy Now

Or Download a Free 30-Day Evaluation Version

You may qualify for free software tools.

Deliver Flexible, Efficient, and Scalable Cluster Messaging

Intel® MPI Library 5.0 focuses on making applications perform better on Intel® architecture-based clusters—implementing the high performance Message Passing Interface Version 3.0 specification on multiple fabrics. It enables you to quickly deliver maximum end user performance even if you change or upgrade to new interconnects, without requiring changes to the software or operating environment.

Use this high performance MPI message library to develop applications that can run on multiple cluster interconnects chosen by the user at runtime. Benefit from a free runtime environment kit for products developed with Intel® MPI Library. Get excellent performance for enterprise, divisional, departmental, workgroup, and personal High Performance Computing.


“Fast and accurate state of the art general purpose CFD solvers is the focus at S & I Engineering Solutions Pvt, Ltd. Scalability and efficiency are key to us when it comes to our choice and use of MPI Libraries. The Intel® MPI Library has enabled us to scale to over 10k cores with high efficiency and performance.”
Nikhil Vijay Shende, Director,
S & I Engineering Solutions, Pvt. Ltd.


Optimized shared memory path for multicore platforms allows more communication throughput and lower latencies. Native InfiniBand interface (OFED verbs) also provides support for lower latencies. Multi-rail capability for higher bandwidth and increased interprocess communication and Tag Matching Interface (TMI) support for higher performance on Intel® True Scale, Qlogic* PSM, and Myricom* MX solutions.

  • Low latency MPI implementation up to 2 times as fast as alternative MPI libraries
  • Enable optimized shared memory dynamic connection mode for large SMP nodes
  • Increase performance with improved DAPL, OFA, and TMI fabric support
  • Accelerate applications using the enhanced tuning utility for MPI


Implementing the high performance MPI 3.0 specification on multiple fabrics, Intel® MPI Library for Windows* and Linux* focuses on making applications perform better on IA-based clusters. Intel® MPI Library enables you to quickly deliver maximum end-user performance, even if you change or upgrade to new interconnects without requiring major modifications to the software or to the operating environment. Intel also provides a free runtime environment kit for products developed with the Intel® MPI Library.

  • Scaling verified up to 150k Processes
  • Thread safety allows you to trace hybrid multithreaded MPI applications for optimal performance on multi- and many-core Intel® Architecture.
  • Improved start scalability through the mpiexec.hydra process manager

Interconnect Independence & Flexible Runtime Fabric Selection

Whether you need to run TCP sockets, shared memory, or one of many Remote Direct Memory Access (RDMA) based interconnects, including InfiniBand*, Intel® MPI Library covers all configurations by providing an accelerated universal, multi-fabric layer for fast interconnects via the Direct Access Programming Library (DAPL*) or the Open Fabrics Association (OFA*) methodology. Develop MPI code independent of the fabric, knowing it will run efficiently on whatever network is chosen by the user at runtime.

  • Get high-performance interconnects, including Intel® True Scale, Myrinet* MX, and QLogic* PSM interfaces as well as TCP, shared memory, and others
  • Efficiently work through the Direct Access Programming Library (DAPL*), Open Fabrics Association (OFA*), and Tag Matching Interface (TMI*), making it easy for you to test and run applications on a variety of network fabrics.
    Optimizations to all levels of cluster fabrics: from shared memory thru Ethernet and RDMA-based fabrics to the tag matching interconnects

Intel® MPI Library dynamically establishes the connection, but only when needed, which reduces the memory footprint. It also automatically chooses the fastest transport available. Memory requirements are reduced by several methods including a two-phase communication buffer enlargement capability which allocates only the memory space actually required.

MPI 3.0 Standard Support

The next major evolution of the Message Passing Interface is with the release of the MPI-3.0 standard. Significant changes to remote memory access (RMA) one-sided communications, addition of non-blocking collective operations, and large counts messages greater than 2GB will enhance usability and performance. Now available in the Intel® MPI Library 5.0.

Binary compatibility

Intel® MPI Library offers binary compatibility with existing MPI-1.x and MPI-2.x applications. Even if you’re not ready to move to the new standard, you can still take advantage of the latest Intel® MPI Library performance improvements without recompiling. Furthermore, the Intel® MPI Library is an active collaborator in the MPICH ABI Compatibility Initiative, ensuring any MPICH-compiled code can use our runtimes.

Support for Mixed Operating Systems

Run a single MPI job using a cluster with mixed operating systems (Windows* OS and Linux OS*) under the Hydra process manager. Get more flexibility in job deployment with this added functionality.

Latest Processor Support

Intel consistently offers the first set of tools to take advantage of the latest performance enhancements in the newest Intel product, while preserving compatibility with older Intel and compatible processors. New support includes AVX2, TSX, FMA3 and AVX-512.

Videos to help you get started.

Register for future Webinars

Previously recorded Webinars:

  • MPI-3 Is Here: Optimize and Perform with Intel MPI Tools
  • Intel® MPI library implementation of a new MPI3.0 standard - new features and performance benchmarks
  • Increase Cluster MPI Application Performance with a "MPI Tune" Up
  • MPI on Intel® Xeon Phi™ coprocessor

More Tech Articles

Mapping of Intel® MPI Library versions to bundle suites
By Gergana Slavova (Intel)Posted 08/28/20140
Introduction: Mapping the Intel® MPI Library numbers to specific suites and update versions Intel® Parallel Studio XE 2015 Update 1 Cluster Edition (released 26 November 2014) Intel® MPI Library 5.0 Intel® Registration Center Activation Date ( Windows Version / build Linu...
Using Intel® MPI Library 5.0 with MPICH based applications
By Dmitry Sivkov (Intel)Posted 08/25/20140
Why it is needed? Different MPI implementations have their specific benefits and advantages. So in the specific cluster environment the HPC application with the other MPI implementation can probably perform better.  Intel® MPI Library has the following benefits: Support of the wide range of cl...
NOAA NIM with Support for Intel® Xeon Phi™ Coprocessor
By Ashish Jha (Intel)Posted 07/03/20140
Non-hydrostatic Icosahedral Model is a weather forecasting model developed by NOAA. G6 K96 which is a smaller data-set which scales best up to 4 cluster nodes. G9 is useful for studying larger clusters. The code supports the symmetric mode of operation of the Intel® Xeon® processor (Referred to a...
Intel® Cluster Tools Open Source Downloads
By Gergana Slavova (Intel)Posted 03/06/20140
This article makes available third-party libraries and sources that were used in the creation of Intel® Software Development Products. Intel provides this software pursuant to their applicable licenses. Products and Versions: Intel® Trace Analyzer and Collector for Linux* (whi...
Subscribe to Intel Developer Zone Articles

Supplemental Documentation

Intel® Parallel Studio XE 2015 Update 2 Cluster Edition Readme
By Gergana Slavova (Intel)Posted 02/06/20150
The Intel® Parallel Studio XE 2015 Update 2 Cluster Edition for Linux* and Windows* combines all Intel® Parallel Studio XE and Intel® Cluster Tools into a single package. This multi-component software toolkit contains the core libraries and tools to efficiently develop, optimize, run, and distrib...
Intel® Parallel Studio XE 2015 Update 1 Cluster Edition Readme
By Gergana Slavova (Intel)Posted 11/24/20140
The Intel® Parallel Studio XE 2015 Update 1 Cluster Edition for Linux* and Windows* combines all Intel® Parallel Studio XE and Intel® Cluster Tools into a single package. This multi-component software toolkit contains the core libraries and tools to efficiently develop, optimize, run, and distrib...
Intel® Parallel Studio XE 2015 Cluster Edition Initial Release Readme
By Gergana Slavova (Intel)Posted 08/15/20140
The Intel® Parallel Studio XE 2015 Cluster Edition for Linux* and Windows* combines all Intel® Parallel Studio XE and Intel® Cluster Tools into a single package. This multi-component software toolkit contains the core libraries and tools to efficiently develop, optimize, run, and distribute paral...
Using Multiple DAPL* Providers with the Intel® MPI Library
By James Tullos (Intel)Posted 09/19/20130
Introduction If your MPI program sends messages of drastically different sizes (for example, some 16 byte messages, and some 4 megabyte messages), you want optimum performance at all message sizes.  This cannot easily be obtained with a single DAPL* provider.  This is due to latency being a major...
Subscribe to Intel Developer Zone Articles

You can reply to any of the forum topics below by clicking on the title. Please do not include private information such as your email address or product serial number in your posts. If you need to share private information with an Intel employee, they can start a private thread for you.

New topic    Search within this forum     Subscribe to this forum

Profiling a complex MPI Application : CESM (Community Earth System Model)
By Nitin Kundapur B.2
Hello.  CESM is a complex MPI climate model which is a highly parallel application.  I am looking for ways to profile CESM runs. The default profiler provides profiling data for only a few routines. I have tried using external profilers like TAU, HPC Toolkit, Allinea Map, ITAC Traceanalyzer and VTune.  As I was running CESM across a cluster (with 8 nodes - 16 processors each), it was most beneficial to use HPC Toolkit and Allinea Map for profiling. However, I am keen on finding two metrics for each CESM routine executed.  These are : 1) Total execution time of the function 2) Number of function calls made Both of these do not provide the number of function calls made for a routine.  The number of function calls made is important because this will help me find the time taken for execution of each call of a function. Just wanted to know if this has been achieved by anyone. Is there a way to do this with any of these tools?    Thanks, Nitin K Bhat SERC, Indian Institute of Science
Performance issues of Intel MPI on Windows 7 SP 1 with 2x18 cores cpus.
By Frank R.2
Dear support team, I have a question about a performance difference between Windows 7 SP 1 and RHEL 6.5. The situation is as follows: The hardware is a DELL precision rack 7910, see link for exact specification (click on components): We installed Linux RHEL 6.5 on this machine and ran our product (compiled with Intel C/C++/Fortran 13.1.3 (gcc version 4.4.7 compatibility) and Intel MPI on Linux). After that, we installed Windows 7 SP 1 on this machine and ran our product (compiled with Intel C/C++/Fortran and Intel MPI on Windows) again. What we observed is a big performance drop on 1 and 2 cpu on Windows in comparison to Linux. If we go up to 8, 16, 32 cpus we got nearly the same performance on Windows as on Linux, but we got heavy oscillation in computation time only on Windows (sometimes 16 cpus faster than 32 cpu). On Intel MPI we didn't see this os...
MPI_Init_thread or MPI_Init failed in child process
By Yongjun L.7
I have two programs, A and B. They all are developed with MPI. A will call B.  If I directly start A and call B, every thing is OK. If I start A with mpiexec, like mpiexec -localonly 2 A.exe, and call B. MPI_Init_thread or MPI_Init will fail in B.  Below is the error message I got. [01:2668]..ERROR:Error while connecting to host, No connection could be made because the target machine actively refused it. (10061) [01:2668]..ERROR:Connect on sock (host=localhost, port=53649) failed, exhaused all end points SMPDU_Sock_post_connect failed. [1] PMI_ConnectToHost failed: unable to post a connect to localhost:53649, error: Undefined dynamic error code uPMI_ConnectToHost returning PMI_FAIL [1] PMI_Init failed. Fatal error in PMPI_Init_thread: Other MPI error, error stack: MPIR_Init_thread(659): MPID_Init(154).......: channel initialization failed MPID_Init(448).......: PMI_Init returned -1 Can anyone tell me what is the problem? How to solve it? Thanks Yongjun
need to type "Enter" ?
By dingjun.chencmgl.ca1
Hi, Everyone, I am running my hybrid MPI/OpenMP jobs on 3-nodes Infiniband PCs Linux cluster. each node has one MPI process that has 15 OpenMP threads. This means my job runs with 3 MPI processes and each MPI process has 15 threads. the hosts.txt file is given as follows: coflowrhc4-5:1 coflowrhc4-6:1 coflowrhc4-7:1  I wrote the following batch file as follows: /************** batch file******************/ export CMG_LIC_HOST=rlmserv export exe=/cmg/dingjun/imexLocal/imex_xsamg_dave.exe export LD_LIBRARY_PATH=/cmg/dingjun/imexLocal/linux_x64/lib export OMP_SCHEDULE=static,1 export KMP_AFFINITY=compact,0 export datadir=/cmg/dingjun/imexdatasets/7testproblems/mx1041_rb cd /cmg/dingjun/imexdatasets/7testproblems/mx1041_rb mpirun -machinefile hosts.txt ${exe} -fgmres -f ${datadir}/mx1041x105x10loa2_rb_xsamg.dat -log -jacdoms 16 -parasol 16 -o mx1041x105x10loa2_rb_xsamg_3MPI15threads_run7 export datadir=/cmg/dingjun/imexdatasets/7testproblems/mx521_rb cd /cmg/dingjun/imexdataset...
Intel MPI and I_MPI_EXTRA_FILESYSTEM: How to tell it's on?
By thematt3
All, I hope the Intel MPI experts here can help me out. Intel MPI was recently installed on our cluster, a cluster that uses a GPFS filesystem. Looking at the release notes I saw that "I_MPI_EXTRA_FILESYSTEM_LIST gpfs" was now available. Great! I thought I'd try to see if I can see an effect or not. However, I'm having trouble detecting whether it's on or not. I tried running a simple Hello World (no I/O, but simple) with I_MPI_DEBUG=9. When I do so, I get the usual splat of information but if I pass in "-genv I_MPI_EXTRA_FILESYSTEM on -genv I_MPI_EXTRA_FILESYSTEM_LIST gpfs" or not, I never see anything in the I_MPI_DEBUG output that says if I enabled it or not. I even tried I_MPI_DEBUG=100, but nothing. Is there a way to know if this has been enabled? I was hoping to try to figure out an MPI-I/O benchmark that would let me see a difference, but if I can't tell if Intel MPI is actually enabling it, I'm a bit wary to thrash my disks without being sure. Thanks, Matt
Cannot use jemalloc with IntelMPI
By Eloi Gaudry3
Hi, I've tried to bench several memory allocators on Linux (64-bit) such as ptmalloc2, tcmalloc and jemalloc with an application linked against IntelMPI ( Launching any application linked with jemalloc will cause the execution to abort with a signal 11. But the same application, when not linked with IntelMPI will work without any issue. Is IntelMPI doing its own malloc/free ? How can this issue be overcome ? Thanks, Eloi  
How many processes we can run run HPL on with Intel MPI ?
By Nihir Parikh6
If we are using evaluation version of Intel MPI/compiler/MKL (cluster tools) how many cores/processes we can run HPL on?
problem when multiple MPI versions installed
By tangzhanghong98@yahoo.com8
Dear all, I have a problem to launch processes when multiple MPI versions installed. The processes work before I installed latest MPI C:\Program Files (x86)\Intel\MPI\>mpiexec -wdir "Z:\test" -mapall -hosts 10 n01 6 n02 6 n03 6 n04 6 n05 6 n06 6 n07 6 n08 6 n09 6 n10 6 Z:\test However, after I installed MPI, the following errors displayed when I launch mpiexec in the environment of Aborting: unable to connect to N01, smpd version mismatch   I have already run the following command in the environment of before launching mpiexec: hydra_service -stop   Could anyone help me to take a look at it? Is it possible to let both versions work in the cluster?   Thanks, Zhanghong Tang  
Subscribe to Forums


  • What kinds of licenses are available for the Intel® MPI Library?
  • The Runtime license includes everything you need to run Intel MPI-based applications. The license is free and permanent. The Developer license includes everything needed to build and run applications. It is fee-based and permanent. It allows free redistribution of the components needed to run Intel MPI-based applications.

  • When is a Developer license required for the Intel® MPI Library?
  • The two kits (developer and runtime) can co-exist on a machine and it is fine for customers of Intel MPI-based applications to relink the application to include user subroutines. If the customer is actually writing MPI code (calling MPI_* functions directly), then a Developer license would be needed.

  • I am an ISV and am planning to ship my product with Intel® MPI Library. Do my customers have to buy the Intel® MPI Library Development Kit in order to use my software?
  • No. There are currently 3 different models if ISVs want to ship with Intel MPI Library.
    1) An ISV can redistribute the runtime components of the Intel MPI Library available from the development kit (see the redist.txt file in the Intel MPI Library installation directory for list of redistributable files).
    2) If a customer would rather install the Intel MPI Library as a system component, the Runtime Environment Kit can be downloaded free of charge from the Intel MPI Library product page.
    3) The Intel® MPI Library Runtime Environment (RTO) can be pre-installed by the vendor and shipped with the application.



  • Does the Intel® MPI Library support 32-bit applications on 64-bit operating systems?
  • No. The Intel® MPI Library only supports 64-bit apps on 64-bit operating systems on Intel® 64. For more details, visit our Deprecation page.

  • Is there a Microsoft* Windows* version of the Intel® MPI Library?
  • Yes. The Intel MPI Library for Windows is available now.

  • Does the Intel MPI Library run on AMD platforms?
  • Yes. The Intel® MPI Library is known to run on AMD platforms, and we have had no issue reports specific to AMD platforms so far.

  • Does the Intel® MPI Library support parallel I/O calls?
  • Yes. The parallel file I/O part of the MPI-2 standard is fully implemented by the Intel® MPI Library 5.0. Some of the currently supported file systems include Unix File System (UFS), Network File System (NFS), Parallel Virtual File System (PVFS2), and Lustre*.  For a complete list, check the Release Notes.

  • Does the Intel® MPI Library support one-sided communication?
  • Yes. The Intel® MPI Library supports both active target and passive target one-sided communication. The only exception is the passive target one-sided communication in case the target process does not call any MPI functions. Further support is available through the new one-sided calls and memory models in MPI-3.0.

  • Does the Intel® MPI Library support heterogeneous clusters?
  • Yes. The Intel® MPI Library now supports clusters running different operating systems as well as an environment of mixed Intel processors. The library provides default optimizations depending on the detected architecture.

  • What DAPL* version does the Intel® MPI Library support?
  • The Intel® MPI Library uses Direct Access Programming Library (DAPL) as a fabric independent API to run on fast interconnects like InfiniBand* or Myrinet*. Currently the Intel MPI Library supports DAPL* version 1.1, 1.2 as well as DAPL* version 2.0-capable providers. Intel MPI automatically determines the version of DAPL standard to which the provider conforms.

  • What compilers does the Intel® MPI Library support?
  • The Intel® MPI Library supports Intel® Compilers 13.1 through 15.0 (or higher), as well as GNU* C, C++, Fortran77 3.3 or higher, and GNU* Fortran95 4.0 or higher. Additionally, the Intel® MPI Library provides a bundled source kit that offers support for the PGI* C, PGI* Fortran 77, and Absoft* Fortran 77 compilers out of the box, with the following caveats:

    • The PGI* compiled source files must not transfer long double entities
    • The Absoft* based build procedure must use the -g77, -B108 compiler option
    • You must take care of installing and selecting the right compilers
    • You must make sure that the respective compiler runtime is installed on all nodes

    You may have to build extra Intel® MPI binding libraries if you need support for PGI* C++, PGI* Fortran 95, and Absoft* Fortran 95 bindings. If you need access to this additional binding kit, contact us via the Intel® Premier Support portal @

  • Does the Intel® MPI Library work with any common resource managers?
  • Yes. The Intel® MPI Library supports OpenPBS*, PBS Pro*, Torque, LSF*, Parallelnavi*, NetBatch*, SLURM*, SGE*, LoadLeveler* and Lava* batch schedulers. The simplified job startup command mpirun recognizes when it is run inside a session started by any PBS compatible resource manager (like OpenPBS*, PBS Pro*, Torque*), as well as LSF*. See the Intel® MPI Library Reference Manual for a description of this command.

  • I have a mixed application which uses both MPI and OpenMP* calls. Does the Intel® MPI Library support this type of hybrid functionality?
  • Yes, Intel MPI does support mixed MPI/OpenMP applications.


  • Is the Intel® MPI Library fault-tolerant?
  • Yes, to an extent. Note that the MPI standard does not yet define proper handling of aborted MPI ranks. By default, the Intel® MPI Library will stop the entire application if any of the processes exit abnormally. This behavior can be overwritten via a runtime option where the library does allow for an application to continue execution even if one of the processes stops responding. Check the Intel® MPI Library Reference Manual for details and application requirements.

  • Is the Intel® MPI Library thread safe?
  • Yes. The Intel® MPI Library includes thread safe libraries at level MPI_THREAD_MULTIPLE. Several threads can make the Intel MPI Library calls simultaneously. Use the compiler driver -mt_mpi option to link the thread safe version of the Intel MPI Library. Use the thread safe libraries if you request the thread support at the following levels:


  • How can I learn what version of the Intel® MPI Library is installed on the system?
  • You can use mpirun –V to get versioning and build information:

    mpirun –V
    This will output version information.

    If this is an official package, look up the mpisupport.txt file or the Release Notes and search for a version information there:
    cat /opt/intel/mpi/5.0/mpisupport.txt

    If Intel MPI has been installed in RPM mode, try to query the RPM database:
    rpm –qa | grep intel-mpi

    Finally, for full build identification information, set I_MPI_VERSION to 1 and run any MPI program, grepping for "Build":
    mpirun –n 2 –env ./a.out | grep –i build
    This will turn up a couple of lines with the build date. Most of this information is also imbedded into the library and can be queried using the strings utility:
    strings /opt/intel/mpi/5.0/lib/ | grep –i build

Intel® MPI Library 5.0

Getting Started?

Click the Learn tab for guides and links that will quickly get you started.

Get Help or Advice

Search Support Articles
Forums - The best place for timely answers from our technical experts and your peers. Use it even for bug reports.
Support - For secure, web-based, engineer-to-engineer support, visit our Intel® Premier Support web site. Intel Premier Support registration is required.
Download, Registration and Licensing Help - Specific help for download, registration, and licensing questions.


Release Notes - View Release Notes online!
Intel® MPI Library Product Documentation - View documentation online!
Documentation for other software products