Below is a simple reproduction case for the issue we're facing:
Version: Beta 2015 Update 1
Product: Intel® Cluster Studio XE bundle installer
Operating System: Linux*
Problem Description: When attempting to upgrade from an existing installation of Intel® Cluster Studio XE 2015 Beta to the new Intel® Cluster Studio XE 2015 Beta Update 1, distributed installation on the compute nodes of a cluster may fail with the following error message:
Hello. I've been attempting to follow the instructions on this page https://software.intel.com/sites/products/documentation/hpc/ics/itac/81/Getting_Started.htm using Intel mpirun but have either not been able to get the trace files to be generated or I cannot find them. An example of my command line input is:
mpirun -trace -n 8 <app> <parameters>
I was wondering if anyone had experience running the Intel MPI library on a cluster consisting entirely of VMs. Is this possible? Which library or product is appropriate for this need? Windows or Linux, either is fine. I can't seem to find any guidance in the documentation.
The Intel® Trace Analyzer and Collector for Linux* and Windows* is a low-overhead scalable event-tracing library with graphical analysis that reduces the time it takes an application developer to enable maximum performance of cluster applications. This package is for users who develop on and build for Intel® 64 architectures on Linux* and Windows*, as well as customers running on the Intel® Xeon Phi™ coprocessor on Linux*. You must have a valid license to download, install and use this product.
The Intel® MPI Library is a high-performance interconnect-independent multi-fabric library implementation of the industry-standard Message Passing Interface, v3.0 (MPI-3.0) specification. This package is for MPI users who develop on and build for Intel® 64 architectures on Linux* and Windows*, as well as customers running on the Intel® Xeon Phi™ coprocessor on Linux*. You must have a valid license to download, install, and use this product.
I am running an MPI application using IntelMPI. I could run this application using 1process successfully. While running it with more than 1process I am getting the following error.
After an update of our Cluster I started receiving dapl errors. I am compiling my fortran code with impi 4.1.2. The error occurs if I try running my code with complicated models having large memory demand. The dapl errors occur after completing about 75% of the entire job. This is a rather strange problem because I have 64GB of memory on my nodes. This should be more than enough to fit even the most complicated problems. Moreover these inputs were running without any objections before the update.