Intel® Clusters and HPC Technology

Unusual requirement?

Dear All,

We are involved in setting up an HPC cluster with about 25 Dell PowerEdge 720 servers, each equipped with 172 GB of RAM and 24 Intel cores running at 2,4 GHz). Every node is connected to a Gigabit Ethernet switch and to a 56 Gbps Mellanox Infiniband switch that provides storage access.

IntelMPI over Mellanox 10GB RDMA chipset

Hi

What is the best DAPL provider to used with IntelMPI 5.1  when running with a 10Gb RDMA chipsset from Mellanox?

Currently I'm using I_MPI_DAPL_PROVIDER=ofa-v2-scm-roe-mlx4_0-1

I added some others extra parameters to workaround the resource limitation that we have on the chip

  export I_MPI_DAPL_RDMA_RNDV_WRITE=on

   export I_MPI_RDMA_MAX_MSG_SIZE=1048576

 

Thanks

Thierry

 

How to run application without mpiexec?

Greetings forum,

I would like to manually start my application (paraview's pvserver) on each node of a windows cluster without using mpiexec. The reason is that the application needs to make visible windows so that it can drive a CAVE display system. mpiexec will not work for me, because it uses smpd, which is a service, and since Vista windows services can not interact with the user. (Alternatively, is there a way to run smpd.exe as a startup application instead of a service?)

a cluster of two virtual machines cannot be created,it said cannot access "my another node name" using ssh command

I want to create a cluster of two virtual machines. I have followed the file "parallel_studio_xe_2015_update3/doc/Install_Guide.htm#prerequisites". However, when I get the  eighth step in Prerequisites,it occurs this situation:

Yes,cws02 is the master node ,cws01 is the other node.It requires me to input my current virtual machines's key word,but after i enter that ,it stayed there so long and never changed.I can't understand.

clusters of two virtual machines cannot be created ,the master can't access another

I want to create a cluster of two virtual machines. I have followed the file "parallel_studio_xe_2015_update3/doc/Install_Guide.htm#prerequisites". However, when I get the  eighth step in Prerequisites,it occurs this situation:

Yes,cws02 is the master node ,cws01 is the other node.It requires me to input my current virtual machines's key word,but after i enter that ,it stayed there so long and never changed.I can't understand.

After that ,I command "./install_GUI.sh",and it goes smoothly until it get the  installation step,it tell me this:

open MPI e intel MPI DATATYPE

Dear all,

 

I have notice small difference between OPEN-MPI and intel MPI. 

For example in MPI_ALLREDUCE in intel MPI is not allowed to use the same variable in send and receiving Buff.

 

I have written my code in OPEN-MPI, run in on a intel-MPI cluster. 

Now I have the following error:

 

Fatal error in MPI_Isend: Invalid communicator, error stack:
MPI_Isend(158): MPI_Isend(buf=0x1dd27b0, count=1, INVALID DATATYPE, dest=0, tag=0, comm=0x0, request=0x7fff9d7dd9f0) failed

 

 

 

MPI_Request_free() hangs

Hi,

Trying to make use of the MPI persistent communication primitives in our application, I'm ending up with the following sequence of events:

MPI_Ssend_init(msg, msg_length, MPI_BYTE, 0, tag, comm, &req);
MPI_Start(&req);
MPI_Cancel(&req);
MPI_Wait(&req, MPI_STATUS_IGNORE);
MPI_Request_free(&req); // <-- HANGS

The only other node is blocked in an MPI_Barrier(comm);

file not present or wrong filename - when using >=4 procs

Hi,

I have an intel i7-4770 cpu.

Running my CFD simulation code using 1 - 3procs works well. However, when using >=4 procs, I almost always get 2 lines of the error:

file not present or wrong filename.

Occasionally, I can get the code working, but it's like 1 out of 20 tries, although it can be quite random.

I guess it has to do with some file opening:

do ijk=0,num_procs-1

    call MPI_Barrier(MPI_COMM_WORLD,ierr)

S’abonner à Intel® Clusters and HPC Technology