mpirun and lsf 8

mpirun and lsf 8

mpirun  -ppn # creates # process per node.

It works fine with ssh

??--[ 15:05:57 ]--\> $ mpirun -hosts "mn1,mn2" -ppn 2 -np 4 ~/a.out
Hello world!I'm 0 of 4 on mn1
Hello world!I'm 2 of 4 on mn2
Hello world!I'm 1 of 4 on mn1
Hello world!I'm 3 of 4 on mn2

however, It doesn't with LSF (-n 32 will assign two nodes with 16 procs to the job):

??--[ 16:39:41 ]--\> $ bsub -q pruebas -n 32 -I mpirun -ppn 2 -np 4 ~/a.out

Job <29169> is submitted to queue <pruebas>.
<<Waiting for dispatch ...>>
<<Starting on mn339>>
Hello world!I'm 0 of 4 on mn339
Hello world!I'm 1 of 4 on mn339
Hello world!I'm 2 of 4 on mn339
Hello world!I'm 3 of 4 on mn339

mpirun does detect it is under an LSF job.

any ideas?

4 帖子 / 0 全新
最新文章
如需更全面地了解编译器优化,请参阅优化注意事项

Hi Jose,

What version of the Intel® MPI Library are you using?  I would recommend using the latest, Version 4.1 Update 1, as there are improvements to compatibility with LSF* in this version.

You can try using the ptile capability of LSF* to specify how many ranks per host should be run.

Sincerely,
James Tullos
Technical Consulting Engineer
Intel® Cluster Tools

引文:

James Tullos (Intel) 写道:

Hi Jose,

What version of the Intel® MPI Library are you using?  I would recommend using the latest, Version 4.1 Update 1, as there are improvements to compatibility with LSF* in this version.

You can try using the ptile capability of LSF* to specify how many ranks per host should be run.

Sincerely,
James Tullos
Technical Consulting Engineer
Intel® Cluster Tools

Hi james, 

how does IntelMPI 4.1 U1 launch tasks under LSF? Does it use the native LSF mechanism to talk to LSF deamons which themsleves create the tasks on the target nodes ? Or just simply reads a host-file prepared by LSF and SSHs there? 

We have a vert large LSF shop here and the issue of integration of MPI code with the scheduler is important (for suspending or killing jobs and for accurately counting CPU and memory usage per process/rank).

thanks

Michael

R/D High-Performance Computing and Engineering

James

thank you for your message. I have 4.1.0 (I'll update as soon as possible).

About your suggestion of ptile, it's a very good one and it works in some cases. However, I want to use all the processors of the nodes, but just start an smaller number of MPI process (the other processors will run threads of the MPI process).

发表评论

登录添加评论。还不是成员?立即加入