mpd problems

mpd problems

Imagen de 4sissi

Hi all,

I'm using Intel-MPI 3.2.011 on a cluster with 9 nodes and 36 cpus and a master node with 2 cpus. Ethernet interconnects all nodes.

The mpdboot commands on master:

/opt/intel/impi/3.2.0.011/bin64/mpd --ncpus=2 -e -d &

/opt/intel/impi/3.2.0.011/bin64/mpdboot --rsh=/usr/bin/ssh --totalnum=10 -1 --file=$HOME//machines.LINUX --verbose --ncpus=2 &b

bring out on nodes the daemon:

[root@sissi0 ~]# ps aux | grep mpd
giorgio 3141 0.0 0.1 156704 5632 ? S 11:32 0:00 python /opt/intel/impi/3.2/bin64/mpd.py -h sissi2 -p 40057 --ifhn=10.1.1.10 --ncpus=4 --myhost=sissi0 --myip=10.1.1.10 -e -d -s 10

On master node I got the following:

LAUNCHED mpd on sissi.xxxx.xx via
RUNNING: mpd on sissi.xxxx.xx
LAUNCHED mpd on sissi8 via sissi.xxxx.xx
LAUNCHED mpd on sissi1 via sissi.xxxx.xx
LAUNCHED mpd on sissi2 via sissi.inogs.it
LAUNCHED mpd on sissi3 via sissi.inogs.it
RUNNING: mpd on sissi8
RUNNING: mpd on sissi2
LAUNCHED mpd on sissi0 via sissi8
LAUNCHED mpd on sissi4 via sissi8
LAUNCHED mpd on sissi5 via sissi8
RUNNING: mpd on sissi1
LAUNCHED mpd on sissi6 via sissi8
RUNNING: mpd on sissi3
LAUNCHED mpd on sissi7 via sissi3
RUNNING: mpd on sissi5
RUNNING: mpd on sissi0
RUNNING: mpd on sissi4
RUNNING: mpd on sissi7
RUNNING: mpd on sissi6
mpdboot_sissi.inogs.it (handle_mpd_output 752): from mpd on sissi0, invalid port info:
sissi0: Connection refused

Can someone help me out to resolve this issue?

giorgio

publicaciones de 4 / 0 nuevos
Último envío
Para obtener más información sobre las optimizaciones del compilador, consulte el aviso sobre la optimización.
Imagen de Andrey Derbunovich (Intel)

Hi,

It is dificult to determine a reason of such issue without having mpd.log files. I would suggest you submit an issue report at https://premier.intel.com

Best regards,

Andrey

Imagen de Mostafa N.

Hi,

I have a corrupted MPD on my machine. Whenever I call any of the commands "mpdtrace" or "mpdallexit" I get the following message:

mpdroot: cannot connect to local mpd at: /tmp/mpd2.console_root
probable cause: no mpd daemon on this machine
possible cause: unix socket /tmp/mpd2.console_root has been removed
mpdtrace (__init__ 1524): forked process failed; status=255

I have my mpd running:

mpd& 
ps -ef | grep mpd

Note: I am running on my own machine! Single Node, Quad core (intel i7 core)

Any helps would be appreciated.

Imagen de James Tullos (Intel)

Hi Mostafa,

Please see my reply to your other post at http://software.intel.com/en-us/forums/topic/380080#comment-1730473.

Sincerely,
James Tullos
Technical Consulting Engineer
Intel® Cluster Tools

Inicie sesión para dejar un comentario.