I am trying to run Intel's version of Caffe for benchmarking purposes, which I downloaded from:
The training command works fine when running on a local system (without MPI) but when trying to use MPI like in the following command:
mpirun -v -n 2 -ppn 1 -machinefile /home/demouser/mpd.hosts /home/demouser/intelcaffe/build/tools/caffe train -solver /home/demouser/dogvscat/dogvscat_solver.prototxt
I receive the following error message:
(4328): /localdisk/jenkins/mlsl-build/src/comms_ep.cpp:CommsAlloc:535: ASSERT 'ptr' FAILED: NULL pointer
Does anybody have any idea why this occurs (the source for the MLSL library is not available - just the binary blob)?