I'm trying to run pNetCDF on lustre. The test code and pNetCDF library are both compiled with intel mpi library v4.0.2. Our lustre file system has 40 OSTs.
When running with stripes = 1 or processes = 32, the test codes works well and can output data correctly.
However, when I set stripe = 40 and run with 64 processes, the test code crashed as :
rank 19 in job 1 c25b09_39645 caused collective abort of all ranks
exit status of rank 19: killed by signal 9
The test code is attacted. Thank you in advance.