I have a system with 16 cores and 2 IB interfaces. I need to use both rails, so have been using export I_MPI_OFA_NUM_ADAPTERS=2 export I_MPI_OFA_NUM_PORTS=1 export I_MPI_OFA_RAIL_SCHEDULER=ROUND_ROBIN export I_MPI_FABRICS="shm:ofa"But this has the undesirable effect of sending some of the data from the second 8 cores to the IB on the first eight cores and vice-versa.I triedexport I_MPI_OFA_NUM_ADAPTERS=2export I_MPI_OFA_NUM_PORTS=1export I_MPI_OFA_RAIL_SCHEDULER=PROCESS_BINDexport I_MPI_FABRICS="shm:ofa"But this assigns process 0 to interface 1, process 1 to interface 2, process 3 to interface 1, etc. This sends alot of data from one set of cores to the opposing interface. Furthermore, the code is written so that it assumes that process 0 and process 1 are in the same cpu so it tries to do a cyclic data movement.I need to assign process 0 - 7 to interface 1 and process 8 - 15 to interface 2.Is this possible?ThanksDavidThanks
For more complete information about compiler optimizations, see our Optimization Notice.