It looks like there is a bug in the way Intel MPI interacts with SLURM. I had the following hostlist in SLURM_JOB_NODELIST
Other versions of MPI such as OpenMPI have had no problems interpreting this. However Intel MPI when it used that node list it tried to find itc017. That isn't even a valid hostname let alone at that hostlist.
I wrote a script to bypass this and generate the correct host list and explicitly pass it to Intel MPI. However, it would be better to fix this inside of Intel MPI itself.