From what I can tell, the _mm512_prefetch_i32gather_ps intrinsic ought to emit a VGATHERPF0DPS instruction if _MM_HINT_T0 is specified and a VGATHERPF1DPS instruction if _MM_HINT_T1 is specified. However, it appears that it always emits the VGATHERPF0DPS instruction. Is this a bug? (I'm using composer_xe_2013_sp1.0.080).
Additionally, it looks to me like ICC does aggressive software prefetching for simple cases, but never emits prefetches for gathers. Is this correct?