Is there any reference for the latency of AVX2 instructions, such as latency for vgather, vpshufb, etc.? I got some related information from APPENDIX C of the Intel® 64 and IA-32 Architectures Optimization Reference Manual, but looks not all the AVX2 instructions are fully contained in that manual.
AVX2 latency
Para obter mais informações sobre otimizações de compiladores, consulte Aviso sobre otimizações.


