Is there any reference for the latency of AVX2 instructions, such as latency for vgather, vpshufb, etc.? I got some related information from APPENDIX C of the Intel® 64 and IA-32 Architectures Optimization Reference Manual, but looks not all the AVX2 instructions are fully contained in that manual.
AVX2 latency
Para obtener más información sobre las optimizaciones del compilador, consulte el aviso sobre la optimización.


