Computes sum of 32/64 bit unsigned integer value with borrow-in value and then subtracts the result from a 32/64 bit unsigned integer value. The corresponding 4th Generation Intel® Core™ Processor extension instruction is
Computes the average of unsigned 8/16-bit integer data elements of two vectors. The corresponding Intel® AVX2 instruction is VPAVGB or VPAVGW.
Take byte elements from the source operand and broadcast to all elements of the result vector. The corresponding Intel® AVX2 instruction is VPBROADCASTB.
Multiply-adds scalar single-precision floating-point values using three float32vectors. The corresponding FMA instruction is VFMADD<XXX>SS, where XXX could be 132, 213, or 231.
Multiply-subtracts negated scalar single-precision floating-point values of three float32 vectors. The corresponding FMA instruction is VFNMSUB<XXX>SS, where XXX could be 132, 213, or 231.
Gathers 2/4 quadword values from memory referenced by the given base address, qword indices and scale, and using the given qword mask values. The corresponding Intel® AVX2 instruction is