Calculates the dot product of float32 vectors. The corresponding Intel® AVX instruction is
- float32 vector used for the operation
- float32 vector also used for the operation
- a constant of integer type where the high four bits of the mask determine how the resultant elements are summed and the low four bits determine whether the summed resultant value is to be broadcast to the destination vector or not
First performs a SIMD multiplication of the lower four packed single-precision floating-point elements (float32 elements) from the first source vector
m1with corresponding elements in the second source vector
Each of the four resulting single-precision elements is conditionally summed depending on the high four bits in the
The resulting summed value is broadcast to each of the lower 4 positions in the destination vector, if the corresponding lower bit of the
maskis "1". If the corresponding lower bit of the
maskis zero, the corresponding lower element in the destination vector is set to zero.
The process is then replicated with the high elements of the source vectors.
Result of the operation.