Developer Guide and Reference

Contents

Intrinsics for Division Operations (512-bit)

The prototypes for Intel® Advanced Vector Extensions 512 (Intel® AVX-512) intrinsics are located in the
zmmintrin.h
header file.
To use these intrinsics, include the
immintrin.h
file as follows:
#include <immintrin.h>
Intrinsic Name
Operation
Corresponding
Intel® AVX-512 Instruction
_mm512_div_pd
,
_mm512_mask_div_pd
,
_mm512_maskz_div_pd
_mm512_div_round_pd
,
_mm512_mask_div_round_pd
,
_mm512_maskz_div_round_pd
Calculates quotient of rounded division operation of packed float64 elements.
VDIVPD
_mm512_div_ps
,
_mm512_mask_div_ps
,
_mm512_maskz_div_ps
_mm512_div_round_ps
,
_mm512_mask_div_round_ps
,
_mm512_maskz_div_round_ps
Calculates quotient of rounded division operation of packed float32 elements.
VDIVPS
_mm_mask_div_sd
,
_mm_maskz_div_sd
_mm_div_round_sd
,
_mm_mask_div_round_sd
,
_mm_maskz_div_round_sd
Calculates quotient of rounded division operation of scalar float64 elements.
VDIVSD
_mm_mask_div_ss
,
_mm_maskz_div_ss
_mm_div_round_ss
,
_mm_mask_div_round_ss
,
_mm_maskz_div_round_ss
Calculates quotient of rounded division operation of scalar float32 elements.
VDIVSS
variable
definition
k
writemask used as a selector
a
first source vector element
b
second source vector element
src
source element to use based on writemask result
round
Rounding control values; these can be one of the following (along with the
sae
suppress all exceptions flag):
  • _MM_FROUND_TO_NEAREST_INT
    - rounds to nearest even
  • _MM_FROUND_TO_NEG_INF
    - rounds to negative infinity
  • _MM_FROUND_TO_POS_INF
    - rounds to positive infinity
  • _MM_FROUND_TO_ZERO
    - rounds to zero
  • _MM_FROUND_CUR_DIRECTION
    - rounds using default from MXCSR register
_mm512_div_pd
extern __m512d __cdecl _mm512_div_pd(__m512d a, __m512d b);
Divides packed float64 elements in
a
by packed elements in
b
, and stores the result.
_mm512_mask_div_pd
extern __m512d __cdecl _mm512_mask_div_pd(__m512d src, __mmask8 k, __m512d a, __m512d b);
Divides packed float64 elements in
a
by packed elements in
b
, and stores the result using writemask
k
(elements are copied from
src
when the corresponding mask bit is not set).
_mm512_maskz_div_pd
extern __m512d __cdecl _mm512_maskz_div_pd(__mmask8 k, __m512d a, __m512d b);
Divides packed float64 elements in
a
by packed elements in
b
, and stores the result using zeromask
k
(elements are zeroed out when the corresponding mask bit is not set).
_mm512_div_round_pd
extern __m512d __cdecl _mm512_div_round_pd(__m512d a, __m512d b, int round);
Divides packed float64 elements in
a
by packed elements in
b
, and stores the result.
_mm512_mask_div_round_pd
extern __m512d __cdecl _mm512_mask_div_round_pd(__m512d src, __mmask8 k, __m512d a, __m512d b, int round);
Divides packed float64 elements in
a
by packed elements in
b
, and stores the result us