Developer Guide and Reference

Contents

Intrinsics for FP Addition Operations

The prototypes for Intel® Advanced Vector Extensions 512 (Intel® AVX-512) intrinsics are located in the
zmmintrin.h
header file.
To use these intrinsics, include the
immintrin.h
file as follows:
#include <immintrin.h>
Intrinsic Name
Operation
Corresponding
Intel® AVX-512 Instruction
_mm512_add_round_pd
,
_mm512_mask_add_round_pd
,
_mm512_maskz_add_round_pd
Add rounded float64 vectors.
VADDPD
_mm512_add_pd
,
_mm512_mask_add_pd
,
_mm512_maskz_add_pd
Add rounded float64 vectors.
VADDPD
_mm512_add_round_ps
,
_mm512_mask_add_round_ps
,
_mm512_maskz_add_round_ps
Add rounded float32 vectors.
VADDPS
_mm512_add_ps
,
_mm512_mask_add_ps
,
_mm512_maskz_add_ps
Add rounded float32 vectors.
VADDPS
_mm_add_round_sd
,
_mm_mask_add_round_sd
,
_mm_maskz_add_round_sd
Add scalar float64 vectors.
VADDSD
_mm_mask_add_sd
,
_mm_maskz_add_sd
Add scalar float64 vectors.
VADDSD
_mm_add_round_ss
,
_mm_mask_add_round_ss
,
_mm_maskz_add_round_ss
Add scalar float32 vectors.
VADDSS
_mm_mask_add_ss
,
_mm_maskz_add_ss
Add scalar float32 vectors.
VADDPD
variable
definition
k
writemask used as a selector
a
first source vector element
b
second source vector element
src
source element to use based on writemask result
round
Rounding control values; these can be one of the following (along with the
sae
suppress all exceptions flag):
  • _MM_FROUND_TO_NEAREST_INT
    - rounds to nearest even
  • _MM_FROUND_TO_NEG_INF
    - rounds to negative infinity
  • _MM_FROUND_TO_POS_INF
    - rounds to positive infinity
  • _MM_FROUND_TO_ZERO
    - rounds to zero
  • _MM_FROUND_CUR_DIRECTION
    - rounds using default from MXCSR register
_mm512_add_pd
extern __m512d __cdecl _mm512_add_pd(__m512d a, __m512d b);
Adds packed float64 elements in
a
and
b
, and stores the result.
_mm512_mask_add_pd
extern __m512d __cdecl _mm512_mask_add_pd(__m512d src, __mmask8 k, __m512d a, __m512d b);
Adds packed float64 elements in
a
and
b
, and stores the result using writemask
k
(elements are copied from
src
when the corresponding mask bit is not set).
_mm512_maskz_add_pd
extern __m512d __cdecl _mm512_maskz_add_pd(__mmask8 k, __m512d a, __m512d b);
Adds packed float64 elements in
a
and
b
, and stores the result using zeromask
k
(elements are zeroed out when the corresponding mask bit is not set).
_mm512_add_round_pd
extern __m512d __cdecl _mm512_add_r