Developer Guide and Reference

Contents

Intrinsics for Reciprocal Operations (512-bit)

The prototypes for Intel® Advanced Vector Extensions 512 (Intel® AVX-512) intrinsics are located in the
zmmintrin.h
header file.
To use these intrinsics, include the
immintrin.h
file as follows:
#include <immintrin.h>
Intrinsic Name
Operation
Corresponding
Intel® AVX-512 Instruction
_mm512_rcp14_pd
,
_mm512_mask_rcp14_pd
,
_mm512_maskz_rcp14_pd
Computes the approximate reciprocal of packed float64 elements.
VRCP14PD
_mm512_rcp14_ps
,
_mm512_mask_rcp14_ps
,
_mm512_maskz_rcp14_ps
Computes the approximate reciprocal of packed float32 elements.
VRCP14PS
_mm_rcp14_sd
,
_mm_mask_rcp14_sd
,
_mm_maskz_rcp14_sd
Computes the approximate reciprocal of scalar float64 elements.
VRCP14SD
_mm_rcp14_ss
,
_mm_mask_rcp14_ss
,
_mm_maskz_rcp14_ss
Computes the approximate reciprocal of scalar float32 elements.
VRCP14SS
_mm512_rcp28_pd
,
_mm512_mask_rcp28_pd
,
_mm512_maskz_rcp28_pd
_mm512_rcp28_round_pd
,
_mm512_mask_rcp28_round_pd
,
_mm512_maskz_rcp28_round_pd
Computes the approximate reciprocal of packed float64 elements with bounded relative error.
VRCP28PD
_mm_rcp28_sd
,
_mm_mask_rcp28_sd
,
_mm_maskz_rcp28_sd
_mm_rcp28_round_sd
,
_mm_mask_rcp28_round_sd
,
_mm_maskz_rcp28_round_sd
Computes the approximate reciprocal of scalar float64 elements with bounded relative error.
VRCP28SD
_mm512_rcp28_ps
,
_mm512_mask_rcp28_ps
,
_mm512_maskz_rcp28_ps
_mm512_rcp28_round_ps
,
_mm512_mask_rcp28_round_ps
,
_mm512_maskz_rcp28_round_ps
Computes the approximate reciprocal of packed float32 elements with bounded relative error.
VRCP28PS
_mm_rcp28_ss
,
_mm_mask_rcp28_ss
,
_mm_maskz_rcp28_ss
_mm_rcp28_round_ss
,
_mm_mask_rcp28_round_ss
,
_mm_maskz_rcp28_round_ss
Computes the approximate reciprocal of scalar float32 elements with bounded relative error.
VRCP28SS
_mm512_recip_pd
,
_mm512_mask_recip_pd
Computes the approximate reciprocal of packed float64 elements.
None.
_mm512_recip_ps
,
_mm512_mask_recip_ps
Computes the approximate reciprocal of packed float32 elements.
None.
variable
definition
k
writemask used as a selector
a
first source vector element
b
second source vector element
src
source element to use based on writemask result
_mm512_rcp14_pd
extern __m512d __cdecl _mm512_rcp14_pd(__m512d a);
Computes the approximate reciprocal of packed float64 elements in
a
, and stores the result.
The maximum relative error for this approximation is less than 2
(-14)
.
_mm512_mask_rcp14_pd