Proxy
ConstProxy
Proxy
soa1d_container::const_accessor
__assume_aligned
__declspec
align
align_value
avoid_false_share
code_align
concurrency_safe
const
cpu_dispatch
cpu_specific
mpx
target
vector
vector_variant
align
align_value
avoid_false_share
code_align
concurrency_safe
const
cpu_dispatch
cpu_specific
mpx
target
vector
vector_variant
__INTEL_OFFLOAD
Writing Target-Specific Code Using a Pragma
Writing Code that Should Not Be Built for CPU-Only Execution
__MIC__
Writing Target-Specific Code Using a Pragma
Offload Using a Pragma
__regcall
_allow_cpu_features
_Cilk_offload
Rules for Using _Cilk_shared and _Cilk_offload
keyword
_Cilk_offload_to
_Cilk_offload
keyword
_Cilk_shared
Rules for Using _Cilk_shared and _Cilk_offload
keyword
_InterlockedCompareExchange_HLEAcquire
_InterlockedCompareExchange_HLERelease
_InterlockedCompareExchange64_HLEAcquire
_InterlockedCompareExchange64_HLERelease
_InterlockedCompareExchangePointer_HLEAcquire
_InterlockedCompareExchangePointer_HLERelease
_InterlockedExchangeAdd_HLEAcquire
_InterlockedExchangeAdd_HLERelease
_InterlockedExchangeAdd64_HLEAcquire
_InterlockedExchangeAdd64_HLERelease
_may_i_use_cpu_feature
_mm_div_epi16
_mm_div_epi32
_mm_div_epi64
_mm_div_epi8/
_mm_div_epu16
_mm_div_epu32
_mm_div_epu64
_mm_div_epu8
_mm_rem_epi16
_mm_rem_epi32
_mm_rem_epi64
_mm_rem_epi8
_mm_rem_epu16
_mm_rem_epu32
_mm_rem_epu64
_mm_rem_epu8
_mm256_div_epi16
_mm256_div_epi32
_mm256_div_epi64
_mm256_div_epi8
_mm256_div_epu16
_mm256_div_epu32
_mm256_div_epu64
_mm256_div_epu8
_mm256_rem_epi16
_mm256_rem_epi32
_mm256_rem_epi64
_mm256_rem_epi8
_mm256_rem_epu16
_mm256_rem_epu32
_mm256_rem_epu64
_mm256_rem_epu8
_Offload_report
_Offload_report API
_PGOPTI_Prof_Reset_All
_rdseed16_step
_rdseed32_step
_rdseed64_step
_Simd keyword
_Store_HLERelease
_Store64_HLERelease
_StorePointer_HLERelease
_xabort
_xbegin
_xend
_xtest
--sysroot compiler option (Linux* only)
--version compiler option
-A compiler option
-alias-const compiler option
-align compiler option
-ansi compiler option
-ansi-alias compiler option
-ansi-alias-check compiler option
-auto-ilp32 compiler option
-auto-p32 compiler option
-ax compiler option
-B compiler option
-Bdynamic compiler option (Linux* only)
-Bstatic compiler option (Linux* only)
-Bsymbolic compiler option (Linux* only)
-Bsymbolic-functions compiler option (Linux* only)
-c compiler option
c
Creating Libraries
-C compiler option
-check compiler option
-check-pointers compiler option
-check-pointers-dangling compiler option
-check-pointers-mpx compiler option
-check-pointers-narrowing compiler option
-check-pointers-undimensioned compiler option
-cilk-serialize compiler option
-clang-name compiler option
-clangxx-name compiler option
-complex-limited-range compiler option
-cxxlib compiler option (Linux* only)
-cxxlib-nostd compiler option (Linux* only)
-D compiler option
-daal compiler option
-dD compiler option
-debug compiler option
-diag compiler option
-diag-disable compiler option
-diag-disable=all compiler option
-diag-dump compiler option
-diag-enable compiler option
-diag-enable=all compiler option
-diag-enable=power compiler option
-diag-error compiler option
-diag-error-limit compiler option
-diag-file compiler option
-diag-file-append compiler option
-diag-id-numbers compiler option
-diag-once compiler option
-diag-remark compiler option
-diag-warning compiler option
-dM compiler option
-dN compiler option
-dryrun compiler option
-dumpmachine compiler option
-dumpversion compiler option
-dynamic-linker compiler option (Linux* only)
-dynamiclib compiler option
Using Intel Shared Libraries
Creating Libraries
-dynamiclib compiler option (macOS* only)
-E compiler option
-early-template-check compiler option
-EP compiler option
-F compiler option (macOS)
-Fa compiler option
-fabi-version compiler option
-falias compiler option
-falign-functions compiler option
-falign-loops compiler option
-falign-stack compiler option (Linux* only)
-fargument-alias compiler option
-fargument-noalias-global compiler option
-fasm-blocks compiler option
-fast compiler option
-fast-transcendentals compiler option
-fasynchronous-unwind-tables compiler option
-fblocks compiler option (macOS*)
-fbuiltin compiler option
-fcode-asm compiler option
-fcommon compiler option
-fdata-sections compiler option
-fdefer-pop compiler option
-feliminate-unused-debug-types compiler option
-femit-class-debug-always compiler option (Linux* only)
-fexceptions compiler option
-fextend-arguments compiler option
-ffat-lto-objects compiler option (Linux* only)
-ffnalias compiler option
-ffreestanding compiler option
-ffriend-injection compiler option
-ffunction-sections compiler option
-fgnu89-inline compiler option
-fimf-absolute-error compiler option
-fimf-accuracy-bits compiler option
-fimf-arch-consistency compiler option
-fimf-domain-exclusion compiler option
-fimf-force-dynamic-target compiler option
-fimf-max-error compiler option
-fimf-precision compiler option
-fimf-use-svml compiler option
-finline compiler option
-finline-functions compiler options
-finline-limit compiler option
-finstrument-functions compiler option
-fjump-tables compiler option
-fkeep-static-consts compiler option
-fma compiler option
-fmath-errno compiler option
-fmerge-constants compiler option (Linux* only)
-fmerge-debug-strings compiler option (Linux* only)
-fminshared compiler option
-fmpc-privatize compiler option (Linux* only)
-fms-dialect compiler option (Linux* only)
-fmudflap compiler option (Linux* only)
-fno-asynchronous-unwind-tables compiler option
-fno-gnu-keywords compiler option
-fno-implicit-inline-templates compiler option
-fno-implicit-templates compiler option
-fno-operator-names compiler option
-fno-rtti compiler option
-fnon-call-exceptions compiler option
-fnon-lvalue-assign compiler option
-fnsplit compiler option (Linux* only)
-fomit-frame-pointer compiler option
-fopenmp compiler option
-foptimize-sibling-calls compiler option
-fp compiler option
-fp-model compiler option
fp-model, fp
how to use
-fp-model consistent compiler option
-fp-port compiler option
-fp-speculation compiler option
-fp-stack-check compiler option
-fp-trap compiler option
-fp-trap-all compiler option
-fpack-struct compiler option
-fpascal-strings compiler option
-fpermissive compiler option
-fpic compiler option
Using Intel Shared Libraries
Creating Libraries
fpic
-fpie compiler option (Linux* only)
-fprotect-parens compiler option
-freg-struct-return compiler option
-fshort-enums compiler option
-fsource-asm compiler option
-fstack-protector compiler option
-fstack-protector-all compiler option
-fstack-protector-strong compiler option
-fstack-security-check compiler option
-fstrict-aliasing compiler option
-fsyntax-only compiler option
-ftemplate-depth compiler option
-ftls-model compiler option
-ftrapuv compiler option
-ftz compiler option
Setting the FTZ and DAZ Flags
ftz, Qftz
-funroll-all-loops compiler option
-funroll-loops compiler option
-funsigned-bitfields compiler option
-funsigned-char compiler option
-fuse-ld compiler option
-fvar-tracking compiler option
-fvar-tracking-assignments compiler option
-fverbose-asm compiler option
-fvisibility compiler option
-fvisibility-inlines-hidden compiler option
-fzero-initialized-in-bss compiler option
-g compiler option
-g0 compiler option
-g1 compiler option
-g2 compiler option
-g3 compiler option
-gcc compiler option
-gcc-include-dir compiler option (Linux only)
-gcc-name compiler option (Linux* only)
-gcc-sys compiler option
-gdwarf-2 compiler option
-gdwarf-3 compiler option
-gdwarf-4 compiler option
-global-hoist compiler option
-gnu-prefix compiler option (Linux* only)
-grecord-gcc-switches compiler option (Linux* only)
-gsplit-dwarf compiler option (Linux* only)
-guide compiler option
-guide-data-trans compiler option
-guide-file compiler option
-guide-file-append compiler option
-guide-opts compiler option
-guide-par compiler option
-guide-profile compiler option
-guide-vec compiler option
-gxx-name compiler option (Linux* only)
-H compiler option
-help compiler option
-help-pragma compiler option
-I compiler option
-I- compiler option
-icc compiler option
-idirafter compiler option
-imacros compiler option
-inline-calloc compiler option
-inline-factor compiler option
-inline-forceinline compiler option
-inline-level compiler option
-inline-max-per-compile compiler option
-inline-max-per-routine compiler option
-inline-max-size compiler option
-inline-max-total-size compiler option
-inline-min-caller-growth compiler option
-inline-min-size compiler option
-intel-extensions compiler option
-intel-freestanding compiler option
-intel-freestanding-target-os compiler option
-ip compiler option
-ip-no-inlining compiler option
-ip-no-pinlining compiler option
-ipo compiler option
Using IPO
ipo, Qipo
-ipo-c compiler option
-ipo-jobs compiler option
-ipo-S compiler option
-ipo-separate compiler option
-ipp compiler option
-ipp-link compiler option
-iprefix compiler option
-iquote compiler option
-isystem compiler option
-iwithprefix compiler option
-iwithprefixbefore compiler option
-Kc++ compiler option
-l compiler option
-L compiler option
-m compiler option
-M compiler option
-m32 compiler option (Linux* only)
-m64 compiler option
-m80387 compiler option
-malign-double compiler option
-malign-mac68k compiler option (macOS*)
-malign-natural compiler option (macOS*)
-malign-power compiler option (macOS*)
-map-opts compiler option
-march compiler option
-masm compiler option (Linux* only)
-mauto-arch compiler option
-mcmodel compiler option (Linux* only)
-mconditional-branch compiler option
-mcpu compiler option
-MD compiler option
-mdynamic-no-pic compiler option (macOS*)
-MF compiler option
-MG compiler option
-minstruction compiler option
-mkl compiler option
-mlong-double compiler option (Linux* only)
-MM compiler option
-MMD compiler option
-momit-leaf-frame-pointer
-MP compiler option
-mp1 compiler option
-MQ compiler option
-mregparm compiler option (Linux* only)
-mregparm-version compiler option (Linux* only)
-mstringop-inline-threshold compiler option
-mstringop-strategy compiler option
-MT compiler option
-mtune compiler option
-multibyte-chars compiler option
-multiple-processes compiler option
-no-bss-init compiler option
-no-libgcc compiler option
-nodefaultlibs compiler option
-nolib-inline compiler option
-nostartfiles compiler option
-nostdinc++ compiler option
-nostdlib compiler option
-o compiler option
-O compiler option
-Ofast compiler option
-Os compiler option
-p compiler option
-P compiler option
-par-affinity compiler option
-par-loops compiler option
-par-num-threads compiler option
-par-report compiler option
qopt-report, Qopt-report
See -qopt-report
-par-runtime-control compiler option
-par-schedule compiler option
-par-threshold compiler option
-parallel compiler option
-parallel-source-info compiler option
-pc compiler option
-pch compiler option
-pch-create compiler option
-pch-dir compiler option
-pch-use compiler option
-pie compiler option
-pragma-optimization-level compiler option
-prec-div compiler option
-prec-sqrt compiler option
-print-multi-lib compiler option
-print-sysroot compiler option (Linux* only)
-prof-data-order compiler options
-prof-dir compiler option
-prof-file compiler option
-prof-func-groups compiler option (Linux* only)
-prof-func-order compiler options
-prof-gen compiler option
Profile an Application with Instrumentation
prof-gen, Qprof-gen
-prof-gen-sampling compiler option (Linux* only)
-prof-gen:srcpos compiler option
Profile an Application with Instrumentation
code coverage tool
test priorization tool
-prof-hotness-threshold compiler option
-prof-src-dir compiler option
-prof-src-root compiler option
-prof-src-root-cwd compiler option
-prof-use compiler option
prof-use, Qprof-use
code coverage tool
profmerge utility
-prof-use-sampling compiler option (Linux* only)
-prof-value-profiling compiler option
-profile-functions compiler option
-profile-loops compiler option
-profile-loops-report compiler option
-pthread compiler option
-qcf-protection compiler option
-qdiag-disable linking option
-qdiag-enable linking option
-qhelp linking option
-Qinstall compiler option
-Qlocation compiler option
-qno-offload compiler option
-qoffload compiler option
-qoffload-arch compiler option (Linux* only)
-qoffload-attribute-target compiler option
-qoffload-option compiler option
-qopenmp compiler option
-qopenmp option
-qopenmp-lib compiler option
OpenMP* Source Compatibility and Interoperability with Other Compilers
qopenmp-lib, Qopenmp-lib
-qopenmp-link compiler option
-qopenmp-offload compiler option
-qopenmp-report compiler option
qopt-report, Qopt-report
See -qopt-report
-qopenmp-simd compiler option
-qopenmp-stubs compiler option
-qopenmp-threadprivate compiler option
-qopt-args-in-regs compiler option
-qopt-assume-safe-padding compiler option
-qopt-block-factor compiler option
-qopt-calloc compiler option (Linux only)
-qopt-class-analysis compiler option
-qopt-dynamic-align compiler option
-qopt-jump-tables compiler option
-qopt-malloc-options compiler option
-qopt-matmul compiler option
-qopt-mem-layout-trans compiler option
-qopt-multi-version-aggressive compiler option
-qopt-prefetch compiler option
-qopt-prefetch-distance compiler option
-qopt-prefetch-issue-excl-hint compiler option
-qopt-ra-region-strategy compiler option
-qopt-report compiler option
-qopt-report-annotate compiler option
-qopt-report-annotate-position compiler option
-qopt-report-embed compiler option
-qopt-report-file compiler option
-qopt-report-filter compiler option
-qopt-report-format compiler option
-qopt-report-help compiler option
-qopt-report-names compiler option
-qopt-report-per-object compiler option
-qopt-report-phase compiler option
-qopt-report-routine compiler option
-qopt-streaming-stores compiler option
-qopt-subscript-in-range compiler option
-qopt-threads-per-core compiler option
-qopt-zmm-usage compiler option
-Qoption compiler option
-qoverride-limits compiler option
-qp compiler option
-qsimd-honor-fp-model compiler option
-qsimd-serialize-fp-reduction compiler option
-rcd compiler option
-regcall compiler option
-restrict compiler option
-S compiler option
-save-temps compiler option
-scalar-rep compiler option
-shared compiler option
Using Intel Shared Libraries
Creating Libraries
-shared compiler option (Linux* only)
-shared-intel compiler option
shared-intel
Using Intel Shared Libraries
-shared-libgcc compiler option (Linux* only)
-simd compiler option
-simd-function-pointers compiler option
-sox compiler option (Linux* only)
-static compiler option (Linux* only)
-static-intel compiler option
-static-libgcc compiler option (Linux* only)
-static-libstdc++ compiler option (Linux* only)
-staticlib compiler option (macOS*)
-std compiler option
-stdlib compiler option (macOS*)
-strict-ansi compiler option
-T compiler option (Linux* only)
-tbb compiler option
-tcollect compiler option
-tcollect-filter compiler option
-traceback compiler option
-u compiler option
-U compiler option
-undef compiler option
-unroll compiler option
-unroll-aggressive compiler option
-use-asm compiler option
-use-intel-optimized-headers compiler option
-use-msasm compiler option
-v compiler option
-V compiler option
-vec compiler option
-vec-guard-write compiler option
-vec-report compiler option
qopt-report, Qopt-report
See -qopt-report
-vec-threshold compiler option
-vecabi compiler option
-w compiler option
w
w, W
-Wa compiler option
-Wabi compiler option
-Wall compiler option
-watch compiler option
-Wbrief compiler option
-Wcheck compiler option
-Wcomment compiler option
-Wcontext-limit compiler option
-wd compiler option
-Wdeprecated compiler option
-we compiler option
-Weffc++ compiler option
-Werror compiler option
-Werror-all compiler option
-Wextra-tokens compiler option
-Wformat compiler option
-Wformat-security compiler option
-Wic-pointer compiler option
-Winline compiler option
-Wl compiler option
-Wmain compiler option
-Wmissing-declarations compiler option
-Wmissing-prototypes compiler option
-wn compiler option
-Wnon-virtual-dtor compiler option
-wo compiler option
-Wp compiler option
-Wp64 compiler option
-Wpch-messages compiler option
-Wpointer-arith compiler option
-wr compiler option
-Wremarks compiler option
-Wreorder compiler option
-Wreturn-type compiler option
-Wshadow compiler option
-Wsign-compare compiler option
-Wstrict-aliasing compiler option
-Wstrict-prototypes compiler option
-Wtrigraphs compiler option
-Wuninitialized compiler option
-Wunknown-pragmas compiler option
-Wunused-function compiler option
-Wunused-variable compiler option
-ww compiler option
-Wwrite-strings compiler option
-x (type) compiler option
-x compiler option
-X compiler option
-xHost compiler option
-Xlinker compiler option
-Zp compiler option
.dpi file
Profmerge and Proforder Tools
Test Prioritization Tool
Code Coverage Tool
.dyn file
Profmerge and Proforder Tools
Test Prioritization Tool
Code Coverage Tool
.dyn files
.spi file
Test Prioritization Tool
Code Coverage Tool
/arch compiler option
/bigobj compiler option
/c compiler option
/C compiler option
/check compiler option
/D compiler option
/debug compiler option
/E compiler option
/EH compiler option
/EP compiler option
/F compiler option
/Fa compiler option
/FA compiler option
/fast compiler option
/FC compiler option
/Fd compiler option
/FD compiler option
/Fe compiler option
/FI compiler option
/fixed compiler option
/Fm compiler option
/Fo compiler option
/fp compiler option
fp-model, fp
how to use
/Fp compiler option
/fp:consistent compiler option
/Fr compiler option
/FR compiler option
/GA compiler option
/Gd compiler option
/Ge compiler option
/Gf compiler option
/GF compiler option
/Gh compiler option
/GH compiler option
/Gm compiler option
/Gr compiler option
/GR compiler option
/Gs compiler option
/GS compiler option
/GT compiler option
/guard compiler option
/guard:cf compiler option
/Gv compiler option
/GX compiler option
/Gy compiler option
/Gz compiler option
/GZ compiler option
/H compiler option
/help compiler option
/homeparams compiler option
/hotpatch compiler option
/I compiler option
/I- compiler option
/J compiler option
/LD compiler option
Creating Libraries
LD
/link compiler option
/MD compiler option
MD
Creating Libraries
/MP compiler option
/MP-force compiler option
/MT compiler option
Creating Libraries
MT
/noBool compiler option
/nologo compiler option
/O compiler option
/Oa compiler option
/Ob compiler option
/Od compiler option
/Oi compiler option
/openmp compiler option
/Os compiler option
/Ot compiler option
/Ow compiler option
/Ox compiler option
/Oy compiler option
/P compiler option
/pdbfile compiler option
/QA compiler option
/Qalias-args
/Qalias-const compiler option
/Qalign-loops compiler option
/Qansi-alias compiler option
/Qansi-alias-check compiler option
/Qauto-arch compiler option
/Qauto-ilp32 compiler option
/Qax compiler option
/Qcf-protection compiler option
/Qcheck-pointers compiler option
/Qcheck-pointers-dangling compiler option
/Qcheck-pointers-mpx compiler option
/Qcheck-pointers-narrowing compiler option
/Qcheck-pointers-undimensioned compiler option
/Qcilk-serialize compiler option
/Qcomplex-limited-range compiler option
/Qconditional-branch compiler option
/Qcontext-limit compiler option
/Qcov-dir compiler option
/Qcov-file compiler option
/Qcov-gen compiler option
Profile an Application with Instrumentation
Qcov-gen
code coverage tool
/Qcxx-features compiler option
/Qdaal compiler option
/QdD compiler option
/Qdiag compiler option
/Qdiag-disable compiler option
/Qdiag-disable:all compiler option
/Qdiag-dump compiler option
/Qdiag-enable compiler option
/Qdiag-enable:all compiler option
/Qdiag-enable:power compiler option
/Qdiag-error compiler option
/Qdiag-error-limit compiler option
/Qdiag-file compiler option
/Qdiag-file-append compiler option
/Qdiag-id-numbers compiler option
/Qdiag-once compiler option
/Qdiag-remark compiler option
/Qdiag-warning compiler option
/QdM compiler option
/QdN compiler option
/Qeffc++ compiler option
/Qeliminate-unused-debug-types compiler option
/Qextend-arguments compiler option
/Qfast-transcendentals compiler option
/Qfma compiler option
/Qfnalign compiler option
/Qfnsplit compiler option
/Qfp-port compiler option
/Qfp-speculation compiler option
/Qfp-stack-check compiler option
/Qfp-trap compiler option
/Qfp-trap-all compiler option
/Qfreestanding compiler option
/Qftz compiler option
Setting the FTZ and DAZ Flags
ftz, Qftz
/Qgcc-dialect compiler option
/Qglobal-hoist compiler option
/Qguide compiler option
/Qguide-data-trans compiler option
/Qguide-file compiler option
/Qguide-file-append compiler option
/Qguide-opts compiler option
/Qguide-par compiler option
/Qguide-profile compiler option
/Qguide-vec compiler option
/QH compiler option
/Qicl compiler option
/Qimf-absolute-error compiler option
/Qimf-accuracy-bits compiler option
/Qimf-arch-consistency compiler option
/Qimf-domain-exclusion compiler option
/Qimf-force-dynamic-target compiler option
/Qimf-max-error compiler option
/Qimf-precision compiler option
/Qimf-use-svml compiler option
/Qinline-calloc compiler option
/Qinline-dllimport compiler option
/Qinline-factor compiler option
/Qinline-forceinline compiler option
/Qinline-max-per-compile compiler option
/Qinline-max-per-routine compiler option
/Qinline-max-size compiler option
/Qinline-max-total-size compiler option
/Qinline-min-caller-growth compiler option
/Qinline-min-size compiler option
/Qinstruction compiler option
/Qinstrument-functions compiler option
/Qintel-extensions compiler option
/Qip compiler option
/Qip-no-inlining compiler option
/Qip-no-pinlining compiler option
/Qipo compiler option
Using IPO
ipo, Qipo
/Qipo-c compiler option
/Qipo-jobs compiler option
/Qipo-S compiler option
/Qipo-separate compiler option
/Qipp compiler option
/Qipp-link compiler option
/Qkeep-static-consts compiler option
/Qlocation compiler option
/Qlong-double compiler option
/QM compiler option
/Qm32 compiler option
/Qm64 compiler option
/Qmap-opts compiler option
/QMD compiler option
/QMF compiler option
/QMG compiler option
/Qmkl compiler option
/QMM compiler option
/QMMD compiler option
/Qms compiler option
/QMT compiler option
/Qmultibyte-chars compiler option
/Qno-builtin-name compiler option
/Qnobss-init compiler option
/Qoffload compiler option
/Qoffload- compiler option
/Qoffload-arch compiler option
/Qoffload-attribute-target compiler option
/Qoffload-option compiler option
/Qopenmp compiler option
/Qopenmp option
/Qopenmp-lib compiler option
OpenMP* Source Compatibility and Interoperability with Other Compilers
qopenmp-lib, Qopenmp-lib
/Qopenmp-offload compiler option
/Qopenmp-report compiler option
qopt-report, Qopt-report
See -qopt-report
/Qopenmp-simd compiler option
/Qopenmp-stubs compiler option
/Qopenmp-threadprivate compiler option
/Qopt-args-in-regs compiler option
/Qopt-assume-safe-padding compiler option
/Qopt-block-factor compiler option
/Qopt-class-analysis compiler option
/Qopt-dynamic-align compiler option
/Qopt-jump-tables compiler option
/Qopt-matmul compiler option
/Qopt-mem-layout-trans compiler option
/Qopt-multi-version-aggressive compiler option
/Qopt-prefetch compiler option
/Qopt-prefetch-distance compiler option
/Qopt-prefetch-issue-excl-hint compiler option
/Qopt-ra-region-strategy compiler option
/Qopt-report compiler option
/Qopt-report-annotate compiler option
/Qopt-report-annotate-position compiler option
/Qopt-report-embed compiler option
/Qopt-report-file compiler option
/Qopt-report-filter compiler option
/Qopt-report-format compiler option
/Qopt-report-help compiler option
/Qopt-report-names compiler option
/Qopt-report-per-object compiler option
/Qopt-report-phase compiler option
/Qopt-report-routine compiler option
/Qopt-streaming-stores compiler option
/Qopt-subscript-in-range compiler option
/Qopt-threads-per-core compiler option
/Qopt-zmm-usage compiler option
/Qoption compiler option
/Qoverride-limits compiler option
/Qpar-adjust-stack compiler option
/Qpar-affinity compiler option
/Qpar-loops compiler option
/Qpar-num-threads compiler option
/Qpar-report compiler option
qopt-report, Qopt-report
See -qopt-report
/Qpar-runtime-control compiler option
/Qpar-schedule compiler option
/Qpar-threshold compiler option
/Qparallel compiler option
/Qparallel-source-info compiler option
/Qpatchable-addresses compiler option
/Qpc compiler option
/Qpchi compiler option
/Qprec compiler option
/Qprec-div compiler option
/Qprec-sqrt compiler option
/Qprof-data-order compiler option
/Qprof-dir compiler option
/Qprof-file compiler option
/Qprof-func-order compiler option
/Qprof-gen compiler option
Profile an Application with Instrumentation
prof-gen, Qprof-gen
/Qprof-gen:srcpos compiler option
Profile an Application with Instrumentation
code coverage tool
test priorization tool
/Qprof-hotness-threshold compiler option
/Qprof-src-dir compiler option
/Qprof-src-root compiler option
/Qprof-src-root-cwd compiler option
/Qprof-use compiler option
prof-use, Qprof-use
profmerge utility
/Qprof-use compiler option
Code Coverage Tool
code coverage tool
/Qprof-value-profiling compiler option
/Qprofile-functions compiler option
/Qprofile-loops compiler option
/Qprofile-loops-report compiler option
/Qprotect-parens compiler option
/Qrcd compiler option
/Qregcall compiler option
/Qrestrict compiler option
/Qsafeseh compiler option
/Qsave-temps compiler option
/Qscalar-rep compiler option
/Qsfalign compiler option
/Qsimd compiler option
/Qsimd-function-pointers compiler option
/Qsimd-honor-fp-model compiler option
/Qsimd-serialize-fp-reduction compiler option
/Qstd compiler option
/Qstringop-inline-threshold compiler option
/Qstringop-strategy compiler option
/Qtbb compiler option
/Qtcollect compiler option
/Qtcollect-filter compiler option
/Qtemplate-depth compiler option
/Qtrapuv compiler option
/Qunroll compiler option
/Qunroll-aggressive compiler option
/Quse-asm compiler option
/Quse-intel-optimized-headers compiler option
/Quse-msasm-symbols compiler option
/QV compiler option
/Qvc compiler option
/Qvec compiler option
/Qvec-guard-write compiler option
/Qvec-report compiler option
qopt-report, Qopt-report
See -qopt-report
/Qvec-threshold compiler option
/Qvecabi compiler option
/Qvla compiler option
/Qwd compiler option
/Qwe compiler option
/Qwn compiler option
/Qwo compiler option
/Qwr compiler option
/Qww compiler option
/Qx compiler option
/QxHost compiler option
/Qzero-initialized-in-bss compiler option
/RTC compiler option
/S compiler option
/showIncludes compiler option
/Tc compiler option
/TC compiler option
/Tp compiler option
/TP compiler option
/traceback compiler option
/tune compiler option
/u compiler option
/U compiler option
/V compiler option
/vd compiler option
/vmb compiler option
/vmg compiler option
/vmm compiler option
/vms compiler option
/vmv compiler option
/w compiler option
/W compiler option
/Wall compiler option
/watch compiler option
/Wcheck compiler option
/Werror-all compiler option
/WL compiler option
/Wp64 compiler option
/Wpch-messages compiler option
/Wport compiler option
/WX compiler option
/X compiler option
/Y- compiler option
/Yc compiler option
/Yd compiler option
/Yu compiler option
/Z7 compiler option
/Za compiler option
/Zc compiler option
/Ze compiler option
/Zg compiler option
/Zi compiler option
/ZI compiler option
/Zl compiler option
Creating Libraries
Zl
/Zo compiler option
/Zp compiler option
/Zs compiler option
3rd Generation Intel® Core™ Processor Instruction Extensions
4th Generation Intel® Core™ Processor Instruction Extensions
64-bit executable
Building the Target
building
absolute error
fimf-absolute-error, Qimf-absolute-error
option defining for math library function results
access_by
adding a source file
adding additional Options for Target Compilation, Target Linking, and Target Archiver
adding additional Options for Target Linking
adding files
adding the compiler
Adding the Compiler to Eclipse*
in Eclipse*
advanced PGO options
Advanced Vector Extensions
Intrinsics for Arithmetic Operations
Intrinsics for Bitwise Operations
Intrinsics for Blend and Conditional Merge Operations
Intrinsics for Compare Operations
Intrinsics for Conversion Operations
Intrinsics for Load and Store Operations
Intrinsics to Determine Minimum and Maximum Values
Intrinsics for Miscellaneous Operations
Overview: Intrinsics for Intel® Advanced Vector Extensions Instructions
Details of Intel® Advanced Vector Extensions Intrinsics
Intrinsics for Packed Test Operations
Intrinsics for Permute Operations
Intrinsics for Shuffle Operations
Intrinsics for Unpack and Interleave Operations
Intrinsics Generating Vectors of Undefined Values
Support Intrinsics for Vector Typecasting Operations
arithmetic operations
bitwise logical operations
blend and conditional merge operations
compare operations
conversion operations
load operations
minimum and maximum operations
miscellaneous operations
overview
Overview: Intrinsics for Intel® Advanced Vector Extensions Instructions
Details of Intel® Advanced Vector Extensions Intrinsics
packed test operations
permute operations
shuffle operations
store operations
unpack and interleave operations
vector generation operations
vector typecasting operations
Advanced Vector Extensions 2
Intrinsics for Arithmetic Operations
Intrinsics for Arithmetic Shift Operations
Intrinsics for Operations to Manipulate Integer Data at Bit-Granularity
Intrinsics for Bitwise Operations
Intrinsics for Blend Operations
Intrinsics for Broadcast Operations
Intrinsics for Compare Operations
Intrinsics for Fused Multiply Add Operations
Intrinsics for GATHER Operations
Intrinsics for Insert/Extract Operations
Intrinsics for Masked Load/Store Operations
Intrinsics for Logical Shift Operations
Intrinsics for Miscellaneous Operations
Intrinsics for Pack/Unpack Operations
Intrinsics for Packed Move with Extend Operations
Intrinsics for Permute Operations
Intrinsics for Shuffle Operations
arithmetic operations
arithmetic shift operations
bit manipulation operations
bitwise logical operations
blend operations
broadcast operations
compare operations
fused multiply-add (FMA) operations
GATHER operations
insert and extract operations
load and store operations
logical shift operations
miscellaneous operations
pack and unpack operations
packed move operations
permute operations
shuffle operations
affinity, OpenMP*, specifying
aliasing
ffnalias, Ow
falias, Oa
option specifying assumption in functions
option specifying assumption in programs
align
Function Annotations and the SIMD Directive for Vectorization
attribute
align_value
attribute
aligned
align
attribute
aligned_new
aligned_offset
aligning data
alloc
alloc_section
ALLOCATABLE
Code Coverage Tool
Programming with Auto-parallelization
basic block
code coverage
data flow
visual presentation
alternate compiler options
alternate tools and locations
amplxe-pgo-report
annotated source listing
qopt-report-annotate, Qopt-report-annotate
qopt-report-annotate-position, Qopt-report-annotate-position
option enabling
option specifying position of messages
ANSI/ISO standard
aos1d_container
n_container
n_index_generator
Constructing an n_container
Shape
n_index_t (needs new content)
aos1d_container
max_val
index_d template function
n_extent_generator
Bounds
make_ n_container template function
extent_d template function
aos1d_container::accessor
n_bounds_t
Accessor Concept
n_bounds_generator
soa1d_container::accessor and aos1d_container::accessor
bounds_t
bounds_d Template Function
sdlt::bounds Template Function
aos1d_container::const_accessor
APIs for coprocessor
application tests
applications
Redistributing Libraries When Deploying Applications
O
deploying
option specifying code optimization for
ar tool
array notations
C/C++ Extensions for Array Notations Overview
array declarations
array section
assignment maps
function maps
gather and scatter operations
multi-dimensional casting operations
operator maps
programming hints
reduction and shift operations
assembler
Wa
use-asm, Quse-asm
option passing options to
option producing objects through
assembler output file
masm
option specifying a dialect for
assembly files
Specifying Assembly Files
naming
assembly listing file
Fa
option specifying generation of
asynchronous data transfer
About Asynchronous Computation (Intel® MIC Architecture)
About Asynchronous Data Transfer (Intel® MIC Architecture)
Asynchronous I/O async_class methods
clear_queue
get_error_operation_id
get_last_error
get_last_operation_id
get_status
resume_queue
stop_queue
wait
clear_queue()
get_error_operation_id()
get_last_error()
get_last_operation_id()
get_status()
resume_queue()
stop_queue()
wait()
Asynchronous I/O Extensions
Intel's C++ Asynchronous I/O Extensions for Windows* Operating Systems
Intel's C++ Asynchronous I/O Library for Windows* Operating Systems
Intel's C++ Asynchronous I/O Class for Windows* Operating Systems
introduction
library
template class
Asynchronous I/O library functions
aio_cancel
aio_error
aio_fsync
aio_read
aio_return
aio_suspend
aio_write
Handling Errors Caused by Asynchronous I/O Functions
Example for aio_cancel Function
Example for aio_error and aio_return Functions
Example for aio_read and aio_write Functions
Example for aio_suspend Function
Example for lio_listio Function
lio_listio
aio_cancel()
aio_error()
aio_fsync()
aio_read()
aio_return()
aio_suspend()
aio_write()
errno macro
Error Handling
examples
Example for aio_cancel Function
Example for aio_error and aio_return Functions
Example for aio_read and aio_write Functions
Example for aio_suspend Function
Example for lio_listio Function
aio_cancel()
aio_error()
aio_read()
Example for aio_read and aio_write Functions
aio_write()
aio_return
aio_suspend()
aio_write()
lio_listio()
lio_listio()
Asynchronous I/O template class
Template Class async_class
async_class
thread_control
attribute
align
align_value
avoid_false_share
code_align
concurrency_safe
const
cpu_dispatch
cpu_specific
mpx
target
vector
vector_variant
align
align_value
aligned
avoid_false_share
code_align
concurrency_safe
const
cpu_dispatch
cpu_specific
mpx
target
vector
vector_variant
attribute for placing variables and functions on coprocessor
attribute, applying target
auto parallelism
guide-par, Qguide-par
option setting guidance for
auto-parallelism
guide, Qguide
option setting guidance for
auto-parallelization
Automatic Parallelization
enabling
environment variables
guidelines
overview
programming with
Auto-parallelization
Language Support for Auto-parallelization
language support
auto-parallelizer
Automatic Parallelization
option enabling generation of multithreaded code
option setting threshold for loops
auto-vectorization
Avoiding Mixed Data Type Arithmetic Expressions
option setting guidance for
guide, Qguide
guide-vec, Qguide-vec
auto-vectorization hints
auto-vectorization of innermost loops
auto-vectorizer
Vectorization
Automatic Vectorization Overview
Automatic Vectorization
Using Automatic Vectorization
AVX
Vectorization
Automatic Vectorization Overview
Automatic Vectorization
SSE
Vectorization
Automatic Vectorization Overview
Automatic Vectorization
SSE2
Vectorization
Automatic Vectorization Overview
Automatic Vectorization
SSE3
Vectorization
Automatic Vectorization Overview
Automatic Vectorization
SSSE3
Vectorization
Automatic Vectorization Overview
Automatic Vectorization
using
automatically-aligned dynamic allocation
avoid
Avoiding Mixed Data Type Arithmetic Expressions
inefficient data types
mixed arithmetic expressions
avoid_false_share
attribute
AVX
Intrinsics for Arithmetic Operations
Intrinsics for Bitwise Operations
Intrinsics for Blend and Conditional Merge Operations
Intrinsics for Compare Operations
Intrinsics for Conversion Operations
Intrinsics for Load and Store Operations
Intrinsics to Determine Minimum and Maximum Values
Intrinsics for Miscellaneous Operations
Overview: Intrinsics for Intel® Advanced Vector Extensions Instructions
Details of Intel® Advanced Vector Extensions Intrinsics
Intrinsics for Packed Test Operations
Intrinsics for Permute Operations
Intrinsics for Shuffle Operations
Intrinsics for Unpack and Interleave Operations
Intrinsics Generating Vectors of Undefined Values
Support Intrinsics for Vector Typecasting Operations
arithmetic operations
bitwise logical operations
blend and conditional merge operations
compare operations
conversion operations
load operations
minimum and maximum operations
miscellaneous operations
overview
Overview: Intrinsics for Intel® Advanced Vector Extensions Instructions
Details of Intel® Advanced Vector Extensions Intrinsics
packed test operations
permute operations
shuffle operations
store operations
unpack and interleave operations
vector generation operations
vector typecasting operations
AVX2
Intrinsics for Arithmetic Operations
Intrinsics for Arithmetic Shift Operations
Intrinsics for Operations to Manipulate Integer Data at Bit-Granularity
Intrinsics for Bitwise Operations
Intrinsics for Blend Operations
Intrinsics for Broadcast Operations
Intrinsics for Compare Operations
Intrinsics for Fused Multiply Add Operations
Intrinsics for GATHER Operations
Intrinsics for Insert/Extract Operations
Intrinsics for Masked Load/Store Operations
Intrinsics for Logical Shift Operations
Intrinsics for Miscellaneous Operations
Intrinsics for Pack/Unpack Operations
Intrinsics for Packed Move with Extend Operations
Intrinsics for Permute Operations
Intrinsics for Shuffle Operations
arithmetic operations
arithmetic shift operations
bit manipulation operations
bitwise logical operations
blend operations
broadcast operations
compare operations
fused multiply-add (FMA) operations
GATHER operations
insert and extract operations
load and store operations
logical shift operations
miscellaneous operations
pack and unpack operations
packed move operations
permute operations
shuffle operations
base platform toolset
bit fields and signs
block_loop
building a project
with Eclipse*
building an application that runs natively on Intel® MIC Architecture
building multiple projects
building targets
Building the Target
for macOS*
building with Intel® C++
builds
Performing Parallel Project Builds
parallel project
C++0x
std, Qstd
option enabling support of
C++11
std, Qstd
option enabling support of
c99
std, Qstd
option enabling support of
calling conventions
capturing IPO output
changing number of threads
OpenMP* Pragmas Summary
summary table of
checking
Checking the Floating-point Stack State
floating-point stacks
stacks
Checking floating-point stack state
cilk grainsize
Clang compiler
clang-name
option specifying name of
Clang++ compiler
clangxx-name
option specifying the name of
Class Libraries
C++ Classes and SIMD Operations
Capabilities of C++ SIMD Classes
Terms, Conventions, and Syntax Defined
Arithmetic Operators
Cacheability Support Operators
Compare Operators
Conditional Select Operators for Fvec Classes
Constructors and Initialization
Conversions
Data Alignment
Debug Operations
Load and Store Operators
Logical Operators
Minimum and Maximum Operators
Move Mask Operators
Fvec Notation Conventions
Floating-point Vector Classes
Unpack Operators
Addition and Subtraction Operators
Assignment Operator
Clear MMX™ State Operator
Comparison Operators
Conditional Select Operators
Conversions between Fvec and Ivec
Debug Operations
Integer Functions for Streaming SIMD Extensions
Integer Vector Classes
Logical Operators
Multiplication Operators
Pack Operators
Rules for Operators
Shift Operators
Unpack Operators
Classes Quick Reference
C++ classes and SIMD operations
capabilities of C++ SIMD classes
conventions
floating-point vector classes
Arithmetic Operators
Cacheability Support Operators
Compare Operators
Conditional Select Operators for Fvec Classes
Constructors and Initialization
Conversions
Data Alignment
Debug Operations
Load and Store Operators
Logical Operators
Minimum and Maximum Operators
Move Mask Operators
Fvec Notation Conventions
Floating-point Vector Classes
Unpack Operators
arithmetic operators
cacheability support operators
compare operators
conditional select operators
constructors and initialization
conversions
data alignment
debug operators
load operators
logical operators
minimum and maximum operators
move mask operators
notation conventions
overview
store operators
unpack operators
integer vector classes
Addition and Subtraction Operators
Assignment Operator
Clear MMX™ State Operator
Comparison Operators
Conditional Select Operators
Conversions between Fvec and Ivec
Debug Operations
Integer Functions for Streaming SIMD Extensions
Integer Vector Classes
Logical Operators
Multiplication Operators
Pack Operators
Rules for Operators
Shift Operators
Unpack Operators
addition operators
Addition and Subtraction Operators
subtraction operators
assignment operator
clear MMX™ state operators
comparison operators
conditional select operators
conversions between fvec and ivec
debug operators
Debug Operations
element access operator
element assignment operators
functions for SSE
ivec classes
logical operators
multiplication operators
pack operators
rules for operators
shift operators
unpack operators
Quick reference
syntax
terms
Classes
Programming Example
programming example
code
Methods to Optimize Code Size
m
ax, Qax
arch
march
xHost, QxHost
x, Qx
Writing Target-Specific Code Using a Pragma
Offload Using a Pragma
Writing Target-Specific Code with _Cilk_offload
Writing Code that Should Not Be Built for CPU-Only Execution
methods to optimize size of
option generating feature-specific
m
ax, Qax
option generating feature-specific for Windows* OS
option generating for specified CPU
option generating specialized
option generating specialized and optimized
target-specific
Writing Target-Specific Code Using a Pragma
Offload Using a Pragma
Writing Target-Specific Code with _Cilk_offload
writing only for offload or native builds
code
Microsoft Compatibility
mixing managed and unmanaged
Code Coverage
Options: Code Coverage dialog box
in Microsoft Visual Studio*
Code Coverage dialog box
code coverage tool
color scheme
exporting data
pgopti.dpi file
pgopti.spi file
syntax of
code coverage tool
Code Coverage Tool
dynamic counters in
code layout
code size
Methods to Optimize Code Size
Disable or Decrease the Amount of Inlining
Disable Recognition and Expansion of Intrinsic Functions
Disable Loop Unrolling
Dynamically Link Intel-Provided Libraries
Exclude Unused Code and Data from the Executable
Optimize Exception Handling Data on Linux and macOS* Systems
Disable Passing Arguments in Registers Instead of On the Stack
Strip Symbols from Your Binaries
Disable Automatic Vectorization
Avoid Unnecessary 16-Byte Alignment
Avoid References to Compiler-Specific Libraries
methods to optimize
option affecting inlining
option disabling expansion of functions
option disabling loop unrolling
option dynamically linking libraries
option excluding data
option for certain exception handling
option passing arguments in registers
option stripping symbols
Strip Symbols from Your Binaries
Disable Automatic Vectorization
option to avoid 16-byte alignment (Linux* only)
option to avoid library references
code_align
attribute
codecov tool
Qcov-gen
Qcov-dir
Qcov-file
option producing an instrumented file for
option specifying a directory for profiling output for
option specifying a file name for summary files for
command line
command-line window
Using the Command Line on Windows*
setting up
compatibility
Microsoft Compatibility
with Microsoft* Visual Studio*
compilation phases
compilation units
Developer Directed Inline Expansion of User Functions
option to prevent linking as shareable object
compiler
Introducing the Intel® C++ Compiler
Related Information
overview
Introducing the Intel® C++ Compiler
Related Information
compiler
Compilation Phases
compilation phases
compiler command-line options
grecord-gcc-switches
option recording
compiler differences
Compilation and Execution Differences
between Intel® C++ and Microsoft* Visual C++*
compiler directives
Explicit Vector Programming
Vectorization
Automatic Vectorization Overview
Automatic Vectorization
for vectorization
Explicit Vector Programming
Vectorization
Automatic Vectorization Overview
Automatic Vectorization
compiler information
Saving Compiler Information in Your Executable
saving in your executable
compiler installation
Qinstall
option specifying root directory for
compiler operation
Understanding File Extensions
Invoking the Intel® Compiler
input files
invoking from the command line
compiler operation
About Building Native Intel® MIC Architecture Applications
building an application that runs natively on Intel® MIC Architecture
Compiler Optimization Report window
compiler option mapping tool
compiler options
Alphabetical List of Compiler Options
Alternate Compiler Options
Deprecated and Removed Compiler Options
Specifying Symbol Visibility Explicitly (Linux* and macOS*)
General Rules for Compiler Options
Displaying Options Passed to Offload Compilation
Displaying General Option Information From the Command Line
Passing Options to the Linker
New Options
map-opts, Qmap-opts
sox
What Appears in the Compiler Option Descriptions
alphabetical list of
alternate
deprecated and removed
for visibility
general rules for
how to display informational lists
Displaying Options Passed to Offload Compilation
Displaying General Option Information From the Command Line
linker-related
new
option mapping to equivalents
option saving in executable or object file
overview of descriptions of
compiler options
Using Compiler Options
Other Considerations
Other Considerations
Portability Options
GCC-Compatible Warning Options
command-line syntax
for optimization
Other Considerations
Other Considerations
for portability
gcc-compatible warning
option categories
using
compiler reports
Requesting Compiler Reports with the xi* Tools
requesting with xi* tools
compiler selection
Selecting the Compiler Version
in Visual Studio*
compiler setup
compilers
Multi-Version Compiler Support
using multiple versions
compilervars environment script
compilervars.bat
Specifying the Location of Compiler Components with compilervars
Other Considerations
compilervars.csh
compilervars.sh
compiling
Other Considerations
compiling considerations
compiling
Other Considerations
gcc* code with Intel® C++ Compiler
compiling considerations
compiling large programs
compiling with IPO
complex operations
complex-limited-range, Qcomplex-limited-range
option enabling algebraic expansion of
concurrency_safe
attribute
conditional check
vec-guard-write, Qvec-guard-write
option performing in a vectorized loop
conditional parallel region execution
Compiler Directed Inline Expansion of Functions
inline expansion
configuration files
configurations
Selecting a Configuration
debug and release
console
watch
option displaying information to
const
attribute
control-flow enforcement technology protection
qcf-protection, Qcf-protection
option enabling
conventions
Notational Conventions
in the documentation
converting to Intel® C++ Compiler project system
coprocessorThread allocation on processor
correct usage of countable loop
COS
Loop Constructs
correct usage of
counters for dynamic profile
Resetting the Dynamic Profile Counters
Resetting Profile Information
CPU
march
option generating code for specified
CPU dispatch
CPU time
Test Prioritization Tool
Inline Expansion of Functions
DPI lists
for inline function expansion
cpu_dispatch
attribute
cpu_specific
attribute
cpuid
create libraries using IPO
creating
Creating a New Project
projects
creating a new project
in Eclipse*
cross compilation
About Building Native Intel® MIC Architecture Applications
about
data alignment
data alignment optimizations
qopt-dynamic-align, Qopt-dynamic-align
option disabling dynamic
data format
Programming with Auto-parallelization
High-Level Optimization (HLO)
Explicit Vector Programming
Vectorization
Automatic Vectorization Overview
Automatic Vectorization
partitioning
prefetching
type
Explicit Vector Programming
Vectorization
Automatic Vectorization Overview
Automatic Vectorization
data ordering optimization
data transformation
guide, Qguide
guide-data-trans, Qguide-data-trans
option setting guidance for
guide, Qguide
guide-data-trans, Qguide-data-trans
data types
Using Efficient Data Types
efficiency
data, moving from one variable to another
data, transferring asynchronously
About Asynchronous Computation (Intel® MIC Architecture)
About Asynchronous Data Transfer (Intel® MIC Architecture)
dataflow analysis
DAZ flag
debug information
Linking Debug Information
in program database file
option generating full
option generating in DWARF 2 format
option generating in DWARF 3 format
option generating in DWARF 4 format
option generating levels of
debugging
Linking Debug Information
option affecting information generated
debug (Linux* and macOS*)
debug (Windows*)
option specifying settings to enhance
debug (Linux* and macOS*)
debug (Windows*)
denormal exceptions
denormal numbers
denormal results
ftz, Qftz
option flushing to zero
denormalized numbers (IEEE*)
Special Values
NaN values
denormals
deploying applications
deprecated compiler options
diagnostic messages
diag, Qdiag
diag-once, Qdiag-once
diag-file, Qdiag-file
option affecting which are issued
option controlling auto-parallelizer
option controlling display of
option controlling OpenMP
option controlling vectorizer
option enabling or disabling
option issuing only once
option sending to file
diagnostics
dialog boxes
Code Coverage dialog box
Code Coverage Settings dialog box
Configure Analysis dialog box
Options: Intel® Performance Libraries dialog box
Options: Code Coverage dialog box
Options: Compilers dialog box
Options: Converter dialog box
Options: Profile Guided Optimization (PGO) dialog box
Profile Guided Optimization dialog box
Use Intel® C++ dialog box
Code Coverage
Code Coverage Settings
Configure Analysis
Intel® Performance Libraries
Options: Code Coverage
Options: Compilers
Options: Converter)
Options: Intel® Performance Libraries
Options: Profile Guided Optimization
PGO dialog box
Profile Guided Optimization dialog box
Use Intel C++
difference operators
differential coverage
directory
isystem
B
option adding to start of include path
option specifying for executables
option specifying for includes and libraries
directory paths
Specifying Directory Paths
in Microsoft Visual Studio*
disabling
Compiler Directed Inline Expansion of Functions
inlining
distribute_point
distributing applications
dllimport functions
Qinline-dllimport
option controlling inlining of
DO constructs
documentation
Notational Conventions
conventions for
driver tool commands
v
dryrun
option specifying to show and execute
option specifying to show but not execute
dsymutil
dual core thread affinity
dumping profile information
Dumping Profile Information
Interval Profile Dumping
DWARF debug information
gsplit-dwarf
option creating object file containing
DYLD_LIBRARY_PATH
dyn files
Profile an Application with Instrumentation
Dumping Profile Information
Dumping and Resetting Profile Information
dynamic information
Profile-Guided Optimization (PGO)
dumping profile information
files
resetting profile counters
Resetting the Dynamic Profile Counters
Resetting Profile Information
threads
dynamic libraries
dynamiclib
option invoking tool to generate
dynamic linker
dynamic-linker
option specifying an alternate
dynamic shared object
shared
option producing a
dynamic-link libraries (DLLs)
MD
option searching for unresolved references in
dynamic-linking of libraries
Bdynamic
option enabling
ebp register
fomit-frame-pointer, Oy
option determining use in optimizations
Eclipse*
Adding a C Source File
Using Cheat Sheets
Creating a New Project
Excluding Source Files from a Build
Exporting Makefiles
Intel® C/C++ Error Parser
Global Symbols and Visibility Attributes (Linux* and macOS*)
Using Intel® Performance Libraries with Eclipse*
Using Eclipse* (Linux*)
Multi-Version Compiler Support
Running a Project
Specifying Symbol Visibility Explicitly (Linux* and macOS*)
adding a source file
cheat sheets
creating a new project
Eclipse* integration
Excluding Source Files from a Build
Exporting Makefiles
excluding source files from build
exporting makefiles
error parser
excluding source files from build
exporting makefiles
in Eclipse*
global symbols
integration
Using Intel® Performance Libraries with Eclipse*
adding the compiler
building a project
cheat sheets
creating a new project
excluding source files from build
exporting makefiles
global symbols
Intel® C/C++ Error Parser
makefiles
multi-version compiler support
running a project
setting options
visibility declaration attribute
integration overview
Intel® C/C++ Error Parser
projects
Multi-Version Compiler Support
multi-version compiler support
running a project
in Eclipse*
using Intel® Performance Libraries
visibility declaration attribute
Eclipse* integration
Using Eclipse* (Linux*)
Adding the Compiler to Eclipse*
building a project
makefiles
efficiency
efficient
Compiler Directed Inline Expansion of Functions
Inlining Report
inlining
Compiler Directed Inline Expansion of Functions
Inlining Report
efficient data types
EMMS Instruction
The EMMS Instruction: Why You Need It
EMMS Usage Guidelines
about
using
endian data
Intel® Compiler Extension Routines to OpenMP*
Enabling Auto-parallelization
Dumping Profile Information
Loop Constructs
Interval Profile Dumping
OpenMP* Run-time Library Routines
OpenMP* Examples
Profile an Application with Instrumentation
and OpenMP* extension routines
auto-parallelization
dumping profile information
for auto-parallelization
loop constructs
PROF_DUMP_INTERVAL
routines overriding
using OpenMP*
using profile-guided optimization
enhanced debugging information
Zo
option generating
Enter index keyword
svrng_copy_engine
svrng_delete_distribution
svrng_new_mcg31m1_engine/svrng_new_mcg31m1_ex
svrng_new_rand_engine/svrng_new_rand_ex
svrng_new_mt19937_engine/svrng_new_mt19937_ex
svrng_new_rand0_engine/svrng_new_rand0_ex
svrng_set_status
svrng_generate[1|2|4|8|16|32]_[uint|ulong]
svrng_get_status
svrng_leapfrog_engine
svrng_generate[1|2|4|8|16|32]_[int|float|double]
svrng_new_uniform_distribution_[int|float|double]/svrng_update_uniform_distribution_[int|float|double]
svrng_skipahead_engine
svrng_new_mcg59_engine/svrng_new_mcg59_ex
svrng_delete_engine
svrng_new_normal_distribution_[float|double]/svrng_update_normal_distribution_[float|double]
Enter index keyword
Distribution Initialization and Finalization
Error Handling
Usage Model
Data Types and Calling Conventions
Service Routines
Parallel Computation Support
Random Values Generation
enums
environment variables
Supported Environment Variables
I/O proxy control
Linux*
macOS*
run-time
setting
setting with compilervars file
using to control coprocessor's environment
Windows*
environment variables
Managing Libraries
LD_LIBRARY_PATH
environment, controlling coprocessor
environment, controlling coprocessor's
error messages
error parser
examples
Example for aio_cancel Function
Example for aio_error and aio_return Functions
Example for aio_suspend Function
Example for lio_listio Function
aio_cancel()
aio_error()
aio_return()
aio_suspend()
lio_listio()
exception handling
fexceptions
option generating table of
exceptions
fnon-call-exceptions
option allowing trapping instructions to throw
exclude code
Code Coverage Tool
code coverage tool
execution environment routines
execution flow
execution mode
exit(), calling from an oflload region
explicit vector programming
Explicit Vector Programming
array notations
elemental functions
smid
extended control registers
Overview: Intrinsics for Managing Extended Processor States and Registers
Intrinsics for Reading and Writing the Content of Extended Control Registers
managing
reading
writing
extended processor states
Overview: Intrinsics for Managing Extended Processor States and Registers
managing
extract offload target binary
extract offload target executable
false sharing
feature requirements
feature-specific code
ax, Qax
x, Qx
option generating
option generating and optimizing
find device number of coprocessor
find number of coprocessors
fixed_offset
float-to-integer conversion
rcd, Qrcd
option enabling fast
float64 vector intrinsics
Double-precision Floating-point Vector Intrinsics
Intel® Streaming SIMD Extensions 3
floating-point array operation
Floating-point array: Handling
floating-point calculations
fp-model, fp
option controlling semantics of
option enabling consistent results
Floating-point environment
-fp-model compiler option
/fp compiler option
pragma fenv_access
floating-point exceptions
Reducing the Impact of Denormal Exceptions
denormal exceptions
floating-point numbers
Special Values
special values
floating-point numbers
Floating-Point Formats
formats for
floating-point operations
fp-model, fp
fp-port, Qfp-port
option controlling semantics of
option rounding results of
Floating-point Operations
Programming Tradeoffs in Floating-Point Applications
programming tradeoffs
Floating-point Optimizations
-fp-model compiler option
/fp compiler option
floating-point precision
pc, Qpc
prec-div, Qprec-div
prec-sqrt, Qprec-sqrt
mp1, Qprec
option controlling for significand
option improving for divides
option improving for square root
option improving general
floating-point stack
Floating-Point Formats
option checking
floating-point stacks
Checking the Floating-point Stack State
checking
FMA instructions
fma, Qfma
option enabling
forceinline
format function security problems
Wformat-security
option issuing warning for
FP comparison operations
Intrinsics for FP Comparison Operations
_mm_comi_round_sd
_mm_comi_round_ss
_mm[_mask]_cmp_round_sd_mask
_mm[_mask]_cmp_round_ss_mask
_mm[_mask]_cmp_sd_mask
_mm[_mask]_cmp_ss_mask
_mm512[_mask]_cmp_epi64_mask
_mm512[_mask]_cmp_epu64_mask
_mm512[_mask]_cmp_round_pd_mask
_mm512[_mask]_cmp_round_ps_mask
frame pointer
momit-leaf-frame-pointer
option affecting leaf functions
FTZ flag
Function annotations
Function Annotations and the SIMD Directive for Vectorization
__declspec(align)
__declspec(vector)
function entry and exit points
finstrument-functions, Qinstrument-functions
option determining instrumentation of
function expansion
function grouping
prof-func-groups
option enabling or disabling
function grouping optimization
function multiversioning
function order list
function order lists
function ordering optimization
function preemption
function profiling
p
option compiling and linking for
function splitting
fnsplit, Qfnsplit
option enabling or disabling
functions
falign-functions, Qfnalign
option aligning on byte boundary
functions
Declaration in Scope of Function Defined in a Namespace
global
scope of
functions and data objects
qoffload-attribute-target, Qoffload-attribute-target
option flagging with offload attribute
functions, placing on the coprocessor
functions, shared
fused multiply-add instructions
fma, Qfma
option enabling
g++ compiler
gxx-name
option specifying name of
g++* language extensions
gcc C++ run-time libraries
idirafter
X
cxxlib
include file path
option adding a directory to second
option removing standard directories from
option specifying to link to
gcc compiler
gcc-name
option specifying name of
gcc-compatible warning options
gcc* compatibility
gcc* considerations
gcc* interoperability
gcc* language extensions
general compiler directives
Programming with Auto-parallelization
Inline Expansion of Functions
Profile-Guided Optimization (PGO)
Programming Guidelines for Vectorization
PGO API Support
for auto-parallelization
for inlining functions
for profile-guided optimization
for vectorization
profiling information
global function symbols
Bsymbolic-functions
option binding references to shared library definitions
global routines and data objects
qoffload-option, Qoffload-option
option specifying options to be used for a specified target and tool
global symbols
Global Symbols and Visibility Attributes (Linux* and macOS*)
option binding references to shared library definitions
GNU C++ compatibility
gnu utilities
gnu-prefix
option letting you add a prefix to names of
guided auto parallelism
messages overview
options
overview
using
Guided Auto Parallelism
guided auto-parallelism
guide-file-append, Qguide-file-append
guide-file, Qguide-file
option appending output to a file
option sending output to file
guided auto-parallelism messages
GAP Message (Diagnostic ID 30506)
GAP Message (Diagnostic ID 30513)
GAP Message (Diagnostic ID 30515)
GAP Message (Diagnostic ID 30519)
GAP Message (Diagnostic ID 30521)
GAP Message (Diagnostic ID 30522)
GAP Message (Diagnostic ID 30523)
GAP Message (Diagnostic ID 30525)
GAP Message (Diagnostic ID 30526)
GAP Message (Diagnostic ID 30528)
GAP Message (Diagnostic ID 30531)
GAP Message (Diagnostic ID 30532)
GAP Message (Diagnostic ID 30533)
GAP Message (Diagnostic ID 30534)
GAP Message (Diagnostic ID 30535)
GAP Message (Diagnostic ID 30536)
GAP Message (Diagnostic ID 30537)
GAP Message (Diagnostic ID 30538)
GAP Message (Diagnostic ID 30753)
GAP Message (Diagnostic ID 30754)
GAP Message (Diagnostic ID 30755)
GAP Message (Diagnostic ID 30756)
GAP Message (Diagnostic ID 30757)
GAP Message (Diagnostic ID 30758)
GAP Message (Diagnostic ID 30759)
GAP Message (Diagnostic ID 30760)
diagnostic id 30506
diagnostic id 30513
diagnostic id 30515
diagnostic id 30519
diagnostic id 30521
diagnostic id 30522
diagnostic id 30523
diagnostic id 30525
diagnostic id 30526
diagnostic id 30528
diagnostic id 30531
diagnostic id 30532
diagnostic id 30533
diagnostic id 30534
diagnostic id 30535
diagnostic id 30536
diagnostic id 30537
diagnostic id 30538
diagnostic id 30753
diagnostic id 30754 (Linux* only)
diagnostic id 30755
diagnostic id 30756
diagnostic id 30757
diagnostic id 30758
diagnostic id 30759
diagnostic id 30760
half-float conversion
hardware lock elision
Intrinsics for Hardware Lock Elision Operations
Intrinsics for Intel® Transactional Synchronization Extensions (Intel® TSX)
help
Getting Help and Support
using in Microsoft Visual Studio*
heterogeneous application
Appending Archiver Options for Creating Libraries
Appending Linker Options
building
Appending Archiver Options for Creating Libraries
Appending Linker Options
heterogeneous programming
Overview: Heterogeneous Programming
overview
high performance programming
Profile-Guided Optimization (PGO)
applications for
high-level optimizer
HLO
hot patching
hotpatch
option preparing a routine for
hotness threshold
prof-hotness-threshold, Qprof-hotness-threshold
option setting
i/o on the coprocessor
IA-32 architecture based applications
High-Level Optimization (HLO)
HLO
ICV
IEEE Standard for Floating-Point Arithmetic, IEEE 754-2008
IEEE*
Special Values
floating-point values
include files
initiating an offload
inline
inline function expansion
inline-level, Ob
option specifying level of
inlining
Developer Directed Inline Expansion of User Functions
Profile-Guided Optimization (PGO)
Inline Expansion of Functions
Compiler Directed Inline Expansion of Functions
Inlining Report
compiler directed
developer directed
option disabling full and partial
option disabling partial
option forcing
option specifying lower limit for large routines
option specifying maximum size of function for
option specifying maximum times for a routine
option specifying maximum times for compilation unit
option specifying total size routine can grow
option specifying upper limit for small routine
preemption
inlining options
inline-factor, Qinline-factor
option specifying percentage multiplier for
inlining report
input files
input/output
instrumentation
Dumping Profile Information
compilation
execution
feedback compilation
option enabling or disabling for specified functions
program
instrumentation calls
profile-loops-report, Qprofile-loops-report
profile-functions, Qprofile-functions
profile-loops, Qprofile-loops
option generating report for loops
option inserting for function
option inserting for loops
instrumented binaries
Code Coverage Tool
.spi file
instrumented binaries application
Test Prioritization Tool
.spi file
integer comparison operations
integer vector intrinsics
Integer Vector Intrinsic
Intel® Streaming SIMD Extensions 3
integrating Intel® C++ with Microsoft* Visual Studio*
intel_omp_task
intel_omp_taskq
Intel-provided libraries
shared-intel
static-intel
option linking dynamically
option linking statically
Intel's C++ asynchronous I/O template class
Example for Using async_class Template Class
Usage Example
Intel's Memory Allocator Library
Intel's Numeric String Conversion Library
Overview: Intel's Numeric String Conversion Library
libistrconv
Function List
Intel's Numeric String Conversion Library
Intel® 64 architecture based applications
High-Level Optimization (HLO)
HLO
Intel® Cilk™ Plus
__cilkrts_bump_loop_rank
__cilkrts_bump_worker_rank
__cilkrts_end_cilk
__cilkrts_get_nworkers
__cilkrts_get_pedigree
__cilkrts_get_total_workers
__cilkrts_get_worker_number
__cilkrts_init
__cilkrts_set_param
Debugging an Intel® Cilk™ Plus Program
C/C++ Extensions for Array Notations Programming Model
Cache Efficiency and Bandwidth
cilk_for
cilk_spawn
cilk_sync
Advanced Topic: How to Write a New Reducer
Deadlocks
Exception Handling
Intel® Cilk™ Plus Execution Model
False Sharing
Glossary
Granularity
Intel® Cilk Plus™ Keywords
Holders
Using Holders - An Example
Holder Syntax
Lock Contention
Holding a Lock Across a Strand Boundary
Locks Cause Determinancy Races
Mapping Strands to Workers
Memory Allocation Bottlenecks
Optimize the Serial Program
Pedigrees
Performance Considerations for Intel® Cilk™ Plus Programs
Common Performance Pitfalls
Safety, Correctness, and Performance
Using Reducers - More Examples
Reducer Library
Using Reducers - A Simple Example
How Reducers Work
Reducers
Intel® Cilk™ Plus Run Time System API
SIMD-Enabled Functions
SIMD-Enabled Function Pointers
Strands
Summary of Intel® Cilk™ Plus Language Features
Timing Programs and Program Segments
Work and Span
__cilkrts_bump_loop_rank
__cilkrts_bump_worker_rank
__cilkrts_end_cilk
__cilkrts_get_nworkers
__cilkrts_get_pedigree
__cilkrts_get_total_workers
__cilkrts_get_worker_number
__cilkrts_init
__cilkrts_set_param
__cilkrts_set_param
Debugging an Intel® Cilk™ Plus Program
C/C++ extensions for array notations
cache efficiency
cilk_for
cilk_spawn
cilk_sync
custom reducers
deadlocks
environment variable
exception handling
execution model
false sharing
glosssary
granularity
header file
holders
Holders
Using Holders - An Example
Holder Syntax
locks
Lock Contention
Holding a Lock Across a Strand Boundary
Locks Cause Determinancy Races
mapping strands to workers
memory allocation
optimizing serial programs
pedigrees
performance considerations
performance pitfalls
reducer considerations
reducer examples
reducer library
reducers
Using Reducers - A Simple Example
How Reducers Work
Reducers
Run Time System API
SIMD-enabled functions
SIMD-enabled functions pointers
strands
summary of language features
timing for performance
work and span
Intel® Cilk™ Plus
Intel® Cilk™ Plus
introduction
Intel® IPP libraries
ipp-link, Qipp-link
ipp, Qipp
option letting you choose the library to link to
option letting you link to
Intel® linking tools
Intel® MIC Architecture features
qoffload-attribute-target, Qoffload-attribute-target
qoffload-option, Qoffload-option
qoffload, Qoffload
qopenmp-offload, Qopenmp-offload
option flagging with offload attribute
option specifying options to be used for a specified target and tool
option to ignore language constructs for offloading
option to ignore offloading for target pragmas
option to specify mode for offloading
Intel® MKL
mkl, Qmkl
option letting you link to libraries
Intel® TBB libraries
tbb, Qtbb
option letting you link to
Intel® Trace Collector API
tcollect, Qtcollect
option inserting probes to call
Intel® AVX
Intel® AVX Intrinsic
_mm256_stream_si256
_mm256_stream_si256 (VMOVNTDQ)
Intel® AVX-512
Intrinsics for Integer Comparison Operations
Intrinsics for Integer Reduction Operations
comparison operations
reduction operations
Intel® C/C++ Error Parser
Intel® C++
Using the Command Line on Windows*
command-line environment
Intel® C++ Class Libraries
overview
Intel® C++ Compiler command prompt window
Intel® C++ Compiler extension routines
Intel® compatibility libraries for OpenMP*
Intel® extension environment variables
Intel® Hyper-Threading Technology
Enabling Further Loop Parallelization for Multicore Platforms
parallel loops
thread pools
Intel® IEEE 754-2008 Binary Floating-Point Conformance Library
Overview: Intel® IEEE 754-2008 Binary Floating-Point Conformance Library
formatOf general-computational operations
formatOf General-Computational Operations Functions
from_string
to_int64_floor
to_int64_rninta
to_int64_xceil
to_int64_xfloor
to_uint32_int
to_uint32_rninta
to_uint32_xceil
to_uint32_xfloor
add
binary32_to_binary64
binary64_to_binary32
div
fma
from_hexstring
from_int32
from_int64
from_uint32
from_uint64
mul
sqrt
sub
to_hexstring
to_int32_ceil
to_int32_floor
to_int32_int
to_int32_rnint
to_int32_rninta
to_int32_xceil
to_int32_xfloor
to_int32_xint
to_int32_xrnint
to_int32_xrninta
to_int64_ceil
to_int64_int
to_int64_rnint
to_int64_xint
to_int64_xrnint
to_int64_xrninta
to_string
to_uint32_ceil
to_uint32_floor
to_uint32_rnint
to_uint32_xint
to_uint32_xrnint
to_uint32_xrninta
to_uint64_ceil
to_uint64_floor
to_uint64_int
to_uint64_rnint
to_uint64_rninta
to_uint64_xceil
to_uint64_xfloor
to_uint64_xint
to_uint64_xrnint
to_uint64_xrninta
homogeneous general-computational operations
homogeneous general-computational operations
Homogeneous General-Computational Operations Functions
ilogb
maxnum
maxnum_mag
minnum
minnum_mag
next_down
next_up
rem
round_integral_exact
round_integral_nearest_away
round_integral_nearest_even
round_integral_negative
round_integral_positive
round_integral_zero
scalbn
non-computational operations
Function List
isNaN
isNormal
isSubnormal
lowerFlags
restoreFlags
testFlags
testSavedFlags
totalOrderMag
class
defaultMode
getBinaryRoundingDirection
is754version1985
is754version2008
isCanonical
isFinite
isInfinite
isSignaling
isSignMinus
isZero
radix
raiseFlags
restoreModes
saveFlags
setBinaryRoundingDirectionsaveModes
totalOrder
nonhomogeneous general-computational operations
quiet-computational operations
Function List
copy
negate
copysign
signaling-computational operations
Function List
signaling_greater_equal
quiet_equal
quiet_greater
quiet_greater_equal
quiet_greater_unordered
quiet_less
quiet_less_equal
quiet_less_unordered
quiet_not_equal
quiet_not_greater
quiet_not_less
quiet_ordered
quiet_unordered
signaling_equal
signaling_greater
signaling_greater_unordered
signaling_less
signaling_less_ unordered
signaling_less_equal
signaling_not_equal
signaling_not_greater
signaling_not_less
using the library
Intel® Integrated Performance Primitives
Using Intel® Performance Libraries with Microsoft Visual Studio*
Changing the Selected Intel® Performance Libraries
Intel® Math Kernel Library
Using Intel® Performance Libraries with Microsoft Visual Studio*
Changing the Selected Intel® Performance Libraries
Intel® Math Library
Other Considerations
C99 macros
fpclassify
isfinite
isgreater
isgreaterequal
isinf
isless
islessequal
islessgreater
isnan
isnormal
isunordered
signbit
Intel® MIC Architecture
Device-Only Memory Allocation
Allocating Memory for Parts of Arrays
Moving Data from One Variable to Another
Intel® Many Integrated Core Architecture (Intel® MIC Architecture)
allocating dynamic memory using modifiers
allocating memory for parts of arrays
moving data from one variable to another
overview
Intel® Performance Libraries
Using Intel® Performance Libraries with Microsoft Visual Studio*
Changing the Selected Intel® Performance Libraries
Intel® Integrated Performance Primitives (Intel® IPP)
Using Intel® Performance Libraries with Microsoft Visual Studio*
Changing the Selected Intel® Performance Libraries
Intel® Math Kernel Library (Intel® MKL)
Using Intel® Performance Libraries with Microsoft Visual Studio*
Changing the Selected Intel® Performance Libraries
Intel® Threading Building Blocks (Intel® TBB)
Using Intel® Performance Libraries with Microsoft Visual Studio*
Changing the Selected Intel® Performance Libraries
Intel® SSE4 intrinsics
Application Targeted Accelerators Intrinsics
Floating Point Dot Product Intrinsics
application targeted accelerator intrinsics
intrinsics
Intel® Streaming SIMD Extensions
Cacheability Support Intrinsics
Compare Intrinsics
Conversion Intrinsics
Details about Intel® Streaming SIMD Extensions Intrinsics
Integer Intrinsics
Load Intrinsics
Logical Intrinsics
Macro Function for Matrix Transposition
Macro Functions to Read and Write Control Registers
Macro Function for Shuffle Operations
Miscellaneous Intrinsics
Overview: Intel® Streaming SIMD Extensions (Intel® SSE)
Writing Programs with Intel® Streaming SIMD Extensions (Intel® SSE) Intrinsics
Set Intrinsics
Store Intrinsics
cacheability support operations
compare operations
conversion operations
data types
integer operations
load operations
logical operations
macro functions
Macro Function for Matrix Transposition
Macro Functions to Read and Write Control Registers
Macro Function for Shuffle Operations
matrix transposition
shuffle function
Macro Functions to Read and Write Control Registers
Macro Function for Shuffle Operations
miscellaneous operations
overview
programming with Intel® SSE intrinsics
registers
set operations
store operations
Intel® Streaming SIMD Extensions (Intel® SSE)
Intel® Streaming SIMD Extensions 2
Cacheability Support Intrinsics
Casting Support Intrinsics
Arithmetic Intrinsics
Compare Intrinsics
Conversion Intrinsics
Load Intrinsics
Logical Intrinsics
Set Intrinsics
Store Intrinsics
Arithmetic Intrinsics
Compare Intrinsics
Conversion Intrinsics
Load Intrinsics
Logical Intrinsics
Move Intrinsics
Set Intrinsics
Shift Intrinsics
Store Intrinsics
Miscellaneous Intrinsics
Overview: Intel® Streaming SIMD Extensions 2 (Intel® SSE2)
Pause Intrinsic
Macro Function for Shuffle
cacheability support intrinsics
casting support intrinsics
FP arithmetic intrinsics
FP compare intrinsics
FP conversion intrinsics
FP load intrinsics
FP logical intrinsics
FP set intrinsics
FP store intrinsics
integer arithmetic intrinsics
integer compare intrinsics
integer conversion intrinsics
integer load intrinsics
integer logical intrinsics
integer move intrinsics
integer set intrinsics
integer shift intrinsics
integer store intrinsics
miscellaneous intrinsics
overview
pause intrinsic
shuffle macro
Intel® Streaming SIMD Extensions 2
Macro Functions
macro functions
Intel® Streaming SIMD Extensions 3
Overview: Intel® Streaming SIMD Extensions 3 (Intel® SSE3)
overview
Intel® Streaming SIMD Extensions 3
Macro Functions
macro functions
Intel® Streaming SIMD Extensions 4
Application Targeted Accelerators Intrinsics
Cacheability Support Intrinsic
Floating Point Rounding Intrinsics
Floating Point Dot Product Intrinsics
Packed Blending Intrinsics
Packed Compare for Equal Intrinsic
Packed Compare Intrinsics
Packed DWORD to Unsigned WORD Intrinsic
Packed Format Conversion Intrinsics
Packed Integer Min/Max Intrinsics
Register Insertion/Extraction Intrinsics
Test Intrinsics
DWORD Multiply Intrinsics
application targeted accelerator intrinsics
cacheability support intrinsic
floating-point rounding intrinsics
FP dot product intrinsics
packed blending intrinsics
packed compare for equal intrinsic
packed compare intrinsics
packed DWORD to unsigned WORD intrinsic
packed format conversion intrinsics
packed integer min/max intrinsics
register insertion/extraction intrinsics
test intrinsics
Test Intrinsics
DWORD Multiply Intrinsics
Intel® Streaming SIMD Extensions4
Overview: Intel® Streaming SIMD Extensions 4 (Intel® SSE4)
overview
Intel® Threading Building Blocks
Using Intel® Performance Libraries with Microsoft Visual Studio*
Changing the Selected Intel® Performance Libraries
intermediate files
save-temps, Qsave-temps
option saving during compilation
intermediate representation (IR)
Using IPO
Interprocedural Optimization (IPO)
internal compiler limits
qoverride-limits, Qoverride-limits
option overriding certain
interoperability
GCC* Compatibility and Interoperability
with g++*
with gcc*
interprocedural optimizations
Profile-Guided Optimization (PGO)
Compiler Directed Inline Expansion of Functions
Inlining Report
capturing intermediate output
code layout
compilation
compiling
considerations
creating libraries
initiating
issues
large programs
linking
Using IPO
Interprocedural Optimization (IPO)
option enabling additional
option enabling between files
option enabling for single file compilation
overview
performance
using
whole program analysis
xiar
xild
xilibtool
into
intrinsics
Intrinsics Returning Vectors of Undefined Values
_rdrand16_step(), _rdrand32_step(), _rdrand64_step()
Intrinsics that Allow Reading from and Writing to the FS Base and GS Base Registers
Intrinsics for Converting Half Floats that Map to 3rd Generation Intel® Core™ Processor Instructions
_mm_cvtph_ps()
_mm_cvtps_ph()
_mm256_cvtph_ps()
_mm256_cvtps_ph()
Overview: Intrinsics for 3rd Generation Intel® Core™ Processor Instruction Extensions
Intrinsics that Generate Random Numbers of 16/32/64 Bit Wide Random Integers
_addcarry_u32(), _addcarry_u64()
_addcarryx_u32(), _addcarryx_u64()
_subborrow_u32(), _subborrow_u64()
Intrinsics for Multi-Precision Arithmetic
Overview: Intrinsics for 4th Generation Intel® Core™ Processor Instruction Extensions
Intrinsics
Intrinsics for Carry-less Multiplication Instruction and Advanced Encryption Standard Instructions
Overview: Intrinsics for Carry-less Multiplication Instruction and Advanced Encryption Standard Instructions
String and Block Copy Intrinsics
Floating-point Intrinsics
Integer Arithmetic Intrinsics
Miscellaneous Intrinsics
Overview: Intrinsics across Intel® Architectures
Alignment Support
Overview: Data Alignment, Memory Allocation Intrinsics, and Inline Assembly
Details about Intrinsics
Intrinsics for Saving and Restoring the Extended Processor States
Intrinsics for Reading and Writing the Content of Extended Control Registers
Intrinsics for Managing Extended Processor States and Registers
Intrinsics for Converting Half Floats
Overview: Intrinsics to Convert Half Float Types
Inline Assembly
Intrinsics for Intel® Advanced Vector Extensions
Intrinsics for Intel® Advanced Vector Extensions 2
_mm256_hadd_ps
_mm256_addsub_pd
_mm256_addsub_ps
_mm256_div_pd
_mm256_div_ps
_mm256_dp_ps
_mm256_hadd_pd
_mm256_hsub_pd
_mm256_hsub_ps
_mm256_mul_pd
_mm256_mul_ps
_mm256_rcp_ps
_mm256_rsqrt_ps
_mm256_sqrt_pd
_mm256_sqrt_ps
Intrinsics for Arithmetic Operations
Intrinsics for Bitwise Operations
_mm256_and_pd
_mm256_and_ps
_mm256_andnot_pd
_mm256_andnot_ps
_mm256_or_pd
_mm256_or_ps
_mm256_xor_pd
_mm256_xor_ps
Intrinsics for Blend and Conditional Merge Operations
Intrinsics for Compare Operations
Intrinsics for Conversion Operations
Intrinsics for Load and Store Operations
Intrinsics to Determine Minimum and Maximum Values
Intrinsics for Miscellaneous Operations
_mm256_undefined_pd()
_mm256_undefined_ps()
_mm256_undefined_si256
_mm256_max_pd
_mm256_max_ps
_mm256_min_pd
_mm256_min_ps
Overview: Intrinsics for Intel® Advanced Vector Extensions Instructions
Details of Intel® Advanced Vector Extensions Intrinsics
Intrinsics for Packed Test Operations
Intrinsics for Permute Operations
Intrinsics for Shuffle Operations
Intrinsics for Unpack and Interleave Operations
Intrinsics Generating Vectors of Undefined Values
Support Intrinsics for Vector Typecasting Operations
Intrinsics for Arithmetic Operations
Intrinsics for Arithmetic Shift Operations
Intrinsics for Operations to Manipulate Integer Data at Bit-Granularity
Intrinsics for Bitwise Operations
Intrinsics for Blend Operations
Intrinsics for Broadcast Operations
Intrinsics for Compare Operations
Intrinsics for Fused Multiply Add Operations
Intrinsics for GATHER Operations
Intrinsics for Insert/Extract Operations
Intrinsics for Masked Load/Store Operations
Intrinsics for Logical Shift Operations
_mm_maskload_epi32/64, _mm256_maskload_epi32/64
_mm_maskstore_epi32/64, _mm256_maskstore_epi32/64
Intrinsics for Miscellaneous Operations
Intrinsics for Pack/Unpack Operations
Intrinsics for Packed Move with Extend Operations
Intrinsics for Permute Operations
Intrinsics for Shuffle Operations
Intrinsics for Intel® Transactional Synchronization Extensions (Intel® TSX)
Arithmetic Intrinsics
Cacheability Support Intrinsics
Compare Intrinsics
Conversion Intrinsics
Details about Intel® Streaming SIMD Extensions Intrinsics
Integer Intrinsics
Load Intrinsics
Logical Intrinsics
Macro Function for Matrix Transposition
Macro Functions to Read and Write Control Registers
Macro Function for Shuffle Operations
Miscellaneous Intrinsics
Overview: Intel® Streaming SIMD Extensions (Intel® SSE)
Writing Programs with Intel® Streaming SIMD Extensions (Intel® SSE) Intrinsics
Intrinsics to Read and Write Registers
Set Intrinsics
Store Intrinsics
Cacheability Support Intrinsics
Casting Support Intrinsics
Arithmetic Intrinsics
Compare Intrinsics
Conversion Intrinsics
Load Intrinsics
Logical Intrinsics
Set Intrinsics
Store Intrinsics
Arithmetic Intrinsics
Compare Intrinsics
Conversion Intrinsics
Load Intrinsics
Logical Intrinsics
Move Intrinsics
Set Intrinsics
Shift Intrinsics
Store Intrinsics
Miscellaneous Intrinsics
Overview: Intel® Streaming SIMD Extensions 2 (Intel® SSE2)
Pause Intrinsic
Macro Function for Shuffle
Single-precision Floating-point Vector Intrinsics
Double-precision Floating-point Vector Intrinsics
Integer Vector Intrinsic
Miscellaneous Intrinsics
Overview: Intel® Streaming SIMD Extensions 3 (Intel® SSE3)
Application Targeted Accelerators Intrinsics
Cacheability Support Intrinsic
DWORD Multiply Intrinsics
Floating Point Rounding Intrinsics
Floating Point Dot Product Intrinsics
Overview: Intel® Streaming SIMD Extensions 4 (Intel® SSE4)
Packed Blending Intrinsics
Packed Compare for Equal Intrinsic
Packed Compare Intrinsics
Packed DWORD to Unsigned WORD Intrinsic
Packed Format Conversion Intrinsics
Packed Integer Min/Max Intrinsics
Register Insertion/Extraction Intrinsics
Test Intrinsics
Intrinsics for Later Generation Intel® Core™ Processor Instruction Extensions
Allocating and Freeing Aligned Memory Blocks
Details about MMX™ Technology Intrinsics
Compare Intrinsics (MMX™ technology)
The EMMS Instruction: Why You Need It
EMMS Usage Guidelines
General Support Intrinsics (MMX™ technology)
Logical Intrinsics (MMX™ technology)
Overview: Intrinsics for MMX™ Technology
Packed Arithmetic Intrinsics (MMX™ technology)
Set Intrinsics (MMX™ technology)
Shift Intrinsics (MMX™ technology)
Naming and Usage Syntax
References
Absolute Value Intrinsics
Addition Intrinsics
Concatenate Intrinsics
Multiplication Intrinsics
Negation Intrinsics
Overview: Supplemental Streaming SIMD Extensions 3 (SSSE3)
Shuffle Intrinsics
Subtraction Intrinsics
_mm_cexp_ps, _mm256_cexp_ps
_mm_clog_ps, _mm256_clog_ps
_mm_csqrt_ps, _mm256_csqrt_ps
_mm_cdfnorminv_pd, _mm256_cdfnorminv_pd
_mm_cdfnorminv_ps, _mm256_cdfnorminv_ps
_mm_erf_pd, _mm256_erf_pd
_mm_erf_ps, _mm256_erf_ps
_mm_erfc_pd, _mm256_erfc_pd
_mm_erfc_ps, _mm256_erfc_ps
_mm_erfinv_pd, _mm256_erfinv_pd
_mm_erfinv_ps, _mm256_erfinv_ps
_mm_exp2_pd, _mm256_exp2_pd
_mm_exp2_ps, _mm256_exp2_ps
_mm_hypot_ps, _mm256_hypot_ps
_mm_exp_pd, _mm256_exp_pd
_mm_exp_ps, _mm256_exp_ps
_mm_exp10_pd, _mm256_exp10_pd
_mm_exp10_ps, _mm256_exp10_ps
_mm_expm1_pd, _mm256_expm1_pd
_mm_expm1_ps, _mm256_expm1_ps
_mm_hypot_pd, _mm256_hypot_pd
_mm_pow_pd, _mm256_pow_pd
_mm_pow_ps, _mm256_pow_ps
_mm_log_pd, _mm256_log_pd
_mm_log_ps, _mm256_log_ps
_mm_log10_pd, _mm256_log10_pd
_mm_log10_ps, _mm256_log10_ps
_mm_log1p_pd, _mm256_log1p_pd
_mm_log1p_ps, _mm256_log1p_ps
_mm_log2_pd, _mm256_log2_pd
_mm_log2_ps, _mm256_log2_ps
_mm_logb_pd, _mm256_logb_pd
_mm_logb_ps, _mm256_logb_ps
Overview: Intrinsics for Short Vector Math Library (SVML) Functions
_mm_sqrt_ps, _mm256_sqrt_ps
_mm_cbrt_pd, _mm256_cbrt_pd
_mm_cbrt_ps, _mm256_cbrt_ps
_mm_invcbrt_pd, _mm256_invcbrt_pd
_mm_invcbrt_ps, _mm256_invcbrt_ps
_mm_invsqrt_pd, _mm256_invsqrt_pd
_mm_invsqrt_ps, _mm256_invsqrt_ps
_mm_sqrt_pd, _mm256_sqrt_pd
_mm_sinh_ps, _mm256_sinh_ps
_mm_acos_pd, _mm256_acos_pd
_mm_acos_ps, _mm256_acos_ps
_mm_acosh_pd, _mm256_acosh_pd
_mm_acosh_ps, _mm256_acosh_ps
_mm_asin_pd, _mm256_asin_pd
_mm_asin_ps, _mm256_asin_ps
_mm_asinh_pd, _mm256_asinh_pd
_mm_asinh_ps, _mm256_asinh_ps
_mm_atan_pd, _mm256_atan_pd
_mm_atan_ps, _mm256_atan_ps
_mm_atan2_pd, _mm256_atan2_pd
_mm_atan2_ps, _mm256_atan2_ps
_mm_atanh_pd, _mm256_atanh_pd
_mm_atanh_ps, _mm256_atanh_ps
_mm_cos_pd, _mm256_cos_pd
_mm_cos_ps, _mm256_cos_ps
_mm_cosd_pd, _mm256_cosd_pd
_mm_cosd_ps, _mm256_cosd_ps
_mm_cosh_pd, _mm256_cosh_pd
_mm_cosh_ps, _mm256_cosh_ps
_mm_sin_pd, _mm256_sin_pd
_mm_sin_ps, _mm256_sin_ps
_mm_sincos_pd, _mm256_sincos_pd
_mm_sincos_ps, _mm256_sincos_ps
_mm_sind_pd, _mm256_sind_pd
_mm_sind_ps, _mm256_sind_ps
_mm_sinh_pd, _mm256_sinh_pd
_mm_tan_pd, _mm256_tan_pd
_mm_tan_ps, _mm256_tan_ps
_mm_tand_pd, _mm256_tand_pd
_mm_tand_ps, _mm256_tand_ps
_mm_tanh_pd, _mm256_tanh_pd
_mm_tanh_ps, _mm256_tanh_ps
Intel® SSE2
Intrinsics Returning Vectors of Undefined Values
intrinsics returning vectors of undefined values
Intrinsics Returning Vectors of Undefined Values
_mm_undefined_pd()
_mm_undefined_si128()
3rd Generation Intel® Core™ Processor Instruction Extensions
_rdrand16_step(), _rdrand32_step(), _rdrand64_step()
Intrinsics that Allow Reading from and Writing to the FS Base and GS Base Registers
Intrinsics for Converting Half Floats that Map to 3rd Generation Intel® Core™ Processor Instructions
_mm_cvtph_ps()
_mm_cvtps_ph()
_mm256_cvtph_ps()
_mm256_cvtps_ph()
Overview: Intrinsics for 3rd Generation Intel® Core™ Processor Instruction Extensions
Intrinsics that Generate Random Numbers of 16/32/64 Bit Wide Random Integers
_rdrand16_step()
_rdrand32_step()
_rdrand64_step()
base registers
Intrinsics that Allow Reading from and Writing to the FS Base and GS Base Registers
_readfsbase_u32()
_readfsbase_u64()
_readgsbase_u32()
_readgsbase_u64()
_writefsbase_u32()
_writefsbase_u64()
_writegsbase_u32()
_writegsbase_u64()
half-float
Intrinsics for Converting Half Floats that Map to 3rd Generation Intel® Core™ Processor Instructions
_mm_cvtph_ps()
_mm_cvtps_ph()
_mm256_cvtph_ps()
_mm256_cvtps_ph()
conversion
Intrinsics for Converting Half Floats that Map to 3rd Generation Intel® Core™ Processor Instructions
_mm_cvtph_ps()
_mm_cvtps_ph()
_mm256_cvtph_ps()
_mm256_cvtps_ph()
_mm_cvtph_ps()
Intrinsics for Converting Half Floats that Map to 3rd Generation Intel® Core™ Processor Instructions
_mm_cvtph_ps()
_mm_cvtps_ph()
_mm_cvtps_ph())
_mm256_cvtph_ps()
Intrinsics for Converting Half Floats that Map to 3rd Generation Intel® Core™ Processor Instructions
_mm256_cvtph_ps()
_mm256_cvtps_ph()
_mm256_cvtps_ph()
Intrinsics for Converting Half Floats that Map to 3rd Generation Intel® Core™ Processor Instructions
overview
random number generation (RDRAND)
4th Generation Intel® Core™ Processor Instruction Extensions
_addcarry_u32(), _addcarry_u64()
_addcarryx_u32(), _addcarryx_u64()
_subborrow_u32(), _subborrow_u64()
Intrinsics for Multi-Precision Arithmetic
Overview: Intrinsics for 4th Generation Intel® Core™ Processor Instruction Extensions
Intrinsics that Generate Random Numbers of 16/32/64 Bit Wide Random Integers
_addcarry_u32()
_addcarry_u64()
_addcarryx_u32()
_addcarryx_u64()
_subborrow_u32()
_subborrow_u64()
Multi-Precision Arithmetic
overview
random number generation (RDSEED)
about
Advanced Encryption Standard (AES) Implementation
Intrinsics for Carry-less Multiplication Instruction and Advanced Encryption Standard Instructions
Overview: Intrinsics for Carry-less Multiplication Instruction and Advanced Encryption Standard Instructions
_mm_aesdec_si128
_mm_aesdeclast_si128
_mm_aesenc_si128
_mm_aesenclast_si128
_mm_aesimc_si128
_mm_aeskeygenassist_si128
overview
All Intel Architectures
String and Block Copy Intrinsics
string and block copy operations
All Intel® Architectures
Floating-point Intrinsics
Integer Arithmetic Intrinsics
Miscellaneous Intrinsics
Overview: Intrinsics across Intel® Architectures
floating point operations
integer arithmetic operations
miscellaneous operations
Miscellaneous Intrinsics
_BitScanForward
_BitScanReverse
_bittest
_bittestandreset
_bittestandset
-bittestandcomplement
overview
carry-less multiplication instruction
Overview: Intrinsics for Carry-less Multiplication Instruction and Advanced Encryption Standard Instructions
_mm_clmulepi64_si128
data alignment
Alignment Support
Overview: Data Alignment, Memory Allocation Intrinsics, and Inline Assembly
data types
extended processor states
Intrinsics for Saving and Restoring the Extended Processor States
restoring
saving
for managing extended processor states and registers
Intrinsics for Reading and Writing the Content of Extended Control Registers
Intrinsics for Managing Extended Processor States and Registers
_fxrstor()
_fxrstor64()
_fxsave()
_fxsave64()
_xgetbv()
_xrstor()
_xrstor64()
_xrstors()
_xrstors64()
_xsave()
_xsave64()
_xsavec()
_xsavec64()
_xsaveopt()
_xsaveopt64()
_xsaves()
_xsaves64()
_xsetbv()
restoring extended processor states
saving extended processor states
half-float conversion
Intrinsics for Converting Half Floats
Overview: Intrinsics to Convert Half Float Types
_cvtsh_ss
_cvtss_sh
_mm_cvtph_ps
_mm_cvtps_ph
overview
inline assembly
Inline Assembly
Overview: Data Alignment, Memory Allocation Intrinsics, and Inline Assembly
Intel® Advanced Vector Extensions (AVX)
Intel® Advanced Vector Extensions 2 (Intel® AVX2)
Intrinsics for Intel® Advanced Vector Extensions 2
overview
Intel® AVX
_mm256_hadd_ps
_mm256_addsub_pd
_mm256_addsub_ps
_mm256_div_pd
_mm256_div_ps
_mm256_dp_ps
_mm256_hadd_pd
_mm256_hsub_pd
_mm256_hsub_ps
_mm256_mul_pd
_mm256_mul_ps
_mm256_rcp_ps
_mm256_rsqrt_ps
_mm256_sqrt_pd
_mm256_sqrt_ps
Intrinsics for Arithmetic Operations
Intrinsics for Bitwise Operations
_mm256_and_pd
_mm256_and_ps
_mm256_andnot_pd
_mm256_andnot_ps
_mm256_or_pd
_mm256_or_ps
_mm256_xor_pd
_mm256_xor_ps
Intrinsics for Blend and Conditional Merge Operations
Intrinsics for Compare Operations
Intrinsics for Conversion Operations
Intrinsics for Load and Store Operations
Intrinsics to Determine Minimum and Maximum Values
Intrinsics for Miscellaneous Operations
_mm256_undefined_pd()
_mm256_undefined_ps()
_mm256_undefined_si256
_mm256_max_pd
_mm256_max_ps
_mm256_min_pd
_mm256_min_ps
Overview: Intrinsics for Intel® Advanced Vector Extensions Instructions
Details of Intel® Advanced Vector Extensions Intrinsics
Intrinsics for Packed Test Operations
Intrinsics for Permute Operations
Intrinsics for Shuffle Operations
Intrinsics for Unpack and Interleave Operations
Intrinsics Generating Vectors of Undefined Values
Support Intrinsics for Vector Typecasting Operations
arithmetic intrinsics
_mm256_hadd_ps
_mm256_addsub_pd
_mm256_addsub_ps
_mm256_div_pd
_mm256_div_ps
_mm256_dp_ps
_mm256_hadd_pd
_mm256_hsub_pd
_mm256_hsub_ps
_mm256_mul_pd
_mm256_mul_ps
_mm256_rcp_ps
_mm256_rsqrt_ps
_mm256_sqrt_pd
_mm256_sqrt_ps
_mm256_hadd_ps
_mm256_addsub_pd (VADDSUBPD)
_mm256_addsub_ps (VADDSUBPS)
_mm256_div_pd (VDIVPD)
_mm256_div_ps (VDIVPS)
_mm256_dp_ps (VDPPS)
_mm256_hadd_pd (VHADDPD)
_mm256_hsub_pd (VHSUBPD)
_mm256_hsub_ps (VHSUBPS)
_mm256_mul_pd (VMULPD)
_mm256_mul_ps (VMULPS)
_mm256_rcp_pd (VRCPPS)
_mm256_rsqrt_ps (VRSQRTPS)
_mm256_sqrt_pd (VSQRTPD)
_mm256_sqrt_ps (VSQRTPS)
arithmetic operations
Intrinsics for Arithmetic Operations
_mm256_add_pd (VADDPD)
_mm256_add_ps (VADDPS)
_mm256_sub_pd (VSUBPD)
_mm256_sub_ps (VSUBPS)
bitwise logical operations
bitwise operations
_mm256_and_pd
_mm256_and_ps
_mm256_andnot_pd
_mm256_andnot_ps
_mm256_or_pd
_mm256_or_ps
_mm256_xor_pd
_mm256_xor_ps
__mm256_and_pd (VANDPD)
_mm256_and_ps (VANDPS)
_mm256_andnot_pd (VANDNPD)
_mm256_andnot_ps (VANDNPS)
_mm256_or_pd (VORPD)
_mm256_or_ps (VORPS)
_mm256_xor_pd (VXORPD)
_mm256_xor_ps (VXORPS)
blend and conditional merge operations
Intrinsics for Blend and Conditional Merge Operations
_mm256_blendv_ps (VBLENDVPS)
_mm256_blend_pd (VBLENDPD)
_mm256_blend_ps (VBLENDPS)
_mm256_blendv_pd (VBLENDVPD)
compare operations
Intrinsics for Compare Operations
_m256_cmp_pd (VCMPPD)
_mm_cmp_pd (VCMPPD)
_mm_cmp_ps (VCMPPS)
_mm_cmp_sd (VCMPSD)
_mm_cmp_ss (VCMPSS)
_mm256_cmp_ps (VCMPPS)
conversion operations
Intrinsics for Conversion Operations
_mm256_cvtepi32_pd (VCVTDQ2PD)
_mm256_cvtepi32_ps (VCVTDQ2PS)
_mm256_cvtpd_epi32 (VCVTPD2DQ)
_mm256_cvtpd_ps (VCVTPD2PS)
_mm256_cvtps_epi32 (VCVTPS2DQ)
_mm256_cvtps_pd (VCVTPS2PD)
_mm256_cvtsd_f64 (vmovsd)
_mm256_cvtss_f32 (vmovss)
_mm256_cvttpd_epi32 (VCVTTPD2DQ)
_mm256_cvttps_epi32 (VCVTTPS2DQ)
_mm256_cvtsi256_si32
_mm256_cvttps_epi32
load operations
Intrinsics for Load and Store Operations
_mm_broadcast_ss (VBROADCASTSS)
_mm_maskload_pd (VMASKMOVPD)
_mm_maskload_ps (VMASKMOVPS)
_mm_maskstore_pd (VMASKMOVPD)
_mm_maskstore_ps (VMASKMOVPS)
_mm256_add_ps (VMASKMOVPS)
_mm256_broadcast_pd (VBROADCASTF128)
_mm256_broadcast_ps (VBROADCASTF128)
_mm256_broadcast_sd (VBROADCASTSD)
_mm256_broadcast_ss (VBROADCASTSS)
_mm256_load_pd (VMOVAPD)
_mm256_load_ps (VMOVAPS)
_mm256_load_si256 (VMOVDQA)
_mm256_loadu_pd (VMOVUPD)
_mm256_loadu_ps (VMOVUPS)
_mm256_loadu_si256 (VMOVDQU)
_mm256_maskload_pd (VMASKMOVPD)
_mm256_maskstore_pd (VMASKMOVPD)
_mm256_maskstore_ps (VMASKMOVPS)
_mm256_store_pd (VMOVAPD)
_mm256_store_ps (VMOVAPS)
_mm256_store_si256 (VMOVDQA)
_mm256_storeu_pd (VMOVUPD)
_mm256_storeu_ps (VMOVUPS)
_mm256_storeu_si256 (VMOVDQU)
_mm256_stream_pd (VMOVNTPD)
_mm256_stream_ps (VMOVNTPS)
minimum and maximum operations
miscellaneous operations
Intrinsics for Miscellaneous Operations
_mm256_extractf128_pd (VEXTRACTF128)
_mm256_extractf128_ps (VEXTRACTF128)
_mm256_extractf128_si256 (VEXTRACTF128)
_mm256_insertf128_pd (VINSERTF128)
_mm256_insertf128_ps (VINSERTF128)
_mm256_insertf128_si256 (VINSERTF128)
_mm256_lddqu_si256 (VLDDQU)
_mm256_movedup_pd (VMOVDDUP)
_mm256_movehdup_ps (VMOVSHDUP)
_mm256_moveldup_ps (VMOVSLDUP)
_mm256_movemask_pd (VMOVMSKPD)
_mm256_movemask_ps (VMOVMSKPS)
_mm256_round_pd (VROUNDPD)
_mm256_round_ps (VROUNDPS)
_mm256_set_epi16
_mm256_set_epi32
_mm256_set_epi64x
_mm256_set_epi8
_mm256_set_pd
_mm256_set_ps
_mm256_set1_epi16
_mm256_set1_epi32
_mm256_set1_epi64x
_mm256_set1_epi8
_mm256_set1_pd
_mm256_set1_ps
_mm256_setr_epi16
_mm256_setr_epi32
_mm256_setr_epi64x
_mm256_setr_epi8
_mm256_setr_pd
_mm256_setr_ps
_mm256_setzero_pd
_mm256_setzero_ps
_mm256_setzero_si256
_mm256_zeroall (VZEROALL)
_mm256_zeroupper (VZEROUPPER)
operations returning vectors of undefined values
_mm256_undefined_pd()
_mm256_undefined_ps()
_mm256_undefined_si256
_mm256_undefined_pd()
_mm256_undefined_ps()
_mm256_undefined_si128
operations to determine maximum value
_mm256_max_pd
_mm256_max_ps
_mm256_max_pd (VMAXPD)
_mm256_max_ps (VMAXPS)
operations to determine minimum value
_mm256_min_pd
_mm256_min_ps
_mm256_min_pd (VMINPD)
_mm256_min_ps (VMINPS)
overview
Overview: Intrinsics for Intel® Advanced Vector Extensions Instructions
Details of Intel® Advanced Vector Extensions Intrinsics
packed test operations
Intrinsics for Packed Test Operations
_mm_testc_pd (VTESTPD)
_mm_testc_ps (VTESTPS)
_mm_testnzc_pd (VTESTPD)
_mm_testnzc_ps (VTESTPS)
_mm_testz_pd (VTESTPD)
_mm_testz_ps (VTESTPS)
_mm256_testc_pd (VTESTPD)
_mm256_testc_ps (VTESTPS)
_mm256_testc_si256 (VPTEST)
_mm256_testnzc_pd (VTESTPD)
_mm256_testnzc_ps (VTESTPS)
_mm256_testnzc_si256 (VPTEST)
_mm256_testz_pd (VTESTPD)
_mm256_testz_ps (VTESTPS)
_mm256_testz_si256 (VPTEST)
permute operations
Intrinsics for Permute Operations
_mm_permute_pd (VPERMILPD)
_mm_permute_ps (VPERMILPS)
_mm_permutevar_pd (VPERMILPD)
_mm_permutevar_ps (VPERMILPS)
_mm256_permute_pd (VPERMILPD)
_mm256_permute_ps (VPERMILPS)
_mm256_permute2f128_pd (VPERM2F128)
_mm256_permute2f128_ps (VPERM2F128)
_mm256_permute2f128_si256 (VPERM2F128)
_mm256_permutevar_pd (VPERMILPD)
_mm256_permutevar_ps (VPERMILPS)
shuffle operations
Intrinsics for Shuffle Operations
_mm256_shuffle_pd (VSHUFPD)
_mm256_shuffle_ps (VSHUFPS)
unpack and interleave operations
Intrinsics for Unpack and Interleave Operations
_mm256_unpackhi_pd (VUNPCKHPD)
_mm256_unpackhi_ps (VUNPCKHPS)
_mm256_unpacklo_pd (VUNPCKLPD)
_mm256_unpacklo_ps (VUNPCKLPS)
vector generation operations
vector typecasting operations
Support Intrinsics for Vector Typecasting Operations
_mm256_castpd_ps
_mm256_castpd_si256
_mm256_castpd128_pd256
_mm256_castpd256_pd128
_mm256_castps_pd
_mm256_castps_si256
_mm256_castps128_ps256
_mm256_castps256_ps128
_mm256_castsi128_si256
_mm256_castsi256_pd
_mm256_castsi256_ps
_mm256_castsi256_si128
Intel® AVX2
Intrinsics for Arithmetic Operations
Intrinsics for Arithmetic Shift Operations
Intrinsics for Operations to Manipulate Integer Data at Bit-Granularity
Intrinsics for Bitwise Operations
Intrinsics for Blend Operations
Intrinsics for Broadcast Operations
Intrinsics for Compare Operations
Intrinsics for Fused Multiply Add Operations
Intrinsics for GATHER Operations
Intrinsics for Insert/Extract Operations
Intrinsics for Masked Load/Store Operations
Intrinsics for Logical Shift Operations
_mm_maskload_epi32/64, _mm256_maskload_epi32/64
_mm_maskstore_epi32/64, _mm256_maskstore_epi32/64
Intrinsics for Miscellaneous Operations
Intrinsics for Pack/Unpack Operations
Intrinsics for Packed Move with Extend Operations
Intrinsics for Permute Operations
Intrinsics for Shuffle Operations
Intrinsics for Intel® Transactional Synchronization Extensions (Intel® TSX)
arithmetic operations
Intrinsics for Arithmetic Operations
_mm256_abs_epi16 (VPABSW)
_mm256_abs_epi32 (VPABSD)
_mm256_abs_epi8 (VPABSB)
_mm256_add_epi16 (VPADDW)
_mm256_add_epi32 (VPADDD)
_mm256_add_epi64 (VPADDQ)
_mm256_add_epi8 (VPADDB)
_mm256_adds_epi16 (VPADDSW)
_mm256_adds_epi8 (VPADDSB)
_mm256_adds_epu16 (VPADDUSW)
_mm256_adds_epu8 (VPADDUSB)
_mm256_avg_epu16 (VPAVGW)
_mm256_avg_epu8 (VPAVGB)
_mm256_hadd_epi16 (VPHADDW)
_mm256_hadd_epi32 (VPHADDD)
_mm256_hadds_epi16 (VPHADDSW)
_mm256_hsub_epi16 (VPHSUBW)
_mm256_hsub_epi32 (VPHSUBD)
_mm256_hsubs_epi16 (VPHSUBSW)
_mm256_madd_epi16 (VPMADDW)
_mm256_maddubs_epi16 (VPMADDUBSW)
_mm256_mpsadbw_epu8 (VMPSADBW)
_mm256_mul_epi32 (VPMULDQ)
_mm256_mul_epu32 (VPMULUDQ)
_mm256_mulhi_epi16 (VPMULHW)
_mm256_mulhi_epu16 (VPMULHUW)
_mm256_mulhrs_epi16 (VPMULHRSW)
_mm256_mullo_epi16 (VPMULLW)
_mm256_mullo_epi32 (VPMULLD)
_mm256_sad_epu8 (VPSADBW)
_mm256_sign_epi16 (VPSIGNW)
_mm256_sign_epi32 (VPSIGND)
_mm256_sign_epi8 (VPSIGNB)
_mm256_sub_epi16 (VPSUBW)
_mm256_sub_epi32 (VPSUBD)
_mm256_sub_epi64 (VPSUBQ)
_mm256_sub_epi8 (VPSUBB)
_mm256_subs_epi16 (VPSUBSW)
_mm256_subs_epi8 (VPSUBSB)
_mm256_subs_epu16 (VPSUBUSW)
_mm256_subs_epu8 (VPSUBUSB)
arithmetic shift operations
Intrinsics for Arithmetic Shift Operations
_mm_srav_epi32 (VPSRAVD)
_mm256_sra_epi16 (VPSRAW)
_mm256_sra_epi32 (VPSRAD)
_mm256_srai_epi16 (VPSRAW)
_mm256_srai_epi32 (VPSRAD)
_mm256_srav_epi32 (VPSRAVD)
bit manipulation operations
Intrinsics for Operations to Manipulate Integer Data at Bit-Granularity
_bextr_u32 (BEXTR)
_bextr_u64 (BEXTR)
_blsi_u32 (BLSI)
_blsi_u64 (BLSI)
_blsmsk_u32 (BLSMSK)
_blsmsk_u64 (BLSMSK)
_blsr_u64 (BLSR)
_bslr_u32 (BLSR)
_lzcnt_u32 (LZCNT)
_lzcnt_u32/64
_bzhi_u32/64
_lzcnt_u64 (LZCNT)
_lzcnt_u32/64
_bzhi_u32/64
_pdep_u32 (PDEP)
_pdep_u64 (PDEP)
_pext_u32 (PEXT)
_pext_u64 (PEXT)
_tzcnt_u32 (TZCNT)
_tzcnt_u64 (TZCNT)
bitwise logical operations
Intrinsics for Bitwise Operations
_mm256_xor_si256 (VPXOR)
_mm256_and_si256 (VPAND)
_mm256_andnot_si256 (VPANDN)
_mm256_or_si256 (VPOR)
blend operations
Intrinsics for Blend Operations
_mm_blend_epi32
_mm256_blend_epi16 (VPBLENDW)
_mm256_blend_epi32 (VPBLENDD)
_mm256_blend_epi32 (VPBLENDVB)
broadcast operations
Intrinsics for Broadcast Operations
_mm_broadcastb_epi8 (VPBROADCASTB)
_mm_broadcastd_epi32 (VPBROADCASTD)
_mm_broadcastq_epi64 (VPBROADCASTQ)
_mm_broadcastsd_pd (VBROADCASTSD)
_mm_broadcastss_ps (VBROADCASTSS)
_mm_broadcastw_epi16 (VPBROADCASTW)
_mm256_broadcastb_epi8 (VPBROADCASTB)
_mm256_broadcastd_epi32 (VPBROADCASTD)
_mm256_broadcastq_epi64 (VPBROADCASTQ)
_mm256_broadcastsd_pd (VBROADCASTSD)
_mm256_broadcastsi128_si256 (VBROADCASTI128)
_mm256_broadcastsi128_si256 (VPERM2I128)
_mm256_broadcastss_ps (VBROADCASTSS)
_mm256_broadcastw_epi16 (VPBROADCASTW)
compare operations
Intrinsics for Compare Operations
_mm256_cmpeq_epi16 (VPCMPEQW)
_mm256_cmpeq_epi32 (VPCMPEQD)
_mm256_cmpeq_epi64 (VPCMPEQQ)
_mm256_cmpeq_epi8 (VPCMPEQB)
_mm256_cmpgt_epi16 (VPCMPGTW)
_mm256_cmpgt_epi32 (VPCMPGTD)
_mm256_cmpgt_epi64 (VPCMPGTQ)
_mm256_cmpgt_epi8 (VPCMPGTB)
_mm256_max_epi16 (VPMAXSW)
_mm256_max_epi32 (VPMAXSD)
_mm256_max_epi8 (VPMAXSB)
_mm256_max_epu16 (VPMAXUW)
_mm256_max_epu32 (VPMAXUD)
_mm256_max_epu8 (VPMAXUB)
_mm256_min_epi16 (VPMINSW)
_mm256_min_epi32 (VPMINSD)
_mm256_min_epi8 (VPMINSB)
_mm256_min_epu16 (VPMINUW)
_mm256_min_epu32 (VPMINUD)
_mm256_min_epu8 (VPMINUB)
fused multiply-add (FMA) operations
Intrinsics for Fused Multiply Add Operations
_mm_fmadd_pd (VFMADD###)
_mm_fmadd_ps (VFMADD###)
_mm_fmadd_sd (VFMADD###)
_mm_fmadd_ss (VFMADD###)
_mm_fmaddsub_pd (VFMADDSUB###)
_mm_fmaddsub_ps (VFMADDSUB###)
_mm_fmsub_pd (VFMSUB###)
_mm_fmsub_ps (VFMSUB###)
_mm_fmsub_sd (VFMSUB###)
_mm_fmsub_ss (VFMSUB###)
_mm_fmsubadd_pd (VFMSUBADD###)
_mm_fmsubadd_ps (VFMSUBADD###)
_mm_fnmadd_pd (VFNMADD###)
_mm_fnmadd_ps (VFNMADD###)
_mm_fnmadd_sd (VFNMADD###)
_mm_fnmadd_ss (VFNMADD###)
_mm_fnmsub_pd (VFNMSUB###)
_mm_fnmsub_ps (VFNMSUB###)
_mm_fnmsub_sd (VFNMSUB###)
_mm_fnmsub_ss (VFNMSUB###)
_mm256_fmadd_pd (VFMADD###)
_mm256_fmadd_ps (VFMADD###)
_mm256_fmadd_sd (VFMADD###)
_mm256_fmadd_ss (VFMADD###)
_mm256_fmaddsub_pd (VFMADDSUB###)
_mm256_fmaddsub_ps (VFMADDSUB###)
_mm256_fmsub_pd (VFMSUB###)
_mm256_fmsub_ps (VFMSUB###)
_mm256_fmsub_sd (VFMSUB###)
_mm256_fmsub_ss (VFMSUB###)
_mm256_fmsubadd_pd (VFMSUBADD###)
_mm256_fmsubadd_ps (VFMSUBADD###)
_mm256_fnmadd_pd (VFNMADD###)
_mm256_fnmadd_ps (VFNMADD###)
_mm256_fnmadd_sd (VFNMADD###)
_mm256_fnmadd_ss (VFNMADD###)
_mm256_fnmsub_pd (VFNMSUB###)
_mm256_fnmsub_ps (VFNMSUB###)
_mm256_fnmsub_sd (VFNMSUB###)
_mm256_fnmsub_ss (VFNMSUB###)
GATHER operations
Intrinsics for GATHER Operations
_mm_i32gather_epi32 (VPGATHERDD)
_mm_i32gather_epi64 (VPGATHERDQ)
_mm_i32gather_pd (VGATHERDPD)
_mm_i64gather_epi32 (VPGATHERQD)
_mm_i64gather_epi64 (VPGATHERQQ)
_mm_i64gather_pd (VGATHERQPD)
_mm_i64gather_ps (VGATHERQPS)
_mm_mask_i32gather_epi32 (VPGATHERDD)
_mm_mask_i32gather_epi64 (VPGATHERDQ)
_mm_mask_i32gather_ps (VGATHERDPS)
_mm_i32gather_ps, _mm256_i32gather_ps
_mm_mask_i32gather_ps, _mm256_mask_i32gather_ps
_mm_mask_i64gather_epi32 (VPGATHERQD)
_mm_mask_i64gather_epi64 (VPGATHERQQ)
_mm_mask_i64gather_pd (VGATHERQPD)
_mm_mask_i64gather_ps (VGATHERQPS)
_mm256_i32gather_epi32 (VPGATHERDD)
_mm256_i32gather_epi64 (VPGATHERDQ)
_mm256_i64gather_epi32 (VPGATHERQD)
_mm256_i64gather_epi64 (VPGATHERQQ)
_mm256_i64gather_pd (VGATHERQPD)
_mm256_i64gather_ps (VGATHERQPS)
_mm256_mask_i32gather_epi32 (VPGATHERDD)
_mm256_mask_i32gather_epi64 (VPGATHERDQ)
_mm256_mask_i32gather_pd (VGATHERDPD)
_mm_mask_i32gather_pd, _mm256_mask_i32gather_pd
_mm_i32gather_pd, _mm256_i32gather_pd
_mm256_mask_i32gather_ps (VGATHERDPS)
_mm_i32gather_ps, _mm256_i32gather_ps
_mm_mask_i32gather_ps, _mm256_mask_i32gather_ps
_mm256_mask_i64gather_epi32 (VPGATHERQD)
_mm256_mask_i64gather_epi64 (VPGATHERQQ)
_mm256_mask_i64gather_pd (VGATHERQPD)
_mm256_mask_i64gather_ps (VGATHERQPS)
insert and extract operations
Intrinsics for Insert/Extract Operations
_mm256_extractepi16
_mm256_extractepi32
_mm256_extractepi64
_mm256_extractepi8
_mm256_extracti128_si256 (VEXTRACTI128)
_mm256_insertepi16
_mm256_insertepi32
_mm256_insertepi64
_mm256_insertepi8
_mm256_inserti128_si256 (VINSERTI128)
load and store operations
logical shift operations
Intrinsics for Logical Shift Operations
_mm256_srl_epi16 (VPSRLW)
_mm256_srli_epi16 (VPSRLW)
_mm_sllv_epi16 (VPSLLVD)
_mm_sllv_epi32 (VPSLLVQ)
_mm_srlv_epi16 (VPSRLVD)
_mm_srlv_epi32 (VPSRLVQ)
_mm256_sll_epi16 (VPSLLW)
_mm256_sll_epi32 (VPSLLD)
_mm256_sll_epi64 (VPSLLQ)
_mm256_slli_epi16 (VPSLLW)
_mm256_slli_epi32 (VPSLLD)
_mm256_slli_epi64 (VPSLLQ)
_mm256_slli_si256 (VPSLLDQ)
_mm256_sllv_epi32 (VPSLLVD)
_mm256_sllv_epi64 (VPSLLVQ)
_mm256_srl_epi32 (VPSRLD)
_mm256_srl_epi64 (VPSRLQ)
_mm256_srli_epi32 (VPSRLD)
_mm256_srli_epi64 (VPSRLQ)
_mm256_srli_si256 (VPSRLDQ)
_mm256_srlv_epi32 (VPSRLVD)
_mm256_srlv_epi64 (VPSRLVQ)
masked load and store operations
_mm_maskload_epi32/64, _mm256_maskload_epi32/64
_mm_maskstore_epi32/64, _mm256_maskstore_epi32/64
_mm256_maskload_epi32 (VPMASKMOVD)
_mm256_maskload_epi64 (VPMASKMOVQ)
_mm256_maskstore_epi32 (VPMASKMOVD)
_mm256_maskstore_epi64 (VPMASKMOVQ)
miscellaneous operations
Intrinsics for Miscellaneous Operations
_mm256_alignr_epi8 (VPALIGNRB)
_mm256_movemask_epi8 (VPMOVMSKB)
_mm256_stream_load_si256 (VMOVNTDQA)
pack and unpack operations
Intrinsics for Pack/Unpack Operations
_mm256_packs_epi16 (VPACKSSWB)
_mm256_packs_epi32 (VPACKSSDW)
_mm256_packus_epi16 (VPACKUSWB)
_mm256_packus_epi32 (VPACKUSDW)
_mm256_unpackhi_epi16 (VPUNPCKHWD)
_mm256_unpackhi_epi32 (VPUNPCKHDQ)
_mm256_unpackhi_epi64 (VPUNPCKHQDQ)
_mm256_unpackhi_epi8 (VPUNPCKHBW)
_mm256_unpacklo_epi16 (VPUNPCKLWD)
_mm256_unpacklo_epi32 (VPUNPCKLDQ)
_mm256_unpacklo_epi64 (VPUNPCKLQDQ)
_mm256_unpacklo_epi8 (VPUNPCKLBW)
packed move operations
Intrinsics for Packed Move with Extend Operations
_mm256_cvtepu8_epi16 (VPMOVZXBW)
_mm256_cvtepi16_epi32 (VPMOVSXWD)
_mm256_cvtepi16_epi64 (VPMOVSXWQ)
_mm256_cvtepi32_epi64 (VPMOVSXDQ)
_mm256_cvtepi8_epi16 (VPMOVSXBW)
_mm256_cvtepi8_epi32 (VPMOVSXBD)
_mm256_cvtepi8_epi64 (VPMOVSXBQ)
_mm256_cvtepu16_epi32 (VPMOVZXWD)
_mm256_cvtepu16_epi64 (VPMOVZXWQ)
_mm256_cvtepu32_epi64 (VPMOVZXDQ)
_mm256_cvtepu8_epi32 (VPMOVZXBD)
_mm256_cvtepu8_epi64 (VPMOVZXBQ)
permute operations
Intrinsics for Permute Operations
_mm256_permute4x64_epi64 (VPERMQ)
_mm256_permute4x64_pd (VPERMPD)
_mm256_permutevar8x32_epi32 (VPERM2I128)
_mm256_permutevar8x32_epi32 (VPERMD)
_mm256_permutevar8x32_epi32 (VPERMPS)
shuffle operations
Intrinsics for Shuffle Operations
_mm256_shuffle_epi32 (VPSHUFD)
_mm256_shuffle_epi8
_mm256_shuffle_epi8 (VPSHUFB)
_mm256_sufflehi_epi16 (VPSHUFHW)
_mm256_sufflelo_epi16 (VPSHUFLW)
Transactional Synchronization Extensions
Intel® SSE
Arithmetic Intrinsics
Cacheability Support Intrinsics
Compare Intrinsics
Conversion Intrinsics
Details about Intel® Streaming SIMD Extensions Intrinsics
Integer Intrinsics
Load Intrinsics
Logical Intrinsics
Macro Function for Matrix Transposition
Macro Functions to Read and Write Control Registers
Macro Function for Shuffle Operations
Miscellaneous Intrinsics
Overview: Intel® Streaming SIMD Extensions (Intel® SSE)
Writing Programs with Intel® Streaming SIMD Extensions (Intel® SSE) Intrinsics
Intrinsics to Read and Write Registers
Set Intrinsics
Store Intrinsics
arithmetic operations
Arithmetic Intrinsics
add_ps
add_ss
div_ps
div_ss
max_ps
max_ss
min_ps
min_ss
mul_ps
mul_ss
rcp_ps
rcp_ss
rsqrt_ps
rsqrt_ss
sqrt_ps
sqrt_ss
sub_ps
sub_ss
cacheability support operations
Cacheability Support Intrinsics
prefetch
sfence
stream_pi
stream_ps
compare operations
Compare Intrinsics
cmpeq_ps
cmpeq_ss
cmpge_ps
cmpge_ss
cmpgt_ps
cmpgt_ss
cmple_ps
cmple_ss
cmplt_ps
cmplt_ss
cmpneq_ps
cmpneq_ss
cmpnge_ps
cmpnge_ss
cmpngt_ps
cmpngt_ss
cmpnle_ps
cmpnle_ss
cmpnlt_ps
cmpnlt_ss
cmpord_ps
cmpord_ss
cmpunord_ps
cmpunord_ss
comieq_ss
comige_ss
comigt_ss
comile_ss
comilt_ss
comineq_ss
ucomieq_ss
ucomige_ss
ucomigt_ss
ucomile_ss
ucomilt_ss
ucomineq_ss
conversion operations
Conversion Intrinsics
cvtpi16_ps
cvtpi32_ps
cvtpi32x2_ps
cvtpi8_ps
cvtps_pi16
cvtps_pi32
cvtps_pi8
cvtpu16_ps
cvtpu8_ps
cvtsi32_ss
cvtsi64_ss
cvtss_f32
cvtss_si32
cvtss_si64
cvttps_pi32
cvttss_si32
cvttss_si64
data types
integer operations
Integer Intrinsics
avg_pu16
avg_pu8
extract_pi16
insert_pi16
maskmove_si641
max_pi16
max_pu8
min_pi16
min_pu8
movemask_pi8
mulhi_pu16
sad_pu8
shuffle_pi16
load operations
Load Intrinsics
load_ps
load_ps1
load_ss
loadh_pi
loadl_pi
loadr_ps(
loadu_ps
logical operations
Logical Intrinsics
and_ps
andnot_ps
or_ps
xor_ps
macros
Macro Function for Matrix Transposition
Macro Functions to Read and Write Control Registers
Macro Function for Shuffle Operations
matrix transposition
read control register
shuffle function
write control register
miscellaneous operations
Miscellaneous Intrinsics
_mm_undefined_ps()
move_ss
movehl_ps
movelh_ps
movemask_ps
shuffle_ps
unpackhi_ps
unpacklo_ps
overview
programming with Intel® SSE intrinsics
read/write register intrinsics
Intrinsics to Read and Write Registers
getcsr
setcsr
registers
set operations
Set Intrinsics
set_ps
set_ps1
set_ss
setr_ps
setzero_ps
store operations
Store Intrinsics
store_ps
store_ps1
store_ss
storeh_pi
storel_pi
storer_ps
storeu_ps
Intel® SSE2
Cacheability Support Intrinsics
Casting Support Intrinsics
Arithmetic Intrinsics
Compare Intrinsics
Conversion Intrinsics
Load Intrinsics
Logical Intrinsics
Set Intrinsics
Store Intrinsics
Arithmetic Intrinsics
Compare Intrinsics
Conversion Intrinsics
Load Intrinsics
Logical Intrinsics
Move Intrinsics
Set Intrinsics
Shift Intrinsics
Store Intrinsics
Miscellaneous Intrinsics
Overview: Intel® Streaming SIMD Extensions 2 (Intel® SSE2)
Pause Intrinsic
Macro Function for Shuffle
cacheability support operations
Cacheability Support Intrinsics
clflush
clflushopt
lfence
mfence
stream_pd
stream_si128
stream_si32
casting support
Casting Support Intrinsics
_mm_castpd_ps
_mm_castpd_si128
_mm_castps_pd
_mm_castps_si128
_mm_castsi128_pd
_mm_castsi128_ps
FP arithmetic operations
Arithmetic Intrinsics
add_pd
add_sd
div_pd
div_sd
max_pd
max_sd
min_pd
min_sd
mul_pd
mul_sd
sqrt_pd
sqrt_sd
sub_pd
sub_sd
FP compare operations
Compare Intrinsics
cmpeq_pd
cmpeq_sd
cmpge_pd
cmpge_sd
cmpgt_pd
cmpgt_sd
cmple_pd
cmple_sd
cmplt_pd
cmplt_sd
cmpneq_pd
cmpneq_sd
cmpnge_pd
cmpnge_sd
cmpngt_pd
cmpngt_sd
cmpnle_pd
cmpnle_sd
cmpnlt_pd
cmpnlt_sd
cmpord_pd
cmpord_sd
cmpunord_pd
cmpunord_sd
comieq_sd
comige_sd
comigt_sd
comile_sd
comilt_sd
comineq_sd
ucomieq_sd
ucomige_sd
ucomigt_sd
ucomile_sd
ucomilt_sd
ucomineq_sd
FP conversion operations
Conversion Intrinsics
cvtepi32_pd
cvtpd_epi32
cvtpd_pi32
cvtpd_ps
cvtpi32_pd
cvtps_pd
cvtsd_f64
cvtsd_si32
cvtsd_ss
cvtsi32_sd
cvtss_sd
cvttpd_epi32
cvttpd_pi32
cvttsd_si32
FP load operations
Load Intrinsics
load_pd
load_sd
load1_pd
loadh_pd
loadl_pd
loadr_pd
loadu_pd
FP logical operations
Logical Intrinsics
and_pd
andnot_pd
or_pd
xor_pd
FP set operations
Set Intrinsics
move_sd
set_pd
set_sd
set1_pd
setr_pd
setzero_pd
FP store operations
Store Intrinsics
store_pd
store_sd
store1_pd
storeh_pd
storel_pd
storer_pd
storeu_pd
integer arithmetic operations
Arithmetic Intrinsics
add_epi16
add_epi32
add_epi64
add_epi8
add_si64
adds_epi16
adds_epi8
adds_epu16
adds_epu8
avg_epu16
avg_epu8
madd_epi16
max_epi16
max_epu8
min_epi16
min_epu8
mul_epu32
mul_su32
mulhi_epi16
mulhi_epu16
mullo_epi16
sad_epu8
sub_epi16
sub_epi32
sub_epi64
sub_epi8
sub_si64
subs_epi16
subs_epi8
subs_epu16
subs_epu8
integer compare operations
Compare Intrinsics
cmpeq_epi16
cmpeq_epi32
cmpeq_epi8
cmpgt_epi16
cmpgt_epi32
cmpgt_epi8
cmplt_epi16
cmplt_epi32
cmplt_epi8
integer conversion operations
Conversion Intrinsics
cvtepi32_ps
cvtps_epi32
cvtsd_si64
cvtsi64_sd
cvttps_epi32
cvttsd_si64
integer load operations
Load Intrinsics
load_si128
loadl_epi64
loadu_si128
integer logical operations
Logical Intrinsics
and_si128
andnot_si128
or_si128
xor_si128
integer move operations
Move Intrinsics
cvtsi128_si32
cvtsi128_si64
cvtsi32_si128
cvtsi64_si128
integer set operations
Set Intrinsics
set_epi16
set_epi32
set_epi64
set_epi8
set1_epi16
set1_epi32
set1_epi64
set1_epi8
setr_epi16
setr_epi32
setr_epi64
setr_epi8
setzero_si128
integer shift operations
Shift Intrinsics
sll_epi16
sll_epi32
sll_epi64
slli_epi16
slli_epi32
slli_epi64
slli_si128
sra_epi16
sra_epi32
srai_epi16
srai_epi32
srl_epi16
srl_epi32
srl_epi64
srli_epi16
srli_epi32
srli_epi64
srli_si128
integer store operations
Store Intrinsics
maskmoveu_si128
store_si128
storel_epi64
storeu_si128
miscellaneous operations
Miscellaneous Intrinsics
extract_epi16
insert_epi16
move_epi64
movemask_epi8
movemask_pd
movepi64_pi64
movpi64_pi64
packs_epi16
packs_epi32
packus_epi16
shuffle_epi32
shuffle_pd
shufflehi_epi16
shufflelo_epi16
unpackhi_epi16
unpackhi_epi32
unpackhi_epi64
unpackhi_epi8
unpackhi_pd
unpacklo_epi16
unpacklo_epi32
unpacklo_epi64
unpacklo_epi8
unpacklo_pd
overview
pause intrinsic
shuffle macro
Intel® SSE3
Single-precision Floating-point Vector Intrinsics
Double-precision Floating-point Vector Intrinsics
Integer Vector Intrinsic
Miscellaneous Intrinsics
Overview: Intel® Streaming SIMD Extensions 3 (Intel® SSE3)
float32 vector intrinsics
Single-precision Floating-point Vector Intrinsics
addsub_ps
hadd_ps
hsub_ps
movehdup_ps
moveldup_ps
float64 vector intrinsics
Double-precision Floating-point Vector Intrinsics
addsub_pd
hadd_pd
hsub_pd
loaddup_pd
movedup_pd
integer vector intrinsic
Integer Vector Intrinsic
lddqu_si128
miscellaneous intrinsics
overview
Intel® SSE4
Application Targeted Accelerators Intrinsics
Cacheability Support Intrinsic
DWORD Multiply Intrinsics
Floating Point Rounding Intrinsics
Floating Point Dot Product Intrinsics
Overview: Intel® Streaming SIMD Extensions 4 (Intel® SSE4)
Packed Blending Intrinsics
Packed Compare for Equal Intrinsic
Packed Compare Intrinsics
Packed DWORD to Unsigned WORD Intrinsic
Packed Format Conversion Intrinsics
Packed Integer Min/Max Intrinsics
Register Insertion/Extraction Intrinsics
Test Intrinsics
application targeted accelerator intrinsics
Application Targeted Accelerators Intrinsics
_mm_crc32_u16
_mm_crc32_u32
_mm_crc32_u64
_mm_crc32_u8
_mm_popcnt_u64
_mm_popcnt_u32
cacheability support intrinsic
Cacheability Support Intrinsic
_mm_stream_load_si128
MOVNTDQA
DWORD multiply operations
DWORD Multiply Intrinsics
_m128i _mm_mul_epi32
_m128i _mm_mullo_epi32
floating-point rounding operations
Floating Point Rounding Intrinsics
_mm_ceil_pd
_mm_ceil_ps
_mm_ceil_sd
_mm_ceil_ss
_mm_floor_pd
_mm_floor_ps
_mm_floor_sd
_mm_floor_ss
_mm_round_pd
_mm_round_ps
_mm_round_sd
_mm_round_ss
FP dot product operations
Floating Point Dot Product Intrinsics
_mm_dp_pd
_mm_dp_ps
overview
packed blending operations
Packed Blending Intrinsics
_mm_blend_epi16
_mm_blend_pd
_mm_blend_ps
_mm_blendv_epi8
_mm_blendv_pd
_mm_blendv_ps
packed compare for equal intrinsic
_mm_cmpeq_epi64
PCMPEQQ
packed compare operations
Packed Compare Intrinsics
_cmpestra
_cmpestrc
_cmpestri
_cmpestrm
_cmpestro
_cmpestrs
_cmpestrz
_cmpistra
_cmpistrc
_cmpistri
_cmpistrm
_cmpistro
_cmpistrs
_cmpistrz
PCMPESTRA
PCMPESTRC
PCMPESTRI
PCMPESTRM
PCMPESTRO
PCMPESTRS
PCMPESTRZ
PCMPISTRA
PCMPISTRC
PCMPISTRI
PCMPISTRM
PCMPISTRO
PCMPISTRS
PCMPISTRZ
packed DWORD to unsigned WORD intrinsic
Packed DWORD to Unsigned WORD Intrinsic
_mm_packus_epi32
PACKUSDW
packed format conversion operations
Packed Format Conversion Intrinsics
_mm_cvtepi16_epi32
_mm_cvtepi16_epi64
_mm_cvtepi32_epi64
_mm_cvtepi8_epi16
_mm_cvtepi8_epi32
_mm_cvtepi8_epi64
_mm_cvtepu16_epi32
_mm_cvtepu16_epi64
_mm_cvtepu32_epi64
_mm_cvtepu8_epi16
_mm_cvtepu8_epi32
_mm_cvtepu8_epi64
PMOVSXBD
PMOVSXBQ
PMOVSXBW
PMOVSXDQ
PMOVSXWD
PMOVSXWQ
PMOVZXBD
PMOVZXBQ
PMOVZXBW
PMOVZXDQ
PMOVZXWD
PMOVZXWQ
packed integer min/max intrinsics
_mm_max_epi16
_mm_max_epi32
_mm_max_epi8
_mm_max_epu32
_mm_min_epi16
_mm_min_epi32
_mm_min_epi8
_mm_min_epu32
PMAXSB
PMAXSD
PMAXUD
PMAXUW
PMINSB
PMINSD
PMINUW
register insertion/extraction operations
Register Insertion/Extraction Intrinsics
_mm_extract_epi16
_mm_extract_epi32
_mm_extract_epi64
_mm_extract_epi8
_mm_extract_ps
_mm_insert_epi32
_mm_insert_epi64
_mm_insert_epi8
_mm_insert_ps
EXTRACTPS
INSERTPS
PEXTRB
PEXTRD
PEXTRQ
PEXTRW
PINSRB
PINSRD
PINSRQ
test operations
Test Intrinsics
_mm_testc_si128
_mm_testnzc_si128
_mm_testz_si128
Intel® Streaming SIMD Extensions
Arithmetic Intrinsics
Intrinsics to Read and Write Registers
arithmetic operations
register intrinsics
Intel® Streaming SIMD Extensions 3
Single-precision Floating-point Vector Intrinsics
Miscellaneous Intrinsics
float32 vector intrinsics
miscellaneous intrinsics
Later Generation Intel® Core™ Processor Instruction Extensions
memory allocation
Allocating and Freeing Aligned Memory Blocks
Overview: Data Alignment, Memory Allocation Intrinsics, and Inline Assembly
MMX™ Technology
Details about MMX™ Technology Intrinsics
data types
registers
MMX™ Technology
Compare Intrinsics (MMX™ technology)
The EMMS Instruction: Why You Need It
EMMS Usage Guidelines
General Support Intrinsics (MMX™ technology)
Logical Intrinsics (MMX™ technology)
Overview: Intrinsics for MMX™ Technology
Packed Arithmetic Intrinsics (MMX™ technology)
Set Intrinsics (MMX™ technology)
Shift Intrinsics (MMX™ technology)
compare operations
Compare Intrinsics (MMX™ technology)
cmpeq_pi16
cmpeq_pi32
cmpeq_pi8
cmpgt_pi16
cmpgt_pi32
cmpgt_pi8
EMMS instruction
The EMMS Instruction: Why You Need It
EMMS Usage Guidelines
about
using
general support operations
General Support Intrinsics (MMX™ technology)
cvtm64_si64
cvtsi32_si64
cvtsi64_m64
cvtsi64_si32
empty
packs_pi16
packs_pi32
packs_pu16
unpackhi_pi16
unpackhi_pi32
unpackhi_pi8
unpacklo_pi16
unpacklo_pi32
unpacklo_pi8
logical operations
Logical Intrinsics (MMX™ technology)
and_si64
andnot_si64
or_si64
xor_si64
overview
packed arithmetic operations
Packed Arithmetic Intrinsics (MMX™ technology)
add_pi16
add_pi32
add_pi8
adds_pi16
adds_pi8
adds_pu16
adds_pu8
madd_pi16
mulhi_pi16
mullo_pi16
sub_pi16
sub_pi32
sub_pi8
subs_pi16
subs_pi8
subs_pu16
subs_pu8
set operations
Set Intrinsics (MMX™ technology)
set_pi16
set_pi32
set_pi8
set1_pi16
set1_pi32
set1_pi8
setr_pi16
setr_pi32
setr_pi8
setzero_si64
shift operations
Shift Intrinsics (MMX™ technology)
sll_pi16
sll_pi32
slli_pi16
slli_pi32
slli_pi64
sra_pi16
sra_pi32
srai_pi16
srai_pi32
srl_pi16
srl_pi32
srl_pi64
srli_pi16
srli_pi32
srli_pi64
naming and syntax
references
registers
SSSE3
Absolute Value Intrinsics
Addition Intrinsics
Concatenate Intrinsics
Multiplication Intrinsics
Negation Intrinsics
Overview: Supplemental Streaming SIMD Extensions 3 (SSSE3)
Shuffle Intrinsics
Subtraction Intrinsics
absolute value operations
Absolute Value Intrinsics
_mm_abs_epi16
_mm_abs_epi32
_mm_abs_epi8
_mm_abs_pi16
_mm_abs_pi32
_mm_abs_pi8
addition operations
Addition Intrinsics
_mm_hadd_epi16
_mm_hadd_epi32
_mm_hadd_pi16
_mm_hadd_pi32
_mm_hadds_epi16
_mm_hadds_pi16
concatenate operations
Concatenate Intrinsics
_mm_alignr_epi8
_mm_alignr_pi8
multiplication operations
Multiplication Intrinsics
_mm_maddubs_epi16
_mm_maddubs_pi16
_mm_mulhrs_epi16
_mm_mulhrs_pi16
negation operations
overview
shuffle operations
subtraction operations
SVML
_mm_cexp_ps, _mm256_cexp_ps
_mm_clog_ps, _mm256_clog_ps
_mm_csqrt_ps, _mm256_csqrt_ps
_mm_cdfnorminv_pd, _mm256_cdfnorminv_pd
_mm_cdfnorminv_ps, _mm256_cdfnorminv_ps
_mm_erf_pd, _mm256_erf_pd
_mm_erf_ps, _mm256_erf_ps
_mm_erfc_pd, _mm256_erfc_pd
_mm_erfc_ps, _mm256_erfc_ps
_mm_erfinv_pd, _mm256_erfinv_pd
_mm_erfinv_ps, _mm256_erfinv_ps
_mm_exp2_pd, _mm256_exp2_pd
_mm_exp2_ps, _mm256_exp2_ps
_mm_hypot_ps, _mm256_hypot_ps
_mm_exp_pd, _mm256_exp_pd
_mm_exp_ps, _mm256_exp_ps
_mm_exp10_pd, _mm256_exp10_pd
_mm_exp10_ps, _mm256_exp10_ps
_mm_expm1_pd, _mm256_expm1_pd
_mm_expm1_ps, _mm256_expm1_ps
_mm_hypot_pd, _mm256_hypot_pd
_mm_pow_pd, _mm256_pow_pd
_mm_pow_ps, _mm256_pow_ps
_mm_log_pd, _mm256_log_pd
_mm_log_ps, _mm256_log_ps
_mm_log10_pd, _mm256_log10_pd
_mm_log10_ps, _mm256_log10_ps
_mm_log1p_pd, _mm256_log1p_pd
_mm_log1p_ps, _mm256_log1p_ps
_mm_log2_pd, _mm256_log2_pd
_mm_log2_ps, _mm256_log2_ps
_mm_logb_pd, _mm256_logb_pd
_mm_logb_ps, _mm256_logb_ps
Overview: Intrinsics for Short Vector Math Library (SVML) Functions
_mm_sqrt_ps, _mm256_sqrt_ps
_mm_cbrt_pd, _mm256_cbrt_pd
_mm_cbrt_ps, _mm256_cbrt_ps
_mm_invcbrt_pd, _mm256_invcbrt_pd
_mm_invcbrt_ps, _mm256_invcbrt_ps
_mm_invsqrt_pd, _mm256_invsqrt_pd
_mm_invsqrt_ps, _mm256_invsqrt_ps
_mm_sqrt_pd, _mm256_sqrt_pd
_mm_sinh_ps, _mm256_sinh_ps
_mm_acos_pd, _mm256_acos_pd
_mm_acos_ps, _mm256_acos_ps
_mm_acosh_pd, _mm256_acosh_pd
_mm_acosh_ps, _mm256_acosh_ps
_mm_asin_pd, _mm256_asin_pd
_mm_asin_ps, _mm256_asin_ps
_mm_asinh_pd, _mm256_asinh_pd
_mm_asinh_ps, _mm256_asinh_ps
_mm_atan_pd, _mm256_atan_pd
_mm_atan_ps, _mm256_atan_ps
_mm_atan2_pd, _mm256_atan2_pd
_mm_atan2_ps, _mm256_atan2_ps
_mm_atanh_pd, _mm256_atanh_pd
_mm_atanh_ps, _mm256_atanh_ps
_mm_cos_pd, _mm256_cos_pd
_mm_cos_ps, _mm256_cos_ps
_mm_cosd_pd, _mm256_cosd_pd
_mm_cosd_ps, _mm256_cosd_ps
_mm_cosh_pd, _mm256_cosh_pd
_mm_cosh_ps, _mm256_cosh_ps
_mm_sin_pd, _mm256_sin_pd
_mm_sin_ps, _mm256_sin_ps
_mm_sincos_pd, _mm256_sincos_pd
_mm_sincos_ps, _mm256_sincos_ps
_mm_sind_pd, _mm256_sind_pd
_mm_sind_ps, _mm256_sind_ps
_mm_sinh_pd, _mm256_sinh_pd
_mm_tan_pd, _mm256_tan_pd
_mm_tan_ps, _mm256_tan_ps
_mm_tand_pd, _mm256_tand_pd
_mm_tand_ps, _mm256_tand_ps
_mm_tanh_pd, _mm256_tanh_pd
_mm_tanh_ps, _mm256_tanh_ps
complex functions
_mm_cexp_ps, _mm256_cexp_ps
_mm_clog_ps, _mm256_clog_ps
_mm_csqrt_ps, _mm256_csqrt_ps
_mm_cexp_ps, _mm256_cexp_ps
_mm_clog_ps, _mm256_clog_ps
_mm_csqrt_ps, _mm256_csqrt_ps
error functions
_mm_cdfnorminv_pd, _mm256_cdfnorminv_pd
_mm_cdfnorminv_ps, _mm256_cdfnorminv_ps
_mm_erf_pd, _mm256_erf_pd
_mm_erf_ps, _mm256_erf_ps
_mm_erfc_pd, _mm256_erfc_pd
_mm_erfc_ps, _mm256_erfc_ps
_mm_erfinv_pd, _mm256_erfinv_pd
_mm_erfinv_ps, _mm256_erfinv_ps
_mm_cdfnorminv_pd, _mm256_cdfnorminv_pd
_mm_cdfnorminv_ps, _mm256_cdfnorminv_ps
_mm_erf_pd, _mm256_erf_pd
_mm_erf_ps, _mm256_erf_ps
_mm_erfc_pd, _mm256_erfc_pd
_mm_erfc_ps, _mm256_erfc_ps
_mm_erfinv_pd, _mm256_erfinv_pd
_mm_erfinv_ps, _mm256_erfinv_ps
exponential functions
_mm_exp2_pd, _mm256_exp2_pd
_mm_exp2_ps, _mm256_exp2_ps
_mm_hypot_ps, _mm256_hypot_ps
_mm_exp_pd, _mm256_exp_pd
_mm_exp_ps, _mm256_exp_ps
_mm_exp10_pd, _mm256_exp10_pd
_mm_exp10_ps, _mm256_exp10_ps
_mm_expm1_pd, _mm256_expm1_pd
_mm_expm1_ps, _mm256_expm1_ps
_mm_hypot_pd, _mm256_hypot_pd
_mm_pow_pd, _mm256_pow_pd
_mm_pow_ps, _mm256_pow_ps
_mm_exp2_pd, _mm256_exp2_pd
_mm_exp2_ps, _mm256_exp2_ps
_mm_hypot_ps, _mm256_hypot_ps
_mm_exp_pd, _mm256_exp_pd
_mm_exp_ps, _mm256_exp_ps
_mm_exp10_pd, _mm256_exp10_pd
_mm_exp10_ps, _mm256_exp10_ps
_mm_expm1_pd, _mm256_expm1_pd
_mm_expm1_ps, _mm256_expm1_ps
_mm_hypot_pd, _mm256_hypot_pd
_mm_pow_pd, _mm256_pow_pd
_mm_pow_ps, _mm256_pow_ps
logarithmic functions
_mm_log_pd, _mm256_log_pd
_mm_log_ps, _mm256_log_ps
_mm_log10_pd, _mm256_log10_pd
_mm_log10_ps, _mm256_log10_ps
_mm_log1p_pd, _mm256_log1p_pd
_mm_log1p_ps, _mm256_log1p_ps
_mm_log2_pd, _mm256_log2_pd
_mm_log2_ps, _mm256_log2_ps
_mm_logb_pd, _mm256_logb_pd
_mm_logb_ps, _mm256_logb_ps
_mm_log_pd, _mm256_log_pd
_mm_log_ps, _mm256_log_ps
_mm_log10_pd, _mm256_log10_pd
_mm_log10_ps, _mm256_log10_ps
_mm_log1p_pd, _mm256_log1p_pd
_mm_log1p_ps, _mm256_log1p_ps
_mm_log2_pd, _mm256_log2_pd
_mm_log2_ps, _mm256_log2_ps
_mm_logb_pd, _mm256_logb_pd
_mm_logb_ps, _mm256_logb_ps
overview
square and cube root functions
_mm_sqrt_ps, _mm256_sqrt_ps
_mm_cbrt_pd, _mm256_cbrt_pd
_mm_cbrt_ps, _mm256_cbrt_ps
_mm_invcbrt_pd, _mm256_invcbrt_pd
_mm_invcbrt_ps, _mm256_invcbrt_ps
_mm_invsqrt_pd, _mm256_invsqrt_pd
_mm_invsqrt_ps, _mm256_invsqrt_ps
_mm_sqrt_pd, _mm256_sqrt_pd
_mm_sqrt_ps, _mm256_sqrt_ps
_mm_cbrt_pd, _mm256_cbrt_pd
_mm_cbrt_ps, _mm256_cbrt_ps
_mm_invcbrt_pd, _mm256_invcbrt_pd
_mm_invcbrt_ps, _mm256_invcbrt_ps
_mm_invsqrt_pd, _mm256_invsqrt_pd
_mm_invsqrt_ps, _mm256_invsqrt_ps
_mm_sinh_pd, _mm256_sinh_pd
trigonometric functions
_mm_sinh_ps, _mm256_sinh_ps
_mm_acos_pd, _mm256_acos_pd
_mm_acos_ps, _mm256_acos_ps
_mm_acosh_pd, _mm256_acosh_pd
_mm_acosh_ps, _mm256_acosh_ps
_mm_asin_pd, _mm256_asin_pd
_mm_asin_ps, _mm256_asin_ps
_mm_asinh_pd, _mm256_asinh_pd
_mm_asinh_ps, _mm256_asinh_ps
_mm_atan_pd, _mm256_atan_pd
_mm_atan_ps, _mm256_atan_ps
_mm_atan2_pd, _mm256_atan2_pd
_mm_atan2_ps, _mm256_atan2_ps
_mm_atanh_pd, _mm256_atanh_pd
_mm_atanh_ps, _mm256_atanh_ps
_mm_cos_pd, _mm256_cos_pd
_mm_cos_ps, _mm256_cos_ps
_mm_cosd_pd, _mm256_cosd_pd
_mm_cosd_ps, _mm256_cosd_ps
_mm_cosh_pd, _mm256_cosh_pd
_mm_cosh_ps, _mm256_cosh_ps
_mm_sin_pd, _mm256_sin_pd
_mm_sin_ps, _mm256_sin_ps
_mm_sincos_pd, _mm256_sincos_pd
_mm_sincos_ps, _mm256_sincos_ps
_mm_sind_pd, _mm256_sind_pd
_mm_sind_ps, _mm256_sind_ps
_mm_sinh_pd, _mm256_sinh_pd
_mm_tan_pd, _mm256_tan_pd
_mm_tan_ps, _mm256_tan_ps
_mm_tand_pd, _mm256_tand_pd
_mm_tand_ps, _mm256_tand_ps
_mm_tanh_pd, _mm256_tanh_pd
_mm_tanh_ps, _mm256_tanh_ps
_mm_sinh_ps, _mm256_sinh_ps
_mm_acos_pd, _mm256_acos_pd
_mm_acos_ps, _mm256_acos_ps
_mm_acosh_pd, _mm256_acosh_pd
_mm_acosh_ps, _mm256_acosh_ps
_mm_asin_pd, _mm256_asin_pd
_mm_asin_ps, _mm256_asin_ps
_mm_asinh_pd, _mm256_asinh_pd
_mm_asinh_ps, _mm256_asinh_ps
_mm_atan_pd, _mm256_atan_pd
_mm_atan_ps, _mm256_atan_ps
_mm_atan2_pd, _mm256_atan2_pd
_mm_atan2_ps, _mm256_atan2_ps
_mm_atanh_pd, _mm256_atanh_pd
_mm_atanh_ps, _mm256_atanh_ps
_mm_cos_pd, _mm256_cos_pd
_mm_cos_ps, _mm256_cos_ps
_mm_cosd_pd, _mm256_cosd_pd
_mm_cosd_ps, _mm256_cosd_ps
_mm_cosh_pd, _mm256_cosh_pd
_mm_cosh_ps, _mm256_cosh_ps
_mm_sin_pd, _mm256_sin_pd
_mm_sin_ps, _mm256_sin_ps
_mm_sincos_pd, _mm256_sincos_pd
_mm_sincos_ps, _mm256_sincos_ps
_mm_sind_pd, _mm256_sind_pd
_mm_sind_ps, _mm256_sind_ps
_mm_sinh_pd, _mm256_sinh_pd
_mm_tan_pd, _mm256_tan_pd
_mm_tan_ps, _mm256_tan_ps
_mm_tand_pd, _mm256_tand_pd
_mm_tand_ps, _mm256_tand_ps
_mm_tanh_pd, _mm256_tanh_pd
_mm_tanh_ps, _mm256_tanh_ps
intrinsics
Macro Functions
Intel® SSE2
Macro Functions
macro functions
Intel® SSE3
Macro Functions
macro functions
Intrinsics for Intel® Advanced Vector Extensions 512 (Intel® AVX-512)
Intrinsics for Integer Comparison Operations
Intrinsics for FP Comparison Operations
absolute value operations
Intrinsics for Absolute Value Operations
_mm512[_mask[z]]_abs_epi32
_mm512[_mask[z]]_abs_epi64
architectural enhancements
arithmetic operations
Intrinsics for FP Multiplication Operations
Intrinsics for FP Subtraction Operations
Intrinsics for Integer Addition Operations
Intrinsics for Integer Multiplication Operations
Intrinsics for Integer Subtraction Operations
FP multiplicationoperations
Intrinsics for FP Multiplication Operations
_mm_mask_mul_round_sd
_mm_mask_mul_round_ss
_mm_maskz_mul_round_sd
_mm_maskz_mul_round_ss
_mm_mul_round_sd
_mm_mul_round_ss
_mm512_maskz_mul_round_pd
_mm512_maskz_mul_round_ps
FP subtraction operations
Intrinsics for FP Subtraction Operations
_mm_mask[z]_sub_sd
_mm_mask[z]_sub_ss
_mm512[_mask[z]]_sub_pd
_mm512[_mask[z]]_sub_ps
_mm512[_mask[z]]_sub_round_pd
_mm512[_mask[z]]_sub_round_ps
_mm512[_mask[z]]_sub_round_sd
_mm512[_mask[z]]_sub_round_ss
integer addition
Intrinsics for Integer Addition Operations
_mm512[_mask[z]]_add_epi32
_mm512[_mask[z]]_add_epi64
integer multiplication operations
Intrinsics for Integer Multiplication Operations
_mm512[_mask[z]]_mul_epi32
_mm512[_mask[z]]_mul_epu32
_mm512[_mask]_mulhi_epi32
_mm512[_mask]_mulhi_epu32
_mm512[_mask]_mullo_epi32
_mm512[_mask]_mullox_epi64
integer subtraction operations
Intrinsics for Integer Subtraction Operations
_mm512[_mask[z]]_sub_round_epi32
_mm512[_mask[z]]_sub_round_epi64
arithmetic operations
Intrinsics for FP Addition Operations
FP addition operations
Intrinsics for FP Addition Operations
_mm_mask[z]_add_sd
_mm_mask[z]_add_ss
_mm[_mask[z]]_add_round_sd
_mm[_mask[z]]_add_round_ss
_mm512_mask[z]_add_pd
_mm512[_mask[z]]_add_ps
_mm512[_mask[z]]_add_round_pd
_mm512[_mask[z]]_add_round_ps
bit manipulation operations
Intrinsics for Integer Bit Manipulation and Conflict Detection Operations
_mm512_lzcnt_epi32
_mm512_lzcnt_epi64
_mm512_mask_lzcnt_epi32
_mm512_mask_lzcnt_epi64
_mm512_maskz_lzcnt_epi32
_mm512_maskz_lzcnt_epi64
bit rotation operations
Intrinsics for Integer Bit Rotation Operations
_mm512[_mask[z]]_rol_epi32
_mm512[_mask[z]]_rol_epi64
_mm512[_mask[z]]_rolv_epi32
_mm512[_mask[z]]_rolv_epi64
_mm512[_mask[z]]_ror_epi32
_mm512[_mask[z]]_ror_epi64
_mm512[_mask[z]]_rorv_epi32
_mm512[_mask[z]]_rorv_epi64
bitwise logical operations
Intrinsics for Bitwise Logical Operations
_mm512[_mask[z]]_and_epi32
_mm512[_mask[z]]_and_epi64
_mm512[_mask[z]]_andnot_epi32
_mm512[_mask[z]]_andnot_epi64
_mm512[_mask[z]]_or_epi32
_mm512[_mask[z]]_or_epi64
_mm512[_mask[z]]_xor_epi32
_mm512[_mask[z]]_xor_epi64
blend operations
Intrinsics for Blend Operations
_mm512_mask_blend_epi32
_mm512_mask_blend_epi64
_mm512_mask_blend_pd
_mm512_mask_blend_ps
conflict detection operations
Intrinsics for Test Operations
_mm512[_mask[z]]_conflict_epi32
_mm512[_mask[z]]_conflict_epi64
data types
extract operations
Intrinsics for FP Insert and Extract Operations
Intrinsics for Integer Insert and Extract Operations
_mm512[_mask[z]]_extractf32x4_ps
_mm512[_mask[z]]_extractf64x4_pd
_mm512[_mask[z]]_extracti32x4_epi32
Intrinsics for FP Insert and Extract Operations
Intrinsics for Integer Insert and Extract Operations
_mm512[_mask[z]]_extracti64x4_epi64
Intrinsics for FP Insert and Extract Operations
Intrinsics for Integer Insert and Extract Operations
FP broadcast operations
Intrinsics for FP Broadcast Operations
_mm512[_mask[z]]_broadcast_f32x4
_mm512[_mask[z]]_broadcast_f64x4
_mm512[_mask[z]]_broadcastsd_pd
_mm512[_mask[z]]_broadcastss_ps
FP conversion operations
Intrinsics for FP Conversion Operations
_mm_cvt_roundsd_i32
_mm_cvt_roundsd_i64
_mm_cvt_roundsd_u32
_mm_cvt_roundsd_u64
_mm_cvt_roundss_i32
_mm_cvt_roundss_i64
_mm_cvt_roundss_u32
_mm_cvt_roundss_u64
_mm_cvtt_roundsd_i32
_mm_cvtt_roundsd_i64
_mm_cvtt_roundsd_u32
_mm_cvtt_roundsd_u64
_mm_cvtt_roundss_i32
_mm_cvtt_roundss_i64
_mm_cvtt_roundss_u32
_mm_cvtt_roundss_u64
_mm512[_mask[z]]_cvt_roundpd_epi32
_mm512[_mask[z]]_cvt_roundpd_epu32
_mm512[_mask[z]]_cvt_roundpd_ps
_mm512[_mask[z]]_cvt_roundph_ps
_mm512[_mask[z]]_cvt_roundps_epi32
_mm512[_mask[z]]_cvt_roundps_epu32
_mm512[_mask[z]]_cvt_roundps_pd
_mm512[_mask[z]]_cvt_roundps_ph
_mm512[_mask[z]]_cvt_roundsd_ss
_mm512[_mask[z]]_cvt_roundss_sd
_mm512[_mask[z]]_cvtt_roundpd_epi32
_mm512[_mask[z]]_cvtt_roundpd_epu32
_mm512[_mask[z]]_cvtt_roundps_epi32
_mm512[_mask[z]]_cvtt_roundps_epu32
FP division operations
Intrinsics for FP Division Operations
_mm[_mask[z]]_div_round_sd
_mm[_mask[z]]_div_round_ss
_mm512[_mask[z]]_div_round_pd
FP expand and load operations
Intrinsics for FP Expand and Load Operations
_mm512_mask[z]_expandloadu_pd
_mm512_mask[z]_expandloadu_ps
_mm512[_mask[z]]_expand_pd
_mm512[_mask[z]]_expand_ps
FP Fused Multiply-Add (FMA) operations
Intrinsics for FP Fused Multiply-Add (FMA) Operations
_mm512_mask[3][z]_fmadd_round_sd
_mm512_mask[3][z]_fmadd_round_ss
_mm512_mask[3][z]_fmadd_sd
_mm512_mask[3][z]_fmadd_ss
_mm512_mask[3][z]_fnmadd_round_sd
_mm512_mask[3][z]_fnmadd_round_ss
_mm512_mask[3][z]_fnmadd_sd
_mm512_mask[3][z]_fnmadd_ss
_mm512[_mask[3][z]]_fmadd_pd
_mm512[_mask[3][z]]_fmadd_ps
_mm512[_mask[3][z]]_fmadd_round_pd
_mm512[_mask[3][z]]_fmadd_round_ps
_mm512[_mask[3][z]]_fmaddsub_pd
_mm512[_mask[3][z]]_fmaddsub_ps
_mm512[_mask[3][z]]_fmaddsub_round_pd
_mm512[_mask[3][z]]_fmaddsub_round_ps
_mm512[_mask[3][z]]_fmaddsub_round_sd
_mm512[_mask[3][z]]_fmaddsub_round_ss
_mm512[_mask[3][z]]_fmaddsub_sd
_mm512[_mask[3][z]]_fmaddsub_ss
_mm512[_mask[3][z]]_fmsub_pd
_mm512[_mask[3][z]]_fmsub_ps
_mm512[_mask[3][z]]_fmsub_round_pd
_mm512[_mask[3][z]]_fmsub_round_ps
_mm512[_mask[3][z]]_fmsub_round_sd
_mm512[_mask[3][z]]_fmsub_round_ss
_mm512[_mask[3][z]]_fmsub_sd
_mm512[_mask[3][z]]_fmsub_ss
_mm512[_mask[3][z]]_fnmadd_pd
_mm512[_mask[3][z]]_fnmadd_ps
_mm512[_mask[3][z]]_fnmadd_round_pd
_mm512[_mask[3][z]]_fnmadd_round_ps
_mm512[_mask[3][z]]_fnmaddsub_pd
_mm512[_mask[3][z]]_fnmaddsub_ps
_mm512[_mask[3][z]]_fnmaddsub_round_pd
_mm512[_mask[3][z]]_fnmaddsub_round_ps
_mm512[_mask[3][z]]_fnmaddsub_round_sd
_mm512[_mask[3][z]]_fnmaddsub_round_ss
_mm512[_mask[3][z]]_fnmaddsub_sd
_mm512[_mask[3][z]]_fnmaddsub_ss
_mm512[_mask[3][z]]_fnmsub_pd
_mm512[_mask[3][z]]_fnmsub_ps
_mm512[_mask[3][z]]_fnmsub_round_pd
_mm512[_mask[3][z]]_fnmsub_round_ps
_mm512[_mask[3][z]]_fnmsub_round_sd
_mm512[_mask[3][z]]_fnmsub_round_ss
_mm512[_mask[3][z]]_fnmsub_sd
_mm512[_mask[3][z]]_fnmsub_ss
FP gather and scatter operations
FP Loads and store operations
FP move operations
Intrinsics for FP Move Operations
_mm512_mask_mov_pd
_mm512_mask_mov_ps
_mm512_mask_move_sd
_mm512_mask_move_ss
_mm512_mask_movedup_pd
_mm512_mask_movehdup_ps
_mm512_mask_moveldup_ps
_mm512_maskz_mov_pd
_mm512_maskz_mov_ps
_mm512_maskz_move_sd
_mm512_maskz_move_ss
_mm512_maskz_movedup_pd
_mm512_maskz_movehdup_ps
_mm512_maskz_moveldup_ps
_mm512_movedup_pd
_mm512_movehdup_ps
_mm512_moveldup_ps
FP permute operations
Intrinsics for FP Permutation Operations
_mm512[_mask[2][z]]_permutex2var_pd
_mm512[_mask[2][z]]_permutex2var_ps
_mm512[_mask[z]]_permute_pd
_mm512[_mask[z]]_permute_ps
_mm512[_mask[z]]_permutevar_pd
_mm512[_mask[z]]_permutevar_ps
_mm512[_mask[z]]_permutex_pd
_mm512[_mask[z]]_permutexvar_pd
_mm512[_mask[z]]_permutexvar_ps
_mm512[_mask]_permute4f128_ps
FP reduction operations
Intrinsics for FP Reduction Operations
_mm512[_mask]_reduce_add_pd
_mm512[_mask]_reduce_add_ps
_mm512[_mask]_reduce_max_pd
_mm512[_mask]_reduce_max_ps
_mm512[_mask]_reduce_min_pd
_mm512[_mask]_reduce_min_ps
_mm512[_mask]_reduce_mul_pd
_mm512[_mask]_reduce_mul_ps
FP shuffle operations
Intrinsics for FP Shuffle Operations
_mm512[_mask[z]]_shuffle_f32x4
_mm512[_mask[z]]_shuffle_f64x2
_mm512[_mask[z]]_shuffle_pd
_mm512[_mask[z]]_shuffle_ps
FP unpack operations
Intrinsics for FP Pack and Unpack Operations
_mm512_mask_unpackhi_pd
_mm512_mask_unpackhi_ps
_mm512_mask_unpacklo_pd
_mm512_mask_unpacklo_ps
_mm512_maskz_unpackhi_pd
_mm512_maskz_unpackhi_ps
_mm512_maskz_unpacklo_pd
_mm512_maskz_unpacklo_ps
_mm512_unpackhi_pd
_mm512_unpackhi_ps
_mm512_unpacklo_pd
_mm512_unpacklo_ps
insert operations
Intrinsics for FP Insert and Extract Operations
Intrinsics for Integer Insert and Extract Operations
_mm_extract_ps
_mm_insert_ps
_mm256_insertf128_pd
_mm256_insertf128_ps
_mm256_insertf128_si256
Intrinsics for FP Insert and Extract Operations
Intrinsics for Integer Insert and Extract Operations
_mm512[_mask[z]]_insertf32x4
_mm512[_mask[z]]_insertf64x4
Intrinsics for FP Insert and Extract Operations
Intrinsics for Integer Insert and Extract Operations
_mm512[_mask[z]]_inserti64x4
integer bit shift operations
Intrinsics for Integer Bit Shift Operations
_mm512[_mask[z]]_sll_epi32
_mm512[_mask[z]]_sll_epi64
_mm512[_mask[z]]_slli_epi64
_mm512[_mask[z]]_sllv_epi64
_mm512[_mask[z]]_sra_epi32
_mm512[_mask[z]]_sra_epi64
_mm512[_mask[z]]_srai_epi64
_mm512[_mask[z]]_srav_epi64
_mm512[_mask[z]]_srl_epi32
_mm512[_mask[z]]_srl_epi64
_mm512[_mask[z]]_srli_epi32
_mm512[_mask[z]]_srli_epi64
_mm512[_mask[z]]_srlv_epi32
_mm512[_mask[z]]_srlv_epi64
integer broadcast operations
Intrinsics for Integer Broadcast Operations
_mm512_broadcastmb_epi64
_mm512_broadcastmw_epi32
_mm512[_mask[z]]_broadcast_i32x4
_mm512[_mask[z]]_broadcast_i64x4
_mm512[_mask[z]]_broadcastd_epi32
_mm512[_mask[z]]_broadcastq_epi64
integer compression operations
integer conversion operations
Intrinsics for Integer Conversion Operations
_mm_cvt_roundi32_ss
_mm_cvt_roundi64_sd
_mm_cvt_roundi64_ss
_mm_cvt_roundu32_ss
_mm_cvt_roundu64_sd
_mm_cvt_roundu64_ss
_mm_cvtu32_sd
_mm512_cvtsi512_si32
_mm512[_mask[z]]_cvt_roundepi32_ps
_mm512[_mask[z]]_cvt_roundepu32_ps
_mm512[_mask[z]]_cvtepi16_epi32
_mm512[_mask[z]]_cvtepi16_epi64
_mm512[_mask[z]]_cvtepi32_epi16
_mm512[_mask[z]]_cvtepi32_epi64
_mm512[_mask[z]]_cvtepi32_epi8
_mm512[_mask[z]]_cvtepi32_pd
_mm512[_mask[z]]_cvtepi64_epi16
_mm512[_mask[z]]_cvtepi64_epi32
_mm512[_mask[z]]_cvtepi64_epi8
_mm512[_mask[z]]_cvtepi8_epi32
_mm512[_mask[z]]_cvtepi8_epi64
_mm512[_mask[z]]_cvtepu16_epi32
_mm512[_mask[z]]_cvtepu32_epi64
_mm512[_mask[z]]_cvtepu32_pd
_mm512[_mask[z]]_cvtepu8_epi64
_mm512[_mask[z]]_cvtsepi32_epi16
_mm512[_mask[z]]_cvtsepi32_epi8
_mm512[_mask[z]]_cvtsepi64_epi16
_mm512[_mask[z]]_cvtsepi64_epi32
_mm512[_mask[z]]_cvtsepi64_epi8
_mm512[_mask[z]]_cvtusepi32_epi16
_mm512[_mask[z]]_cvtusepi32_epi8
_mm512[_mask[z]]_cvtusepi64_epi16
_mm512[_mask[z]]_cvtusepi64_epi32
_mm512[_mask[z]]_cvtusepi64_epi8
integer gather and scatter operations
integer move operations
Intrinsics for Integer Move Operations
_mm512_mask[z]_mov_epi32
_mm512_mask[z]_mov_epi64
integer permute operations
Intrinsics for Integer Permutation Operations
_mm512[_mask[2][z]]_permutex2var_epi32
_mm512[_mask[2][z]]_permutex2var_epi64
_mm512[_mask[z]]_permutex_epi64
_mm512[_mask[z]]_permutexvar_epi32
integer reduction operations
Intrinsics for Integer Reduction Operations
_mm512[_mask]_reduce_add_epi64
_mm512[_mask]_reduce_and_epi64
_mm512[_mask]_reduce_max_epi64
_mm512[_mask]_reduce_max_epu64
_mm512[_mask]_reduce_min_epi64
_mm512[_mask]_reduce_min_epu64
_mm512[_mask]_reduce_mul_epi64
_mm512[_mask]_reduce_or_epi64
integer shuffle operations
Intrinsics for Integer Shuffle Operations
_mm512[_mask[z]]_shuffle_epi32
_mm512[_mask[z]]_shuffle_f32x4
_mm512[_mask[z]]_shuffle_f64x2
_mm512[_mask[z]]_shuffle_i32x4
_mm512[_mask[z]]_shuffle_i64x2
_mm512[_mask[z]]_shuffle_pd
_mm512[_mask[z]]_shuffle_ps
load and store operations
mathematics operations
minimum and maximum FP operations
Intrinsics for Determining Minimum and Maximum FP Values
_mm[_mask[z]]_max_round_sd
_mm[_mask[z]]_max_round_ss
_mm[_mask[z]]_min_round_sd
_mm[_mask[z]]_min_round_ss
_mm512[_mask[z]]_max_round_pd
_mm512[_mask[z]]_max_round_ps
_mm512[_mask[z]]_min_round_pd
_mm512[_mask[z]]_min_round_ps
minimum and maximum integer operations
Intrinsics for Determining Minimum and Maximum Integer Values
_mm512[_mask[z]]_max_epi32
_mm512[_mask[z]]_max_epi64
_mm512[_mask[z]]_max_epu32
_mm512[_mask[z]]_max_epu64
_mm512[_mask[z]]_min_epi32
_mm512[_mask[z]]_min_epi64
_mm512[_mask[z]]_min_epu32
_mm512[_mask[z]]_min_epu64
miscellaneous FP operations
miscellaneous integer operations
Intrinsics for Miscellaneous Integer Operations
_mm512[_mask[z]]_alignr_epi32
_mm512[_mask[z]]_alignr_epi64
overview
registers
scale operations
Intrinsics for Scale Operations
_mm512_mask_scalef_round_pd
_mm512_mask_scalef_round_ps
_mm512_mask_scalef_round_sd
_mm512_mask_scalef_round_ss
_mm512_maskz_scalef_round_pd
_mm512_maskz_scalef_round_ps
_mm512_maskz_scalef_round_sd
_mm512_maskz_scalef_round_ss
_mm512_scalef_round_pd
_mm512_scalef_round_ps
_mm512_scalef_round_sd
_mm512_scalef_round_ss
set operations
Intrinsics for Set Operations
_mm512_undefined
_mm512_undefined_epi32
_mm512_undefined_pd
_mm512_undefined_ps
SVML
Intrinsics for Short Vector Math Library (SVML) Operations
Intrinsics for Error Function Operations
Intrinsics for Logarithmic Operations
Intrinsics for Root Function Operations
division operations
Intrinsics for Short Vector Math Library (SVML) Operations
_mm[_mask[z]]_div_round_sd
_mm[_mask[z]]_div_round_ss
_mm512[_mask[z]]_div_round_pd
error function operations
Intrinsics for Short Vector Math Library (SVML) Operations
Intrinsics for Error Function Operations
exponential operations
logarithmic operations
logarithmic operations
Intrinsics for Logarithmic Operations
_mm512[_mask]_log_pd
_mm512[_mask]_log_ps
_mm512[_mask]_log10_pd
_mm512[_mask]_log10_ps
_mm512[_mask]_log1p_pd
_mm512[_mask]_log1p_ps
_mm512[_mask]_log2_pd
_mm512[_mask]_log2_ps
_mm512[_mask]_logb_pd
_mm512[_mask]_logb_ps
reciprocal operations
remainder operations
root and cube root operations
root operations
Intrinsics for Root Function Operations
_mm512[_mask]_cbrt_pd
_mm512[_mask]_cbrt_ps
_mm512[_mask]_hypot_pd
_mm512[_mask]_hypot_ps
_mm512[_mask]_invsqrt_pd
_mm512[_mask]_invsqrt_ps
_mm512[_mask]_sqrt_pd
_mm512[_mask]_sqrt_ps
rounding operations
Intrinsics for Short Vector Math Library (SVML) Operations
_mm512[_mask]__pd
_mm512[_mask]__ps
trigonometric operations
Intrinsics for Short Vector Math Library (SVML) Operations
_mm512[_mask]_acos_pd
_mm512[_mask]_acos_ps
_mm512[_mask]_acosh_pd
_mm512[_mask]_acosh_ps
_mm512[_mask]_asin_pd
_mm512[_mask]_asin_ps
_mm512[_mask]_asinh_pd
_mm512[_mask]_asinh_ps
_mm512[_mask]_atan_pd
_mm512[_mask]_atan_ps
_mm512[_mask]_atan2_pd
_mm512[_mask]_atan2_ps
_mm512[_mask]_atanh_pd
_mm512[_mask]_atanh_ps
_mm512[_mask]_cos_pd
_mm512[_mask]_cos_ps
_mm512[_mask]_cosd_pd
_mm512[_mask]_cosd_ps
_mm512[_mask]_cosh_pd
_mm512[_mask]_cosh_ps
_mm512[_mask]_sin_pd
_mm512[_mask]_sin_ps
_mm512[_mask]_sind_pd
_mm512[_mask]_sind_ps
_mm512[_mask]_sinh_pd
_mm512[_mask]_sinh_ps
_mm512[_mask]_tan_pd
_mm512[_mask]_tan_ps
_mm512[_mask]_tand_pd
_mm512[_mask]_tand_ps
_mm512[_mask]_tanh_pd
_mm512[_mask]_tanh_ps
SVML operations
Intrinsics for Exponential Operations
Intrinsics for Reciprocal Operations
exponential operations
Intrinsics for Exponential Operations
_mm512[_mask[z]]_exp2a23_pd
_mm512[_mask[z]]_mm512_exp2a23_ps
_mm512[_mask[z]]_mm512_exp2a23_round_pd
_mm512[_mask[z]]_mm512_exp2a23_round_ps
_mm512[_mask]_mm512_exp_pd
_mm512[_mask]_mm512_exp_ps
_mm512[_mask]_mm512_exp10_pd
_mm512[_mask]_mm512_exp10_ps
_mm512[_mask]_mm512_exp2_pd
_mm512[_mask]_mm512_exp2_ps
_mm512[_mask]_mm512_expm1_pd
_mm512[_mask]_mm512_expm1_ps
_mm512[_mask]_mm512_pow_ps
_mm512[_mask]_pow_pd
reciprocal operations
test operations
Intrinsics for Test Operations
_mm512_mask_test_epi64_mask
_mm512_mask_testn_epi32_mask
_mm512_mask_testn_epi64_mask
_mm512_test_epi64_mask
_mm512_testn_epi32_mask
_mm512_testn_epi64_mask
typecast operations
Intrinsics for Typecast Operations
_mm512_castpd_ps
_mm512_castpd_si512
_mm512_castpd128_pd512
_mm512_castpd256_pd512
_mm512_castpd512_pd128
_mm512_castpd512_pd256
_mm512_castpd512_ps128
_mm512_castps_pd
_mm512_castps_si512
_mm512_castps128_ps512
_mm512_castps256_ps512
_mm512_castps512_ps128
_mm512_castps512_ps256
_mm512_castsi128_si512
_mm512_castsi256_si512
_mm512_castsi512_pd
_mm512_castsi512_ps
_mm512_castsi512_si128
_mm512_castsi512_si256
unpack operations
Intrinsics for Integer Pack and Unpack Operations
_mm512_[mask[z]_]unpackhi_epi32
_mm512_[mask[z]_]unpackhi_epi64
_mm512_[mask[z]_]unpackhi_pd
_mm512_[mask[z]_]unpackhi_ps
_mm512_[mask[z]_]unpacklo_epi32
_mm512_[mask[z]_]unpacklo_epi64
_mm512_[mask[z]_]unpacklo_pd
_mm512_[mask[z]_]unpacklo_ps
vector mask operations
Intrinsics for Vector Mask Operations
_mm512_kand
_mm512_kandn
_mm512_kmov
_mm512_knot
_mm512_kor
_mm512_kortestc
_mm512_kortestz
_mm512_kunpackb
_mm512_kxnor
_mm512_kxor
Intrinsics for Intel® Advanced Vector Extensions 512 (Intel® AVX-512)
Intrinsics for Intel® Advanced Vector Extensions 512 (Intel® AVX-512) 4FMAPS Instructions
Intrinsics for Intel® Advanced Vector Extensions 512 (Intel® AVX-512) 4VNNIW Instructions
Intrinsics for Integer Expand and Load Operations
Intrinsics for Intel® Advanced Vector Extensions 512 (Intel® AVX-512) VPOPCNTDQ Instructions
4FMAPS Instructions
4VNNIW Instructions
integer expand and load operations
Intrinsics for Integer Expand and Load Operations
_mm512_mask[z]_expand_epi32
_mm512_mask[z]_expand_epi64
_mm512_mask[z]_expandloadu_epi32
_mm512_mask[z]_expandloadu_epi64
VPOPCNTDQ Instructions
Intrinsics for Intel® Advanced Vector Extensions 512 (Intel® AVX-512) BW, DQ, and VL instructions
Intrinsics for Bit Manipulation Operations
Intrinsics for Comparison Operations
Intrinsics for Conversion Operations
Intrinsics for Load Operations
Intrinsics for Logical Operations
Intrinsics for Miscellaneous Operations
Intrinsics for Move Operations
Intrinsics for Set Operations
Intrinsics for Shift Operations
Intrinsics for Store Operations
bit manipulation operations
comparison operations
conversion operations
load operations
logical operations
miscellaneous operations
Move operations
set operations
shift operations
store operations
invoking Intel® C++ Compiler
IPO
ipo-jobs, Qipo-jobs
option specifying jobs during the link phase of
IR
ivdep
IVDEP
High-Level Optimization (HLO)
effect when tuning applications
jmpbuf
jump
jump tables
qopt-jump-tables, Qopt-jump-tables
option enabling generation of
keyword
_Cilk_offload
_Cilk_shared
_Cilk_offload
_Cilk_offload_to
_Cilk_shared
KMP_AFFINITY
Thread Affinity Interface (Linux* and Windows*)
modifier
offset
permute
type
KMP_LIBRARY
KMP_TOPOLOGY_METHOD
KMP_TOPOLOGY_METHOD environment variable
language extensions
GCC* Compatibility and Interoperability
g++*
gcc*
LD_LIBRARY_PATH
LIB environment variable
libgcc library
shared-libgcc
static-libgcc
option linking dynamically
option linking statically
libistrconv Library
Function List
Intel's Numeric String Conversion Library
Overview: Intel's Numeric String Conversion Library
Intel's Numeric String Conversion functions
Numeric String Conversion
Numeric String Conversion Functions
libm
libqkmalloc Library
libraries
creating
OpenMP* run-time routines
OpenMP* Run-time Library Routines
Intel® Compiler Extension Routines to OpenMP*
option enabling dynamic linking of
option enabling static linking of
option letting you link to Intel® DAAL
option preventing linking with shared
option preventing use of standard
option printing location of system
redistributing
specifying
libraries
Creating Libraries
Managing Libraries
Using Intel Shared Libraries
-c compiler option
-fPIC compiler option
-shared compiler option
creating your own
LD_LIBRARY_PATH
managing
shared
Using Intel Shared Libraries
Creating Libraries
static
libraries, using in offloaded code
library
L
l
option searching in specified directory for
option to search for
Library extensions
C++ Library Extensions
valarray implementation
library functions
Inline Expansion of Functions
Intel extension
OpenMP* run-time routines
library math functions
fmath-errno
option testing errno after calls to
libstdc++ library
static-libstdc++
option linking statically
linear_index
linker
Xlinker
link
option passing linker option to
option passing options to
linker options
Passing Options to the Linker
specifying
linking
Linking Debug Information
Compilation Phases
option preventing use of startup files and libraries when
option preventing use of startup files when
option suppressing
linking debug information
linking options
linking tools
Linking Tools and Options
xild
Interprocedural Optimization (IPO)
IPO-Related Performance Issues
Creating a Library from IPO Objects
xilibtool
xilink
Interprocedural Optimization (IPO)
IPO-Related Performance Issues
linking tools IR
linking with IPO
Linux* compiler options
Specifying Object Files
c
o
Linux* compiler options
Specifying Include Files
Specifying Alternate Tools and Paths
Specifying Assembly Files
I
Qlocation
Qoption
S
X
lock routines
log file from offloaded code
long double data type
mlong-double
option overriding default configuration of
longjmp
loop alignment
falign-loops, Qalign-loops
option enabling
loop blocking factor
qopt-block-factor, Qopt-block-factor
option specifying
loop unrolling
Programming Guidelines for Vectorization
using the HLO optimizer
loop_count
loops
Loop Constructs
constructs
dependencies
distribution
interchange
option performing run-time checks for
option specifying blocking factor for
option specifying maximum times to unroll
option using aggressive unrolling for
parallelization
Vectorization and Loops
Programming with Auto-parallelization
transformations
vectorization
Vectorization and Loops
Vectorizing a Loop Using the _Simd Keyword
macOS* compiler options
Specifying Alternate Tools and Paths
Qlocation
Qoption
macro names
D
option associating with an optional value
macros
Additional Predefined Macros
GCC* Compatibility and Interoperability
Equivalent Macros
ISO Standard Predefined Macros
main cover
main thread
Qpar-adjust-stack
option adjusting the stack size for
maintainability
Avoiding Mixed Data Type Arithmetic Expressions
allocation
makefiles
Modifying Your makefile
Modifying Your makefile
modifying
Modifying Your makefile
Modifying Your makefile
makefiles, using
managed and unmanaged code
manual processor dispatch
Math Library
Overview: Intel® Math Library
code examples
function list
Complex Functions
Exponential Functions
Hyperbolic Functions
Miscellaneous Functions
Nearest Integer Functions
Remainder Functions
Special Functions
Trigonometric Functions
using
Math library
Complex Functions
Exponential Functions
Hyperbolic Functions
Miscellaneous Functions
Nearest Integer Functions
Remainder Functions
Special Functions
Trigonometric Functions
Complex Functions
cabs library function
cacos library function
cacosh library function
carg library function
casin library function
casinh library function
catan library function
catanh library function
ccos library function
ccosh library function
cexp library function
cexp10 library function
cimag library function
cis library function
clog library function
clog2 library function
conj library function
cpow library function
cproj library function
creal library function
csin library function
csinh library function
csqrt library function
ctan library function
ctanh library function
Exponential Functions
cbrt library function
exp library function
exp10 library function
exp2 library function
expm1 library function
frexp library function
hypot library function
ilogb library function
ldexp library function
log library function
log10 library function
log1p library function
log2 library function
logb library function
pow library function
scalb library function
scalbn library function
sqrt library function
Hyperbolic Functions
acosh library function
asinh library function
atanh library function
cosh library function
sinh library function
sinhcosh library function
tanh library function
Miscellaneous Functions
copysign library function
fabs library function
fdim library function
finite library function
fma library function
fmax library function
fmin library function
Miscellaneous Functions
nextafter library function
Nearest Integer Functions
ceil library function
floor library function
llrint library function
llround library function
lrint library function
lround library function
modf library function
nearbyint library function
rint library function
round library function
trunc library function
Remainder Functions
fmod library function
remainder library function
remquo library function
Special Functions
annuity library function
compound library function
erf library function
erfc library function
gamma library function
gamma_r library function
j0 library function
j1 library function
jn library function
lgamma library function
lgamma_r library function
tgamma library function
y0 library function
y1 library function
yn library function
Trigonometric Functions
acos library function
acosd library function
asin library function
asind library function
atan library function
atan2 library function
atand library function
atand2 library function
cos library function
cosd library function
cot library function
cotd library function
sin library function
sincos library function
sincosd library function
sind library function
tan library function
tand library function
math library functions
fimf-domain-exclusion, Qimf-domain-exclusion
fimf-arch-consistency, Qimf-arch-consistency
fimf-precision, Qimf-precision
option indicating domain for input arguments
option producing consistent results
option specifying a level of accuracy for
matmul library call
qopt-matmul, Qopt-matmul
option replacing matrix multiplication loop nests with
matrix multiplication loop nests
qopt-matmul, Qopt-matmul
option identifying and replacing
measuring time and data transferred in offload regions
memory layout transformations
qopt-mem-layout-trans, Qopt-mem-layout-trans
option controlling level of
memory loads
global-hoist, Qglobal-hoist
option enabling optimizations to move
memory management
Managing Memory Allocation for Pointer Variables
alloc_if
free_if
pointer variables
memory model
mcmodel
option specifying large
option specifying small or medium
option to use specific
memory, allocating for parts of arrays
memory, shared
Message Fabric Interface (MPI) support
Microsoft Visual Studio*
Creating a New Project
enabling optimization reports
getting started with
Guided Auto Parallelism
Intel® Performance Libraries
optimization reports, enabling
property pages
target platform
Specifying a Target Platform
for projects
for solutions
using code coverage
using Profile Guided Optimization
Microsoft* Visual C++
Qvc
option specifying compatibility with
Microsoft* Visual Studio
Qvc
option specifying compatibility with
Microsoft* Visual Studio*
Microsoft Compatibility
compatibility
integration
min_val
mixing vectorizable types in a loop
mock object files
MOVBE instructions
minstruction, Qinstruction
option generating
MPI support
mpx
attribute
multiple coprocessors
Calling Functions on the CPU to Modify the Coprocessor's Execution Environment
declaring target number
declaring target type
multiple processes
multiple-processes, MP
option creating
multithreaded programs
multithreading
Programming with Auto-parallelization
OpenMP* Support Libraries
MXCSR register
native Intel® MIC Architecture applications
About Building Native Intel® MIC Architecture Applications
about building
new implementation of parallel loops
par-loops, Qpar-loops
option enabling
noblock_loop
nofusion
noinline
noparallel
noprefetch
normalized floating-point number
Not-a-Number (NaN)
nounroll
nounroll_and_jam
novector
number of hardware threads per core
qopt-threads-per-core, Qopt-threads-per-core
option specifying
object file
ipo-separate, Qipo-separate
bigobj
option generating one per source file
option increasing number of sections in
object files
Specifying Object Files
specifying
offload
offload compilation
Displaying Options Passed to Offload Compilation
watch
qoffload-option, Qoffload-option
list of options passed to
option listing all options passed to
option passing additional options to
passing additional options to
offload extract tool
offload regions, measuring time and data transferred in
offload_attribute
OFFLOAD_REPORT environment variable
offload_transfer
offload_wait
offloading code
omp simdoff
OMP_STACKSIZE environment variable
Open Source tools
openmp_version
OpenMP*
Linking Debug Information
C/C++ interoperability
Fortran and C/C++ interoperability
Intel® Xeon Phi™ coprocessor support
support overview
using
advanced issues
combined construct
compatibility libraries
OpenMP* Source Compatibility and Interoperability with Other Compilers
OpenMP* Support Libraries
compatibility with other compilers
composite construct
debugging
environment variables
extensions for Intel® Compiler
guidelines for using libraries
header files
KMP_AFFINITY
legacy libraries
OpenMP* Source Compatibility and Interoperability with Other Compilers
OpenMP* Support Libraries
library file names
load balancing
object-level interoperability
omp.h
parallel processing thread model
performance
run-time library routines
SIMD-enabled functions
source compatibility
support libraries
OpenMP*
OpenMP* Examples
examples of
OpenMP* affinity, specifying
OpenMP* API
qopenmp, Qopenmp
qopenmp-stubs, Qopenmp-stubs
qopenmp-threadprivate, Qopenmp-threadprivate
option enabling
option enabling programs in sequential mode
option specifying threadprivate
OpenMP* clauses summary
OpenMP* header files
OpenMP* Libraries
Using the OpenMP* Libraries
using
OpenMP* pragmas
OpenMP* Pragmas Summary
using
syntax
OpenMP* run-time library
qopenmp-link
qopenmp-lib, Qopenmp-lib
option controlling which is linked to
option specifying
OpenMP*, defaults
OpenMP*, loop constructs
OpenMP* Run-time Library Routines
numbers
opt report
Inlining Report
inlining report
optimization
Other Considerations
Other Considerations
option enabling prefetch insertion
option generating single assembly file from multiple files
option generating single object file from multiple files
option specifying code
optimization report
Options: Optimization Reports dialog box
Optimization Reports: Enabling in Microsoft Visual Studio*
qopt-report-help, Qopt-report-help
qopt-report-routine, Qopt-report-routine
qopt-report-filter, Qopt-report-filter
qopt-report-per-object, Qopt-report-per-object
qopt-report, Qopt-report
qopt-report-embed, Qopt-report-embed
qopt-report-names, Qopt-report-names
qopt-report-file, Qopt-report-file
qopt-report-phase, Qopt-report-phase
qopt-report-format, Qopt-report-format
Optimization Reports: Viewing
enabling in Visual Studio*
Options: Optimization Reports dialog box
Optimization Reports: Enabling in Microsoft Visual Studio*
option displaying phases for
option generating for routine names with specified substring
option generating from subset
option generating in separate file per object
option generating to stderr
option including loop annotations
option specifying level of detail for
option specifying mangled or unmangled names
option specifying name for
option specifying phase to use for
option specifying the format for
viewing in Visual Studio*
optimization_level
optimization_parameter
optimizations
High-Level Optimization (HLO)
Od
Ot
Os
guide-opts, Qguide-opts
Profile-Guided Optimization (PGO)
high-level language
option disabling all
option enabling all speed
option enabling many speed
option generating recommendations to improve
overview of
profile-guided
optimize
option mapping tool
options passed to offload compilation
watch
option listing all
Options: Optimization Reports dialog box
Options: Profile Guided Optimization dialog box
output files
o
option specifying name for
overflow
OpenMP* Run-time Library Routines
call to a runtime library routine
overview
padding
qopt-assume-safe-padding, Qopt-assume-safe-padding
option specifying assumptions for dynamically allocated memory
option specifying assumptions for variables
parallel
parallel pragma
Enabling Further Loop Parallelization for Multicore Platforms
lastprivate clause
private clause
parallel processing
Parallel Processing Model
thread model
parallel project builds
Performing Parallel Project Builds
performing
parallel region
par-num-threads, Qpar-num-threads
option specifying number of threads to use in
parallel regions
parallelism
Using Intel® Performance Libraries with Microsoft Visual Studio*
Guided Auto Parallelism
Automatic Parallelization
C/C++ Extensions for Array Notations Overview
OpenMP* Run-time Library Routines
Changing the Selected Intel® Performance Libraries
parallelization
Guided Auto Parallelism
Automatic Parallelization
C/C++ Extensions for Array Notations Overview
Programming with Auto-parallelization
Using Guided Auto Parallelism
C/C++ Extensions for Array Notations
parentheses in expressions
fprotect-parens, Qprotect-parens
option determining interpretation of
pedigrees
__cilkrts_get_pedigree
Pedigrees
performance
performance issues with IPO
PGO
Profile-Guided Optimization (PGO)
dialog box
in Microsoft Visual Studio*
using
PGO API
Dumping Profile Information
Dumping and Resetting Profile Information
Resetting the Dynamic Profile Counters
Resetting Profile Information
Interval Profile Dumping
Getting Coverage Summary Information on Demand
PGO API Support
_PGOPTI_Prof_Dump_All
_PGOPTI_Prof_Dump_And_Reset
_PGOPTI_Prof_Reset
_PGOPTI_Prof_Reset_ALL
_PGOPTI_Set_Interval_Prof_Dump
:Getting coverage summary on demand
enable
PGO dialog box
PGO reports
PGO tools
PGO Tools Overview
code coverage tool
profmerge
proforder
test prioritization tool
pgopti.spi file
pgouser.h header file
platform toolset
pointer aliasing
qopt-multi-version-aggressive, Qopt-multi-version-aggressive
option using aggressive multi-versioning check for
pointer checker
Checking Arrays
Checking Bounds
Checking Custom Memory Allocators
Checking for Dangling Pointers
Checking Multi-Threaded Code
Checking Run-Time Library Functions
Pointer Checker Feature Summary
Finding and Reporting Out-of-Bounds Errors
How the Compiler Defines Bounds Information for Pointers
Pointer Checker Overview
Passing and Returning Bounds
Storing Bounds Information
Working with Enabled and Non-Enabled Modules
Writing a Wrapper
checking arrays
checking bounds
checking custom memory allocator
checking for dangling pointers
checking multi-threaded code
checking run-time library functions
feature summary
finding and reporting errors
how bounds are defined
overview
passing and returning bounds
storing bounds information
working with enabled and non-enabled modules
wrappers
Checking Run-Time Library Functions
Writing a Wrapper
wrapping run-time library functions
porting applications
Overview: Porting from the Microsoft* Compiler to the Intel® C++ Compiler
from the Microsoft* C++ Compiler
to the Intel® C++ Compiler
porting applications
Overview: Porting from gcc* to the Intel® C++ Compiler
from gcc* to the Intel® C++ Compiler
position-independent code
fpic
fpie
option generating
fpic
fpie
position-independent external references
mdynamic-no-pic
option generating code with
power consumption
diag-enable=power, Qdiag-enable:power
option enabling diagnostics for
pragma alloc_section
alloc_section
var
pragma block_loop
block_loop/noblock_loop
factor
level
pragma cilk grainsize
pragma code_align
pragma distribute_point
pragma forceinline
inline, noinline, forceinline
recursive
pragma inline
inline, noinline, forceinline
recursive
pragma intel_omp_task
pragma intel_omp_taskq
pragma ivdep
pragma loop_count
loop_count
avg
max
min
n
pragma noblock_loop
pragma nofusion
pragma noinline
pragma noparallel
pragma noprefetch
prefetch/noprefetch
var
pragma nounroll
pragma nounroll_and_jam
pragma novector
pragma offload
offload
offload-parameter
target
pragma offload_attribute
offload_attribute
none
pop
push
target
pragma offload_transfer
offload_transfer
offload-parameter
target
pragma offload_wait
offload_wait
target-name
wait
pragma omp simdoff
pragma optimization_level
optimization_level
GCC
intel
n
pragma optimization_parameter
optimization_parameter
target_arch
pragma optimize
optimize
off
on
pragma parallel
parallel/noparallel
always
firstprivate
lastprivate
num_threads
private
pragma prefetch
prefetch/noprefetch
distance
hint
var
pragma simd
User-Mandated or SIMD Vectorization
assert
firstprivate
lastprivate
linear
noassert
novecremainder
private
reduction
vecremainder
vectorlength
vectorlengthfor
pragma unroll
pragma unroll_and_jam
pragma unused
pragma vector
vector
aligned
always
mask_readwrite
nomask_readwrite
nontemporal
novecremainder
temporal
unaligned
vecremainder
pragmas
help-pragma, Qhelp-pragma
option displaying
Pragmas
Intel-supported Pragma Reference
Pragmas
gcc* compatible
HP* compatible
Intel-supported
Microsoft* compatible
overview
Pragmas: Intel-specific
preallocated modifier
precompiled header files
predefined macros
Additional Predefined Macros
GCC* Compatibility and Interoperability
ISO Standard Predefined Macros
preempting functions
prefetch
prefetch distance
qopt-prefetch-distance, Qopt-prefetch-distance
option specifying for prefetches inside loops
prefetch insertion
qopt-prefetch, Qopt-prefetch
option enabling
prefetchW instruction
qopt-prefetch-issue-excl-hint, Qopt-prefetch-issue-excl-hint
option supporting
preprocessor variables
Writing Target-Specific Code Using a Pragma
Writing Code that Should Not Be Built for CPU-Only Execution
prioritizing application tests
private variables
privatization of static data for the MPC unified parallel runtime
fmpc-privatize
option enabling
processor
mtune, tune
option optimizing for specific
processor features
x, Qx
option telling which to target
prof-gen-sampling compiler option
prof-use-sampling compiler option
profile data records
prof-src-dir, Qprof-src-dir
prof-src-root-cwd, Qprof-src-root-cwd
prof-src-root, Qprof-src-root
option affecting search for
option letting you use relative paths when searching for
prof-src-root-cwd, Qprof-src-root-cwd
prof-src-root, Qprof-src-root
Profile Guided Optimization
Profile Guided Optimization dialog box
Using Profile Guided Optimization in Microsoft Visual Studio*
dialog box
in Microsoft Visual Studio*
using
Profile Guided Optimization dialog box
profile information
Profmerge and Proforder Tools
.dyn
profile information
Profmerge and Proforder Tools
Dumping Profile Information
Dumping and Resetting Profile Information
.dyn file
Profmerge and Proforder Tools
Dumping Profile Information
Dumping and Resetting Profile Information
profile information arrays
Code Coverage Tool
.dyn file
profile-guided optimization
Profile-Guided Optimization (PGO)
API support
data ordering optimization
dumping profile information
example of
function grouping optimization
function order lists optimization
function ordering optimization
funtion/loop execution time
interval profile dumping
overview
phases
reports
resetting dynamic profile counters
Resetting the Dynamic Profile Counters
Resetting Profile Information
resetting profile information
support
usage model
profile-optimized code
PGO API Support
dumping
Dumping Profile Information
Interval Profile Dumping
generating information
resetting dynamic counters for
Resetting the Dynamic Profile Counters
Resetting Profile Information
profiling
prof-use, Qprof-use
prof-gen, Qprof-gen
prof-dir, Qprof-dir
prof-file, Qprof-file
option enabling use of information from
option instrumenting a program for
option specifying directory for output files
option specifying name for summary
profiling an application
Profile an Application with Instrumentation
.dyn
profiling feedback compilation
Profile an Application with Instrumentation
source
profiling information
prof-func-order, Qprof-func-order
prof-data-order, Qprof-data-order
option enabling function ordering
option using to order static data items
profmerge
profmerge tool
Test Prioritization Tool
Code Coverage Tool
.dpi file
Test Prioritization Tool
Code Coverage Tool
.dyn file
program loops
Guided Auto Parallelism
Automatic Parallelization
Using Guided Auto Parallelism
parallel processing model
programs
fast
falias, Oa
option maximizing speed in
option specifying aliasing should be assumed in
projects
Creating a New Project
adding files
creating
in Microsoft Visual Studio*
property pages in Microsoft Visual Studio*
redistributable package
redistributing libraries
references to global function symbols
Bsymbolic-functions
option binding to shared library definitions
references to global symbols
Bsymbolic
option binding to shared library definitions
register allocator
qopt-ra-region-strategy, Qopt-ra-region-strategy
option selecting method for partitioning
regparm parameter passing convention
mregparm-version
option specifying ABI for
relative error
fimf-accuracy-bits, Qimf-accuracy-bits
fimf-max-error, Qimf-max-error
option defining for math library function results
option defining maximum for math library function results
release configuration
remarks
Werror-all
option changing to errors
removed compiler options
report generation
Resetting the Dynamic Profile Counters
Resetting Profile Information
Intel® Compiler Extension Routines to OpenMP*
OpenMP* Run-time Library Routines
Dumping and Resetting Profile Information
Requesting Compiler Reports with the xi* Tools
dynamic profile counters
Resetting the Dynamic Profile Counters
Resetting Profile Information
Intel® Compiler extensions
OpenMP* run-time routines
profile information
timing
using xi* tools
response files
restricted transactional memory
Intrinsics for Restricted Transactional Memory Operations
Intrinsics for Intel® Transactional Synchronization Extensions (Intel® TSX)
restrictions on offloaded code
Restrictions on Offloaded Code Using a Pragma
Restrictions on Offloaded Code Using Shared Virtual Memory
restrictions on offloaded code, Windows*
routine entry
falign-stack
option specifying the stack alignment to use on
routines
qopt-args-in-regs, Qopt-args-in-regs
option passing parameters in registers
RTM
Function Prototype and Macro Definitions
function prototypes
macro definitions
run-time environment variables
run-time performance
Overview: Tuning Performance
improving
runtime dispatch
fimf-force-dynamic-target, Qimf-force-dynamic-target
option using in calls to math functions
scalar replacement
scalar-rep, Qscalar-rep
qopt-multi-version-aggressive, Qopt-multi-version-aggressive
option enabling during loop transformation
option using aggressive multi-versioning check for
SDLT
Example 4
Example 2
Example 3
accessors
Accessors
Bounds
example programs
Example 1
Examples
Example 5
indexes
number representation
proxy objects
SDLT_DEBUG
SDLT_INLINE
SDLT Layouts
Layouts
sdlt layout namespace
set stack size on coprocessors
setjmp
setting compiler options
setting options
Setting Options for a Project or File
in Eclipse*
shared functions
shared libraries
Using Shared Libraries on macOS*
Creating Libraries
shared memory
shared memory, about using
shared object
shared
option producing a dynamic
shared scalars
shared variables
short vector math library
fimf-use-svml, Qimf-use-svml
option specifying for math library functions
Short Vector Math Library (SVML) Intrinsics
Overview: Intrinsics for Short Vector Math Library (SVML) Functions
overview
Short Vector Random Number Generator Library
signed infinity
signed zero
simd
vectorization
simd
function annotations
simd pragmas
simd, Qsimd
option disabling compiler interpretation of
SMP systems
soa1d_container
soa1d_container::accessor
n_bounds_t
Accessor Concept
n_bounds_generator
soa1d_container::accessor and aos1d_container::accessor
bounds_t
bounds_d Template Function
sdlt::bounds Template Function
specifying file names
Specifying Object Files
for object files
specifying file names
Specifying Assembly Files
for assembly files
stack
F (Windows*)
option specifying reserve amount
stack alignment
Qsfalign
option specifying for functions
stack checking routine
Gs
option controlling threshold for call of
stack size, setting
stack variables
ftrapuv, Qtrapuv
option initializing to NaN
standard directories
X
option removing from include search path
standards conformance
static libraries
Creating Libraries
option invoking tool to generate
streaming stores
qopt-streaming-stores, Qopt-streaming-stores
option generating for optimization
streams in offloading
subnormal numbers
subroutines in the OpenMP* run-time library
OpenMP* Run-time Library Routines
for OpenMP*
parallel run-time
Supplemental Streaming SIMD Extensions 3
Absolute Value Intrinsics
Addition Intrinsics
Concatenate Intrinsics
Multiplication Intrinsics
Negation Intrinsics
Overview: Supplemental Streaming SIMD Extensions 3 (SSSE3)
Shuffle Intrinsics
Subtraction Intrinsics
absolute value intrinsics
addition intrinsics
concatenate intrinsics
multiplication intrinsics
negation intrinsics
Negation Intrinsics
_mm_sign_epi16
_mm_sign_epi32
_mm_sign_epi8
_mm_sign_pi16
_mm_sign_pi32
_mm_sign_pi8
overview
shuffle intrinsics
_mm_shuffle_epi8
_mm_shuffle_pi8
subtraction intrinsics
Subtraction Intrinsics
_mm_hsub_epi16
_mm_hsub_epi32
_mm_hsub_pi16
_mm_hsub_pi32
_mm_hsubs_epi16
_mm_hsubs_pi16
supported tools
SVML
symbol names
Quse-msasm-symbols
option using dollar sign when producing
symbol visibility
fvisibility
option specifying
synchronization
Automatic Parallelization
parallel processing model for
thread sleep time
synchronization between cpu and target
sysroot target directory
print-sysroot
option returning
target
Applying the target Attribute to Multiple Declarations
target
Managing Memory Allocation for Pointer Variables
applying to multiple declarations
attribute
memory management
target-specific code, writing
Writing Target-Specific Code Using a Pragma
Offload Using a Pragma
Writing Target-Specific Code with _Cilk_offload
targeting processors manually
Processor Targeting
cpu_dispatch
target (GCC attribute)
targetptr modifier
test prioritization tool
examples
options
requirements
thread affinity
Thread Affinity Interface (Linux* and Windows*)
option specifying
thread pooling
thread-private variables
threads
Using Intel® Performance Libraries with Microsoft Visual Studio*
Changing the Selected Intel® Performance Libraries
threshold control for auto-parallelization
OpenMP* Run-time Library Routines
Programming Guidelines for Vectorization
OpenMP* routines for
reordering
timing offload regions
to Microsoft Visual Studio* projects
tool options
PGO Tools Overview
code coverage tool
profmerge
proforder
test prioritization
tools
PGO Tools Overview
option passing options to
option specifying directory for supporting
topology maps
traceback information
traceback
option providing
transcendental functions
fast-transcendentals, Qfast-transcendentals
option replacing calls to
transferring data asynchronously
About Asynchronous Computation (Intel® MIC Architecture)
About Asynchronous Data Transfer (Intel® MIC Architecture)
tselect tool
Qcov-gen
Qcov-dir
Qcov-file
option producing an instrumented file for
option specifying a directory for profiling output for
option specifying a file name for summary files for
universal binaries
unroll
unroll/nounroll
n
unroll_and_jam
unroll_and_jam/nounroll_and_jam
n
unused
unwind information
fasynchronous-unwind-tables
option determining where precision occurs
use PGO
user functions
Developer Directed Inline Expansion of User Functions
Compiler Directed Inline Expansion of Functions
Inlining Report
auto-parallelization
dynamic libraries
OpenMP*
profile-guided optimization
Profile an Application with Instrumentation
Profile Function or Loop Execution Time
using
Using Configuration Files
Using Response Files
using Intel® Performance Libraries
Using Intel® Performance Libraries with Xcode*
in Eclipse*
Using OpenMP*
using property pages in Microsoft Visual Studio*
utilities
PGO Tools Overview
profmerge
proforder
valarray implementation
Using Intel's valarray Implementation
compiling code
using in code
value-profiling
Resetting the Dynamic Profile Counters
Resetting Profile Information
variable length arrays
Qvla
option enabling
variables
fzero-initialized-in-bss, Qzero-initialized-in-bss
no-bss-init, Qnobss-init
fkeep-static-consts, Qkeep-static-consts
Placing Variables and Functions on the Coprocessor
Special Cases
option placing explicitly zero-initialized in DATA section
fzero-initialized-in-bss, Qzero-initialized-in-bss
no-bss-init, Qnobss-init
option placing uninitialized in DATA section
option saving always
placing on the coprocessor
thread-private
variables, moving data from one to another
variables, shared
vector
vector
attribute
vector copy
Programming Guidelines for Vectorization
non-vectorizable copy
programming guidelines
vector function application binary interface
vecabi, Qvecabi
option specifying compatibility for
vector_variant
attribute
vectorization
Using Automatic Vectorization
vec, Qvec
vec-threshold, Qvec-threshold
compiler options
compiler pragmas
keywords
obstacles
option disabling
option setting threshold for loops
speed-up
what is
Vectorization
Programming Guidelines for Vectorization
Function Annotations and the SIMD Directive for Vectorization
simd
User-Mandated or SIMD Vectorization
auto-parallelization
Programming Guidelines for Vectorization
reordering threshold control
general compiler directives
Intel® Streaming SIMD Extensions
language support
loop unrolling
pragma
pragma simd
SIMD
user-mandated
vector copy
Programming Guidelines for Vectorization
non-vectorizable copy
programming guidelines
vectorizing
Profile-Guided Optimization (PGO)
Loop Constructs
loops
Profile-Guided Optimization (PGO)
Loop Constructs
version
sox
option saving in executable or object file
visibility declaration attribute
Visual Studio*
Selecting a Configuration
Performing Parallel Project Builds
Building Your Intel® C++ Project
Specifying Directory Paths
Optimization Reports: Viewing
Selecting the Compiler Version
Converting Projects to Use a Selected Compiler from the Command Line
Code Coverage dialog box
Code Coverage Settings dialog box
Options: Compilers dialog box
Options: Converter dialog box
Options: Guided Auto Parallelism dialog box
Options: Intel® Performance Libraries dialog box
Options: Code Coverage dialog box
Use Intel® C++ dialog box
Options: Optimization Reports dialog box
Changing the Selected Intel® Performance Libraries
Including MPI Support
Switching Back to the Visual C++* Compiler
build configuration
build options
building multiple projects
building parallel projects
building with Intel® C++
changing directory paths
Compiler Inline Report window
Compiler Optimization Report window
compiler selection
converting projects
debug configuration
dialog boxes
Code Coverage dialog box
Code Coverage Settings dialog box
Options: Compilers dialog box
Options: Converter dialog box
Options: Guided Auto Parallelism dialog box
Options: Intel® Performance Libraries dialog box
Options: Code Coverage dialog box
Use Intel® C++ dialog box
Code Coverage dialog box
Code Coverage Settings
Compilers
Converter
GAP
Intel® Performance Libraries
Options: Code Coverage
Options: Guided Auto Parallelism
Use Intel C++
enabling optimization reports
Intel® Performance Libraries
MPI support
optimization reports, enabling
optimization reports, viewing
Options: Optimization Reports dialog box
release configuration
selecting the compiler
selecting the Visual C++* compiler
viewing optimization reports
warnings
GCC-Compatible Warning Options
Werror, WX
Werror-all
gcc-compatible
option changing to errors
Werror, WX
Werror-all
warnings and errors
whole program analysis
Windows* compiler options
Specifying Object Files
Specifying Include Files
Specifying Alternate Tools and Paths
Fo
I
Qlocation
Qoption
X
Windows* compiler options
Specifying Assembly Files
Fa
worker thread
worksharing
OpenMP* Pragmas Summary
Automatic Parallelization
Xcode*
Using Xcode* (macOS*)
building the target
creating a project
Creating an Xcode* Project
in Xcode*
projects
Creating an Xcode* Project
Running the Executable
Selecting the Intel® Compiler
Setting Compiler Options
creating
running the executable
selecting Intel's Clang-based C++ Compiler
selecting the Intel® C++ Compiler
setting compiler options
using dynamic libraries
xiar
IPO-Related Performance Issues
Creating a Library from IPO Objects
xild
Interprocedural Optimization (IPO)
IPO-Related Performance Issues
Creating a Library from IPO Objects
xilib
xilibtool
xilink
Interprocedural Optimization (IPO)
IPO-Related Performance Issues
Creating a Library from IPO Objects
zmm registers usage
qopt-zmm-usage, Qopt-zmm-usage
option defining a level of