SIMD 编程的优势 --TickerTape Part 2

Ticker Tape 是一种技术演示,旨在鼓励开发人员在粒子系统中执行更为复杂的操作。参与该演示的开发人员会运用大量技术,来提高包括多线程和针对英特尔® SIMD 流指令扩展(SSE)的优化等在内的性能
Autor admin Última actualización 24/01/2018 - 12:12

利用英特尔® SIMD 流指令扩展和英特尔® 高级矢量扩展指令集的图像处理加速技术

This article details optimized implementations of data transformations and algorithms together with analysis comparing performance and providing speedup measurements for Intel® SSE optimized code and estimates for Intel® AVX optimized code.
Autor Larsson, Petter (Blackbelt) Última actualización 25/05/2018 - 15:30

利用英特尔高级矢量扩展指令集进行 Wiener 过滤

Wiener filtering (also known as Least Mean Square filtering) is a technique for removing unwanted noise from an image. This article discusses Wiener filtering, and includes an example of code that has been optimized using Intel® AVX
Autor Última actualización 11/12/2018 - 13:12

在不编写 AVX 代码的情况下使用 AVX

1. 简介和工具
Autor admin Última actualización 05/07/2019 - 20:45

避免 AVX-SSE 转换造成的性能损失

避免 AVX-SSE 转换造成的性能损失 (PDF 678 KB)

Autor Patrick Konsor (Intel) Última actualización 05/07/2019 - 20:48

英特尔® MKL 中的英特尔® AVX 优化代码

Starting from Intel MKL 10.3, AVX code will be dispatched as one of the platforms in MKL and does not require special activation as in MKL 10.2.
Autor Vipin Kumar E K (Intel) Última actualización 27/03/2019 - 12:20

在不编写 AVX 的情况下使用英特尔® AVX

Intel® AVX is a new 256-bit instruction set extension to Intel® Streaming SIMD Extensions and is designed for applications that are floating point intensive. This paper discusses options to integrate Intel® AVX into an application via use of intrinsics.
Autor richard-hubbard (Intel) Última actualización 07/06/2017 - 12:12


Photo-realistic rendering requires accurate simulation of light propagation according to physics laws. The best known way to solve this problem is Monte Carlo ray tracing. We describe a state-of-the-art photo-realistic Monte Carlo rendering engine.
Autor Sven Woop (Intel) Última actualización 02/08/2019 - 17:30

诊断信息 15532: 循环无法进行矢量化处理:编译时间不足妨碍了循环进行优化

产品版本: Intel(R) Visual Fortran 编译器 XE


使用 Visual Fortran 编译器的优化选项 ( -O2  -Qopt-report:2 )  时出现矢量化报告,表示编译时间不足妨碍了优化。

Autor Devorah H. (Intel) Última actualización 05/07/2019 - 14:23