Filtros

Article

利用英特尔® SIMD 流指令扩展和英特尔® 高级矢量扩展指令集的图像处理加速技术

This article details optimized implementations of data transformations and algorithms together with analysis comparing performance and providing speedup measurements for Intel® SSE optimized code and estimates for Intel® AVX optimized code.
Autor Larsson, Petter (Blackbelt) Última actualización 25/05/2018 - 15:30
Article

避免 AVX-SSE 转换造成的性能损失

避免 AVX-SSE 转换造成的性能损失 (PDF 678 KB)

Autor Patrick Konsor (Intel) Última actualización 05/07/2019 - 20:48
Article

英特尔® MKL 中的英特尔® AVX 优化代码

Starting from Intel MKL 10.3, AVX code will be dispatched as one of the platforms in MKL and does not require special activation as in MKL 10.2.
Autor Vipin Kumar E K (Intel) Última actualización 27/03/2019 - 12:20
Article

Embree:照片级光线追踪内核

Photo-realistic rendering requires accurate simulation of light propagation according to physics laws. The best known way to solve this problem is Monte Carlo ray tracing. We describe a state-of-the-art photo-realistic Monte Carlo rendering engine.
Autor Sven Woop (Intel) Última actualización 02/08/2019 - 17:30
Article

诊断信息 15532: 循环无法进行矢量化处理:编译时间不足妨碍了循环进行优化

产品版本: Intel(R) Visual Fortran 编译器 XE 15.0.0.070

原因:

使用 Visual Fortran 编译器的优化选项 ( -O2  -Qopt-report:2 )  时出现矢量化报告,表示编译时间不足妨碍了优化。

Autor Devorah H. (Intel) Última actualización 05/07/2019 - 14:23
Article

英特尔® 至强融核™ 协处理器(代号 “Knights Landing”)— 应用就绪

为了将来在英特尔® 至强™ 处理器和英特尔® 至强融核™ 协处理器(代号 Knights Landing)上实现部分应用就绪,开发人员主要希望从两个方面改进工作负载:

矢量化/代码生成 线程并行性

本文主要讨论矢量化/代码生成,并介绍了一些有用的线程并行工具和资源。

Autor Última actualización 06/07/2019 - 16:40
Article

Vectorization Advisor 助您一臂之力

Vectorization Advisor is like having a trusted friend look over your code and give you advice based on what he sees. As you’ll see in this article, user feedback on the tool has included, “there are significant speedups produced by following advisor output, I'm already sold on this tool!”
Autor Última actualización 06/07/2019 - 16:40
Article

整理您的数据和代码: 优化和内存 — 第 1 部分

This series of two articles discusses how data and memory layout affect performance and suggests specific steps to improve software performance. The basic steps shown in these two articles can yield significant performance gains. These two articles are designed at an intermediate level. It is assumed the reader desires to optimize software performance using common C, C++ and Fortran* programming...
Autor David M. Última actualización 12/12/2018 - 18:00
Article

准确预报各种天气:英特尔五步框架帮助实现代码现代化

天气预报是现代生活的一个重要方面,它可在出现恶劣天气状况时即时发出警报,从而帮助有效制定计划和安排物流,并可保护生命财产安全。 但是,准确预测长期的天气情况非常复杂,通常涉及到大量数据集,并且要求对代码进行优化以利用最高级的计算机硬件功能。

Autor Última actualización 21/03/2019 - 12:00
Article

了解面向三维同性有限差分 (3DFD) 波动方程代码的 NUMA

本文将介绍一些技巧,帮助软件开发人员识别并修复使用最新英特尔软件开发工具时遇到的与 NUMA 相关的应用性能问题。

Autor Sunny G. (Intel) Última actualización 05/07/2019 - 20:13