Article

整理您的数据和代码: 数据和布局 - 第 2 部分

Apply the concepts of parallelism and distributed memory computing to your code to improve software performance. This paper expands on concepts discussed in Part 1, to consider parallelism, both vectorization (single instruction multiple data SIMD) as well as shared memory parallelism (threading), and distributed memory computing.
Criado por David M. Última atualização em 06/07/2019 - 16:40
Article

面向英特尔® 架构优化的 Caffe*:使用现代代码技巧

This paper demonstrates a special version of Caffe* — a deep learning framework originally developed by the Berkeley Vision and Learning Center (BVLC) — that is optimized for Intel® architecture.
Criado por Última atualização em 06/07/2019 - 16:40
Article

借助 SIMD 数据布局模板优化数据布局

Financial service customers need to improve financial algorithmic performance for models such as Monte Carlo, Black-Scholes, and others. SIMD programming can speed up these workloads. In this paper, we perform data layout optimizations using two approaches on a Black-Scholes workload for European options valuation from the open source Quantlib library.
Criado por Nimisha R. (Intel) Última atualização em 12/12/2018 - 18:00
Article

借助 SIMD 数据布局模板和数据预处理提高 SIMD 在动画中的使用效率

In this paper, we walk through a 3D Animation algorithm example and describe some techniques and methodologies that may benefit your next vectorization endeavors. We also integrate the algorithm with SIMD Data Layout Templates (SDLT), which is a feature of Intel® C++ Compiler, to improve data layout and SIMD efficiency. Includes code sample.
Criado por Última atualização em 25/03/2019 - 11:40
Article

英特尔® 至强融核™ 处理器优化教程

In this tutorial, we demonstrate some possible ways to optimize an application to run on the Intel® Xeon Phi™ processor
Criado por Nguyen, Loc Q (Intel) Última atualização em 21/03/2019 - 12:00
Article

面向英特尔® 至强融核™ 处理器的 Offload over Fabric教程

This tutorial shows how to install Offload over Fabric (OoF) software on 2nd generation Intel® Xeon Phi™ processor, configure the hardware, test the basic configuration, and enable OoF
Criado por Nguyen, Loc Q (Intel) Última atualização em 21/03/2019 - 12:00
Article

如何在英特尔® 至强融核™ 处理器中使用 MPI-3 共享内存

学习如何在英特尔® 至强融核™ 处理器中使用 MPI-3 共享内存
Criado por Nguyen, Loc Q (Intel) Última atualização em 06/07/2019 - 16:30