Article

使用任务(而非线程)

Tasks are a lightweight alternative to threads that provide faster startup and shutdown times, better load balancing, an efficient use of available resources, and a higher level of abstraction.
Authored by admin Last updated on 07/05/2019 - 09:51
Article

面向英特尔® 架构优化的 Caffe*:使用现代代码技巧

This paper demonstrates a special version of Caffe* — a deep learning framework originally developed by the Berkeley Vision and Learning Center (BVLC) — that is optimized for Intel® architecture.
Authored by Last updated on 07/06/2019 - 16:40
Article

面向英特尔® 至强融核™ 处理器的 Offload over Fabric教程

This tutorial shows how to install Offload over Fabric (OoF) software on 2nd generation Intel® Xeon Phi™ processor, configure the hardware, test the basic configuration, and enable OoF
Authored by Nguyen, Loc Q (Intel) Last updated on 03/21/2019 - 12:00
Article

Utilizando tarefas ao invés de threads

Tasks are a lightweight alternative to threads that provide faster startup and shutdown times, better load balancing, an efficient use of available resources, and a higher level of abstraction.
Authored by admin Last updated on 07/05/2019 - 09:53
Article

使用现代 C++ 技术增强多核优化

如今,多核处理器已经在 PC 中普及,内核数量不断增长,软件工程师必须适应这种情况。通过学习如何处理潜在的性能瓶颈和并发性问题,工程师可以使他们的代码适应未来,以无缝处理添加到消费者系统的额外内核。
Authored by Last updated on 08/02/2018 - 00:18
Article

整理您的数据和代码: 数据和布局 - 第 2 部分

Apply the concepts of parallelism and distributed memory computing to your code to improve software performance. This paper expands on concepts discussed in Part 1, to consider parallelism, both vectorization (single instruction multiple data SIMD) as well as shared memory parallelism (threading), and distributed memory computing.
Authored by David M. Last updated on 07/06/2019 - 16:40
Article

利用有序数据流中的数据并行性

This article identifies some of these challenges and illustrates strategies for addressing them while maintaining parallel performance.
Authored by admin Last updated on 07/05/2019 - 14:50
Article

英特尔® 至强融核™ 处理器优化教程

In this tutorial, we demonstrate some possible ways to optimize an application to run on the Intel® Xeon Phi™ processor
Authored by Nguyen, Loc Q (Intel) Last updated on 03/21/2019 - 12:00
Article

借助 SIMD 数据布局模板优化数据布局

Financial service customers need to improve financial algorithmic performance for models such as Monte Carlo, Black-Scholes, and others. SIMD programming can speed up these workloads. In this paper, we perform data layout optimizations using two approaches on a Black-Scholes workload for European options valuation from the open source Quantlib library.
Authored by Nimisha R. (Intel) Last updated on 12/12/2018 - 18:00
Article

安装英特尔® Theano*软件优化包和支持工具

Theano* is a Python* library developed at the LISA lab to define, optimize, and evaluate mathematical expressions, including the ones with multi-dimensional arrays. Theano can be installed and used with several combinations of development tools and libraries on a variety of platforms. This tutorial provides one such recipe describing steps to build and install Intel-optimized Theano with Intel®...
Authored by Sunny G. (Intel) Last updated on 05/08/2018 - 10:50