Article

Parallelization Using Intel® Threading Building Blocks (Intel® TBB)

Compiler Methodology for Intel® MIC Architecture

作者: 管理 最后更新时间: 2019/08/01 - 09:30
Article

使用英特尔® 线程构建模块(英特尔® TBB)实现并行化

面向英特尔® MIC 架构的编译器方法

使用英特尔® 线程构建模块(英特尔® TBB)实现并行化

概述

作者: Ronald W Green (Blackbelt) 最后更新时间: 2019/08/01 - 09:30
Article

Measuring performance in HPC

This is the first article in a series of articles about High Performance Computing with the Intel® Xeon Phi™ coprocessor.

作者: 最后更新时间: 2019/07/06 - 16:10
博客

Power Configuration Part 0: Introduction: Yikes, there is a lot that is not documented

I was hoping to write a brief two part overview of how to configure the various power settings for the Intel® Xeon Phi™ coprocessor.

作者: 最后更新时间: 2019/07/06 - 17:00
博客

BKMs on the use of the SIMD directive

We had an ask from one of the various "Birds of a Feather" meetings Intel® holds at venues such as at the Super Computing* (SC) and International Super Computing* (ISC) conferences.

作者: 最后更新时间: 2019/07/06 - 17:00
Article

A Brief Survey of NUMA (Non-Uniform Memory Architecture) Literature

This document presents a list of articles on NUMA (Non-uniform Memory Architecture) that the author considers particularly useful. The document is divided into categories corresponding to the type of article being referenced. Often the referenced article could have been placed in more than one category. In this situation, the reference to the article is placed in what the author thinks is the...
作者: 最后更新时间: 2019/07/06 - 16:40
Article

Putting Your Data and Code in Order: Data and layout - Part 2

Apply the concepts of parallelism and distributed memory computing to your code to improve software performance. This paper expands on concepts discussed in Part 1, to consider parallelism, both vectorization (single instruction multiple data SIMD) as well as shared memory parallelism (threading), and distributed memory computing.
作者: David M. 最后更新时间: 2019/07/06 - 16:40
Article

Improve Performance with Vectorization

This article focuses on the steps to improve software performance with vectorization. Included are examples of full applications along with some simpler cases to illustrate the steps to vectorization.
作者: David M. 最后更新时间: 2019/07/06 - 16:40
Article

整理您的数据和代码: 数据和布局 - 第 2 部分

Apply the concepts of parallelism and distributed memory computing to your code to improve software performance. This paper expands on concepts discussed in Part 1, to consider parallelism, both vectorization (single instruction multiple data SIMD) as well as shared memory parallelism (threading), and distributed memory computing.
作者: David M. 最后更新时间: 2019/07/06 - 16:40
博客

Introduction to Embree 2.1 - Part 1

This is part of a series of blogs on Embree, a collection of high performance ray tracing kernels. Embree has been released open source since version 1.0.

作者: Louis F. (Intel) 最后更新时间: 2019/09/30 - 16:50