博客

Vectorization Series, Part 3 - What are the Benefits?

This will be the final post in my planned short vectorization series. Although I reserve the right to post more on vectorization in the future!

作者: Shannon Cepeda (Blackbelt) 最后更新时间: 2017/06/14 - 15:56
博客

Vectorization Series, Part 2- Who Can Use It?

In my last blog, I introduced the concept of vectorization, which is parallelism across data elements in a regi

作者: Shannon Cepeda (Blackbelt) 最后更新时间: 2017/06/14 - 16:16
博客

Some Performance Advantages of Using a Task-Based Parallelism Model

As part of my focus on software performance, I also support and consult on implementing scalable parallelism in applications.

作者: Shannon Cepeda (Blackbelt) 最后更新时间: 2019/02/04 - 10:40
博客

Let's rename "for" to "serial_for"...

Proposal: rename for in C and C++ to serial_for No more incumbent "for." (it was voted off the island)

作者: James R. (Blackbelt) 最后更新时间: 2017/06/14 - 16:06
博客

Parallelism as a First Class Citizen in C and C++, the time has come

It is time to make Parallelism a full First Class Citizen in C and C++.  Hardware is once again ahead of software, and we need to close the gap so that application development is better able to uti

作者: James R. (Blackbelt) 最后更新时间: 2017/06/14 - 16:04
博客

Graduate Intern at Intel - Parallel Ray-Tracing

Ray-tracing is a classic example of an embarrassingly parallel algorithm; since each pixel is typically independent of the rest, theoretically every pixel can be done in parallel (given enough core

作者: 最后更新时间: 2017/06/14 - 15:37
博客

Graduate Intern at Intel - Parallel N-Body

The N-Body problem is a classic example used frequently to demonstrate parallelization and how it improves performance.

作者: 最后更新时间: 2017/06/14 - 15:46
Article

Visualizing Parallel Speedup with Cilkview

Posted by Will Leiserson originally on www.cilk.com on Tue, Jun 30, 2009

作者: 最后更新时间: 2018/05/24 - 20:49
Article

Intel Cilk++ SDK Resource Library

The technical articles listed below supplement the information provided in the Cilk-Programmers-Guide (pdf) included in the

作者: 最后更新时间: 2017/10/11 - 11:28
Article

Superscalar Programming 101 (Matrix Multiply) Part 1 of 5

Part one of a five-part series, this article teaches a methodology to interpret statistics gathered during test runs and use those interpretations to improve parallel code.
作者: jimdempseyatthecove (Blackbelt) 最后更新时间: 2019/07/04 - 22:00